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Streptococcus pneumoniae Polynucleotides and Sequences 

FIELD OF THE INVENTION 

5 The present invention relates to the field of molecular biology. In 

particular, it relates to, among other things, nucleotide sequences of Streptococcus 
pneumoniae, contigs, ORFs, fragments, probes, primers and related 
polynucleotides thereof, peptides and polypeptides encoded by the sequences, and 
uses of the polynucleotides and sequences thereof, such as in fermentation, 

10 polypeptide production, assays and pharmaceutical development, among others. 

BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae has been one of the most extensively studied 

15 microorganisms since its first isolation in 1881. It was the object of many 
investigations that led to important scientific discoveries. In 1928, Griffith 
observed that when heat-killed encapsulated pneumococci and live strains 
constitutively lacking any capsule were concomitantly injected into mice, the 
nonencapsulated could be converted into encapsulated pneumococci with the same 

20 capsular type as the heat-killed strain. Years later, the nature of this "transforming 
principle," or carrier of genetic information, was shown to be DNA. (Avery, O.T., 
etaLJ. Exp. Med., 79; 1 37- 157 (1944)). 

In spite of the vast number of publications on 5. pneumoniae many 
questions about its virulence are still unanswered, and this pathogen remains a 

25 major causative agent of serious human disease, especially community-acquired 
pneumonia, (Johnston, R.B., era/., Rev. Infect, Dis. 7J(Suppl. 6):S509-517 
(1991)). In addition, in developing countries, the pneumococcus is responsible for 
the death of a large number of children under the age of 5 years from pneumococcal 
pneumonia. The incidence of pneumococcal disease is highest in infants under 2 

30 years of age and in people over 60 years of age. Pneumococci are the second most 
frequent cause (after Haemophilus influenzae type b) of bacterial meningitis and 
otitis media in children. With the recent introduction of conjugate vaccines for H . 
influenzae type b, pneumococcal meningitis is likely to become increasingly 
prominent. S. pneumoniae is the most important etiologic agent of community- 
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acquired pneumonia in adults and is the second most common cause of bacterial 
meningitis behind Neisseria meningitidis. 

The antibiotic generally prescribed to treat S. pneumoniae is 
benzylpenicillin, although resistance to this and to other antibiotics is found 

5 occasionally. Pneumococcal resistance to penicillin results from mutations in its 
penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused by 
a sensitive strain, treatment with penicillin is usually successful unless started too 
late. Erythromycin or clindamycin can be used to treat pneumonia in patients 
hypersensitive to penicillin, but resistant strains to these drugs exist. Broad 

10 spectrum antibiotics (e.g., the tetracyc lines) may also be effective, although 
tetracycline-resistant strains are not rare. In spite of the availability of antibiotics, 
the mortality of pneumococcal bacteremia in the last four decades has remained 
stable between 25 and 29%. (Gillespie, S.H., et a/., J. Med. Microbiol. 28:231- 
248 (1989). 

15 S. pneumoniae is carried in the upper respiratory tract by many healthy 

individuals. It has been suggested that attachment of pneumococci is mediated by a 
disaccharide receptor on fibronectin. present on human pharyngeal epithelial cells. 
(Anderson, B.J., et al % J. Immunol 742:2464-2468 (1989). The mechanisms by 
which pneumococci translocate from the nasopharynx to the lung, thereby causing 

20 pneumonia, or migrate to the blood, giving rise to bacteremia or septicemia, are 
poorly understood. (Johnston, R.B., et a/.. Rev. Infect. Dis. /3(SuppI. 6):S509- 
517 (1991). 

Various proteins have been suggested to be involved in the pathogenicity of 
S. pneumoniae, however, only a few of them have actually been confirmed as 

25 virulence factors. Pneumococci produce an IgAl protease that might interfere with 
host defense at mucosal surfaces. (Kornfield, S.J., et a/.. Rev. Inf. Dis. 5:521- 
534 (1981). S. pneumoniae also produces neuraminidase, an enzyme that may 
facilitate attachment to epithelial cells by cleaving sialic acid from the host 
glycolipids and gangliosides. Partially purified neuraminidase was observed to 

30 induce meningitis-like symptoms in mice: however, the reliability of this finding 
has been questioned because the neuraminidase preparations used were probably 
contaminated with cell wall products. Other pneumococcal proteins besides 
neuraminidase are involved in the adhesion of pneumococci to epithelial and 
endothelial cells. These pneumococcal proteins have as yet not been identified. 

35 Recently, Cundell et. reported that peptide permeases can modulate 
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pneumococcal adherence to epithelial and endothelial cells. It was. however, 
unclear whether these permeases function directly as adhesions or whether they 
enhance adherence by modulating the expression of pneumococcal adhesions. 
(DeVelasco, E.A., et aL % Micro. Rev. 59:591-603 (1995). A better understanding 
5 of the virulence factors determining its pathogenicity will need to be developed to 
cope with the devastating effects of pneumococcal disease in humans. 

Ironically, despite the prominent role of 5. pneumoniae in the discovery of 
DNA, little is known about the molecular genetics of the organism. The 5. 
pneumoniae genome consists of one circular, covalently closed, double-stranded 

10 DNA and a collection of so-called variable accessory elements, such as prophages, 
plasinids, transposons and the like. Most physical characteristics and almost all of 
the genes of 5. pneumoniae are unknown. Among the few that have been 
identified, most have not been physically mapped or characterized in detail. Only a 
few genes of this organism have been sequenced. (See, for instance current 

1 5 versions of GENBANK and other nucleic acid databases, and references that relate 
to the genome of S. pneumoniae such as those set out elsewhere herein.) 

It is clear that the etiology of diseases mediated or exacerbated by 5. 
pneumoniae, infection involves the programmed expression of S. pneumoniae 
genes, and that characterizing the genes and their patterns of expression would add 

20 dramatically to our understanding of the organism and its host interactions. 
Knowledge of 5. pneumoniae genes and genomic organization would improve our 
understanding of disease etiology and lead to improved and new ways of 
preventing, ameliorating, arresting and reversing diseases. Moreover, 
characterized genes and genomic fragments of S. pneumoniae would provide 

25 reagents for, among other things, detecting, characterizing and controlling 5. 
pneumoniae infections. There is a need to characterize the genome of 5. 
pneumoniae and for polynucleotides of this organism. 
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SUMMARY OF THE INVENTION 

The present invention is based on the sequencing of fragments of the 
5 Streptococcus pneumoniae genome. The primary nucleotide sequences which were 
generated are provided in SEQ ID NOS: 1-391. 

The present invention provides the nucleotide sequence of several hundred 
contigs of the Streptococcus pneumoniae genome, which are listed in tables below 
and set out in the Sequence Listing submitted herewith, and representative 

i() fragments thereof, in a form which can be readily used, analyzed, and interpreted 
by a skilled artisan. In one embodiment, the present invention is provided as 
contiguous strings of primary sequence information corresponding to the 
nucleotide sequences depicted in SEQ ID NOS: 1-391. 

The present invention further provides nucleotide sequences which are at 

15 least 95% identical to the nucleotide sequences of SEQ ID NOS: 1-391. 

The nucleotide sequence of SEQ ID NOS: 1-39 1, a representative fragment 
thereof, or a nucleotide sequence which is at least 95% identical to the nucleotide 
sequence of SEQ ID NOS: I -391 may be provided in a variety of mediums to 
facilitate its use. In one application of this embodiment, the sequences of the 

20 present invention are recorded on computer readable media. Such media includes, 
but is not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape: optical storage media such as CD-ROM: 
electrical storage media such as RAM and ROM; and hybrids of these categories 
such as magnetic/optical storage media. 

25 The present invention further provides systems, particularly computer- 

based systems which contain the sequence information herein described stored in a 
data storage means. Such systems are designed to identify commercially important 
fragments of the Streptococcus pneumoniae genome. 

Another embodiment of the present invention is directed to fragments of the 

30 Streptococcus pneumoniae genome having particular structural or functional 
attributes. Such fragments of the Streptococcus pneumoniae genome of the present 
invention include, but are not limited to, fragments which encode peptides, 
hereinafter referred to as open reading frames or ORFs. fragments which modulate 
the expression of an operably linked ORF, hereinafter refenred to as expression 

35 modulating fragments or EMFs. and fragments which can be used to diagnose the 
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presence of Streptococcus pneumoniae in a sample, hereinafter referred to as 
diagnostic fragments or DFs. 

Each of the ORFs in fragments of the Streptococcus pneumoniae genome 
disclosed in Tables 1-3, and the EMFs found 5* to the ORFs, can be used in 
5 numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining the 
presence of a specific microbe in a sample, to selectively control gene expression in 
a host and in the production of polypeptides, such as polypeptides encoded by 
ORFs of the present invention, particular those polypeptides that have a 

to pharmacological activity. 

The present invention further includes recombinant constructs comprising 
one or more fragments of the Streptococcus pneumoniae genome of the present 
invention. The recombinant constructs of the present invention comprise vectors, 
such as a plasmid or viral vector, into which a fragment of the Streptococcus 

1 5 pneumoniae has been inserted. 

The present invention further provides host cells containing any of the 
isolated fragments of the Streptococcus pneumoniae genome of the present 
invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a 

20 bacterial cell. 

The present invention is further directed to isolated polypeptides and 
proteins encoded by ORFs of the present invention. A variety of methods, well 
known to those of skill in the art, routinely may be utilized to obtain any of the 
polypeptides and proteins of the present invention. For instance, polypeptides and 

25 proteins of the present invention having relatively short, simple amino acid 
sequences readily can be synthesized using commercially available automated 
peptide synthesizers. Polypeptides and proteins of the present invention also may 
be purified from bacterial cells which naturally produce the protein. Yet another 
alternative is to purify polypeptide and proteins of the present invention from cells 

30 which have been altered to express them. 

The invention further provides methods of obtaining homologs of the 
fragments of the Streptococcus pneumoniae genome of the present invention and 
homologs of the proteins encoded by the ORFs of the present invention. 
Specifically, by using the nucleotide and amino acid sequences disclosed herein as 
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a probe or as primers, and techniques such as_ECR cloning and colony/plaque 
hybridization, one skilled in the an can obtain homoiogs. 

The invention further provides antibodies which selectively bind 
polypeptides and proteins of the present invention. Such antibodies include both 

5 monoclonal and polyclonal antibodies. 

The invention further provides hybridomas which produce the above- 
described antibodies. A hybridoma is an immortalized cell line which is capable of 
secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test samples 

10 derived from cells which express one of the ORFs of the present invention, or a 
homolog thereof. Such methods comprise incubating a test sample with one or 
more of the antibodies of the present invention, or one or more of the DFs of the 
present invention, under conditions which allow a skilled artisan to determine if the 
sample contains the ORF or product produced therefrom. 

15 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
comprising one of the antibodies, or one of the DFs of the present invention; and 

20 (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of bound antibodies or hybridized 
DFs. 

Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents capable of binding to 

25 a polypeptide or protein encoded by one of the ORFs of the present invention. 
Specifically, such agents include, as further described below, antibodies, peptides, 
carbohydrates, pharmaceutical agents and the like. Such methods comprise steps 
of: (a) contacting an agent with an isolated protein encoded by one of the ORFs of 
the present invention; and (b) determining whether the agent binds to said protein. 

30 The present genomic sequences of Streptococcus pneumoniae will be of 

great value to all laboratories working with this organism and for a variety of 
commercial purposes. Many fragments of the Streptococcus pneumoniae genome 
will be immediately identified by similarity searches against GenBank or protein 
databases and will be of immediate value to Streptococcus pneumoniae researcher's 
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and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic 
sequences of bacterial and other genomes has and will greatly enhance the ability to 
5 analyze and understand chromosomal organization. In particular, sequenced 
contigs and genomes will provide the models for developing tools for the analysis 
of chromosome structure and function, including the ability to identify genes within 
large segments of genomic DNA, the structure, position, and spacing of regulatory 
elements, the identification of genes with potential industrial applications, and the 
10 ability to do comparative genomic and molecular phylogeny. 

DESCRIPTION OF THE FIGURES 

FIGURE 1 is a block diagram of a computer system (102) that can be 
15 used to implement computer-based systems of present invention. 

FIGURE 2 is a schematic diagram depicting the data flow and computer 
programs used to collect, assemble, edit and annotate the contigs of the 
Streptococcus pneumoniae genome of the present invention. Both Macintosh and 

20 Unix platforms are used to handle the AB 373 and 377 sequence data files, largely 
ds described in Xeriavage et aL. Proceedings of the Twenty-Sixth Annual Hawaii 
International Conference on System Sciences. 585, IEEE Computer Society Press, 
Washington D.C. (1993). Factura (AB) is a Macintosh program designed for 
automatic vector sequence removal and end-trimming of sequence files. The 

25 program Loadis runs on a Macintosh platform and parses the feature data extracted 
from the sequence files by Factura to the Unix based Streptococcus pneumoniae 
relational database. Assembly of contigs (and whole genome sequences) is 
accomplished by retrieving a specific set of sequence files and their associated 
features using Extrseq, a Unix utility for retrieving sequences from an SQL 

3D database. The resulting sequence file is processed by seq_filter to trim portions of 
the sequences with more than 2% ambiguous nucleotides. The sequence files were 
assembled using TIGR Assembler, an assembly engine designed at The Institute 
for Genomic Research ( TIGR ) for rapid and accurate assembly of thousands of 
sequence fragments. The collection of contigs generated by the assembly step is 

35 loaded into the database with the lassie program. Identification of open reading 
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frames (ORFs) is accomplished by processing concigs with zorf or GenMark. The 
ORFs are searched against S. pneumoniae sequences from GenBank and against all 
protein sequences using the BLASTN and BLASTP programs, described in 
Aitschul et a/.. J. MoL Biol. 215: 403-410 (1990)). Results of the ORF 
5 determination and similarity searching steps were loaded into the database. As 
described below, some results of the determination and the searches are set out in 
Tables 1-3. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

.10 

The present invention is based on the sequencing of fragments of the 
Streptococcus pneumoniae genome and analysis of the sequences. The primary 
nucleotide sequences generated by sequencing the fragments are provided in SEQ 
ID NOS: 1-391. (As used herein* the "primary sequence" refers to the nucleotide 

1 5 sequence represented by the IUPAC nomenclature system.) 

In addition to the aforementioned Streptococcus pneumoniae polynucleotide 
and polynucleotide sequences, the present invention provides the nucleotide 
sequences of SEQ ID NOS: 1-39 1, or representative fragments thereof, in a form 
which can be readily used, analyzed, and interpreted by a skilled artisan. 

20 As used herein, a "representative fragment of the nucleotide sequence 

depicted in SEQ ID NOS: i-39i" refers to any portion of the SEQ ID NOS: 1-39 1 
which is not presently represented within a publicly available database. Preferred 
representative fragments of the present invention are Streptococcus pneumoniae 
open reading frames ( ORFs ), expression modulating fragment ( EMFs ) and 

25 fragments which can be used to diagnose the presence of Streptococcus 
pneumoniae in sample ( DFs ). A non-limiting identification of preferred 
representative fragments is provided in Tables 1-3. As discussed in detail below, 
the information provided in SEQ DD NOS: 1-391 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled 

30 in the art to clone and sequence all "representative fragments" of interest, including 
open reading frames encoding a large variety of Streptococcus pneumoniae 
proteins. 

While the presently disclosed sequences of SEQ ID NOS: 1-39 1 are highly 
accurate, sequencing techniques are not perfect and, in relatively rare instances, 
35 further investigation of a fragment or sequence of the invention may reveal a 
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nucleotide sequence error present in a nucleotide sequence disclosed in SEQ ID 
NOS: I -391. However, once the present invention is made available (i.e.. once the 
information in SEQ ID NOS: 1-391 and Tables 1-3 has been made available), 
resolving a rare sequencing error in SEQ ID NOS: 1-391 will be well within the 
5 skill of the art. The present disclosure makes available sufficient sequence 
information to allow any of the described contigs or portions thereof to be obtained 
readily by straightforward application of routine techniques. Further sequencing of 
such polynucleotide may proceed in like manner using manual and automated 
sequencing methods which are employed ubiquitous in the an. Nucleotide 

10 sequence editing software is publicly available. For example. Applied Biosy stem's 
(AB) AutoAssembler can be used as an aid during visual inspection of nucleotide 
sequences. By employing such routine techniques potential errors readily may be 
identified and the correct sequence then may be ascertained by targeting further 
sequencing effort, also of a routine nature, to the region containing the potential 

15 error. 

Even if all of the very rare sequencing errors in SEQ ED NOS: 1-391 were 
corrected, the resulting nucleotide sequences would still be at least 95% identical, 
nearly all would be at least 99% identical, and the great majority would be at least 
99.9% identical to the nucleotide sequences of SEQ ID NOS : 1 -39 1 . 

20 As discussed elsewhere herein, polynucleotides of the present invention 

readily may be obtained by routine application of well known and standard 
procedures for cloning and sequencing DNA. Detailed methods for obtaining 
libraries and for sequencing are provided below, for instance. A wide variety of 
Streptococcus pneumoniae strains that can be used to prepare 5. pneumoniae 

25 genomic DNA for cloning and for obtaining polynucleotides of the present 
invention are available to the public from recognized depository institutions, such 
as the American Type Culture Collection ( ATCC ). While the present invention is 
enabled by the sequences and other information herein disclosed, the S. 
pneumoniae strain that provided the DNA of the present Sequence Listing, Strain 

30 7/87 14.8.91, has been deposited in the ATCC, as a convenience to those of skill 
in the art. As a further convenience, a library of 5. pneumoniae genomic DNA. 
derived from the same strain, also has been deposited in the ATCC. The 5. 
pneumoniae strain was deposited on October 10, 1996, and was given Deposit No. 
55840, and the cDNA library was deposited on October 1 1, 1996 and was given 

35 Deposit No. 97755. The genomic fragments in the library are 15 to 20 kb 
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fragments generated by partial Sau3Al digestion and they are inserted into the 
BamHI site in the well-known lambda-derived vector lambda DASH II (Stratagene, 
La Jolla, CA). The provision of the deposits is not a waiver of any rights of the 
inventors or their assignees in the present subject matter. 
5 The nucleotide sequences of the genomes from different strains of 

Streptococcus pneumoniae differ somewhat. However, the nucleotide sequences 
of the genomes of all Streptococcus pneumoniae strains will be at least 95% 
identical, in corresponding part, to the nucleotide sequences provided in SEQ ID 
NOS: 1-391. Nearly all will be at least 99% identical and the great majority will be 

10 99.9% identical. 

Thus, the present invention further provides nucleotide sequences which 
are at least 95%, preferably 99% and most preferably 99.9% identical to the 
nucleotide sequences of SEQ ID NOS: 1-391, in a form which can be readily used, 
analyzed and interpreted by the skilled artisan. 

15 Methods for determining whether a nucleotide sequence is at least 95%, at 

least 99% or at least 99.9% identical to the nucleotide sequences of SEQ ID 
NOS: 1-39 1 are routine and readily available to the skilled artisan. For example, the 
well known fasta algorithm described in Pearson and Lipman, Proc. Natl. Acad. 
ScL USA 85: 2444 (1988) can be used to generate the percent identity of nucleotide 

.0 sequences. The BLASTN program also can be used to generate an identity score 
or polynucleotides compared to one another. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS: 1-391, a representative 
25 fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% 
and most preferably at least 99.9% identical to a polynucleotide sequence of SEQ 
ID NOS: 1-391 may be "provided" in a variety of mediums to facilitate use thereof. 
As used herein, provided refers to a manufacture, other than an isolated nucleic 
acid molecule, which contains a nucleotide sequence of the present invention; i.e.. 
30 a nucleotide sequence provided in SEQ ID NOS: 1-39 1, a representative fragment 
thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most 
preferably at least 99.9% identical to a polynucleotide of SEQ ID NOS: 1-391. 
Such a manufacture provides a large portion of the Streptococcus pneumoniae 
genome and parts thereof (e.g., a Streptococcus pneumoniae open reading frame 
35 (ORF)) in a form which allows a skilled artisan to examine the manufacture using 
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means not directly applicable to examining the Streptococcus pneumoniae genome 
or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
5 readable media" refers to any medium which carv-be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, 
such as floppy discs, hard disc storage medium, and magnetic tape; optical storage 
media such as CD- ROM; electrical storage media such as RAM and ROM; and 
hybrids of these categories, such as magnetic/optical storage media. A skilled 

10 artisan can readily appreciate how any of the presently known computer readable 
mediums can be used to create a manufacture comprising computer readable 
medium having recorded thereon a nucleotide sequence of the present invention. 
Likewise, it will be clear to those of skill how additional computer readable media 
that may be developed also can be used to create analogous manufactures having 

1 5 recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable medium. A skilled artisan can readily adopt any of the presently 
know methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present 

20 invention. A variety of data storage structures are available to a skilled artisan 
for creating a computer readable medium having recorded thereon a nucleotide 
sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored information. In 
addition, a variety of data processor programs and formats can be used to store the 

25 nucleotide sequence information of the present invention on computer readable 
medium. The sequence information can be represented in a word processing text 
file, formatted in commercially- available software such as WordPerfect and 
MicroSoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily 

30 adapt any number of data-processor structuring formats (e.g., text file or database) 
in order to obtain computer readable medium having recorded thereon the 
nucleotide sequence information of the present invention. 

Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Thus, by 

35 providing in computer readable form the nucleotide sequences of SEQ CD NOS:l- 
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391, a representative fragment thereof, or a nucleotide sequence at least 95%, 
preferably at least 99% and most preferably at least 99.9% identical to a sequence 
of SEQ ID NOS: 1-391 the present invention enables the skilled artisan routinely to 
access the provided sequence information for a wide variety of purposes. 
5 The examples which follow demonstrate how software which implements 

the BLAST (Altschul et a/., J. Mol Biol. 2/5:403^10 (1990)) and BLAZE 
(Brutlag et a/., Comp. Chem. 77:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Streptococcus 
pneumoniae genome which contain homology to ORFs or proteins from both 

10 Streptococcus pneumoniae and from other organisms. Among the ORFs discussed 
herein are protein encoding fragments of the Streptococcus pneumoniae genome 
useful in producing commercially important proteins, such as enzymes used in 
fermentation reactions and in the production of commercially useful metabolites. 

The present invention further provides systems, particularly computer- 

15 based systems, which contain the sequence information described herein. Such 
systems are designed to identify, among other things, commercially important 
fragments of the Streptococcus pneumoniae genome. 

As used herein, "a computer-based system" refers to the hardware means, 
software means, and data storage means used to analyze the nucleotide sequence 

20 information of the present invention. The minimum hardware means of the 
computer-based systems of the present invention comprises a central processing 
unit (CPU), input means, output means, and data storage means. A skilled artisan 
can readily appreciate that any one of the currently available computer-based 
systems are suitable for use in the present invention. 

25 As stated above, the computer-based systems of the present invention 

comprise a data storage means having stored therein a nucleotide sequence of the 
present invention and the necessary hardware means and software means for 
supporting and implementing a search means. 

As used herein, 'data storage means" refers to memory which can store 

30 nucleotide sequence information of the present invention, or a memory access 
means which can access manufactures having recorded thereon the nucleotide 
sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target 

35 structural motif with the sequence information stored within the data storage 
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means. Search means are used to identify fragments or regions of the present 
genomic sequences which match a particular target sequence or target motif. A 
variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the 
5 computer-based systems of the present invention. Examples of such software 
includes, but is not limited to, MacPattem (EMBL), BLASTN and BLASTX 
(NCBIA). A skilled artisan can readily recognize that any one of the available 
algorithms or implementing software packages for conducting homology searches 
can be adapted for use in the present computer-based systems. 

10 As used herein, a "target sequence" can be any DNA or amino acid 

sequence of six or more nucleotides or two or more amino acids. A skilled artisan 
can readily recognize that the longer a target sequence is, the less likely a target 
sequence will be present as a random occurrence in the database. The most 
preferred sequence length of a target sequence is from about 10 to 100 amino acids 

15 or from about 30 to 300 nucleotide residues. However, it is well recognized that 
searches for commercially important fragments, such as sequence fragments* 
involved in gene expression and protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(sX 

20 are chosen based on a three-dimensional configuration which is formed upon the 
folding of the target motif. There arc a variety of target motifs known in the art. 
Protein target motifs include, but are not limited to. enzymic active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, promoter 
sequences, hairpin structures and inducible expression elements (protein binding 

25 sequences). 

A variety of structural formats for the input and output means can be used 
to input and output the information in the computer-based systems of the present 
invention. A preferred format for an output means ranks fragments of the 
Streptococcus pneumoniae genomic sequences possessing varying degrees of 

30 homology to the target sequence or target motif. Such presentation provides a 
skilled artisan with a ranking of sequences which contain various amounts of the 
target sequence or target motif and identifies the degree of homology contained in 
the identified fragment. 

A variety of comparing means can be used to compare a target sequence or 

35 target motif with the data storage means to identify sequence fragments of the 



WO 98/18931 



14 



PCT/US97/19588 



Streptococcus pneumoniae genome. In the present examples, implementing 
software which implement the BLAST and BLAZE algorithms, described in 
Altschul et a/., 7. MoL Biol, 215: 403-410 (1990), is used to identify open reading 
frames within the Streptococcus pneumoniae genome. A skilled artisan can readily 
5 recognize that any one of the publicly available homology search programs can be 
used as the search means for the computer-based systems of the present invention. 
Of course, suitable proprietary systems that may be known to those of skill also 
may be employed in this regard. 

Figure 1 provides a block diagram of a computer system illustrative of 

10 embodiments of this aspect of present invention. The computer system 102 
includes a processor 106 connected to a bus 104. Also connected to the bus 104 
are a main memory 108 (preferably implemented as random access memory, RAM) 
and a variety of secondary storage devices 1 10, such as a hard drive 1 12 and a 
removable medium storage device 1 14. The removable medium storage device 1 14 

15 may represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape 
drive, etc. A removable storage medium 1 16 (such as a floppy disk, a compact 
disk, a magnetic tape, etc) containing control logic and/or data recorded therein 
may be inserted into the removable medium storage device 1 14. The computer 
system 102 includes appropriate software for reading the control logic and/or the 

20 data from the removable medium storage device 1 14, once it is inserted into the 
removable medium storage device 114. 

A nucleotide sequence of the present invention may be stored in a well 
known manner in the main memory 108, any of the secondary storage devices 1 10. 
and/or a removable storage medium 116. During execution, software for accessing 

25 and processing the genomic sequence (such as search tools, comparing tools, etc.) 
reside in main memory 108, in accordance with the requirements and operating 
parameters of the operating system, the hardware system and the software program 
or programs. 
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BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to isolated 
fragments of the Streptococcus pneumoniae genome. The fragments of the 
5 Streptococcus pneumoniae genome of the present invention include, but are not 
limited to fragments which encode peptides and polypeptides, hereinafter open 
reading frames (ORFs), fragments which modulate the expression of an operabfy 
linked ORF, hereinafter expression modulating fragments (EMFs) and fragments 
which can be used to diagnose the presence of Streptococcus pneumoniae in a 

to sample, hereinafter diagnostic fragments (DFs). 

As used herein, an 'isolated nucleic acid molecule" or an "isolated fragment 
of the Streptococcus pneumoniae genome" refers to a nucleic acid molecule 
possessing a specific nucleotide sequence which has been subjected to purification 
means to reduce, from the composition, the number of compounds which arc 

15 normally associated with the composition. Particularly, the term refers to the 
nucleic acid molecules having the sequences set out in SEQ ID NOS: 1-39L* to 
representative fragments thereof as described above, to polynucleotides at least 
95%, preferably at least 99% and especially preferably at least 99.9% identical in 
sequence thereto, also as set out above. : 

20 A variety of purification means can be used to generate the isolated 

fragments of the present invention. These include, but are not limited to methods 
which separate constituents of a solution based on charge, solubility, or size. 

In one embodiment. Streptococcus pneumoniae DNA can be enzymaucally 
sheared to produce fragments of 15-20 kb in length. These fragments can then be 

25 used to generate a Streptococcus pneumoniae library by inserting them into lambda 
clones as described in the Examples below. Primers flanking, for example, an 
ORF, such as those enumerated in Tables 1-3 can then be generated using 
nucleotide sequence information provided in SEQ ID NOS: 1-391. Well known 
and routine techniques of PCR cloning then can be used to isolate the ORF from 

30 the lambda DNA library or Streptococcus pneumoniae genomic DNA. Thus, given 
the availability of SEQ ID NOS: 1-391, the information in Tables 1. 2 and 3, and 
the information that may be obtained readily by analysis of the sequences of SEQ 
ID NOS: 1-391 using methods set out above, those of skill will be enabled by the 
present disclosure to isolate any ORF-containing or other nucleic acid fragment of 

35 the present invention. 
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The isolated nucleic acid molecules of the: present invention include, but arc 
not limited to single stranded and double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame," ORF, means a series of triplets 
coding for amino acids without any termination codons and is a sequence 
5 translatable into protein. 

Tables 1, 2, and 3 list ORFs in the Streptococcus pneumoniae genomic 
contigs of the present invention that were identified as putative coding regions by 
the GeneMark software using organism-specific second-order Markov probability 
transition matrices. It will be appreciated that other criteria can be used, in 
10 accordance with well known analytical methods, such as those discussed herein, to 
generate more inclusive, more restrictive, or more selective lists. 

Table 1 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that over a continuous region of at least 50 bases are 95% or 
more identical (by BLAST analysis) to a nucleotide sequence available through 
15 GenBank in October, 1997. 

Table 2 sets out ORFs in the Streptococcus pneumoniae contigs of the 
present invention that are not in Table 1 and match, with a BLASTP probability 
score of 0.01 or less, a polypeptide sequence available through GenBank in 
October, 1997. 

20 Table 3 sets out ORFs in the Streptococcus pneumoniae contigs of the 

present invention that do not match significantly, by BLASTP analysis, a 
polypeptide sequence available through GenBank in October, 1997. 

In each table, the first and second columns identify the ORF by, 
respectively, contig number and ORF number within the contig; the third column 

25 indicates the first nucleotide of the ORF (actually the first nucleotide of the stop 
codon immediately preceeding the ORF). counting from the 5' end of the contig 
strand; and the fourth column, "stop (nt) M indicates the last nucleotide of the stop 
codon defining the 3'end of the ORF. 

In Tables 1 and 2, column five, lists the Reference for the closest 

30 matching sequence available through GenBank. These reference numbers are the 
databases entry numbers commonly used by those of skill in the art, who will be 
familiar with their denominators. Descriptions of the nomenclature are available 
from the National Center for Biotechnology Information. Column six in Tables 1 
and 2 provides the gene name of the matching sequence; column seven provides 

35 the BLAST identity score and column eight the BLAST similarity score from the 
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comparison of the ORF and the homologous gene : and column nine indicates the 
length in nucleotides of the highest scoring segment pair identified by the BLAST 
identity analysis. 

Each ORF described in the tables is defined by "start (nt)" (5') and "stop 
5 (nt)" (3') nucleotide position numbers. These position numbers refer to the 
boundaries of each ORF and provide orientation with respect to whether the 
forward or reverse strand is the coding strand and which reading frame the coding 
sequence is contained. The "start" position is the first nucleotide of the triplet 
encoding a stop codon just 5' to the ORF and the "stop" position is the last 

10 nucleotide of the triplet encoding the next in-frame stop codon (i.e., the stop codon 
at the 3' end of the ORF). Those of ordinary skill in the art appreciate that 
preferred fragments within each ORF described in the table include fragments of 
each ORF which include the entire sequence from the delineated ''start" and "stop" 
positions excepting the first and last three nucleotides since these encode stop 

15 codons. Thus, polynucleotides set out as ORFs in the tables but lacking the three 
(3) 5* nucleotides and the three (3) 3* nucleotides are encompassed by the present 
invention. Those of skill also appreciate that particularly preferred are fragments 
within each ORF that are polynucleotide fragments comprising polypeptide coding 
sequence. As defined herein, "coding sequence" includes the fragment within an 

20 ORF beginning at the first in-framc ATG (triplet encoding methionine) and ending 
with the iast nucleotide prior to the triplet encoding the 3' stop codon. Preferred 
are fragments comprising the entire coding sequence and fragments comprising the 
entire coding sequence, excepting the coding sequence for the N-terminal 
methionine. Those of skill appreciate that the N-terminal methionine is often 

25 removed during post-translational processing and that polynucleotides lacking the 
ATG can be used to facilitate production of N-termainal fusion proteins which may 
be benefical in the production or use of genetically engineered proteins. Of course, 
due to the degeneracy of the genetic code many polynucleotides can encode a given 
polypeptide. Thus, the invention further includes polynucleotides comprising a 

30 nucleotide sequence encoding a polypeptide sequence itself encoded by the coding 
sequence within an ORF described in Tables 1-3 herein. Further, polynucleotides 
at least 95%, preferably at least 99% and especially preferably at least 99.9% 
identical in sequence to the foregoing polynucleotides, are contemplated by the 
present invention. 
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Polypeptides encoded by polynucleotides described above and elsewhere 
herein are also provided by the present invention as are polypeptide comprising a 
an amino acid sequence at least about 95%, preferably at least 97% and even more 
preferably 99% identical to the amino acid sequence of a polypeptide encoded by an 
5 ORF shown in Tables 1-3. These polypeptides may or may not comprise an N- 
terminal methionine. 

The concepts of percent identity and percent similarity of two polypeptide 
sequences is well understood in the art. For example, two polypeptides 10 amino 
acids in length which differ at three amino acid positions (e.g., at positions I, 3 

]0 and 5) are said to have a percent identity of 70%. However, the same two 
polypeptides would be deemed to have a percent similarity of 80% if, for example 
at position 5, the amino acids moieties, although not identical, were "similar* (i.e., 
possessed similar biochemical characteristics). Many programs for analysis of 
nucleotide or amino acid sequence similarity, such as fasta and BLAST specifically 

15 list percent identity of a matching region as an output parameter. Thus, for 
instance, Tables 1 and 2 herein enumerate the percent identity of the highest 
scoring segment pair in each ORF and its listed relative. Further details 
concerning the algorithms and criteria used for homology searches are provided 
below and are described in the pertinent literature highlighted by the citations 

20 provided below. 

It will be appreciated that other criteria can be used to generate more 
inclusive and more exclusive listings of the types set out in the tables. As those of 
skill will appreciate, narrow and broad searches both are useful. Thus, a skilled 
artisan can readily identify ORFs in contigs of the Streptococcus pneumoniae 

25 genome other than those listed in Tables 1-3. such as ORFs which are overlapping 
or encoded by the opposite strand of an identified ORF in addition to those 
ascertainable using the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment," EMF, means a 
series of nucleotide molecules which modulates the expression of an operably 

30 linked ORF or EMF. 
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As used herein, a sequence is said to "modulate the expression of an 
operably linked sequence" when the expression of the sequence is altered by the 
presence of the EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One class of EMFs are 
5 fragments which induce the expression or an operably linked ORF in response to a 
specific regulatory factor or physiological event, 

EMF sequences can be identified within the contigs of the Streptococcus 
pneumoniae genome by their proximity to the ORFs provided in Tables 1-3. An 
intergenic segment, or a fragment of the intergenic segment, from about 10 to 200 

10 nucleotides in length, taken from any one of the ORFs of Tables 1-3 will modulate 
the expression of an operably linked ORF in a fashion similar to that found with the 
naturally 1'nked ORF sequence. As used herein, an "intergenic segment" refers to 
fragments of the Streptococcus pneumoniae genome which are between two 
ORF(s) herein described. EMFs also can be identified using known EMFs as a 

15 target sequence or target motif in the computer-based systems of the present 
invention. Further, the two methods can be combined and used together. 

The presence and activity of an EMF can be confirmed using an EMF trap 
vector. An EMF trap vector contains a cloning site linked to a marker sequence. A 
marker sequence encodes an identifiable phenotype, such as antibiotic resistance or 

20 a complementing nutrition auxotrophic factor, which can be identified or assayed 
when the EMF trap vector is placed within an appropriate host under appropriate 
conditions. As described above, a EMF will modulate the expression of an 
operably linked marker sequence. A more detailed discussion of various marker 
sequences is provided below. A sequence which is suspected as being an EMF is 

25 cloned in all three reading frames in one or more restriction sites upstream from the 
marker sequence in the EMF trap vector. The vector is then transformed into an 
appropriate host using known procedures and the phenotype of the transformed 
host in examined under appropriate conditions. As described above, an EMF will 
modulate the expression of an operably linked marker sequence. 

30 As used herein, a "diagnostic fragment," DF, means a series of nucleotide 

molecules which selectively hybridize to Streptococcus pneumoniae sequences. 
DFs can be readily identified by identifying unique sequences within contigs of the 
Streptococcus pneumoniae genome, such as by using well-known computer 
analysis software, and by generating and testing probes or amplification primers 
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consisting of the DF sequence in an appropriate diagnostic format which 
determines amplification or hybridization selectivity. 

The sequences falling within the scope of the present invention are not 
limited to the specific sequences herein described, but also include allelic and 
5 species variations thereof. Allelic and species variations can be routinely 
determined by comparing the sequences provided in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95%, preferably 
at least 99% and most at least preferably 99.9% identical to SEQ ID NOS: 1-391, 
with a sequence from another isolate of the same species. Furthermore, to 

10 accommodate codon variability, the invention includes nucleic acid molecules 
coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF, substitution of one codon 
for another which encodes the same amino acid is expressly contemplated. Any 
specific sequence disclosed herein can be readily screened for errors by 

15 resequencing a particular fragment, such as an ORF, in both directions (i.e., 
sequence both strands). Alternatively, error screening can be performed by 
sequencing corresponding polynucleotides of Streptococcus pneumoniae origin 
isolated by using part or all of the fragments in question as a probe or primer. 

Preferred DFs of the present invention comprise at least about 17, 

20 preferrably at least about 20, and more preferrably at least about 50 contiguous 
nucleotides within an ORF set out in Tables 1-3. Most highly preferred DFs 
specifically hybridize to a polynucleotide containing the sequence of the ORF from 
which they are derived. Specific hybridization occurs even under stringent 
conditions defined elsewhere herein. 

25 Each of the ORFs of the Streptococcus pneumoniae genome disclosed in 

Tables 1, 2 and 3, and the EMFs found 5' to the ORFs, can be used as 
polynucleotide reagents in numerous ways. For example, the sequences can be 
used as diagnostic probes or diagnostic amplification primers to detect the presence 
of a specific microbe in a sample, particularly Streptococcus pneumoniae. 

30 Especially preferred in this regard are ORFs such as those of Table 3, which do not 
match previously characterized sequences from other organisms and thus are most 
likely to be highly selective for Streptococcus pneumoniae. Also particularly 
preferred are ORFs that can be used to distinguish between strains of Streptococcus 
pneumoniae, particularly those that distinguish medically important strain, such as 

35 drug-resistant strains. 
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In addition, the fragments of the present invention, as broadly described, 
can be used to control gene expression through triple helix formation or antisense 
DNA or RNA, both of which methods are based on the binding of a polynucleotide 
sequence to DNA or RNA. Triple helix- formation optimally results in a shut-off of 
5 RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Information from the 
sequences of the present invention can be used to design antisense and triple helix- 
forming oligonucleotides. Polynucleotides suitable for use in these methods are 
usually 20 to 40 bases in length and are designed to be complementary to a region 

10 of the gene involved in transcription, for triple-helix formation, or to the mRNA 
itself, for antisense inhibition. Both techniques have been demonstrated to be 
effective in model systems, and the requisite techniques are well known and 
involve routine procedures. Triple helix techniques are discussed in, for example. 
Lee et a/., Nuci Acids Res, 6:3073 (1979); Cooney et a/.. Science 247:456 

15 (1988); and Dervan et a/.. Science 257:1360 (1991). Antisense techniques in 
general are discussed in, for instance, Okano, J. Neurochem. 56:560 (1991) and 
Oligodeoxy nucleotides as Antisense Inhibitors of Gene Expression, CRC Press. 
Boca Raton, FL (1988)). 

The present invention further provides recombinant constructs comprising 

20 one or more fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention. Certain preferred recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a 
fragment of the Streptococcus pneumoniae genome has been inserted, in a forward 
or reverse orientation. In the case of a vector comprising one of the ORFs of the 

25 present invention, the vector may further comprise regulatory sequences, including 
for example, a promoter, operably linked to the ORF. For vectors comprising the 
EMFs of the present invention, the vector may further comprise a marker sequence 
or heterologous ORF operably linked to the EMF. 

Large numbers of suitable vectors and promoters are known to those of 

30 skill in the an and are commercially available for generating the recombinant 
constructs of the present invention. The following vectors are provided by way of 
example. Useful bacterial vectors include phagescript, PsiX174, pBluescript SK, 
pBS KS, pNH8a, pNH16a, pNH18a, pNH46a (available from Stratagene); 
pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (available from Pharmacia). 

35 Useful eukaryotic vectors include pWLneo, pSV2cat, pOG44, pXTl. pSG 
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(available from Stratagene) pSVK3, pBPV, pMSG, pSVL (available from 
Pharmacia). 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
5 Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad. lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein- I. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the art. 

10 The present invention further provides host cells containing any one of the 

isolated fragments of the Streptococcus pneumoniae genomic fragments and 
contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host 
cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or 

1 5 a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct 
comprising an ORF of the present invention, may be introduced into the host by a 
variety of well established techniques that are standard in the art, such as calcium 
phosphate transfection, DEAE, dextran mediated transfection and electroporation, 

20 which are described in, for instance. Davis, L. et a/., BASIC METHODS IN 
MOLECULAR BIOLOGY (1986). 

A host cell containing one of the fragments of the Streptococcus 
pneumoniae genomic fragments and contigs of the present invention, can be used 
in conventional manners to produce the gene product encoded by the isolated 

25 fragment (in the case of an ORF) or can be used to produce a heterologous protein 
under the control of the EMF. The present invention further provides 

isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present 
invention. By "degenerate variant" is intended nucleotide fragments which differ 

30 from a nucleic acid fragment of the present invention {e.g.. an ORF) by nucleotide 
sequence but, due to the degeneracy of the Genetic Code, encode an identical 
polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs and 
subfragments thereof depicted in Tables 2 and 3 which encode proteins. 
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A variety of methodologies known in the an can be utilized to obtain any 
one of the isolated polypeptides or proteins of the present invention. At the 
simplest level, the amino acid sequence can be synthesized using commercially 
available peptide synthesizers. This is particularly useful in producing small 
5 peptides and fragments of larger polypeptides. -Such short fragments as may be 
obtained most readily by synthesis are useful, for example, in generating antibodies 
against the native polypeptide, as discussed further below. 

In an alternative method, the polypeptide or protein is purified from 
bacterial cells which naturally produce the polypeptide or protein. One skilled in 

10 the art can readily employ well-known methods for isolating polypeptides and 
proteins to isolate and purify polypeptides or proteins of the present invention 
produced naturally by a bacterial strain, or by other methods. Methods for 
isolation and purification that can be employed in this regard include, but are not 
limited to, immunochromatography, HPLC, size-exclusion chromatography, ion- 

15 exchange chromatography, and immuno-affinity chromatography. 

The polypeptides and proteins of the present invention also can be purified- 
from cells which have been altered to express the desired polypeptide or protein. 
As used herein, a cell is said to be altered to express a desired polypeptide or 
protein when the cell, through genetic manipulation, is made to produce a« 

20 polypeptide or protein which it normally docs not produce or which the cell 
normally produces at a lower level. Those skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic 
sequences into eukaryotic or prokaryotic cells in order to generate a cell which 
produces one of the polypeptides or proteins of the present invention. 

25 Any host/vector system can be used to express one or more of the ORFs of 

the present invention. These include, but are not limited to, eukaryotic hosts such 
as HeLa cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic host 
such as E. coli and B. subtilis. The most preferred cells are those which do not 
normally express the particular polypeptide or protein or which expresses the 

30 polypeptide or protein at low natural level. 
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"Recombinant," as used herein, means that a polypeptide or protein is 
derived from recombinant {e.g., microbial or mammalian) expression systems. 
"Microbial" refers to recombinant polypeptides or proteins made in bacterial or 
fungal {e.g., yeast) expression systems. As a product, 'recombinant 
5 microbiaTdefines a polypeptide or protein essentially free of native endogenous 
substances and unaccompanied by associated native glycosylation. Polypeptides or 
proteins expressed in most bacterial cultures, e.g., E. coli. will be free of 
glycosylation modifications; polypeptides or proteins expressed in yeast will have a 
glycosylation pattern different from that expressed in mammalian cells. 

10 "Nucleotide sequence" refers to a heteropolymer of deoxy ribonucleotides. 

Generally, DNA segments encoding the polypeptides and proteins provided by this 
invention are assembled from fragments of the Streptococcus pneumoniae genome 
and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a 
synthetic gene which is capable of being expressed in a recombinant transcriptional 

15 unit comprising regulatory elements derived from a microbial or viral operon. 

Recombinant expression vehicle or vector" refers to a plasmid or phage or 
virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. The 
expression vehicle can comprise a transcriptional unit comprising an assembly of 
(1) a genetic regulatory elements necessary for gene expression in the host, 

20 including elements required to initiate and maintain transcription at a level sufficient 
for suitable expression of the desired polypeptide, including, for example, 
promoters and, where necessary, an enhancer and a polyadenylation signal; (2) a 
structural or coding sequence which is transcribed into mRNA and translated into 
protein, and (3) appropriate signals to initiate translation at the beginning of the 

25 desired coding region and terminate translation at its end. Structural units intended 
for use in yeast or eukaryotic expression systems preferably include a leader 
sequence enabling extracellular secretion of translated protein by a host cell. 
Alternatively, where recombinant protein is expressed without a leader or transport 
sequence, it may include an N-terminal methionine residue. This residue may or 

30 may not be subsequently cleaved from the expressed recombinant protein to 
provide a final product. 

"Recombinant expression system" means host cells which have stably 
integrated a recombinant transcriptional unit into chromosomal DNA or carry the 
recombinant transcriptional unit extra chromosomaily. The cells can be prokaryotic 

35 or eukaryotic. Recombinant expression systems as defined herein will express 
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heterologous polypeptides or proteins upon induction of the regulatory elements 
linked to the DNA segment or synthetic gene to be expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or 
other cells under the control of appropriate promoters. Cell-free translation 
5 systems can also be employed to produce such proteins using RNAs derived from 
the DNA constructs of the present invention. Appropriate cloning and expression 
vectors for use with prokaryotic and eukaryotic hosts are described in Sambrook ei 
a/.. Molecular Cloning: A Laboratory Manual 2 nd Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989), the disclosure of which 

10 is hereby incorporated by reference in its entirety. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, e.g., 
the ampicillin resistance gene of £. coli and S. cerevisiae TRPl gene, and a 
promoter derived from a highly expressed gene to direct transcription of a 

15 downstream structural sequence. Such promoters can be derived from operdns 
encoding glycolytic enzymes such as 3- phosphoglycerate kinase (PGK), alpha- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and 
termination sequences, and preferably, a leader sequence capable of directing 

20 secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N- 
terminai identification peptide imparting desired characteristics, e.g., stabilization 
or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 

25 structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic selectable 
markers and an origin of replication to ensure maintenance of the vector and, when 
desirable, provide amplification within the host. 

30 Suitable prokaryotic hosts for transformation include strains of E. coli, B. 

subtilis. Salmonella ryphimurium and various species within the genera 
Pseudomonas and Streptomyces. Others may, also be employed as a matter of 
choice. 

As a representative but non-limiting example, useful expression vectors for 
35 bacterial use can comprise a selectable marker and bacterial origin of replication 
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derived from commercially available plasmids comprising genetic elements of the 
well known cloning vector pBR322 (ATCC 37017). Such commercial vectors 
include, for example, pKK223-3 (available form Pharmacia Fine Chemicals, 
Uppsala, Sweden) and GEM 1 (available from Promega Biotec, Madison, WI. 
5 USA). These pBR322 "backbone" sections -are combined with an appropriate 
promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter, where it is inducible, is 
derepressed or induced by appropriate means (e.g., temperature shift or chemical 

10 induction) and cells are cultured for an additional period to provide for expression 
of the induced gene product. Thereafter cells are typically harvested, generally by 
centnfugation, disrupted to release expressed protein, generally by physical or 
chemical means, and the resulting crude extract is retained for further purification. 

Various mammalian cell culture systems can also be employed to express 

15 recombinant protein. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts, described in Gluzrnan, Cell 23:175 
( 198 1), and other cell lines capable of expressing a compatible vector, for example, 
the CI 27, 3T3, CHO, HeLa and BHK ceil lines. 

Mammalian expression vectors will comprise an origin of replication, a 

20 suitable promoter and enhancer, and also any necessary ribosome binding sites, 
polyadenyiation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5* flanking nontranscribed sequences. DNA sequences derived 
from the S V40 viral genome, for example, SV40 origin, early promoter, enhancer, 
splice, and polyadenyiation sites may be used to provide the required 

25 nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is 
usually isolated by initial extraction from cell pellets, followed by one or more 
salting-out, aqueous ion exchange or size exclusion chromatography steps. 
Microbial cells employed in expression of proteins can be disrupted by any 

30 convenient method, including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents. Protein refolding steps can be used, as 
necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification 
steps. 
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The present invention further includes isolated polypeptides, proteins and 
nucleic acid molecules which are substantially equivalent to those herein described. 
As used herein, substantially equivalent can refer both to nucleic acid and amino 
acid sequences, for example a mutant sequence, that varies from a reference 
5 sequence by one or more substitutions, deletions, or additions, the net effect of 
which does not result in an adverse functional dissimilarity between reference and 
subject sequences. For purposes of the present invention, sequences having 
equivalent biological activity, and equivalent expression characteristics are 
considered substantially equivalent. For purposes of determining equivalence, 

10 truncation of the mature sequence should be disregarded. 

The invenuon further provides methods of obtaining homologs from other 
strains of Streptococcus pneumoniae, of the fragments of the Streptococcus 
pneumoniae genome of the present invention and homologs of the proteins encoded 
by the ORFs of the present invention. As used herein, a sequence or protein of 

15 Streptococcus pneumoniae is defined as a homolog of a fragment of the 
Streptococcus pneumoniae fragments or contigs or a protein encoded by one of the 
ORFs of the present invention, if it shares significant homology to one of the 
fragments of the Streptococcus pneumoniae genome of the present invention or a 
protein encoded by one of the ORFs of the present invention. Specifically, by 

20 using the sequence disclosed herein as a probe or as primers, and techniques such 
as PCR cloning and colony/plaque hybridization, one skilled in the art can obtain 
homologs. 

As used herein, two nucleic acid molecules or proteins are said to "share 
significant homology" if the two contain regions which possess greater than 85% 

25 sequence (amino acid or nucleic acid) homology. Preferred homologs in this 
regard are those with more than 90% homology. Especially preferred are those 
with 93% or more homology. Among especially preferred homologs those with 
95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those 

30 are homologs with 99% or more homology. The most preferred homologs among 
these are those with 99.9% homology or more. It will be understood that, among 
measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence 
provided in SEQ ID NOS:l-391 or from a nucleotide sequence at least 95%, 

35 particularly at least 99%, especially at least 99.5% identical to a sequence of SEQ 
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ID NOS:l-39l can be used to prime DNA synthesis and PCR amplification, as 
well as to identify colonies containing cloned DNA encoding a homolog. Methods 
suitable to this aspect of the present invention are well known and have been 
described in great detail in many publications such as, for example, Innis et ai. 
5 PCR Protocols. Academic Press, San Diego, CA ( 1 990)). 

When using primers derived from SEQ ID NOS: 1-391 or from a nucleotide 
sequence having an aforementioned identity to a sequence of SEQ ED NOS: 1-391, 
one skilled in the art will recognize that by employing high stringency conditions 
(e.g.. annealing at 50-60°C in 6X SSPC and 50% formamide, and washing at 50- 

10 65°C in 0.5X SSPC) only sequences which are greater than 75% homologous to 
the primer will be amplified. By employing lower stringency conditions (e.g.. 
hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C 
in 0.5X SSPC), sequences which are greater than 40-50% homologous to the 
primer will also be amplified. 

15 When using DNA probes derived from SEQ ID NOS: 1-391, or from a 

nucleotide sequence having an aforementioned identity to a sequence of SEQ ID 
NOS: 1-391, for colony/plaque hybridization, one skilled in the an will recognize 
that by employing high stringency conditions (e.g., hybridizing at 50- 65°C in 5X 
SSPC and 50% formamide, and washing at 50- 65°C in 0.5X SSPC), sequences 

20 having regions which are greater than 90% homologous to the probe can be 
obtained, and that by employing lower stringency conditions (e.g., hybridizing at 
35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C in 0.5X 
SSPC), sequences having regions which are greater than 35-45% homologous to 
the probe will be obtained. 

25 Any organism can be used as the source for homologs of the present 

invention so long as the organism naturally expresses such a protein or contains 
genes encoding the same. The most preferred organism for isolating homologs are 
bacteria which are closely related to Streptococcus pneumoniae. 

30 ILLUSTRATIVE USES OF COMPOSITIONS OF THE 

INVENTION 

Each ORF provided in Tables 1 and 2 is identified with a function by 
homology to a known gene or polypeptide. As a result, one skilled in the art can 
use the polypeptides of the present invention for commercial, therapeutic and 
35 industrial purposes consistent with the type of putative identification of the 
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polypeptide. Such identifications permit one skjlled in the art to use the 
Streptococcus pneumoniae ORFs in a manner similar to the known type of 
sequences for which the identification is made; for example, to ferment a particular 
sugar source or to produce a particular metabolite. A variety of reviews illustrative 
5 of this aspect of the invention are available, including the following reviews on the 
industrial use of enzymes, for example, BIOCHEMICAL ENGINEERING AND 
BIOTECHNOLOGY HANDBOOK, 2nd Ed., MacMillan Publications. Ltd. NY 
(1991) and B IOC AT AL YSTS IN ORGANIC SYNTHESES, Tramper et ai. 9 Eds., 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). A variety of 
10 exemplary uses that illustrate this and similar aspects of the present invention are 
discussed below. 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic 

15 reactions involved in intermediary and macromolecular metabolism, the 
biosynthesis of small molecules, cellular processes and other functions includes 
enzymes involved in the degradation of the intermediary products of metabolism, 
enzymes involved in central intermediary metabolism, enzymes involved in 
respiration, both aerobic and anaerobic, enzymes involved in fermentation, 

20 enzymes involved in ATP proton motor force conversion, enzymes involved in 
broad regulatory function, enzymes involved in amino acid synthesis, enzymes 
involved in nucleotide synthesis, enzymes involved in cofactor and vitamin 
synthesis, can be used for industrial biosynthesis. 

The various metabolic pathways present in Streptococcus pneumoniae can 

25 be identified based on absolute nutritional requirements as well as by examining the 
various enzymes identified in Table 1-3 and SEQ ID NOS: 1-391. 

Of particular interest are polypeptides involved in the degradation of 
intermediary metabolites as well as non-macromolecular metabolism. Such 
enzymes include amylases, glucose oxidases, and catalase. 

30 Proteolytic enzymes are another class of commercially important enzymes. 

Proteolytic enzymes find use in a number of industrial processes including the 
processing of flax and other vegetable fibers, in the extraction, clarification and 
depectinization of fruit juices, in the extraction of vegetables' oil and in the 
maceration of fruits and vegetables to give unicellular fruits. A detailed review of 

35 the proteolytic enzymes used in the food industry is provided in Rombouts et al., 
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Symbiosis 21:19 (1986) and Voragen et ai in Biocatalysts In Agricultural 
Biotechnology, Whitaker et a/., Eds., American Chemical Society Symposium 
Series 389:93 (1989) . 

The metabolism of sugars is an important aspect of the primary metabolism 
5 of Streptococcus pneumoniae. Enzymes involved in the degradation of sugars, 
such as, particularly, glucose, galactose, fructose and xylose, can be used in 
industrial fermentation. Some of the important sugar transforming enzymes, from 
a commercial viewpoint, include sugar isomerases such as glucose isomerase. 
Other metabolic enzymes have found commercial use such as glucose oxidases 

10 which produces ketogulonic acid (KG A). KG A is an intermediate in the 
commercial production of ascorbic acid using the Reichsteins procedure, as 
described in Krueger et aL, Biotechnology 6(A) . Rhine et ai, Eds., Verlag Press. 
Weinhcim. Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in 

15 purified form as well as in an immobilized form for the deoxygenation of beer. 
See, for instance, Hartmeir et aL, Biotechnology Letters 1:21 (1979). The most 
important application of GOD is the industrial scale fermentation of gluconic acid- 
Market for gluconic acids which are used in the detergent, textile, leather, 
photographic, pharmaceutical, food, feed and concrete industry, as described, for 

20 example, in Bigelis et aL, beginning on page 357 in GENE MANIPULATIONS 
AND FUNGI; Benett et aL Eds., Academic Press, New York (1985), In addition 
to industrial applications, GOD has found applications in medicine for quantitative 
determination of glucose in body fluids recently in biotechnology for analyzing 
syrups from starch and cellulose hydrosylates. This application is described in 

25 Owusu et a/., Biochem. et Biophysica. Acta. 872;%3 (1986), for instance. 

The main sweetener used in the world today is sugar which comes from 
sugar beets and sugar cane. In the field of industrial enzymes, the glucose 
isomerase process shows the largest expansion in the market today. Initially, 
soluble enzymes were used and later immobilized enzymes were developed 

30 (Krueger et ai. Biotechnology, The Textbook of Industrial Microbiology, Sinauer 
Associated Incorporated, Sunderland, Massachusetts (1990)). Today, the use of 
glucose- produced high fructose syrups is by far the largest industrial business 
using immobilized enzymes. A review of the industrial use of these enzymes is 
provided by Jorgensen, Starch 40:307 (1988). 
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Proteinases, such as alkaline serine proteinases, are used as detergent 
additives and thus represent one of the largest volumes of microbial enzymes used 
in the industrial sector. Because of their industrial importance, there is a large body 
of published and unpublished information regarding the use of these enzymes in 
5 industrial processes. (See Faultman et al, Acid Proteases Structure Function and 
Biology, Tang, J., ed. f Plenum Press, New York (1977) and Godfrey et ai y 
Industrial Enzymes, MacMillan Publishers, Surrey, UK (1983) and Hepner et al^ 
Report Industrial Enzymes by 1990, Hel Hepner & Associates, London ( 1986)), 

Another class of commercially usable proteins of the present invention are 

10 the microbial lipases, described by, for instance, Macrae et a/., Philosophical 
Transactions of the Chiral Society of London 310:221 ( 1985) and Poserke, Journal 
of the American Oil Chemist Society 61: 1758 (1984). A major use of lipases is in 
the fat and oil industry for the production of neutral glycerides using lipase 
catalyzed inter-esterification of readily available triglycerides. Application of 

15 lipases include the use as a detergent additive to facilitate the removal of fats from 
fabrics in the course of the washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst' for 
key steps in the synthesis of complex organic molecules is gaining popularity at a 
great rate. One area of great interest is the preparation of chiral intermediates. 

20 Preparation of chiral intermediates is of interest to a wide range of synthetic 
chemists particularly those scientists involved with the preparation of new 
pharmaceuticals, agrochemicals, fragrances and flavors. (See Davies et aL % Recent 
Advances in the Generation of Chiral Intermediates Using Enzymes. CRC Press, 
Boca Raton, Florida (1990)). The following reactions catalyzed by enzymes are of 

25 interest to organic chemists: hydrolysis of carboxylic acid esters, phosphate esters, 
amides and nitriles, esterification reactions, trans-esterification reactions, synthesis 
of amides, reduction of alkanones and oxoalkanates, oxidation of alcohols to 
carbonyl compounds, oxidation of sulfides to sulfoxides, and carbon bond forming 
reactions such as the aldol reaction. 

30 When considering the use of an enzyme encoded by one of the ORFs of the 

present invention for biotransformation and organic synthesis it is sometimes 
necessary to consider the respective advantages and disadvantages of using a 
microorganism as opposed to an isolated enzyme. Pros and cons of using a whole 
cell system on the one hand or an isolated partially purified enzyme on the other 
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hand, has been described in detail by Bud et aL. Chemistry in Britain (1987), p. 
127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism 
of amino acids, are useful in the catalytic production of amino acids. The 

5 advantages of using microbial based enzyme systems is that the amino transferase 
enzymes catalyze the stereo- selective synthesis of only L-amino acids and 
generally possess uniformly high catalytic rates. A description of the use of amino 
transferases for amino acid production is provided by Roselle-David, Methods of 
Enzymology 136:479 (1987). 

10 Another category of useful proteins encoded by the ORFs of the present 

invention include enzymes involved in nucleic acid synthesis, repair, and 
recombination. 

2. Generation of Antibodies 

15 As described here, the proteins of the present invention, as well as 

homologs thereof, can be used in a variety of procedures and methods known in 
the an which are currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the 
protein. Such antibodies can be either monoclonal or polyclonal antibodies, as well 

20 fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of 
the proteins of the present invention and hybridomas which produce ihese 
antibodies. A hybridoma is an immortalized cell line which is capable of secreting 
a specific monoclonal antibody. 

25 In general, techniques for preparing polyclonal and monoclonal antibodies 

as well as hybridomas capable of producing the desired antibody are well known in 
the art (Campbell, A. M., Monoclonal Antibody Technology: Laboratory 
Techniques In Biochemistry And Molecular Biology, Elsevier Science Publishers, 
Amsterdam, The Netherlands (1984); St. Groth et a/., /. Immunol. Methods 35: I- 

30 21 (1980), Kohler and Milstein, Nature 256:495-497 (1975)), the trioma 
technique, the human B-cell hybridoma technique (Kozbor et aL, Immunology 
Today 4:12 (1983), pgs. 77-96 of Cole et a/., in Monoclonal Antibodies And 
Cancer Therapy, Alan R. Liss, Inc. (1985)). Any animal (mouse, rabbit, 

etc.) which is known to produce antibodies can be immunized with the pseudogene 

35 polypeptide. Methods for immunization are well known in the art. Such methods 
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include subcutaneous or interperitoneal injection of the polypeptide. One skilled in 
the an will recognize that the amount of the protein encoded by the ORF of the 
present invention used for immunization will vary based on the animal which is 
immunized, the antigenicity of the peptide and the site of injection. 
5 The protein which is used as an immunogen may be modified or 

administered in an adjuvant in order to increase the protein's antigenicity. Methods 
of increasing the antigenicity of a protein are well known in the art and include, but 
are not limited to coupling the antigen with a heterologous protein (such as globulin 
or galactosidase) or through the inclusion of an adjuvant during immunization. 

10 For monoclonal antibodies, spleen cells from the immunized animals are 

removed, fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and 
allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to 
identify the hybridoma cell which produces an antibody with the desired 

15 characteristics. These include screening the hybridomas with an ELISA assay, 
western blot analysis, or radioimmunoassay (Lutz et aL % Exp. Cell Res. 175: 109- 
124 (1988)}. 

Hybridomas secreting the desired antibodies are cloned and the class and 
subclass is determined using procedures known in the art (Campbell, A. M., 
20 Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and 
Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands 

(1984)). 

Techniques described for the production of single chain antibodies (U. S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to proteins of 
25 the present invention. 

For polyclonal antibodies, antibody containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. 

The present invention further provides the above- described antibodies in 
30 detectably labelled form. Antibodies can be detectably labelled through the use of 
radioisotopes, affinity labels (such as biotin, avidin. etc.), enzymatic labels (such 
as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as 
FITC or rhodamine, etc.), paramagnetic atoms, etc. Procedures for accomplishing 
such labeling are well-known in the art, for example see Sternberger et a/., 7. 
35 Histochem. Cytochem. 18:315 (1970); Bayer, E. A. et aL, Meth. Enzym. 62:308 
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(1979); Engval, E. et at.. Immunol. 109:129 (1972); Goding, J. W., J. Immunol. 
Meth. 13:215 (1976)). 

The labeled antibodies of the present invention can be used for in vitro, in 
vivo, and in situ assays to identify cells or tissues in which a fragment of the 
5 Streptococcus pneumoniae genome is expressed. 

The present invention further provides the above-described antibodies 
immobilized on a solid support. Examples of such solid supports include plastics 
such as polycarbonate, complex carbohydrates such as agarose and sepharose, 
acrylic resins and such as polyacrylamide and latex beads. Techniques for 

10 coupling antibodies to such solid supports are well known in the art (Weir. D. M. 
et aL, "Handbook of Experimental Immunology" 4th Ed., Blackwell Scientific 
Publications, Oxford, England, Chapter 10 (1986); Jacoby, W. D. et a/., Meth. 
Enzym. 34 Academic Press, N. Y. (1974)). The immobilized antibodies of the 
present invention can be used for in vitro, in vivo, and in situ assays as well as for 

15 immunoaffinity purification of the proteins of the present invention. 

3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the expression 
of one of the ORFs of the present invention, or homolog thereof, in a test sample, 

20 using one of the DFs or antibodies of the present invention. 

In detail, such methods comprise incubating a test sample with one or more 
of the antibodies or one or more of the DFs of the present invention and assaying 
for binding of the DFs or antibodies to components within the test sample. 

Conditions for incubating a DF or antibody with a test sample vary. 

25 Incubation conditions depend on the format employed in the assay, the detection 
methods employed, and the type and nature of the DF or antibody used in the 
assay. One skilled in the art will recognize that any one of the commonly available 
hybridization, amplification or immunological assay formats can readily be adapted 
to employ the DFs or antibodies of the present invention. Examples of such assays 

30 can be found in Chard, T., An. Introduction to Radioimmunoassay and Related 
Techniques, Elsevier Science Publishers, Amsterdam. The Netherlands (1986); 
Bullock, G. R. et aL, Techniques in Immunocytochemistry, Academic Press, 
Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and 
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Molecular Biology. Elsevier Science Publishers, Amsterdam, The Netherlands 
(1985). 

The test samples of the present invention include cells, protein or membrane 
extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or 
5 urine. The test sample used in the above-described method will vary based on the 
assay format, nature of the detection method and the tissues, cells or extracts used 
as the sample to be assayed. Methods for preparing protein extracts or membrane 
extracts of cells are well known in the an and can be readily be adapted in order to 
obtain a sample which is compatible with the system utilized. 

10 In another embodiment of the present invention, kits are provided which 

contain the necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in 
close confinement, one or more containers which comprises: (a) a first container 
comprising one of the DFs or antibodies of the present invention; and (b) one or 

15 more other containers comprising one or more of the following: wash reagents, 
reagents capable of detecting presence of a bound DF or antibody. 

In detail, a compartmentalized kit includes any kit in which reagents are 
contained in separate containers. Such containers include small glass containers, 
plastic containers or strips of plastic or paper. Such containers allows one to 

20 efficiently transfer reagents from one compartment to another compartment such 
that the samples and reagents are not cross-contaminated, and the agents or 
solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept 
the test sample, a container which contains the antibodies used in the assay, 

25 containers which contain wash reagents (such as phosphate buffered saline, Tris- 
buffers, etc), and containers which contain the reagents used to detect the bound 
antibody or DF. 

Types of detection reagents include labelled nucleic acid probes, labelled 
secondary antibodies, or in the alternative, if the primary antibody is labelled, the 
30 enzymatic, or antibody binding reagents which are capable of reacting with the 
labelled antibody. One skilled in the an will readily recognize that the disclosed 
DFs and antibodies of the present invention can be readily incorporated into one of 
the established kit formats which are well known in the an. 

35 4. Screening Assay for Binding Agents 
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Using the isolated proteins of the present invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a 
protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Streptococcus pneumoniae fragment and contigs herein 
5 described. 

In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the 
ORFs of the present invention, or an isolated fragment of the Streptococcus 
pneumoniae genome; and 

to (b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, 
peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. The 
agents can be selected and screened at random or rationally selected or designed 
using protein modeling techniques. 

15 For random screening, agents such as peptides, carbohydrates, 

pharmaceutical agents and the like are selected at random and are assayed for their 
ability to bind to the protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used 
herein, an agent is said to be "rationally selected or designed" when the agent is 

20 chosen based on the configuration of the particular protein. For example, one 
skilled in the art can readily adapt currently available procedures to generate 
peptides, pharmaceutical agents and the like capable of binding to a specific peptide 
sequence in order to generate rationally designed antipeptide peptides, for example 
see Hurby et a/., "Application of Synthetic Peptides: Antisense Peptides," in 

25 Synthetic Peptides, A User's Guide. W. H. Freeman, NY (1992), pp. 289-307, 
and Kaspczak et aL % Biochemistry 28:9230-$ (1989), or pharmaceutical agents, or 
the like. 

In addition to the foregoing, one class of agents of the present invention, as 
broadly described, can be used to control gene expression through binding to one 
30 of the ORFs or EMFs of the present invention. As described above, such agents 
can be randomly screened or rationally designed/selected. Targeting the ORF or 
EMF allows a skilled artisan to design sequence specific or element specific agents, 
modulating the expression of either a single ORF or multiple ORFs which rely on 
the same EMF for expression control. 



W098/18931 



37 



PCT/US97/19588 



One class of DNA binding agents are agents which contain base residues 
which hybridize or form a triple helix by binding to DNA or RNA. Such agents 
can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a 
variety of sulfhydryl or polymeric derivatives which have base attachment capacity. 
5 Agents suitable for use in these methods usually contain 20 to 40 bases and 

are designed to be complementary to a region of the gene involved in transcription 
(triple helix - see Lee et ai y NucL Acids Res. 6:3073 (1979); Cooney et aL. 
Science 241:456 (1988); and Dervan et al % Science 257:1360 (1991)) or to the 
mRNA itself (antisense - Okano, 7. Neurochem. 56:560 (1991); 

10 Oligodeoxynucteotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix- formation optimally results in a shut-off of 
RNA transcription from DNA, while antisense RNA hybridization blocks 
translation of an mRNA molecule into polypeptide. Both techniques have been 
demonstrated to be effective in model systems. Information contained in the 

15 sequences of the present invention can be used to design antisense and triple helix- 
forming oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

The present invention further provides pharmaceutical agents which can be 

20 used to modulate the growth or pathogenicity of Streptococcus pneumoniae, or 
another related organism, in vivo or in vitro. As used herein, a "pharmaceutical 
agent" is defined as a composition of matter which can be formulated using known 
techniques to provide a pharmaceuticaJ compositions. As used herein, the 
"pharmaceutical agents of the present invention" refers the pharmaceutical agents 

25 which are derived from the proteins encoded by the ORFs of the present invention 
or are agents which are identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to "modulate the growth 
pathogenicity of Streptococcus pneumoniae or a related organism, in vivo or in 
vitro:' when the agent reduces the rate of growth, rate of division, or viability of 

30 the organism in question. The pharmaceutical agents of the present invention can 
modulate the growth or pathogenicity of an organism in many fashions, although 
an understanding of the underlying mechanism of action is not needed to practice 
the use of the pharmaceutical agents of the present invention. Some agents will 
modulate the growth by binding to an important protein thus blocking the biological 

35 activity of the protein, while other agents may bind to a component of the outer 
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surface of the organism blocking attachment or rendering the organism more prone 
to act the bodies nature immune system. Alternatively, the agent may comprise a 
protein encoded by one of the ORFs of the present invention and serve as a 
vaccine. The development and use of a vaccine based on outer membrane 
5 components are well known in the art. — 

As used herein, a "related organism" is a broad term which refers to any 
organism whose growth can be modulated by one of the pharmaceutical agents of 
the present invention. In general, such an organism will contain a homolog of the 
protein which is the target of the pharmaceutical agent or the protein used as a 
10 vaccine. As such, related organisms do not need to be bacterial but may be fungal 
or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may 
be administered in a convenient manner, such as by the oral, topical, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes. The 

15 pharmaceutical compositions are administered in an amount which is effective for 
treating and/or prophylaxis of the specific indication. In general, they are 
administered in an amount of at least about 1 n;gAg body weight and in most cases 
they will be administered in an amount not in excess of about 1 g/kg body weight 
per day. In most cases, the dosage is from about 0. 1 mg/kg to about 10 g/kg body 

20 weight daily, taking into account the routes of administration, symptoms, etc. 

The agents of the present invention can be used in native form or can be 
modified to form a chemical derivative. As used herein, a molecule is said to be a 
"chemical derivative" of another molecule when it contains additional chemical 
moieties not normally a pan of the molecule. Such moieties may improve the 

25 molecules solubility, absorption, biological half life, etc. The moieties may 
alternatively decrease the toxicity of the molecule, eliminate or attenuate any 
undesirable side effect of the molecule, etc. Moieties capable of mediating such 
effects are disclosed in, among other sources, REMINGTON'S 
PHARMACEUTICAL SCIENCES ( 1980) cited elsewhere herein. 

30 For example, such moieties may change an immunological character of the 

functional derivative, such as affinity for a given antibody. Such changes in 
immunomodulation activity are measured by the appropriate assay, such as a 
competitive type immunoassay. Modifications of such protein properties as redox 
or thermal stability, biological half-life, hydrophobicity, susceptibility to proteolytic 

35 degradation or the tendency to aggregate with carriers or into multimers also may 
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be effected in this way and can be assayed by methods well known to the skilled 
artisan. 

The therapeutic effects of the agents of the present invention may be 
obtained by providing the agent to a patient by any suitable means (e.g., inhalation, 
5 intravenously, intramuscularly, subcuianeously, enterally, or parenterally). It is 
preferred to administer the agent of the present invention so as to achieve an 
effective concentration within the blood or tissue in which the growth of the 
organism is to be controlled. To achieve an effective blood concentration, the 
preferred method is to administer the agent by injection. The administration may be 
10 by continuous infusion, or by single or multiple injections. 

In providing a patient with one of the agents of the present invention, the 
dosage of the administered agent will vary depending upon such factors as the 
patient's age, weight, height, sex, general medical condition, previous medical 
history, etc. In general, it is desirable to provide the recipient with a dosage of 
t5 agent which is in the range of from about 1 pg/kg to 10 mg/kg (body weight of 
patient), although a lower or higher dosage may be administered. The 
therapeutically effective dose can be lowered by using combinations of the agents 
of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be 
20 administered "in combination" with each other when either (1) the physiological 
effects of each compound, or (2) the serum concentrations of each compound can 
be measured at the same time. The composition of the present invention can be 
administered concurrently with, prior to. or following the administration of the 
other agent. 

25 The agents of the present invention are intended to be provided to recipient 

subjects in an amount sufficient to decrease the rate of growth (as defined above) of 
the target organism. 

The administration of the agent(s) of the invention may be for either a 
"prophylactic" or "therapeutic" purpose. When provided prophylactically, the 

30 agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, 
attenuate, or decrease the rate of onset of any subsequent infection. When 
provided therapeutically, the agent(s) are provided at (or shortly after) the onset of 
an indication of infection. The therapeutic administration of the compound(s) 
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serves to attenuate the pathological symptoms of the infection and to increase the 
rate of recovery. 

The agents of the present invention are administered to a subject, such as a 
mammal, or a patient, in a pharmaceutically acceptable form and in a therapeutically 
5 effective concentration. A composition is said to be "pharmacologically acceptable" 
if its administration can be tolerated by a recipient patient. Such an agent is said to 
be administered in a "therapeutically effective amount" if the amount administered 
is physiologically significant. An agent is physiologically significant if its presence 
results in a detectable change in the physiology of a recipient patient. 

10 The agents of the present invention can be formulated according to known 

methods to prepare pharmaceutically useful compositions, whereby these materials, 
or their functional derivatives, are combined in a mixture with a pharmaceutically 
acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of 
other human proteins, e.g., human serum albumin, are described, for example, in 

15 REMINGTON S PHARMACEUTICAL SCIENCES, 16 th Ed., Osol, A., Ed., 
Mack Publishing, Easton PA (1980). In order to form a pharmaceutically 
acceptable composition suitable for effective administration, such compositions will 
contain an effective amount of one or more of the agents of the present invention, 
together with a suitable amount of carrier vehicle. 

20 Additional pharmaceutical methods may be employed to control the duration 

of action. Control release preparations may be achieved through the use of 
polymers to complex or absorb one or more of the agents of the present invention. 
The controlled delivery may be effectuated by a variety of well known techniques, 
including formulation with macromolecules such as. for example, polyesters, 

25 polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcellulose, 
carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the 
macromolecules and the agent in the formulation, and by appropriate use of 
methods of incorporation, which can be manipulated to effectuate a desired time 
course of release. Another possible method to control the duration of action by 

30 controlled release preparations is to incorporate agents of the present invention into 
particles of a polymeric materia] such as polyesters, poiyamino acids, hydrogels, 
poly(lactic acid) or ethylene vinylacetate copolymers. Alternatively, instead of 
incorporating these agents into polymeric panicles, it is possible to entrap these 
materials in microcapsules prepared, for example, by coacervation techniques or by 

35 interfacial polymerization with, for example, hydroxymethylcellulose or gelatine- 
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microcapsules and poly(methylmethacylate) microcapsules, respectively, or in 
colloidal drug delivery systems, for example, liposomes, albumin microspheres, 
microemuisions, nanoparticles, and nanocapsules or in macroemulsions. Such 
techniques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES 
5 (1980). 

The invention further provides a pharmaceutical pack or kit comprising one 
or more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice in 
the form prescribed by a governmental agency regulating the manufacture, use or 
10 sale of pharmaceuticals or biological products, which notice reflects approval by 
the agency of manufacture, use or sale for human administration. 

In addition, the agents of the present invention may be employed in 
conjunction with other therapeutic compounds. 

15 6. Shot-Gun Approach to Megabase DNA Sequencing 

The present invention further demonstrates that a large sequence can be * 
sequenced using a random shotgun approach. This procedure, described in detail 
in the examples that follow, has eliminated the up front cost of isolating and 
ordering overlapping or contiguous subclones prior to the start of the sequencing 
20 protocols. 

Certain aspects of the present invention are described in greater detail in the 
examples that follow. The examples are provided by way of illustration. Other 
aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present 
25 disclosure. 

ILLUSTRATIVE EXAMPLES 

LIBRARIES AND SEQUENCING 
30 L Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole genome sequencing 

follows from the Lander and Waterman (Landerman and Waterman, Genomics 

2/23 1 (1988)) application of the equation for the Poisson distribution. According 

to this treatment, the probability, P , that any given base in a sequence of size L, in 

35 nucleotides, is not sequenced after a certain amount, n, in nucleotides, of random 

0 
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sequence has been determined can be calculated by the equation P = e" m , where m 
is L/n, the fold coverage. For instance, for a genome of 2.8 Mb, m=l when 2.8 
Mb of sequence has been randomly generated (IX coverage). APthat point, P = 
e~ ' = 0.37. The probability that any given base has not been sequenced is the same 

5 as the probability that any region of the whole sequence L has not been determined 
and, therefore, is equivalent to the fraction of the whole sequence that has yet to be 
determined. Thus, at one-fold coverage, approximately 37% of a polynucleotide of 
size L, in nucleotides has not been sequenced. When 14 Mb of sequence has been 
generated, coverage is 5X for a 2.8 Mb and the unsequenced fraction drops to 

10 .0067 or 0.67%. 5X coverage of a 2.8 Mb sequence can be attained by sequencing 
approximately 17,000 random clones from both insert ends with an average 
sequence read length of 410 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le~ m . 
and the average gap size, g, follows the equation, g = L/n. Thus, 5X coverage 

15 leaves about 240 gaps averaging about 82 bp in size in a sequence of a 
polynucleotide 2.8 Mb long. 

The treatment above is essentially that of Lander and Waterman. Genomics 
2: 231 (1988). 

20 2. Random Library Construction 

In order to approximate the random model described above during actual 
sequencing, a nearly ideal library of cloned genomic fragments is required. The 
following library construction procedure was developed to achieve this end. 

Streptococcus pneumoniae DNA is prepared by phenol extraction. A 

25 mixture containing 200 jag DNA in 1 .0 ml of 300 mM sodium acetate, 10 mM Tris- 
HC1, 1 mM Na-EDTA, 50% glycerol is processed through a nebulizer (IPI Medical 
Products) with a stream of nitrogen adjusted to 35 Kpa for 2 minutes. The 
sonicated DNA is ethanol precipitated and redissolved in 500 \il TE buffer. 

To create blunt-ends, a 100 jil aliquot of the resuspended DNA is digested 

30 with 5 units of BAL3 1 nuclease (New England BioLabs) for 10 min at 30°C in 200 
jil BAL31 buffer. The digested DNA is phenol-extracted, ethanol-precipitated, 
redissolved in 100 ^1 TE buffer, and then size-fractionated by electrophoresis 
through a 1.0% low melting temperature agarose gel. The section containing DNA 
fragments 1.6-2.0 kb in size is excised from the gel, and the LGT agarose is melted 

35 and the resulting solution is extracted with phenol to separate the agarose from the 
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DNA. DNA is ethanol precipitated and redissolved in 20 \xl of TE buffer for 
ligation to vector. 

A two-step ligation procedure is used to produce a plasmid library with 
97% inserts, of which >99% were single inserts. The first ligation mixture (50 ul) 
5 contains 2 |ig of DNA fragments, 2 jag pUC18 DNA (Pharmacia) cut with Smal 
and dephosphorylated with bacterial alkaline phosphatase, and 10 units of T4 ligase 
(GE8CO/BRL) and is incubated at 14°C for 4 hr. The ligation mixture then is 
phenol extracted and ethanoi precipitated, and the precipitated DNA is dissolved in 
20 \il TE buffer and electrophoresed on a 1.0% low melting agarose gel. Discrete 

10 bands in a ladder are visualized by ethidium bromide-staining and UV illumination 
and identified by size as insert (I), vector (v), v+l, v+2i, v+3i, ere. The portion of 
the gel containing v+I DNA is excised and the v+I DNA is recovered and 
resuspended into 20 jil TE. The v+I DNA then is blunt-ended by T4 polymerase 
treatment for 5 min. at 37°C in a reaction mixture (50 ul) containing the v+I linears, 

15 500 |iM each of the 4 dNTPs, and 9 units of T4 polymerase (New England 
BioLabs), under recommended buffer conditions. After phenol extraction and 
ethanol precipitation the repaired v+I linears are dissolved in 20 \il TE. The final 
ligation to produce circles is carried out in a 50 p.1 reaction containing 5 jil of v+I 
linears and 5 units of T4 ligase at 14°C overnight. After 10 min. at 70°C the 

20 following day, the reaction mixture is stored at -20°C. 

This two-stage procedure results in a molecularly random collection of 
single-insert plasmid recombinants with minimal contamination from double-insert 
chimeras (<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in 

25 the host, £. colt host cells deficient in all recombination and restriction functions 
(A. Greener, Strategies 3 (1):5 (1990)) are used to prevent rearrangements, 
deletions, and loss of clones by restriction. Furthermore, transformed cells are 
plated directly on antibiotic diffusion plates to avoid the usual broth recovery phase 
which allows multiplication and selection of the most rapidly growing cells. 

30 Plating is carried out as follows. A 100 (il aliquot of Epicurian Coli SURE 

II Supercompetent Cells (Stratagene 200152) is thawed on ice and transferred to a 
chilled Falcon 2059 tube on ice. A 1.7 |al aliquot of L42 M beta-mercaptoethanol 
is added to the aliquot of cells to a final concentration of 25 mM. Cells are 
incubated on ice for 10 min. A 1 |il aliquot of the final ligation is added to the cells 

35 and incubated on ice for 30 min. The cells are heat pulsed for 30 sec. at 42°C and 
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placed back on ice for 2 min. The outgrowth period, in liquid culture is eliminated 
from this protocol in order to minimize the preferential growth of any given 
transformed cell. Instead the transformation mixture is plated directly on a nutrient 
rich SOB plate containing a 5 ml bottom layer of SOB agar (5% SOB agar: 20 g 
5 tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar per liter of media). The 5 
ml bottom layer is supplemented with 0.4 ml of 50 mg/ml ampicillin per 100 ml 
SOB agar. The 15 ml top layer of SOB agar is supplemented with I ml X-Gal 
(2%), t ml MgCl (I M), and 1 ml MgSO / 100 ml SOB agar. The 15 ml top layer 
is poured just prior to plating. Our titer is approximately 100 colonies/ 10 |il aliquot 
10 of transformation? 4 

All colonies are picked for template preparation regardless of size. Thus, 
only clones lost due to "poison" DNA or deleterious gene products are deleted from 
the library, resulting in a slight increase in gap number over that expected. 

15 3. Random DNA Sequencing 

High quality double stranded DNA plasmid templates are prepared using a 
"boiling bead" method developed in collaboration with Advanced Genetic 
Technology Corp. (Gaithersburg, MD) (Adams et a/., Science 252:1651 (1991); 
Adams et a/.. Nature 355:632 (1992)). Plasmid preparation is performed in a 96- 

20 well format for all stages of DNA preparation from bacterial growth through final 
DNA purification. Template concentration is determined using Hoechst Dye and a 
Millipore Cytofluor. DNA concentrations are not adjusted, but low-yielding 
templates are identified where possible and not sequenced. 

Templates are also prepared from two Streptococcus pneumoniae lambda 

25 genomic libraries. An amplified library is constructed in the vector Lambda GEM- 
12 (Promega) and an unamplified library is constructed in Lambda DASH II 
(Stratagene). In particular, for the unamplified lambda library, Streptococcus 
pneumoniae DNA (> 100 kb) is partially digested in a reaction mixture (200 ul) 
containing 50 jag DNA, IX Sau3AI buffer, 20 units Sau3AI for 6 min. at 23°C 

30 The digested DNA was phenol-extracted and electrophoresed on a 0.5% low 
melting agarose gel at 2V/cm for 7 hours. Fragments from 15 to 25 kb are excised 
and recovered in a final volume of 6 ul. One [il of fragments is used with 1 |il of 
DASHII vector (Stratagene) in the recommended ligation reaction. One jil of the 
ligation mixture is used per packaging reaction following the recommended 

35 protocol with the Gigapack II XL Packaging Extract (Stratagene. #22771 1). Phage 
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are plated directly without amplification from the packaging mixture (after dilution 
with 500 jal of recommended SM buffer and chloroform treatment). Yield is about 
2.5x1 CP pfu/ul. The amplified library is prepared essentially as above except the 
lambda GEM- 12 vector is used. After packaging, about 3.5xl0 4 pfu are plated on 
5 the restrictive NM539 host. The lysate is harvested in 2 ml of SM buffer and 
stored frozen in 7% dimethylsulfoxide. The phage titer is approximately IxlO^ 
pfu/mL 

Liquid lysates (100 are prepared from randomly selected plaques (from 
the unamplified library) and template is prepared by long-range PCR using T7 and 

10 T3 vector-specific primers. 

Sequencing reactions are carried out on plasmid and/or PCR templates 
using the AB Catalyst LabStation with Applied Biosystems PRISM Ready 
Reaction Dye Primer Cycle Sequencing Kits for the M13 forward (Ml 3-21) and 
the M 13 reverse (M13RP1) primers (Adams et aL. Nature 368:414 (1994)). Dye 

15 terminator sequencing reactions are carried out on the lambda templates on a 
Perkin-Elmer 9600 Thermocycler using the Applied Biosystems Ready Reaction 
Dye Terminator Cycle Sequencing kits. T7 and SP6 primers are used to sequence 
the ends of the inserts from the Lambda GEM- 12 library and T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. 

20 Sequencing reactions are performed by eight individuals using an average of 
fourteen AB 373 DNA Sequencers per day. All sequencing reactions are analyzed 
using the Stretch modification of the AB 373 , primarily using a 34 cm well-to-read 
distance. The overall sequencing success rate very approximately is about 85% for 
Ml 3-21 and M13RP1 sequences and 65% for dye-terminator reactions. The 

25 average usable read length is 485 bp for Ml 3-2 1 sequences, 445bp for M13RP1 
sequences, and 375 bp for dye-terminator reactions. 

Richards et aL, Chapter 28 in AUTOMATED DNA SEQUENCING AND 
ANALYSIS, M. D. Adams, C. Fields, J. C. Venter, Eds., Academic Press, 
London, (1994) described the value of using sequence from both ends of 

30 sequencing templates to facilitate ordering of contigs in shotgun assembly projects 
of lambda and cosmid clones. We balance the desirability of both-end sequencing 
(including the reduced cost of lower total number of templates) against shorter 
read-lengths for sequencing reactions performed with the M13RP1 (reverse) primer 
compared to the M 13-21 (forward) primer. Approximately one-half of the 

35 templates are sequenced from both ends. Random reverse sequencing reactions are 
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done based on successful forward sequencing reactions. Some M13RPI 
sequences are obtained in a semi -directed fashion: MI 3-21: sequences pointing 
outward at the ends of contigs are chosen for M13RP1 sequencing in an effort to 
specifically order contigs. 

5 

4. Protocol for Automated Cycle Sequencing 

The sequencing is carried out using ABI Catalyst robots and AB 373 
Automated DNA Sequencers. The Catalyst robot is a publicly available 
sophisticated pipetting and temperature control robot which has been developed 

10 specifically for DNA sequencing reactions. The Catalyst combines pre-aliquoted 
templates and reaction mixes consisting of deoxy- and dideoxynucleotides, the 
thermostable Taq DNA polymerase, fluorescently-labelled sequencing primers, and 
reaction buffer. Reaction mixes and templates are combined in the wells of an 
aluminum 96-well thermocycling plate. Thirty consecutive cycles of linear 

15 amplification (i.e.., one primer synthesis) steps are performed including 
denaturation, annealing of primer and template, and extension; i.e., DNA 
synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents 
evaporation without the need for an oil overlay. 

Two sequencing protocols are used: one for dye-labelled primers and a 

20 second for dye-labelled dideoxy chain terminators. The shotgun sequencing 
involves use of four dye-labelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer is labelled with a different fluorescent dye, 
permitting the four individual reactions to be combined into one lane of the 373 
DNA Sequencer for electrophoresis, detection, and base-calling. ABI currently 

25 supplies pre-mixed reaction mixes in bulk packages containing all the necessary 
non-template reagents for sequencing. Sequencing can be done with both plasmid 
and PCR- generated templates with both dye-primers and dye- terminators with 
approximately equal fidelity, although plasmid templates generally give longer 
usable sequences. 

30 Thirty-two reactions are loaded per AB373 Sequencer each day, for a total 

of 960 samples. Electrophoresis is run overnight following the manufacturer s 
protocols, and the data is collected for twelve hours. Following electrophoresis 
and fluorescence detection, the ABI 373 performs automatic lane tracking and base- 
calling. The lane-tracking is confirmed visually. Each sequence electropherogram 

35 (or fluorescence lane trace) is inspected visually and assessed for quality. Trailing 
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sequences of low quality are removed and the sequence itself is loaded via software 
to a Sybase database (archived daily to 8mm tape). Leading vector polylinker 
sequence is removed automatically by a software program. Average edited lengths 
of sequences from the standard ABI 373 are around 400 bp and depend mostly on 
5 the quality of the template used for the sequencing reaction. ABI 373 Sequencers 
converted to Stretch Liners provide a longer electrophoresis path prior to 
fluorescence detection and increase the average number of usable bases to 500-600 
bp. 

io INFORMATICS 

1. Data Management 

A number of information management systems for a large-scale sequencing 
lab have been developed. (For review see, for instance, Kerlavage et al.. 
Proceedings of the Twenty-Sixth Annual Hawaii International Conference on 

15 System Sciences, IEEE Computer Society Press, Washington D. C, 585 (1993)) 
The system used to collect and assemble the sequence data was developed using the 
Sybase relational database management system and was designed to automate data 
flow wherever possible and to reduce user error. The database stores and 
correlates all information collected during the entire operation from template 

20 preparation to final analysis of the genome. Because the raw output of the ABI 373 
Sequencers was based on a Macintosh platform and the data management system 
chosen was based on a Unix platform, it was necessary to design and implement a 
variety of multi- user, client-server applications which allow the raw data as well as 
analysis results to flow seamlessly into the database with a minimum of user effort. 

25 

2. Assembly 

An assembly engine (TIGR Assembler) developed for the rapid and 
accurate assembly of thousands of sequence fragments was employed to generate 
contigs. The TIGR assembler simultaneously clusters and assembles fragments of 

30 the genome. In order to obtain the speed necessary to assemble more than 10 4 
fragments, the algorithm builds a hash table of 12 bp oligonucleotide subsequences 
to generate a list of potential sequence fragment overlaps. The number of potential 
overlaps for each fragment determines which fragments are likely to fall into 
repetitive elements. Beginning with a single seed sequence fragment. TIGR 

35 Assembler extends the. current contig by attempting to add the best matching 
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fragment based on oligonucleotide content. The contig and candidate fragment are 
aligned using a modified version of the Smith-Waterman algorithm which provides 
for optimal gapped alignments (Waterman, M. S., Methods in Enzymology 
164:165 (1988)). The contig is extended by the fragment only if strict criteria for 
5 the quality of the match are met. The match criteria- include the minimum length of 
overlap, the maximum length of an unmatched end. and the minimum percentage 
match. These criteria are automatically lowered by the algorithm in regions of 
minimal coverage and raised in regions with a possible repetitive element. The 
number of potential overlaps for each fragment determines which fragments are 

io likely to fall into repetitive elements. Fragments representing the boundaries of 
repetitive elements and potentially chimeric fragments are often rejected based on 
partial mismatches at the ends of alignments and excluded from the current contig. 
TIGR Assembler is designed to take advantage of clone size information coupled 
with sequencing from both ends of each template. It enforces the constraint that 

15 sequence fragments from two ends of the same template point toward one another 
in the contig and are located within a certain range of base pairs (definable for each 
clone based on the known clone size range for a given library). 

The process resulted in 391 contigs as represented by SEQ ID NOs: 1-391. 

-0 3. Identifying Genes 

The predicted coding regions of the Streptococcus pneumoniae genome 
were initially defined with the program GeneMark. which finds ORFs using a 
probabilistic classification technique. The predicted coding region .sequences were 
used in searches against a database of all nucleotide sequences from GenBank 

25 (October, 1997), using the BLASTN search method to identify overlaps of 50 or 
more nucleotides with at least a 95% identity. Those ORFs with nucleotide 
sequence matches are shown in Table L The ORFs without such matches were 
translated to protein sequences and compared to a non-redundant database of 
known proteins generated by combining the Swiss-prot, PIR and GenPept 

30 databases. ORFs that matched a database protein with BLASTP probability less 
than or equal to 0.01 are shown in Table 2. The table also lists assigned functions 
based on the closest match in the databases. ORFs that did not match protein or 
nucleotide sequences in the databases at these levels are shown in Table 3. 
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ILLUSTRATIVE APPLICATIONS 

L Production of an Antibody to a Streptococcus pneumoniae 
Protein 

Substantially pure protein or polypeptide is isolated from the transfected or 
5 transformed cells using any one of the methods known in the art. The protein can 
also be produced in a recombinant prokaryotic expression system, such as E. coli, 
or can be chemically synthesized. Concentration of protein in the final preparation 
is adjusted, for example, by concentration on an Amicon filter device, to the level 
of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can 
10 then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and 
isolated as described can be prepared from murine hybridomas according to the 

15 classical method of Kohler, G. and Milstein, C, Nature 256:495 (1975) or 
modifications of the methods thereof. Briefly, a mouse is repetitively inoculated 
with a few micrograms of the selected protein over a period of a few weeks. The 
mouse is then sacrificed, and the antibody producing cells of the spleen isolated. 
The spleen cells are fused by means of polyethylene glycol with mouse myeloma 

20 cells, and the excess unfused cells destroyed by growth of the system on selective 
media comprising aminopterin (HAT media). The successfully fused cells are 
diluted and aliquots of the dilution placed in wells of a microliter plate where 
growth of the culture is continued. Antibody-producing clones are identified by 
detection of antibody in the supernatant fluid of the wells by immunoassay 

25 procedures, such as ELISA, as originally described by Engvall, E., Meth. 
Enzymol. 70:419 (1980), and modified methods thereof. Selected positive clones 
can be expanded and their monoclonal antibody product harvested for use. Detailed 
procedures for monoclonal antibody production are described in Davis, L. et aL. 
Basic Methods in Molecular Biology. Elsevier, New York. Section 21-2 ( 1989). 



30 
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3. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a 
single protein can be prepared by immunizing suitable animals with the expressed 
protein described above, which can be unmodified or modified to enhance 
5 immunogenicity. Effective polyclonal antibody production is affected by many 
factors related both to the antigen and the host species. For example, small 
molecules tend to be less immunogenic than others and may require the use of 
carriers and adjuvant. Also, host animals vary in response to site of inoculations 
and dose, with both inadequate or excessive doses of antigen resulting in low titer 
10 antisera. Small doses (ng level) of antigen administered at multiple intradermal 
sites appears to be most reliable. An effective immunization protocol for rabbits 
can be found in Vaitukaitis, J. et «/., J. Clin. Endocrinol. Metab. 33:988-991 
(1971). 

Booster injections can be given at regular intervals, and antiserum harvested 

15 when antibody titer thereof, as determined semi-quantitatively, for example, by 
double immunodiffusion in agar against known concentrations of the antigen, 
begins to fall. See. for example. Ouchterlony, O. et aL. Chap, 19 in: Handbook of 
Experimental Immunology. Wier, D.. ed. Blackwell (1973). Plateau concentration 
of antibody is usually in the range of 0. 1 to 0.2 mg/ml of serum (about 12M). 

20 Affinity of the antisera for the antigen is determined by preparing competitive 
binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of 
Clinical Immunology, second edition. Rose and Friedman, eds., Amer. Soc. For 
Microbiology, Washington, D. C. (1980) 

Antibody preparations prepared according to either protocol are useful in 

25 quantitative immunoassays which determine concentrations of antigen-bearing 
substances in biological samples; they are also used semi- quantitatively or 
qualitatively to identify the presence of antigen in a biological sample. In addition, 
antibodies are useful in various animal models of pneumococcal disease as a means 
of evaluating the protein used to make the antibody as a potential vaccine target or 

30 as a means of evaluating the antibody as a potential immunotherapeutic or 
immunoprophylactic reagent. 
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4. Preparation of PCR Primers and Amplification of DNA 

Various fragments of the Streprococctts pneumoniae genome, such as those 
of Tables 1-3 and SEQ TDNOS: 1-391 can be used, in accordance with the present 
invention, to prepare PCR primers for a variety of uses. The PCR primers are 
5 preferably at least 15 bases, and more preferably at least 18 bases in length. When 
selecting a primer sequence, it is preferred that the primer pairs have approximately 
the same G/C ratio, so that melting temperatures are approximately the same. The 
PCR primers and amplified DNA of this Example find use in the Examples that 
follow. 

10 

5. Gene expression from DNA Sequences Corresponding to 

ORFs 

A fragment of the Streptococcus pneumoniae genome provided in Tables I - 
3 is introduced into an expression vector using conventional technology. 

15 Techniques to transfer cloned sequences into expression vectors that direct protein 
translation in mammalian, yeast, insect or bacterial expression systems are well 
known in the art. Commercially available vectors and expression systems are 
available from a variety of suppliers including Stratagene (La Jolla. California), 
Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If 

20 desired, to enhance expression and facilitate proper protein folding, the codon 
context and codon pairing of the sequence may be optimized for the particular 
expression organism, as explained by Hatfield et «/.. U. S. Patent No. 5,082.767, 
incorporated herein by this reference. 
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The following is provided as one exemplary method to generate 
polypeptide(s) from cloned ORFs of the Streptococcus pneumoniae genome 
fragment. Bacterial ORFs generally lack a poly A addition signal. The addition 
signal sequence can be added to the construct by. for example, splicing out the poly 
5 A addition sequence from pSG5 (Stratagene) -using Bgll and Sail restriction 
endonuclease enzymes and incorporating it into the mammalian expression vector 
pXTl (Stratagene) for use in eukaryotic expression systems. pXTl contains the 
LTRs and a portion of the gag gene of Moloney Murine Leukemia Virus. The 
positions of the LTRs in the construct allow efficient stable transfection. The 

10 vector includes the Herpes Simplex thymidine kinase promoter and the selectable 
neomycin gene. The Streptococcus pneumoniae DNA is obtained by PCR from the 
bacterial vector using oligonucleotide primers complementary to the Streptococcus 
pneumoniae DNA and containing restriction endonuclease sequences for PstI 
incorporated into the 5* primer and Bglll at the 5' end of the corresponding 

15 Streptococcus pneumoniae DNA 3' primer, taking care to ensure that the 
Streptococcus pneumoniae DNA is positioned such that its followed with the poly 
A addition sequence. The purified fragment obtained from the resulting PCR 
reaction is digested with PstL blunt ended with an exonuclcasc, digested with 
Bglll, purified and ligated to pXTl, now containing a poly A addition sequence 

20 and digested Bglll. 

The ligated product is transfected into mouse NIH 3T3 cells using 
Lipofectin (Life Technologies, Inc., Grand Island, New York) under conditions 
outlined in the product specification. Positive transfectants are selected after 
growing the transfected cells in 600 ug/ml G418 (Sigma, St. Louis, Missouri). 

25 The protein is preferably released into the supernatant. However if the protein has 
membrane binding domains, the protein may additionally be retained within the cell 
or expression may be restricted to the cell surface. Since it may be necessary to 
purify and locate the transfected product, synthetic 15-mer peptides synthesized 
from the predicted Streptococcus pneumoniae DNA sequence are injected into mice 

30 to generate antibody to the polypeptide encoded by the Streptococcus pneumoniae 
DNA. 
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Alternatively and if antibody production 4s-not possible, the Streptococcus 
pneumoniae DNA sequence is additionally incorporated into eukaryotic expression 
vectors and expressed as, for example, a globin fusion. Antibody to the globin 
moiety then is used to purify the chimeric protein. Corresponding protease 
5 cleavage sites are engineered between the globin moiety and the polypeptide 
encoded by the Streptococcus pneumoniae DNA so that the latter may be freed 
from the formed by simple protease digestion. One useful expression vector for 
generating globin chimerics is pSG5 (Stratagene). This vector encodes a rabbit 
globin. Intron II of the rabbit globin gene facilitates splicing of the expressed 

10 transcript, and the polyadenylation signal incorporated into the construct increases 
the level of expression. These techniques are well known to those skilled in the an 
of molecular biology. Standard methods are published in methods texts such as 
Davis et aL, cited elsewhere herein, and many of the methods are available from the 
technical assistance representatives from Stratagene, Life Technologies, Inc., or 

15 Promega. Polypeptides of the invention also may be produced using in vitro 
translation systems such as in vitro ExpressTM Translation Kit (Stratagene). 

While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various 
changes in form and detail can be made without departing from the true scope of 

20 the invention. 

All patents, patent applications and publications referred to above are 
hereby incorporated by reference. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Charles Kunsch 

Gil H. Choi 
Patrick S. Dillon 
Craig A. Rosen 
Sceven C. Barash 
Michael R . Fannon 
Brian A . Dougherty 

(ii) TITLE OF INVENTION: Streptococcus pneumoniae Polynucleotides and Sequences 

(iii) NUMBER OF SEQUENCES: 39 L 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences. Inc. 

( B) STREET: 9 410 Key West Avenue 
:C) CITY: Roclcviile 

(D) STATE: Maryland 
(Ei COUNTRY: USA 
{ F ) 21?: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette. 3.50 inch, 1.4Mb storage 

(3) COMPUTER: HP Vectra 436/3 3 

(C) OPERATING SYSTEM: MSDOS version 6.2 

( D ) SOFTWARE : ASCII Text 

tvi) CURRENT APPLICATION DATA: 



WO 98/18931 



PCT/US97/19588 



149 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C> CLASSIFICATION: 



<viii PRIOR APPLICATION DATA: 
(A> APPLICATION NUMBER: 
(3) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION : 

{A) NAME: Brookes. A. Anders 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/ DOCKET NUMBER: PB340P1 



(vi) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (301) 309-8504 
<B) TELEFAX: (301) 309-8512 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5625 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CCAAGCAAAA CCAGCTACAG CTAAAGGAAC TTACGTAACA AACTTGACTA TCACAACTAC 60 

TCAAGGTGTT GGTATCAAAG TTGACGTAAA CTCACTTTAA TCAGTAGTTA AAGTAATGTA 12 0 

AAAAAGTTGA AGACGCTATG TCTCAACTTT TTTTGATGTA CGACGGGCAT CTTGTATAGT . 180 

AGATGTGTAC TATTCTAGTT TCAATCTACT ATAGTAGCTC AGAAGTCGGT ACTTAAACGT 2 40 

GCTATATCAA AACCAGTCCT TGAAAAACGT GGACTGGTTT CGTGTTTGGA TTATTACCTT 3 00 

GAACGACATG CGTTAAAAGT TAGTTGAACC GCCGTATGCC GAACGGACGT ACGGTGGTGT 3 60 

GAGAGGGGCT AGAGATTATC CCCTACTCGA TTTCGAAATC TAGTGGAATG AATCTGGAAT 4 20 

AGTCCATCGA GCTTTCTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT 4 80 

GCCGTGCGTA TGGTTACTGA CTTCGTCAGT TCTATCCACA ACCTCAAAAC AGTGTTTTGA 54 0 

GCTGACTACG TCAGTTCCAT CTACAACCTC AAAACAGTGT TTTGAGCAAC CTGCGGCTAG 600 

TTTCCTAGTT TGCTCTTTGG TTTTC ATTG A GTATAACACA TTGTTAGAAG TTGGTTTAAA 660 

TTTCCTAATC AGTTTGTTCA CATTTACCTT CGATATATTA TATCCCATAG TTAAGGTTGG 72 0 

TCATACAGAT GATTATAGTC ATGGAGCCGT AAAACTTAGT GTTTCTTTAG TTGACAAAGA 7 80 

TGCCATGAAA AAAATATTTG TAACTGTAAT AGGATATTTT GAAATAAATA TAGATGAAAA 84 0 

TATCACCGAT ATTCTATACG TAAATGGTAC TGCTATTCTT TATCTTTATT TACGTTCAAT 900 

TGTTTCAATA GTTTCGGCAA TTGATAGCAG TGAAGCAATG TTGCTACCTA TCATTAATGT 9 60 

TTTAGAGTTA CTAGATAAAT CTCAACCTTT TGAAGAAGAA TAATTTATTA GCTCACTAAA 1020 

TTGAGGGTAA GGAAAAGTAA AAGCAGTAAG AAAAATGTCT TGCATTATAC AGCAACCTTT 1080 

TGGGAATGAG T GG AT GG ATT GAATAAAATT TGATTAAGAG TGGATGATTT ATCTGTAGAT 114 0 

TATTATTGGA CAGTTAGTCT TGAAGTAGTC TAAGAATTAG GTTATAATCA GTAGAAGCCT 1200 

TGCTAATAAT GAGGAGGTTA GTTTATGTAT AGT AG AC TG A ATCT AAAAT A GTACGAAACA 12 60 

ATTGCTAAAA CATTTATAGA AATTAATTTT ACTTTCCCAA TCGATTTGTT CTCATCTTAT 13 20 

TTCAATCCGC TATATATTAT GGTATCGAAT CTTCATCAGA ATGAT AAAAT TAATCAATTG 1380 

AT ATCTG AT T ACAAACAGAA. TATGAAAGCT TTTTATATCA CTATTGAAAA ATTTATACGA . 14 4 0 
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CATGATGAAA GCCTTAAGTG TTATTTTATA AAGGTTATTT CAAGTCGTTC CAAGGTAACA 150 0 

ACTCTAGATC AGATTGAAGC TGATAAAACG ATACAAAGAA AATATTCAAG TGAGCTAAAA 1560 

AAATTTATTG GATTTTATAA TGAGATTATT TGTGAGGAAA ATAGTTTCCT ACATGTACGA 1620 

AAGAGGTGGT CGAGTTGGTT TAGGTAGTCG ATGCGTGAGT TGATAATTCT CAGGGTATGG 1680 

ACTTCTTTTT CATGAATGAG GTAAAAGAGC AGGTATTGTT TAGAGACAAT CATTCTGAGC 1740 

ATATTTTCTG GATAGAGGGA GTATCCGATT TTATGATCAA AGTTAATACC GCCCTCTGGT 1800 

GAGAAGATGA GTAGGTTGGT AATTTAAACT ATTAAACAGA AT TTTTG AT T AAAAGT ATT A 1860 

TTTCATGAGA GAAATCCTAA TTTCACAATC CATAGGCAAA CGCTTGCATT TCGTTTTTTA 192 0 

TTGGACTATA ATAGGTTGGT AT AAAG CCTT CTGTAGTAAT AAAATGTAGA AGGTGTAGAA 1980 

AGTAAGGATT TAGAATATTT GTAGTTAAAA ACACAATGTT GCTATTCCTT ACGATAGGGA 204 0 

GATAGATATG GCAATGATAG AAGTGGAACA TCTTCAGAAA AATTTTGTGA AGACTGTTAA 2100 

GGAACCGGGC TTGAAGGGGG CTTTGCGCTC CTTTATTCAT CCTGAAAAGC AGACCTTTGA 2160 

AGCGGTCAAG GATTTGACCT TTGAGGTTCC AAAAGGGCAG ATTTTAGGAT TTATCGGGGC 2220 

AAATGGTGCT GGGAAGTCGA CAACCATTAA AATGCTGACA GGAATTTTGA AACCAACATC 2280 

TCCTTTTTGT CGGATTAACG GCAAGATTCC CCAGGACAAT CGGCAAGATT ATGTCAAAGA 2 340 

TATTGGCGTA GTCTTTGGAC AACGCACCCA GCTATGGTGG GATTTGGCTC TGCAAGAGAC 2 400 

CTACACTGTC TTAAAAGAGA TTTATGATGT GCCAGACTCG CTCTTTCATA AG CGT ATGG A 2 4 60 

CTTTTTGAAT GAAGTCTTGG ATTTGAAGGA CTTTATCAAG GATCCCGTGC GGACTCTTTC 252 0 

ACTGGGACAA CGGATGCGGG CGGATATTGC GGCCTCCTTG CTCCACAATC CCAAGGTTCT 2 5 80 

TTTTTTAGAT GAGCCGACCA TTGGTTTGGA CGTTTCGGTT AAGGATAATA TTCGTCGGGC 2 64 0 

AATTACTCAG ATCAATCAAG AGGAAGAAAC TACCATTCTT TTGACCACTC ACGATTTGAG 2 700 

TG AT ATT GAG CAACTTTGTG ATCGGATTTT CATGATTGAC AAGGGGCAAG AGATTTTTGA 2 7 60 

TGGAACGGTG AGCCAACTCA AGGAGACCTT TGGTAAGATG AAGACTCTCT CTTTTGAACT 2 820 

GCTACCACC? CAAAGTCATC TCGTCTC TC A CTATGACGGT CTGTCTGATA TGACCATTGA 2 880 

TAGACAAGGA AACAGCCTCA ACATTGAATT TGATAGTTCT CGCTACCAGT CAGCTGACAT 294 0 

TATCAACCAA ACCCTGTCTG AT TTTG AAAT CCGCGATTTG AAGATGGTGG ATACGGATAT 3000 

TG AGG AT AT T ATCCGTCGCT TCTACCGAAA GGAGCTCTAG GATGATCAAA TTGTGGAGAC 3 060 

GTTATAAACC CTTTATCAAT GCAGGGGTTC AGGAGTTGAT TACTTACCGA GTCAACTTTA 312 0 

TTCTCTATCG GATTGGCGAT GTCATGGGGG CTTTTGTGGC CTTTTATCTC TCGAAGGCTG .3180 
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TCTTTGATTC 


TTCGCAAGAG 


TCTTTGATTC 


AGGGCTTCAG 


TATGGCGGAT 


ATCACCCTCT 


3240 


ACATCATCAT 


GAGTTTTGTG 


ACCAATCTTC 


TGACTAGATC 


CGATTCGTCC 


TTTATGATTG 


3300 


GGGAGGAGGT 


CAAGGATGGC 


TCCATTATCA 


TGCGTTTGTT 


GCGACCAGTG 


CATTTTGCGG 


3360 


CCTCCTATCT 


TTTCACCGAG 


CTTGGTTCCA 


AGTGGTTGAT 


TTTTATCAGC 


GTTGGCCTTC 


3420 


CATTTTTAAG 


TGTCATTGTC 


TTGATGAAAA 


TCATATCGGG 


TCAAGGTATT 


GTAGAGGTGC 


3480 


TAGGATTAAC 


TGTCATTTAT 


CTTTTTAGCT 


TAACGCTCGC 


CTATCTGATT 


AACTTTTTCT 


3540 


TTAATATTTG 


CTTTGGATTT 


TCAGCCTTTG 


TGTTTAAAAA 


TCTTTGGGGT 


TCCAACCTAC 


3600 


T T AAG ACTTC 


CATAGTGGCT 


TTTATGTCGG 


GGAGTTTGAT 


TCCCTTGGCA 


TTTTTTCCAA 


3660 


AGGTTGTTTC 


AG AT ATT CTC 


TCCTTTTTGC 


CTTTTTCATC 


CTTGATTTAT 


ACTCCAGTTA 


3720 


TGATCATTGT 


TGGAAAATAC 


GATGCCAGTC 


AGATTCTTCA 


GGCACTCCTT 


TTGCAGTTCT 


3780 


TCTGGCTCTT 


AGTGATGGTG 


GGATTGTCTC 


AGTTAATTTG 


GAAACGGGTC 


CAGTCCTTTA 


3840 


TCACCATTCA 


AGGAGGTTAG 


TATGAAAAAA 


TATCAACGAA 


TGCATCTGAT 


TTTTATCAGA 


3900 


CAATACATCA 


AACAAATCAT 


GGAATATAAG 


GTAGATTTTG 


TGGTTGGTGT 


CTTGGGAGTC 


3960 


TTTCTGACTC 


AAGGCTTGAA 


TCTCTTGTTT 


CTCAATGTCA 


TCTTTCAACA 


TATTCCATTC 


4020 


CTAGAAGGCT 


GGACCTTTCA 


AGAGATAGCT 


TTCATTTATG 


GA7TTTCCTT 


GATTCCCAAG 


4080 


GGAATGGACC 


ATCTCTTTTT 


TGACAATCTC 


TGCCCACTAG 


GGCAACGCCT 


AGTCCGAAAA 


4140 


GGGGAGTTTG 


ACAAGTATCT 


GACTCGTCCC 


ATCAATCCTC 


TCTTTCACAT 


CCTAGTTGAA 


4200 


ACCTTTCAGA 


TTGATGCCTT 


GGGTGAACTC 


TTAGTCGGTG 


GTATTTTATT 


GGGAACAACA 


4260 


GTG ACCAGC A 


TTGTTTGGAC 


TCTTCCAAAA 


TTCCTGCTTT 


TCCTAGTTTG 


TATTCCTTTT 


4320 


GCGACCTTGA 


TTT AT ACTTC 


TCTTAAAATC 


GCAACAGCCA 


GTATCGCCTT 


TTGGACTAAG 


4380 


CAGTCAGGCG 


CCATGATTTA 


CATCTTCTAT 


ATGTTCAATG 


ACTTTGCTAA 


GTATCCGATT 


4440 


TCTATTTACA 


ATTCTCTTCT 


TCGTTGGTTG 


ATT AGCTTT A 


TCGTGCCTTT 


CGCCTTTACA 


4500 


GCCTACTATC 


CAGCTAGCTA 


TTTCTT AC AG 


GAAAAGGATG 


TGTTCTTTAA 


CGTAGGAGGT 


4560 


TTGATCTTGA 


TTTCTCTGGT 


TTTCTTTGTT 


ATTTCCCTTA 


AACTTTGGG A 


TAAGGGCTTA 


4620 


GATTCCTACG 


AAAGTGCGGG 


TTCGTAAAAG 


CTAAAGTAAG 


ACTAAAATCA 


AGAAAGAAAC 


4680 


TTATGATGTT 


TGTAATTGAA 


GAAGTCAAGG 


ATGAAAATCA 


AAAAAAGGCA 


GTTGT CGCTG 


4740 


AGGTTTTGAA 


GGATTTGCCA 


GAATGGTTTG 


GAATCCCAGA 


AAGCACACAA 


GCCTATATAG 


4800 


AAGGAACCAC 


GACACTGCAA 


GTTTGGACCG 


CCTATCAGGA 


GAGTGATTTG 


ACT AG ATTTG 


4860 


TAAGCTTATC 


CTATTCGAGT 


GAAGATTGTG 


CAGAGATTGA 


TTGTCTCGGC 


GTAAAAAAGC 


4920 


TTATCAAGGT 


AG AAAAATT G 


GGAGCCAATT 


GCTTGCTACT 


TTAGAGAGTG 


AAGCTCGTAA 


4980 
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AAAAGTTGCT TATCTCCAGG TC AAAAC AG T GGCAGAAGGT TCTAATAAAG ATTATGATCG 504 0 

AACAAATGAC TTTTATCGAG GTCTTGGCTT TAAAAAGTTA GAGATTTTTC CTCAACTATG 5100 

GAATCCGCAA AATCCTTGTC AGATTTTGAT TAAAAAGCTT GAATAATATT ACTTGACATC 5160 

TATTCTCAGA GTGCTATACT GTAAGTGTAA TCGCCGATTT AGCTTACTTG GTAGAGCAAG 52 2 0 

GCACTCGTAA AGCCTAGGTT ATAGGTAGAT AAACG AC TG A GCATTTGAAA AAATAGATAG 5280 

GTAGAAGATA ACCGTTAAGC CTTACTCTTA G CGGTT ATTT ATATTGTTTA AT AG CGCT AA 534 0 

TATTTTATCA ATTAT GCCTG TTTTCGTGTT TCTGGTAGTT CTTCAAGTTT ATTGCTACTA 54 00 

TTTTTGATGG TATGAATGTG CTTATAATGT ATCCCGGTTA ACGAAAGTTT TGGACTTATA 54 60 

CTCTTCGAAA ATCTCTTCAA ACCACGTCAA CCTCCCCTTG CCGTGCGTAT GGTTATGACT 5 520 

TCGTCACTTC TATCCACAAC CTCAAAACAC TCTTTTCAGT GACTACCTCA GTTCCATCTA 5580 

CAACCTCAAA ACACTGTTTT GCCCAATCTG CGGCTAGTTT CCTAG 5 62 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7571 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEONESS : double 

( D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CTCTCCAGCT TTCCTTGCGA GTTGGCCATC TTGTGTCTTT AAGAACTCTA AAAATATCTC 60 

CAATAAAACG CATCGCTCTC TCCTATCTCG TTTCTCTGTG TGTAGTGTAC TTGCCACAAT 12 0 

GCTT ACAAAA TTT ATTT ACT TCTAGTCGTG TAGGCTTGAG GTTTCCGCTG ATCTTGATTG 180 

AATAGTTTCT CGAACCACAA ACCGCACAAG CTAGGCTTGC TTTTTTTAGT GCCATAACGC 2 40 

CTCCATCTTA TCCATTATAA CAAGAAAGCT AGGCTTTGAC AACCATCTTA GCGAAATAGA 300 

TTGACTATCG AATCCCATAT TGTTTGAGCC TTTTCCTTAA TCTTCGC AT C TGAGATAGCC 360 

CGGCTACCCT CATCTACTAG ACTTTGCGCA CGCCCTCGAA TATCAGACAA ATTATCATCT 42 0 

GTCTGGCTAT TATCATTGGT TTGTACTTGT CTTTTTGTAT TGGCTGGTGC AATTCCATTT 4 80 

TGCTTATAAG CATTTTCAAC CGTAAAGGTA CTTCCTGGCG TATAAGGTAA AATGGTATTG 54 0 

GCAATGTTTC TAAAGACATG AGCTGGACCG TTTGAACTAG AGCCAGCTAG ATAGTGGTTT 600 

TCATCAGTGG TCGGAAAGCC AAGCCAGTGG CTAATCACTA CATCCGGAGT ATAACCAATT 660 

ACCCACTGGT CACTTGTGTA CTCCGGATTG AAAACTGCTT CAGTTGTTCC AGTTTTCCCT 7 20 
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G C CAT G AC AT AGTCTGCAGG CGATGAACTA 
AT CAT ACTGG TCATCTTGTC AG CT AC AG AC 
TGACTCGCAA TAACTTGTCC ACTAGCATTT 
ATTAAACCTT CATTTGCAAA GGCGGCGTAT 
ACACCGCTTC CCAAGGCGAC ACCAAGAACA 
TTTTCGCCTG C CTC AAAAGC CTTGTCGACA 
AGATTAAGCG ATTCTGCCAA GGCTTGATAC 
GCATAGTTAT CAACCTTATA GCTGTCATAC 
AAAGCCCAGC TTGCTTCAAC TGCTGGCGTA 
GGACTACGCT TTGATTGGGT TGCATAGTTG 
GCAACTTGAC CGACAACTCC ACGAACTCCC 
GATTGAGCAA ACGTTCCATC CTCTGCCCTC 
TGCATATTTG CTTGGTAGTT TTGGTCCAGC 
ATCTCTTCCT CTGTT AG ATT ATACTTGGAA 
GAGGGGTAAC GGTAATCTGA GATTTTTCCT 
TCAACTTCAG CAGCTTTGGT TTCTTGGTTT 
TGCAAGACAG TATCGCGCCG ATTAGTAGAA 
TCCGGCCCCT TGAGCATCCC TGCCAGAGTC 
GAAACTCCAA AGTATTTCTT ACT C GC ATCT 
GCGTTGTTAA GGTACATGGT TAGAATTTGC 
GCAAGGAAAA ATTCTTTCGC TTTTCTCTCA 
TTAGCCAGCT GTTGGGTAAT GGTAGAGCCA 
AAGAAAAAAC GGCCATAGTT AATCCCGTCA 
ATAACAGCAT TCTGCAAGTT TTTACTGATG 
CCAGACAAGG CACCAGCCTC TTTTTCTTCA 
GCATTTTGCA AATCATTGAC ATTGGTCGAC 
AGCAAGCCTG CACTCAAACC TAGTATAAGG 
AATTTTCGAA TCGGACCTAC TTGGGCTAAT 
ATAGTAGAAT C AGAGTCCTC TAGTTCACTT 
TCAAATAATT TATCTAATTT CATGCGTTTA 
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ATACCGGTAC CGTTGGTGAA AGTCCCCAAC 7 30 

TTATCAATCA CCCCTTTTTG TGAATTTTTA 840 
TCAATTCTAC TAATAAAATG AGCTTCAGGC 900 
GCTTGAGCCA TTTGAAGAGG GTTGGTTTCA 9 60 

CGGTCGACCT TTTCCATGTT GAGTCCGAAT 102 0 

CCCAAATCAT TAACAGTGCC AACAGCAGGT 1080 

ATAGGAACTT CTCGACTCGT TTTGATCCCT 1140 

TGCATGGTAT GGTTATCCAA CTGCTTATTC 1200 

TAAACAACTA AAGGCTTAAT TGTAGAACCA 12 60 

AAATTCCGGA ATCCAGTTTT ATCATTGTCA 13 2 0 

CCTGTTTTCG GTTCGAGGGC TACACTTCCT 13 80 

GGAAATAGCG ATGTGTTTTC ATAAACAATC 14 40 

TCTGTG T AAA TGCGGTAGCC ATTATTGACA 1500 

ACACCTTCAT TAACCACCGC ATCAAAATAA 1560 

TCATACTTAT CGTGCAATTG CGAAGTCATA 162 0 

TTATCAATAT ATCCTCCTCC AACCATATTC 16 80 

TCTTCTACGG AATTCAAGGG ATTATACAGT 17 40 

GCAGCTTGAT CCAGACTCAC TTCTG ATGC A 1800 

TCTACACCCC ACACACCATT TCCAAAATAA 18 60 

TCCTTACTAT ATTTTTTG CT TAATTCTAAG 19 20 

AC A GTTTG AT CCTGCGATAA ATAGGCGTTT 19 80 

CCACCTGAAC GTCCAGCAGT GACAATAGCC 2 04 0 

TTTTTATAGA AAGAACGGTC TTCTGTCGCA 2100 

TCAGTCAGCT CAACATAGCT TCCCTTTTGA 2160 

CGGTCAAAAA TAAGAGTCCG AGTTTT CAAG 2 22 0 

TTGGCTACAG CAAACAAATA GATTCCAACT 22 80 

ATAATCTTTG TTAGATGATA ACGACGCCAG 2340 

TTTTTTCGAT C ACT ACG AG A GCGACGTAAG 24 00 

CTTTCTTTTT TAAAAAGAGA AAGAAATTTC 2 4 60 

TTTTATCATC TTCATCATAG GAAGACAAGA 2 52 0 
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ATTTAGCTAT TTCCTATCCA AATAGGGCTT TTTTTGTTAC AATATCTGTA TGCAATTCAC 2 580 

ATTTACATTA CCCGCCTCTC TACCTCAAAT GACAGTAAAG CAATTACTTG AGGAACAACT 2 64 0 

CCTCATCCCT AG AAAAATC C GTCATTTTTT GAGAATCAAG AAACATATTT TGATAAATCA 2700 

AGAACAACTC CACTGGAAGG AAAT CGT AAA TCCTGGAGAT GTTTGCCAGT TGACTTTTGA 27 60 

CGAGGAAGAT TATTCCCAAA AGACGATCCC TTGGGGCAAC CCAGACTTAG T GC AGG AAGT 2820 

TTATCAAGAT CAACACTTGA TTATTGTAAA CAAACCAGAG GGGATGAAAA CGCATGGTAA 2 880 

TCAACCAAAC GAAATTGCCC TTCTTAACCA TGTCAGTACC TATGTTGGCC AAAC CTGCT A 29 4 0 

TGTCGTTCAT CGTCTGGACA TGGAAACCAG TGGCTTAGTT CTCTTTGCCA AAAAT CCTTT 3 000 

TATCCTGCCC ATTCTCAATC GCTTATTGGA GAAAAAAGAG ATTTCTAGAG AATATTGGGC 3060 

. TCTAGTTGAT GGAAATATCA AC AG AAAAG A ACTTGTTT T C AGAGACAAAA TTGGACGTGA 3120 

TCGCCATGAT CGTAGAAAAA G AATAGTTG A TGCAAAAAAT GGGCAATATG CTGAAACGCA 3180 

TGTAAGCAGA TTAAAGCAAT TCTCAAACAA GACTTCCTTG GCTCATTGCA AGCTAAAGAC 3 2 40 

AGGGCGAACC CAT CAG ATTC GTGTGCACCT TTCGCATCAT AATCTTCCTA T C CTGGG AG A 3 300 

CCCTCTCTAT AATAGTAAAT CAAAGACAAG CCGGCTTATG CTTCATGCCT TCCGACTTTC 3 3 60 

CTTTACCCAC CCACTTACTT TAGAGAAGCT AACTTTCACT ACCCTTTCAA ATACATTTGA 3 42 0 

AAAAG AATTA AAAAAGAATG GATGATCGTG TCATCCATTT TTCCATATAA AAAAG C AAG A 3 480 

CCACAAAGCC TTGCTTTCTA TCAACTCAAG AATTATTTAG CAATTTTTGC GAAGTATTCA 3 54 0 

AGAGTACGAA CAAGTTGTGC AGTGTATGAC ATTTCGTTGT CGTACCATGA TACAACTTTA 3 600 

ACCAATTCTT TACCGTCAAC GTCAAGAACT TTAGTTTG AG TTGCGTCAAA CAATGAACCG 3 66 0 

TAAGACATAC CTACGATATC TGAAGATACG ATTGGATCTT CTGTGTAACC GTATGATTCG 372 0 

TTTGAAGCTG CTTTCATACC TGCGTTCACT TCATCAACAC TAACGTTCTT TTCAAGAACT 3730 

GCTACCAATT CAGTAACTGA TCCAGTTCGA GTTGGAACGC GTTGTGCAGA TCCGTCAAGT 384 0 

TTACCATTCA ATTC TGGG AT TACAAGACCG ATAGCTTTTG CAGCACCACT TGAGTTAGGA 3 900 

ACG AT GTTTG CAGCACCAGC GCGAGCACGG CGAAGGTCAC CACCACGGTG T GGTCCGT C A 3 9 60 

AGGATCATTT GGTCACCAGT GTAAGCGTGG ATAGTAGTCA TCAATCCTTC AACAACACCA 402 C 

AAGTTGTCTT GAAGAGCTTT AGCCATTGGA GCCAAGCAGT TTGTAGTACA TCAAGCACCT 4080 

GAGATAACTG TTTCAGTACC GTCAAGAACG TCGTGGTTAG TGTTGAATAC AACTGTTTTA 414 0 

ACGTCGTTTC CACCAGGAGC AGTGATAACA ACTTTTTTAG CTCCACCTTT AAGGTGTTTT 4 2 00 

TCAGC7GCTT CTTTCTTAGC AAAGAAACCA GTAGCTTCAA GAACGATTTC TACACCGTCA 4 2 60 
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GTAGCCCAGT 


CGATTTGTTC 


TGGATCACGT 


TCAGCAGAAA 


CTTTGAT^AA 


TTTACCGTTA 


4320 


ACTTCAAATC 


CACCTTCTTT 


AACTTCAACA 


GTACCGTCGA 


AACGACCTTG 


AGTTGTGTCG 


4380 


TATTTCAACA 


AGTGTGCAAG 


CATAACTGGA 


TCTGTAAGGT 


CGTTGATGCG 


TGTAACTTCA 


4440 


ACACCTTCTA 


C GTTTTGG AT 


ACG AC GG AAA 


GCAAGACGAC 


CGATACGTCC 


G AAACCGTT A 


4500 


ATACCAACTT 


TAACTACCAT 


TAGTGATTTC 


CTCCTTATGA 


AAATCATGAA 


ATTTTTATTG 


4560 


TGAAAAGAGT 


AACTTGAATC 


ACTACAAATC 


ACCTTTCAAC 


AAACCTATTA 


TACAACTATT 


4620 


TGAGTTGAAT 


TGCAAGTATG 


GCCATTGTTT 


TTCTATGTTA 


GTTTCTTTTT 


AAGACTGTAA 


4630 


ACCAAGGAAT 


CCCTTACTAT 


TCATAGCATA 


ACGATTCTAT 


AGGATCCATT 


TTACTAATCT 


4740 


TACGCGCCGG 


GAAGTAGGCT 


GAG AC AT AAC 


CAAGTAATAG 


AGCGAAAACT 


AGAGTTCCTA 


4800 


AAACAGATAA 


AAGATTTAAT 


TTAAAAACCT 


TAGTGATGGA 


TGGGTAAAAG 


TGACTTACAA 


4860 


TCGCATTCGC 


CAAACTTCCC 


AC CCCTTGTG 


CAACCAAAAA 


TGCCAGCAGC 


AAGGCGATGC 


4920 


CTACAATCCA 


GATAGCCTCG 


TAAATAAAAA 


TTCCTTTGAC 


ATCACGATTC 


TGATAACCAA 


4980 


C TGCTTT CAT 


GACACCTATT 


TCCTTGGAAC 


CTTGCATGAT 


ATTGATGTAA 


ATAATGATAC 


5040 


CAATCATAAC 


CGCTGCTACC 


AC AAT AG CTT 


GTGATGAAAG 


CACAATCAAT 


AATCC CTG AA 


5100 


TAACACGAAT 


AAAGGTAATC 


ACAATATCAA 


GAACTCTCTG 


TTGAGAAAGC 


ACAGTATACT 


5160 


TCTTATTTTT 


CTGTAATTCT 


T CTGTT ACT A 


CTTTTGTCTG 


TGATGGATCT 


TTGAGTTCCA 


5220 


AG AT AAAAT A 


AGATACAGCT 


TTCGTAAATC 


CAGCCTCTTT 


CAAAATCGTT 


TCCATTTGAT 


5280 


GAGACAGCAT 


GAAACTGTTG 


CTGTCCTCCA 


TGTCATCTTC 


AT C ATT G ATT 


ACACGTACAA 


5340 


TCTTCGTTTG 


AAATTGAGCA 


ATCTTACTAG 


TTTCGGCAGC 


ACTTTCTACA 


ATGCTGGCTG 


5400 


AGACTGATTT 


GCCAATAAGA 


TCATTAGCTG 


TCAAATTTTT 


TCCTGTCTGT 


TCATTCCAAT 


5460 


TTTTTAGTAA 


ACTGC TTGG A 


ATCGTTAATC 


CCTGTTC ATT 


TGTATCAGTA 


TAGAGGGATC 


5520 


CAGCCAACAC 


TTTGTCCGTC 


TC ATT ATT AC 


TAACAGAGAT 


ACTTGTATCA 


TCATAAAGAC 


5580 


TCACTACTTG 


AGCATAAGAA 


GGC AT CGTTT 


GACTCAGATC 


CATTTCTTGC 


CCATCTATAG 


5640 


TAATATTTGA 


CATGTTCATC 


CC AAAAGG AC 


TCTCCAAATA 


TTTAATAGCT 


TCTTTCCCAA 


5700 


CTGTATCCGT 


GATATATAGT 


CAATTGAAAC 


AAGAGCAGGA 


TAAAAAAGCC 


TCGT AAAAGG 


5760 


TATTGCAACT 


TGGTAATACC 


TTTTTGAGGT 


GCTTTTTGAT 


ATGAGCCCAT 


CTTTTCTCAA 


5820 


TAGGATTGTA 


CTCAGGCGAG 


TAGGGAGGAA 


GAGGTAAAAG 


TTTATGCCCA 


AACTCTT CGC 


5880 


ATAAAAGTTC 


TAGCTTCCCC 


ATTCTATGGA 


ATCTTACATT 


ATCCATAATA 


AT AACCG ATG 


5940 


GTGTGTTTAA 


TGTTGGTAAG 


AGAAAATTCT 


GAAACCAAGC 


TTCAAAAAAG 


. TCGCTCGTCA 


6000 


TCGTCTCTTC 


GTAAGTCATT 


GGAGCGATTA 


. ATTCACCATT 


1 TGTTAGACCT 


' GCAACCAAAG 


6060. 
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AAATCCTCTG 


ATATCTTCTT 


CCAGATACTT 


TGCCTCTTAT 


TAATTGACCT 


TTTAATGAGC 


6120 


GACCATATTC 


TCGATAAAAA 


TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGGTGCTA 


6180 


GGTGCTTTAA 


ACTATTAAAA 


TTCTTAAGAA 


ATAAGGCT AC 


TTTTTCTGGG 


TCTTGTT C AT 


6240 


AGTAGGTGTG 


GTTCTTTTTT 


CGAGTGTAGC 


C CAT AGCTTT 


GAGCGTATAG 


TGGATGGTAG 


6300 


TTGGATGACA 


GCCAAATTCA 


GAAGCTATTT 


CAGTCAAATA 


AGCGTCTGGA 


TTGTCAGTAA 


6360 


GATAGTTTTT 


AAGTCTATCT 


CTATCAACCT 


TTCTTGGTTT 


TATTCCTTTT 


ACTTGGTGGT 


6420 


TTAGCTCTCC 


TGTTTTCTCT 


TTTAGCTTTA 


ACCAGCCATA 


AATGGTATTA 


CGTGAGATTT 


6480 


GGAAAACGTG 


TGATGCTTCT 


GTTATACTAC 


CTGTTCGCTC 


ACAATAAGAG 


AG AACTTT TT 


6540 


TACGAAAATC 


TATTGAATA? 


GCCATAAAAA 


GATTATACCA 


CATTGTGTAC 


TATTTTTGGT 


6600 


TCATTTTACT 


ATATTTGAAG 


AGGCGTTTAA 


ACT ATCTG AC 


ATAAAACTCG 


TT CT AG AGG A 


6660 


AAGACATCCT 


TTAAAAAGTT 


AGTTTATTTT 


ACAACTTAGA 


CATCAAGGTA 


GGTTAACCCC 


6720 


TTCATGGAAA 


AATCAAGACT 


CTT AGC ACT A 


TGGGTTAAAC 


TACCACTGGA 


GACGTAATCA 


6780 


ATCGCTAAAC 


CACGAAAACG 


GCTAATAGTG 


GTCATATCAA 


TATTTCCAGA 


ACATTCAATC 


6840 


CG AG AACGTC 


CTGCAATTAG 


GGTAATGGCC 


TGTTCAATCT 


GTTCCAATGA 


CATATTATCC 


6900 


AACATGATAA 


TATCAGCACC 


CGCCGCCGCA 


GCTTCTTCGG 


CAGCAGCAAG 


GCTTTCCACT 


6960 


TCCACCTCGA 


CCATTTTCAC 


AAAAGGGGCA 


TAGGCACGCG 


CTTGAGCAAT 


TGCCTTTTGA 


7020 


ACACTACCTA 


CTGCCGCAAT 


GTGATTGTCT 


TTTAGCACCA 


TAG C ATCTG A 


TAAATTAAAG 


7080 


CCATGATTAT 


AGCCACCGCC 


AACTCTCACG 


GCATATTTCT 


CAAAAAGACG 


TAAATTAGGA 


7140 


GTACTTTTTC 


GAGTATCAAA 


TACCTTAATG 


CAATCATCGC 


CTAAGGCTTC 


TACATAAGCA 


7200 


GCTGTCATCG 


AAGCAATCCC 


TGATAAATGT 


TGTAAAAAAT 


TCAACGCAAC 


GCGTTCACAT 


7260 


GTTAAGAGAC 


TTCTCACCGA 


GCCTATGATT 


TCTAAAACCA 


AATCGCCACT 


AGTCAAACGA 


7320 


TCCCCATCCT 


TAAATTGATG 


AGGATTCTGG 


AAGGTCACCT 


CGGCATCAAA 


TAGGGTAAAA 


7380 


AC CCTTTG AA 


AAACGGTTAG 


CCCCGCTAAA 


ACACCAGCTT 


CCTTGGCAAA 


AAGCGACACC 


7440 


TTGGCTTGGC 


CATGATGATC 


AAAAATGGCA 


TTGGTACTCT 


AATCTTCGGA 


ATGAACATCT 


7500 


TCTCGCAAGG 


CTGCTTTCAA 


TGTATCATCT 


ATTTGAAAAG 


GGGTTAAATC 


AGTTGAAATG 


7560 


ATTGACATCA 


. c 










7571 


(2> INFORMATION FOR SEQ ID NO: 3 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26385 base pairs 
(3) TYPE: nucleic acid 
{O STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTTGCTAGTG GCTTAAATTC TTCAGGAAAA TCAGGCGTAT CTAAAAGTCG TGTCGTTTTT 60 

GTTTCATCTA TATAAAGACT TCCTGCTCCC CCTACAACTA GAAAACGTGT CTGTGTTCCA 12 0 

GCAAGAAGCT GATTAAATAG TTCGATTGAT TTGCTGTGGA GCGGTAGCGT ATCTGGTGTA 180 

TAAGCACCAA ACGCTGAAAT AACAGCATCA AATCCAGTAA GATCATCTTT TGTCAACTCA 240 

AATAAATCTT TTTTAATAAT AGACTCAGCT TGACTTTTGT TTTCAGAACG AACAATAGCC 3 00 

GTTACTTCAT GTCCTCGTTT GACTGCTTCT TCAACAATTG CTTTCCCCGC TTGTCCATTT 3 60 

GCTGCAATAA CTGCTAGTTT CATTTTTTAT ACCT CTCTTG TTGTAATTAT TTTAGTTACA 4 20 

GAAATTGTGA CACTCTTAAT AATCAATGTC AATAGTCTTG CTT AATT ATT ATCAAAATAT 4 80 

TTCTACCAAG AAAACTAACC ATGATTCTAG TGAAAAAAAA TCTTCTTTGT CAACAAATTT 540 

ACTTTCTTGT TTTAAACATG CTATAATAAT CATAGCAAGA GATCTAAGTT GTCTGTTTTT 600 

TT AAAACG AG GTGATTATCA TGCGTAGATT CTATTCCCAT CTCCCCTACT ATCTGGTCAT 6 60 

ATTATTCTTT TATTGGCCAC TTTATGAGTT GTTCTTACTA GTTGTTTCTC ACCCCCTTAC 720 

ACTCAAGGGA CTCTATATAA ACAATCTTCT CTTCTTTACA CCTCTGGTAA TCTTGATTGT 7 80 

ATCGTTACTC TATAGCTACC GTTTCCGTTT CTCACTTTGA TGGTTAGTTG GTAACGGACT 840 

GCTCTTTTAC TTTACTATCA TAACCTTTGG TGAGTTTATA CTAATTTACT TGCTAATCTA 900 

TGAAACAGTT GCTCTGGTCG GCATGGATTC TGGTATTAGC ATCAAGCATA TTCTACAAAA 9 60 

AATGAAAAAC AAAAAACTTT CACAAAATCC TTGAAAAATC TCACAATCAT GCTATAATAA 1020 

T CC AT AG AG A CAAGTCACTT AGTCCCTTTC TACTAGAGAG TGCGTGGTTG CTGGAAACGC 1080 

ATAGGAAGTC T AAACTG AT A CTACTCTTGA GTTTTTTATG AAAACATAAA ACGGTGGCCA 1140 

CGTTAGAGCC GATCAGAGGT GTCCCTCTCT TTTGAGGTAC AT AAATG AAG CTCGAACCAC 12 00 

GTTGCGACGT CCTTTCGAGG ATGTCGCATT TTTTTATTAG GAT AC T AATT ATGGAGTTGC 12 60 

AAGAATTAGT GGAGCGCAGT TGGGCAATCC GACAAGCTTA TCACGAACTG GAACTTAAGC 1320 

ATCATGATTC CAAGTGGACG G TAG AAG AAG ACCTCTTGGC TTTATCTAAT GATATTGGAA 13 80 

ATTTCCAACG ACTGGTG AT G ACAAAGCAAG GACGCTACTA TGATGAAACA CCCTACACAC 14 4 0 

TGGAACAAAA A CTTTC AG AA AATATCTGGT GGCTATTAGA ACTTTCTCAA CGTTTGGATA 1500 

TAGACATTCT G AC GG AAATG GAAAACTTCC TCTCTGATAA AGAAAAGCAA TTCAACGTTA 15 60 

GGACTTGGAA CTAGTCTGCT GATAAAAAAT CAATGCTTAG AAACTATGAA ATAATAAAAA 1620 



WO 98/18931 



PCT/US97/19588 



159 

AGGAGAACAT CATGATTAAC ATTACTTTCC CAGATGGCGC TGTTCGTGAA TTCGAATCTC 1680 

GCGTAACAAC TTTTGAAATT GCCCAATCTA TCAGCAATTC CCTAGCTAAA AAAGCCTTGG 17 4 0 

CTGGTAAATT CAACGGCAAA CTCATCGACA CTACTCGCGC TATCACTGAA GATGGAAGCA 1800 

TCGAAATTGT GACACCTGAT CACGAAGATG CCCTTCCAAT CTTGCGTCAC TCAGCAGCTC 1860 

ACTTGTTCGC CCAAGCAGCT CGTCGTCTTT TCCCAGACAT TCACTTGGGA GTTGGTCCAG 1920 

CCATCGAAGA TGGTTTCTAC TACGATACTG AC AAC AC AG C TGGTCAAATC TCTAACCAAC 1980 

ACCTTCCTCG TATCGAAGAA GAAATGCAAA AAATCGTCAA AGAAAACTTC CCATCTATTC 2040 

GTGAAGAAGT GACTAAAGAC GAGGCACGTG AAATCTTCAA AAATGACCCT TACAAGTTGG 2100 

AATTGATTGA AGAACACTCA GAAGACGAAG GCGGTTTGAC TATCTATCGT CAGGCTGAAT 2160 

ATGTAGACCT CTGCCGTGGA CCTCACGTTC CATCAACAGG TCGTATCCAA ATCTTCCACC 2 220 

TTCTCCATGT AGCTGGTGCG TACTGGCGTG GAAACAGCGA CAACGCTATG ATGCAACGTA 2 2 80 

TCTACGGTAC AGCTT GGTTT GACAAGAAAG ACTTGAAAAA CTACCTTCAA ATGCGTGAAG 2 340 

AAGCTAAGGA ACGTGACCAC CGTAAACTTG GTAAAGAGCT TG AC CTCTTT ATGATTTCAC 2 400 

AAGAAGTGGG ACAAGGTTTG CCATTCTGGT TGCC AAATGG TGCG ACT ATC CGTCGTGAAT 2 4 60 

TGGAACCCTA CATCGTAAAC AAAGAGTTGG TTTCTGG CT A CCAACACCTC TACACTCCAC 2520 

CACTTGCTTC TGTTG AGCTT TACAAGACTT CTGGTCACTG GGATCATTAC CAAGAAGACA 2 580 

TGTTCCCAAC CATGGACATG GGTGACGGGG AAGAATTTGT CCTTCGTCCA ATCAACTGTC 2 640 

CGCACCACAT CCAAGTTTTC AAACACCATG TTCACTCTTA CCGTGAATTG CCAATCCGTA 2 700 

TCGCTGAAAT CGGTATGATG C AC CG TT AC G AAAAATCTGG TGCCCTCACT GGCCTTCAAC 2 7 60 

GTCTACGTGA AATGTCACTC AACGACGGTC ACCTATTCGT TACTCCAGAA CAAATCCAAG 2 820 

AAGAATTCCA ACGTGCCCTT CAGTTGATTA TCGATGTTTA TGAAGACTTC AACTTGACTG 2 8 AO 

ACTACCCCTT CCGCCTCTCT CTTCGTGACC CTCAAGATAC TCATAAGTAC TTTGATAACG 2 940 

ATGAGATGTG GGAAAATGCC CAAACCATGC TTCGTGCAGC TCTTGATGAA ATGGGCGTGG 3 000 

ACTACTTTGA AGCCGAAGGT GAAGCAGCCT TCTACGGACC AAAATTGGAT AT C C AG ATT A 3 060 

AAACTGCCCT TGGAAAAGAA GAAACCCTTT CTACTATCCA ACTTGATTTC TTGTTGCCAG 3120 

AACGCTTCGA CCTCAAATAC ATCGGAGCTG ATGGCGAAGA TCACCGTCCA GTCATGATCC 318 0 

ACCGTG GGG T TATCTCAACT ATGGAACGCT TCACAGCTAT CTTGATTGAG AACTACAAGG 324 0 

GGGCCTTCCC AACATGGCTG GCACCACACC AAGTAACCCT CATCCCAGTA TCTAACGAAA 3300 

AACACGTGGA CTACGCTTGG GAAGTGGCCA AGAAACTCCG TGACCGCGGT GTCCGTGCAG 33 6 0 
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ACGTAGATGA GCGCAATGAA AAAATGCAGT TCAAGATCCG TGCTTCACAA ACCAGCAAGA 3 4 20 

TTCCTTACCA ATTAATTGTT GGAGACAAAG AAATGGAAGA CGAAACAGTC AACGTTCGTC 3 4 90 

GCTACGGCCA AAAAGAAACA CAAACTGTCT CAGTTGATAA TTTTGTTCAA GCTATCCTAG 3 540 

CTGATATCGC CAACAAATCA CGCGTTGAGA AATAAGAGTC T AG CAT AAAA GCCTCCAATC 3 600 

TGGAGGCTTT TTCTCATCTA TTTTTACTCA AGGACTAAGT TCACTTGAGC AAACTGAATC 3660 

CGCACTGTCG TTCCTTTTCC GACCTCAGAC TCGATACGAA TCTGGTGCCC CAGTTCTTCA 37 20 

GAAATTTTCT TAGATAGATA AAGGCCAAGT CCAGAGGACT GCTGGGTCAA ACGGCCATTG 37 80 

TATCCTGAAA AGCCACGTTC AAATACTCGG AGGACATCAC TGTTTTTT AT CCCGATTCCC 384 0 

GTATCTTTGA TACAAAGCTC TTGGTCATCC ATATAAATCT CC AG ACCACC TTCCTTCGTC 3900 

TACTTGAGAC TGTTTG AG AT GATTTGCTCA ATAACCACTA GCACCCACTT TTTATCCGTC 3960 

ACGATTTCTT TATCAAGGTC ATGTAGATTG ACATTTAAGC CTTTTTGAAT AAAGAAAAGA 402 0 

GCATATTTAC GAATTATTTC CTTGACCAAG TCCTCAATTT GAACCTCCTT T AAG AC C AAA 4080 

« TCATCATGGA AACTTTCTAA ACGCAGGTAC TGTAAAACTA GGTTGGTATA GGAGTCGATT 4140 

TTGAAAATTT CCTGTTCTAG CTGCTGCTTC AGTTGGCGGT CGACCACTTC TGCAACTAAG 42 00 

AGTTGACTGG CTGCAATGGG GGTCTTTATC TGATGGACCC ACAAGGTATA GTAATCCAGC 4 260 

AAATCCGTCA GTTTTCTTTC TGCTTTTGAC CTCTGCTGAT AGAGTTCCAT CTCACGCGCT 4 3 20 

TCTAATTTTT CTGCTAAAGC TATTTCCAAA GGACACTTGG CTTCCCTCTC TC CAT AG AG A 4 3 80 

AGTTCCTGGC GATAGACCTG CGTTTCCACC AATATGTCCC AAGTGAAAAA TAATATGGTT 44 4 0 

ACAAAGCAAC AC AAG AAG AA AAAGTAGAGG AAGTAAATTC CTACACTGCC AAATAAAAAC 4 500 

TGAAAGAGTA AG AC AAG AAA TGCCAAAGAA AGCAGATAGA TAAAAAGACG ACTACGGGAG 4 5 60 

CGCAGATAGG CTAGAAAAAA TTGTTTCCAA TC AAG CAT G C TTCAATCCGT ACCCTATTCC 4 620 
TTTCTTGGTC TCGATAAATC CTACCAATCC CTGCTCCTCC AACTTTTTAC GCAAACGAGC 4 680 

CACATTGACA GAGAGGGTAT TATCATCAAT G AAAAAGT C A CTGTTCCAAA GTTCCCGCAT 47 40 

CAGGTCGTCA CCTGCTACGA TGTTGCCTGC ATGCTCAAAT AACACGCGTA AAATCTGGAA 4 800 

TTCATTCTTG GTCAAATTCA AGACTTGCCC TTGATAATGT AAATCCATGG AT TTGGT ATT 4 8 60 

GAGGATAACA CCAGCATATT CCAGCAAACT CTCATCACGC CCAAACTCAT AGGAACGACG 4920 
CAACAAGCCC TGAACCTTAG CTAAAAGAAC CTGCTGGTCA AAAGGCTTGG TCACAAAGTC 4 9 30 

ATCCGCCCCC ATATTGATTG CCATGACAAT ATCCATAGCC TGGTCTCTCG AAGAAAGAAA 5040 
CATGATAGGT ACCTTGG AAA TCTTGCGGAT TTCCTGACAC CAGTGATAAC CATTAAACAA 5100 
GGGCAAACCA ATATCCATGA GGACCAGATG AGGTTCCGAC TGAACAAATA GACTCAAAAC 5160 



WO 98/18931 



PCT/US97/19588 



TTCCATAAAG 
CTGTTGACGA 
TTCCATTATT 
TGTTTTTAGC 
TAAATAAAAA 
ATGATTCTTA 
CCTACCTCTC 
CTCAGGTTCC 
AATTGGCTCA 
ACCCTGAACT 
TGATTGTCTT 
CCTATATGAA 
CCCATCATCC 
TCTATGCTGC 
TTCATGCACC 
ATAGAAGCAA 
CTCACCAAAT 
GTTGGCACAT 
AAGTAGACAC 
ACTTCGACCT 
CTTCCTTAAA 
CAGAAGAAAG 
CATGGAAAAA 
ACATACTTTC 
AAGAGCATTG 
TT AT AG ATT A 
ATATAAAAAG 
AAATAACTTG 
TTCCAAAATG 



TCTTCTACCA GGACCACTTC 
ATGACCTGAT C AT CTTCTAT 
ATAACAGATT TTTCCATGCT 
CAGTACAAAC AGGCTATGCT 
G AAAGG AG CT CTTATGGCCA 
TT AC G AC CTT CCCTTGAATG 
CTTTGATAAT CTGGTCTCCA 
AAGAGATCCC ACCATGCTTA 
ACACAAGCGC TTCAAAAATT 
GCAAAAGCAA TTTGCGGCTA 
TCGTGGGACA GATGACAGTA 
GGAAATTCCT GCTCAAAAGC 
TAAGCAAAAG GTTATTCTAG 
TAGCCAAATT GAGCAAAGTT 
TGGTCTCCAT CAAGAATTGA 
GATATTCATT CCACAAGGTT 
CATCGTTCAG AGTACTGCCC 
TGAGGACAAG CACTTCGTCC 
AACCTTTAAA GAATGGGTGG 
CTTCTTTGGC ACTATTCTTG 
GGCGCTTGAA TACATTCATC 
AGAAACCTTG GGT CGCCTT A 
TACATAATAC TCTTGAAAAT 
ATGTGCATTC TAAGTCTTTT 
AGAGATAATG GGGCTTGGAA 
AAAAGATGCC ACTTAGAAAA 
CTAACTGAAC ATTCTCGTAT 
AAATGAGGGA TAATAAAAAT 
ACACTATAAA GGCTAATACA 
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AAATCCCCAT TCAGAGAGCA 
TAATAAAATC TTGTGCATGC 
AGATGGTCTG AAACTGAATT 
ACT AG CT AAT TTGAGGGAAA 
AT ATTTTTG A CTATCTGAAA 
AGTTAGACAT TCTAACCTTA 
CACTTCCTCA ACGTCTTTTA 
CTAGCAAAAA TCGCCTTCAA 
GCAAACTCTC CCATTTTATC 
TGACTTATCG TGTCAGCCTC 
TCATTGGCTG GAAGGAAGAT 
ACGCCCTTCG CTATTTAAAG 
CTGGGCATTC CAAGGGAGGA 
TGCAAAATCA GATCACAGCA 
CACAGACTGC GCGTTATCAA 
CCATTATCGG TATGATGCTG 
TGGGTGGCAT CGCCCAGCAC 
AACTGGATAA GACCAACAGT 
CCACAGTCCC TGACGAAGAA 
ATGCTGGTAT TAGCTCTATC 
ATCTCTTTGT CCAAGCTCAA 
CCCAGTTATT GATTGATACT 
TAAATGTATA CAAAACAAAA 
TAAATAGAAT CTAATAGTCA 
CGTCCCTCTC GCTTCAACAA 
AGCAAAAAAG GAAGTAAGAC 
CCATTTTTAT AAAAAAGGTA 
AATACTGGAT TCCACAAACT 
ATT C CT A T AA C GAG AT AC AT 



TTTTCCCAAT 5 2 20 

GCTTCTCCTT 5 2 80 

TGAAATAGCC 53 4 0 

TTTGCTAAGA 54 00 

GATGTCGCAT 54 60 

ATAGAAATCA 5 520 

GATCTAGCAC 55 80 

TT ATT AG ATG 5640 

AACGACATCG 57 00 

GATACCTATC 57 60 

TTCCACCTGA 58 20 

AACTTTTTTG 58 80 

AATCTCGCTA 5940 

CTTTATACAT 6000 

AGGATAATGG 6060 

GAAATTCCTG 612 0 

GATACCTTTA 6180 

GATAGCCAGC 62 4 0 

CTTCAGCTCT 6 3 00 

AATCACTTCC 6 3 60 

TCCCTCACTC 6 4 20 

CGTTACCAGG 64 8 0 

GACCTAGAAT 6 54 0 

ATAAAAATCA 66 00 

AATGACCCCA 6 6 60 

AAAGGCAAAT 67 2 0 

GGATAGATAA 67 80 

TCTATTATCC 6 84 0 

TTCTTACTCC 6 9 00 
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TTTAATAGCT AC ATTTT AT C ATAATTATCC AAAGAAAAAA GAGGGCATTT ATCCCTCTTA 6 960 

ATCCTTCATC TGACTCTCTC CATCGGCCAC CACTTTTTCT AGACTGGTTT GACCAAGTTC 70 20 

TGCCTCCATA GTCAACTGAA TTCTCTCCAA TTTTTGATCC AAAACATCAT GAATATGAGC 7080 

TCCTACAGGG CAATTTGGAT TCGGATTGTC ATGGAAACTG AAGAGTTGAC CTGTCTTACC 7140 

AAGACATTCG AC CGCCTG AT AAACATCTAA AAGACTAATA TTTCTTAAGGT CCTTGACAAT 72 0 0 

CTCTGTTCCG CCCGTTCCAC GCGCTACTGA AATCAGCTCT GC CTTCTTC A ACTGGGACAA 7 2 60 

GATCTTTCTG ATAATGACAG GATTGACCCC G AC ACT AG C A CCCAGAAAAT CACTGGTCAC 7 3 20 

CTTGCTTTCC TTCCCCTCGA GGGCAATGAT TATCAGCATA TGAGTCGCAA TGGTAAATCT 7 3 30 

ACTTGGAATT TGCATCCTCT TCTCCTTTTT ACGAGGCTAC CCTGCCTCTA CTCTTCTTTT 7440 

TCTATTATTA TACCCTTTTT AGTTGTAATG TCAATCGTTA CCACTTTTCA ACCAGTCGTC 7 500 

T AACTCCCG A TCGCAGCCCT CTTTCTGAGC CAATTCTCTC AAAAATTCCT GATGATGAGT 7560 

ATGGTGGATC CCATTGACCA GACTTTCATA GTAAACCTCA AAATAGGGAA GTCTCAGGTC 7 62 0 

TTTACCCACC TGCAATTCAG CTGCTACATC GTAGTCTACC CGTCGGAAGT CCATATCTAC 7 680 

CAGGCCTTTG TCATCAAACT CCAAAATCAT ATACTGGGCC CGCAAGTCCT TCCGTAGCTG 77 40 

AGCGTCCAAA AAGAAAGGTT GGCCAATCGA ACCCGG ATTG ACAATCAATT GCCCACCAGT 7 800 

CCCGTAACGA AGCAACTGCT GGTCAATATC TCCATAAACA GCAATATCAC AGGGAGGATG 7 9 60 

AGTCACCAAG CGGTCAAACT CCTCTTGTTT GCCAGTATGA ATCAACTCTC GCCCCCAGTT 7920 

CTTATCAGGC AGATGATGGC TAATTCCCAC CGTCAAATCC CCAAACTGAC GATCAATTTC 7 93 0 

AAGAGGTTGA TTGTGGACCA CTTCAATTTC TTCTAGGGAA ATTTCCTCTA AAACATACTG 804 0 

GCACTGGCGC AAGAGATAGC GTTGACTGGG GCGAGTACTG TCCAATTCCT TACGGACACC 9100 

ATGC CAAAG A CTCTCTTCCC AGTTTCCCAA AACTCTAGCC GTAATCGGTA GTTGATCCAA 8160 

CAAGTCCAAA ATC CTTCT AC GCCCTGTCCC TGGCATGACA ATATCTCCCA AAAG C C ACT A 8 2 2 0 

TTCATCCACT CCTATCTGCC GAGCATCTGC CAAAACAGCC TCCAAGGCGG TGGTATTTCC 82 80 

ATGAATATCT GAAAGAAGAG CTATTTTCGT CATATCCATC TCCTCGTTTT TTCTCTTGCA 3 340 

ATAAGTATAA CATAAAAAGT CACAGCTAGA GAAATCTAGC TTTTTTTGAT AT AC T AG AT A 34 00 

AAGATATTAG ACAAGAGGAA ACGAATGACC CCAAACAAAG AAGACTATCT AAAATGTATT 84 60 

TATGAAATTG GCATAGACCT CCATAACATT ACCAACAAGG AAATTGCGGC TCGCATGCAA 8520 

GTCTCTCCCC CTGCCGTAAC TGAAATGATC AAACGAATGA AAAGTGAAAA TCTCATCCTA 3 5 80 

AAGGACAAGG AATGTGGCTA TCTACTGACT GACCTCGGTC TCAAACTGGT CTCTGAGCTC 864 0 
TATCGTAAGC ACCGCTTCAT TGAAGTTTTT CTAGTTCATC ATTTAGACTA TACAAGTGAC ' 8700 
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CAGATTCACG 


AGGAAGCTGA 


GGTCTTGGAA 


CACACTGTCT 


CTGACCTGTT 


CGTGGAAAGA 


876C 


CTAGATAAAC 


TG CT AGG TTT 


CCCTAAAACC 


TGCCCCCACG 


GGGGAACTAT 


TCCTGCCAAG 


8820 


GG AG AAC* AC 


TCGTTGAAAT 


CAATAACCTC 


CCACTAGCTG 


ATATCAAGGA 


AGCTGGCGCC 


8830 


TArCGCCTGA 

1 *w ^-J V- * vj r\ 


CTCGGGTGCA 


CGATAGTTTT 


GACATTCTCC 


ATTATCTGGA 


CAAGCACTCA 


8940 




GTGACCAGCT 


CCAAGTCAAG 


CAGTTTGATG 


GCTTCACCAA 


TACCTTCACT 


9000 




ACGACGAGGA 


TTT AC AAGTG 


AATATGGACA 


TTGCAAAACA 


ACTCTATGTC 


9060 


dAGAAAA^CA 


ACTAATTTCT 


CAAGTCCCCT 


ACCAACCCTG 


AAACTTTTAT 


TTTGGCTCTT 


9120 




AGTGGGTTGA 


AGTCAGCTAA 


GCTCGAGAAA 


GG AC AAATTT 


TGTCCTTTCT 


9180 




TGAGAGCGA' 7 * 


AAAAATCCGT 


TTTTTGAAGT 


TTTCAAAGTT 


CCGAAAACCA 


9240 




nCTTf.ATAAG 

VJV i I \Jf\ A ™\\J 


TTTGATGAGA 


TTATTGGTCG 


CTTCCAGTTT 


GGCATTAGAA 


9300 




G AAGGG CGTT 


G AC AAT CT TT 


TCTTTATCTT 


TGAGGAAGGT 


TTTAAAGACA 


9360 




TAGGATGAAC 


CTGCTTTAGA 


TTGTCCTCAA 


TGAGTCCGAA 


AAATTTCTCC 


9420 


GGTTTG' r *TAT 


TCTGAAAGTG 


AAAC AG C AAG 


AGTTGATAGA 


GCTGATAGTG 


GTGTTTCAAG 


9480 


Tr^TfiTflAAT 


AGf^CAAAAG 


CTT GTCT AAA 


ATCTCTTTAT 


TGGTT AAGTG 


CATACGAAAA 


9540 






ATCACTCAGT 


TTACGGCTAT 


CCTGTTGTAT 


GAGCTTCCAG 


9600 




TAGGCTTGTA 


TTCATGGGAT 


TTTCGATCCA 


ATTGGTTCAT 


AATTTGAACA 


9660 




TG AT AGG AGG 

1 I.A1 /\V*»— rtV—V^ 


GCTAAGATGT 


TGTACAATGT 


GAAAGCGATC 


CAACACGATT 


9720 


T^AfirAT"''*^ 
1 1 no^ n 1. - ^ -J 


GG AGTGAAAC 


AGTCTGGGAG 


ACTGTTTCAG 


CCTGAGCCTA 


GAAATTTGAA 


9730 


AGGGAAGGTG 


TTTAGCCAAG 


TCATAGTAAG 


GACTAAACAT 


ATCCATCGTA 


AT G ATTTT C A 


9840 


rTTr.ACAACfi 


AACGGCTCTA 


TCGTAGCGAA 


GAAAGTGATT 


TCGGATGACA 


GCTTGTGTTC 


9900 


TGGGTTCAAG 


AACAGTGATA 


ATATTAAGAT 


TATCAAAATC 


TTGCGCAATG 


AAACTCATCT 


9960 


TTCCCTTAGT 


G AAGGC AT AC 


TCATCCCAAG 


ACATAATCTT 


TGGAAGCCGA 


GAAAAATCAT 




GCTCAAAGTG 


AAAGTCATTG 


AGCTTGCGAA 


TGACAGTTGA 


AGTTGAAATC 


GCCAGCTGAT 


10080 


GGGCAATATC 


AGTCATAGAA 


ATTTTTTCAA 


TTAACTTTTG 


AGCAATyTTT 


TGGTTGATGA 


10140 


TACGAGGGAT 


TTGGTGATTT 


TTCTTTACCA 


GGGGAGTCTC 


AGCAACCATC 


ATTTTTGAAC 


10200 


AGTGATAGCA 


CTTGAAACGA 


CG CTTTCTAA 


GGAGAATTCT 


AGAAGGCATA 


CCAGTCGTTT 


10260 


CAAGATAAGG 


AATTTT AG AA 


GGTTTTTGAA 


AGTCATATTT 


CTTCAATTGG 


TTTCCGCACT 


10320 


CAGGGCAAGA 


TGGGGCGTCG 


TAGTCCAGTT 


TGGCGATGAT 


TTCCTTGTGT 


G T AT C CTT AT 


10380 


TGATGATGTC 


TAAAATCTGG 


ATATTAGGGT 


CTTTAATGTC 


7 AG T AATTTT 


GTGATAAAAT 


10440 
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GTAATTGTTC C AT AT G ATTC TTTCTAATGA GTTGTTTTGT CGCTTTTCAT TATAGGTCAT 10500 

ATGGGACTTT TTTTCTACAA TAAAATAGGC TCCATAATAT CTATAGTGGA TTTACCCACT 10 560 

ACAAATATTA TAGAACCGTA AAAATAGAAG GAGATAGCAG GTTTTCAAGC CTGCTATCTT 10 620 

TTTTTGATGA CATTCAGGCT GATACGAAAT CATAAGAGGT CTGAAACTAC TTTCAGAGTA 10680 

GTCTCTTCTA TAAAATATAG TAGATTGAAA TAAGATGTGA ACAACTCTAT CAGGAAAGTC 107 4 0 

AAATTAATTT ATAGAATTAT TTTAGCAGTC AAGGTGTACT GT TAT AG ATT CAATATATTA 10800 

TATGACTATT AACCTTGTCT TCTCCTAAAA TTGACTTTCT TGTTTTCTTA TCTTGTCCAC 10860 

TCGAAACAAG TATTGTAAGA ATTTGATTAT TTTTGAAAGT ACTTTTAATA TACTTGATAT 10920 

AGTTAAAAAA GATTTGAAAC TAAATTCCAA ATTAGAAAAA GACTTGAAAT ACTAAAAAAA 10980 

AAAAAGT AT A CTCTAATTGA AAACGGT AAC AAAACTAATT T AG AG AATG A AATATAGAGT 11040 

ATTTCTCTCT T AAAAGTTT T TGGTGAAACG AGATGTAGAA AGGAGATTTA GCCAAAGAGT 11100 

CTATTAGTGC TAGAATAATA GATTAGAATT ATTTTAGAAA AACGAAGTGA GCAGCTTATA 11160 

AATTCAAGTC CCCAAATAGA TTCATACTAG TATCTTTTGC AAAAAATAAA GGGCGACTTC 112 20 

CTTCATGAAT ATCAATTTCA TCTATAAGGA AGGTAGCTAA TTGAACTAAC TTATTTATTC 11280 

TGTTTGTCGC TAGAAAAATC AGACCTCCTT GTGAAGATTG AGGAGATACT TAATGAAAAT 113 40 

CAAAGAAGAA ACTAGCAAGC TAGTAGCAGA TTCCCCAAAA CACCGCTTTG AGGTTGTAGA 11400 

TAAGACTGAC CTATATAATC CAAGGTGAAG CGACTGTGGT TTGAAGAGAT TTTCAAAGAG 114 60 

TATAGGCTAG AGAGTAGTCT TTTTATGTCC TTCTAGTAGA AAATGCTAGA CAGAAGAATG 11520 

GGGAACTTGG ATAGGAAAAA TAGATTGAGA AAGGAGGTTA GAAGAGATGA TTATTACAAA 11580 

AATTAGCCCT TTAGGAACTT ATGTGGGAGT AAATCCACAT TTTGCAACAT TAATAGATTT 11640 

TCTAGAAAAA ACAGGACTAG AAAATTTAAC AGAAGGTTCG ATTGCTATCG ATGGTAATCG 11700 

ATTGTTTGCG AATTGC TTT A CTTATCTAGC AGATGGTCAA GCAGGGGCTT TCTTTGAAAC 117 60 

CC AC C AAAAA TATTTGGATA TTCATTTAGT TTTGGAAAAC GAAGAAGCCA TGGCTGTTAC 11820 

ATCGCCGGAA AATGTAAGCG TTACCCAAGA AT AT G A TG AA GAGAAAGATA TTGAATTATA 11880 

CACAGGGAAA GTGGAACAGT TGGTTCATTT GAGAGCTGGC GAATGCCTCA TCACTTTTCC 119 40 

AGAAGATTTA CATCAACCCA AGGTTCCTAT AAATGATGAA CCTGTGAAAA AAGTTGTCTT 12 000 
TAAAGTTGCG ATTTCTTAAT GTAGAAAGAG AAGAACGATG AAAAAAATGA CAAAGTTTTT . 12060 

ATGTCTAGCT GGAATTGCGC TAGCGGCTGT TGCCTTGGTA CCTTGTTCAG GAAAAAAAGA 12120 

AGCTACAACT AGTACTGAAC CACCAACAGA ATTATCTCGT GAG ATT AC AA TGTCGCACTC 12180 

CTTTACTCAA GGACCCCGTT TAGAAAGTAT- TCAAAAATCA GCAGATGCTT TCATGCAAAA 122 40 
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GCATCCAAAA ACGAAAATCA AGATTGAAAC ATTTTCTTCG AATCACTTCT ATACTAAATG 12 300 

GACTACAGGT TTAGCAAATG GAAATGTGCC AGATATCAGT ACAGCTCTTC CTAACCAAGT 12 360 

AATGGAAATG GTCAACTCAG ATG CTTTGGT TCCGCTAAAT GATTCTATCA AGCGTATTGG 12420 

AC AAG AT AAA TTTAACGAAA CTG CCTT AAA TGAAGCAAAA ATCGGAGATG ATT AC TACT C 124 80 

TGTTCCTCTT TATTCACATG CACAAGTCAT GTGGGTTAGA ACAGATTTCT TAAAAGAACA 12 540 

TAATATTGAG GTTCCTAAAA CTTGGGATCA ACT CT ATG AA GCTTCTAAAA AATTGAAAGA 12 600 

AGCTGGAGTT TATGGCTTGT CTGTTCCGTT TGGAACAAAT GACTTAATGG CAACACGTTT 12 560 

CTTGAACTTC TACGTACGTA rTGGTGGAGG AAGCCTCTTA ACAAAAGATC TTAAAGCACA 12720 

CTTGACAAGC CAACTTGCTC AAGATGGTAT TAAATACTGG GTTAAATTGT ATAAAGAAAT 12 780 

CTCACCTCAA GATTCTTTGA ACTTTAATGT CCTTCAACAA GCTACCTTGT TCTATCAAGG 12 840 

AAAAACAGCA TTTGACTTTA ACTCTGGCTT CCATATCGGA GGAATTAATG CCAACAGTCC 12 900 

TCAATTGATT GATTCGATTG ATGCTTATCC TATTCCAAAA ATCAAAGAGT CTGATAAAGA 129 60 

CCAAGGAATT GAAACCTCAA ACATTCCAAT GGTTGTTTGG AAAAATTCAA AACATCCAGA 13C2 0 

AGTTG CT AAA GCATTCTTAG AAGCACTTTA TAATGAAGAA GACTACGTTA AATTCCTTCA 13 080 

TTCAACTCCA GTACGTATGT TGCCAACTAT TAAGGGGATT AGCGATTCTG CAGCCTATAA 13 140 

AGAAAATGAA ACTCGTAAGA AATTTAAACA TGCTGAAGAA GTAATTACTG AAGCTGTTAA 13 2 00 

AAAAG G T ACT GCTATTGGTT ATGAAAATGG GCCAAGTGTA CAAGCTGGTA TGTTGACTAA 132 60 

CCAACACATT ATTCAACAAA TGTTCCAAGA TATCATTACA AATGGAACAG ATCCTATGAA 13 320 

AGCAGCAAAA GAAGC AGAAA AAC AATT AAA TGATTTATTT GAGGCTGTTC AGTAGATGTA 13 380 

AAAGACTAGA AAATAGGTGG GATAGTGAGC TGAAAAGCTC TAGCCCAATC TTGTAAAAGA 13 440 

AGGGAGAAGG AGAATCGTTA AAGAACGTAA TTTAACTCGC TGGATATTTG TTTTGCCACC 13 500 

TATGATTATC GTAGGATTAC TCTTTGTTTA TCCGTTTTTC TCGACTATTT TTTATAGCTT 13 560 

TACCAATAAG CATTTGATTA TGCCTAATTA TAAATTTGTT GGTTTGGCTA ACTATAAAGC 13 620 

TGTGCTATCA GATCCCAACT TCTTTAATGC GTTCTTTAAT T C AATT AAG T GGACCGTTTT 13 680 

CTCATTAGTT GGTCAAGTTT TAGTAGGGTT TGTATTGGCT TTAGCTCTTC AC AG AGT AC G 13740 

CCACTTCAAG AAATTATATA GG AC ATT ATT GATTGTTCCT TGGGCATTTC CTACCATCGT 13 800 

TATTGCCTTC TCTTGGCAGT GGATTCTAAA CGGGGT TT AT GGCTACTTAC CTAATCTAAT 13 860 

CGTAAAATTA GGTTTAATGG AACATACACC TGCATTTTTG ACAGATAGTA CATGGGCATT 13 920 

CCTATGTTTG GTGTTTATCA ACATTTGCTT TGGAGCACCA ATGATTATGG TTAATGTGCT 13 980 
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TTCACCTTTG 


CAAACAGTAC 


CAGAAGAACA 


ATTTGAGGCT 


GCTAAGATAG 


ATGGTGCTTC 


14040 


AAGTTGGCAG 


GTGTTCAAGT 


TTATCGTCTT 


TCCACATATT 


AAAGTGGTTG 


TAGGACTTCT 


14 100 


AGTTGTTTTG 


AGAACTGTAT 


GGATCTTTAA 


TAACTTTGAC 


ATTATCTACC 


TCATTACTGG 


14 160 


TGGTGGACCA 


GCCAATGCTA 


CAACGACGCT 


TCCAATTTTT 


GCTTACAACC 


TGGGCTGGGG 


14220 


AACTAAATTG 


TTGGGTCGTG 


CTTCAGCAGT 


TACAGTACTG 


CTCTTTATCT 


TCTTGGTGGC 


14280 


GATTTGCTTT 


ATCTACTTTG 


CTATCATCAG 


TAAGTGGGAA 


AAGGAGGGTA 


G AAAAT AATG 


14340 


AAGAAGAAAT 


CCAGTATTTA 


TTTAGATATT 


CTCTCACATG 


T ACTTTT ACT 


TGGTGCGACC 


14400 


ATCGTTGCAG 


TTTTCCCATT 


GG TAT GG ATT 


ATCATATCTT 


CTGTCAAAGG 


GAAAGGGGAA 


14460 


TTAACTCAGT 


ATCCAACACG 


ATTTTGGCCT 


GAACAGTTTA 


CATTAGATTA 


TTTCACTCAT 


14520 


GTTATCAACG 


AT TTGCACTT 


CATTGATAAC 


ATTCGAAACA 


GTTTAATCAT 


TGCCTTGGCT 


14580 


ACAACCCTTA 


TTGCGATTAT 


TATTTCTGCT 


ATGGCAGCCT 


ATGGTATTGT 


TCGATTCTTT 


14640 


CCTAAATTGG 


GAGCAATCAT 


GTCGAGACTA 


CTCGTCATTA 


CCTACATTTT 


CCCACCAATT 


14700 


TTGTTAGCAA 


TTCCCTATTC 


AATTGCCATT 


GCTAAAGTTG 


GGTTAACAAA 


TAGTTTATTT 


14760 


GCCTTGATCA 


TGGTTTATCT 


ATCTTTTAGT 


GTTCCATATG 


CAGTTTGGCT 


CTT AGTTGG A 


14820 


TTTTTCCAAA 


CACTTCCAAT 


TGGAATTGAA 


GAACCGGCTA 


GAATTGATGG 


TG C AAAT AAA 


14880 


TTTGTTACGT 


TTTATAAAGT 


TGTGCTACCG 


ATTGTAGCAC 


CAGGTATTGT 


AGCAACAGCT 


14940 


ATTTATACAT 


TTATCAATGC 


TTGGAATGAA 


TTCCTGTATG 


CCTTGATTTT 


GATTAACAAT 


15000 


ACAGGAAAGA 


TGACAGTAGC 


AGTAGCCCTT 


CGTTCACTTA 


ATGGTTCAGA 


AAT ACT AG AC 


15060 


TGGGGAGATA 


TGATGGCAGC 


GTCTGTTATT 


GTAGTTCTTC 


CATCAATTAT 


TTTCTTCTCT 


15120 


AT C ATCCAAA 


ATAAGATTGC 


AAGTGGATTA 


TCAGAAGCAT 


CTGTGAAGTA 


GACGAAAGAA 


15190 


GGAAAAAAAT 


GAATAAAAGA 


GCTCTTTATT 


CAAAACTAGG 


AATTTCCGTT 


GTAGGCATTA 


15240 


CTCTTTTAAT 


GGGAGTCCCC 


ACTTTGATTC 


ATGCGAATGA 


ATTAAACTAT 


GGTCAACTGT 


15300 


CCATATCTCC 


TATTTTTCAA 


GGAGGTTCAT 


ATCAACTGAA 


CAATAAGAGT 


ATAGATATCA 


15360 


GCTCTTTGTT 


ATTAGATAAA 


TTGTCTGGAG 


AGAGTCAGAC 


AGTAGTAATG 


AAATTTAAAG 


15420 


C AG AT AAAC C 


AAACTCTCTT 


CAAGCTTTGT 


TTGGCCTATC 


TAATAGTAAA 


GCAGGCTTTA 


15480 


AAAATAATTA 


CTTTTCAATT 


TTCATGAGAG 


ATTCTGGTGA 


GATAGGTGTA 


GAAATAAGAG 


15540 


ACGCCCAAAA 


GGGAATAAAT 


TATTTATTTT 


CCAGACCAGC 


TTCATTATGG 


GGAAAACATA 


15600 


AAGGACAGGC 


AGTTG AAAAT 


ACACTAGTAT 


TTGTATCTGA 


TTCTAAAGAT 


AAAAC AT AC A 


15660 


CAATGTATGT 


TAATGGAATA 


GAAGTGTTCT 


CTGAAACAGT 


TGATACATTT 


TTGCCAATTT 


15720 


CAAATATAAA 


TG GT AT AG AT 


AAGGCAACAC 


TAGGAGCTGT 


TAATCGTGAA 


GGTAAGGAAC 


15780 
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ATTACCTCGC AAAAGGAAGT ATTGATGAAA TCAGTCTATT TAACAAAGCA ATTAGTGATC 15840 

AGGAAGTTTC AACTATTCCC TTGTCAAATC CATTTCAGTT AATTTTCCAA TCAGGAGATT 15900 

CTACTCAAGC TAACTATTTT AGAATACCGA CACTATATAC ATTAAGTAGT GGAAGAGTTC 15 9 60 

TATCAAGTAT TGATGCACGT TATGGTGGGA CTCATGATTC TAAAAGTAAG ATTAATATTG 16020 

CCACTTCTTA TAGTGATGAT AATGGGAAAA CGTGGAGTGA GCCAATTTTT GCTATGAAGT 16080 

TTAATGACTA TGAGGAGCAG TTAGTTTACT GGCCACGAGA TAATAAATTA AAGAATAGTC 16140 

AAATTAGTGG AAGTGCTTCA TTCATAGATT CATCCATTGT TGAAGATAAA AAATCTGGGA 16200 

AAACGATATT ACTAGCTGAT GTTATGCCTG CGGGTATTGG AAATAATAAT GCAAATAAAG 16 260 

CCGACTCAGG TTTTAAAGAA ATAAATGGTC ATT ATT ATT T AAAACTAAAG AAGAATGGAG 16 320 

ATAACGATTT CCGTTATACA GTTAG AG AAA ATGGTGTCGT TTATAATGAA ACAACTAATA 16 380 

AACCTACAAA TT AT ACT ATA AATGATAAGT ATGAAGTTTT GGAGGGAGGA AACTCTTTAA 16 440 

CAGTCGAACA ATATTCGGTT GATTTTGATA GTGGCTCTTT AAGAGAAAGG CATAATGGAA 16500 

AACAGGTTCC TATGAATGTT TTCTACAAAG ATTCGTTATT TAAAGTGACT CCTACTAATT 16560 

ATATAGCAAT GACAACTAGT CAGAATAGAG GAGAGAGTTG GGAACAATTT AAGTTGTTCC 156 20 

CTCCGTTCTT AGGACAAAAA CATAATGGAA CTTACTTATG TCCCGGACAA GGTTTAGCAT 16680 

TAAAATCAAG TAACAGATTG ATTTTTGCAA CAT AT ACT AG TGGAGAACTA ACCTATCTCA 16740 

TTTCTGATGA TAGTGGTCAA ACATGGAAGA AATCCTCACC TTCAATTCCG TTTAAAAATG 16800 

CAACAGCAGA AGCACAAATG CTTCAACTGA GAGATGGTGT GATTAGAACA TTCTTTAGAA 16 860 

CCACTACAGG TAAGATAGCT TATATGACTA GTAGAGATTC TGGAGAAACA TGGTCGAAAG 16 920 

TTTCGTATAT TGATGGAATC CAACAAACTT CATATGGCAC ACAAGTATCT GCAATTAAAT 16980 

ACTCTCAATT AATTGATGGA AAAGAAGCAG TCATTTTGAG TACACCAAAT TCTAGAAGTG 17040 

GCCGCAAGGG AGGCCAATTA GTTGTCGGTT TAGTCAATAA AGAAGATGAT ACTATTCATT 17100 

GGAAATACCA CTATGATATT GATTTGCCTT CGTATGCTTA TGCCTATTCT GCGATTACAG 17160 

AATTGCCAAA TCATCACATA GGTGTACTGT TTGAAAAATA TGATTCGTGG T CG AG AAATG 172 20 

AATTGCATTT AAGCAATGTA GTTCAGTATA TAGATTTGGA AATTAATCAT TTAACAAAAT 17 2 30 

AAAGGAGAAA AACATGGTTA AATACGGTGT TGTTGGAACA GGGTATTTTG G'AGCTGAATT 17 340 

GGCTCGCTAC ATGCAAAAGA ATGATGGAGC AGAGATTACT CTTCTCTATG ATCCAGATAA 17400 

TGCAGAGGCG ATTGCAGAAG AATTGGGAGC AAAAGTAGCA AGTTCCTTAG ATGAGTTGGT 174 60 

TTCTAGCGAT GAAGTAGATT GTGTTATCGT CGCAACTCCA AATAATCTTC ATAAGGAACC 17 52 0 
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GGTTATTAAC 


GCTGCACAGC 


ATGGTAAAAA 


TGTTTTCTGT 


GAAAAACCAA 


TTGCGCTTTC 


17580 


TTATCAAGAT 


TGTCGCGAGA 


TGGTAGATGC 


GTGTAAAGAA 


AACAATGTAA 


CCTTTATGGC 


17640 


AGGACATATT 


ATGAATTTCT 


TTAATGGTGT 


TCATCATGCA 


AAAGAACTCA 


TTAATCAAGG 


17700 


AGTTATCGGA 


GACGTTCTAT 


ATTGTCATAC 


AGCTCGTAAT 


GGTTGGGAAG 


AACAACAACC 


17760 


GTCAGTATCA 


TGGAAAAAAA 


TTCGTGAAAA 


ATCAGGTGGT 


CACTTGTATC 


ACCACATCCA 


17820 


TGAATTGGAT 


TGCGTTCAAT 


TCCTTATGGG 


GGGCATGCCT 


GAAACTGTAA 


CCATGACAGG 


17880 


TGGAAATGTG 


GCCCATGAAG 


GTGAACATTT 


CGGTGATGAA 


GATGATATGA 


TTTTTGTCAA 


17940 


TATGGAATTT 


TCTAATAAGC 


GTTTTGCCTT 


GTTAGAATGG 


GGTTCAGCTT 


ATC GTTGGGG 


18000 


TGAACATTAT 


GTCTTAATCC 


AAGGAAGCAA 


AGGTGCCATC 


CGCTTAGACT 


TATTCAACTG 


13060 


TAAAGGAACT 


CTTAAGCTAG 


ATGCCCAAGA 


AAGCTATTTC 


TTGATTCACG 


AATCGCAAGA 


18120 


AGAAGATGAT 


GATCGGACTC 


GTATCTATCA 


TAGTACAGAG 


ATGGATGGAG 


CAATTCCTTA 


18180 


TGGTAAACCA 


GGTAAACGTA 


CTCCATTATG 


GCTATCATCT 


GTCATTGATA 


AAGAAATGCG 


18240 


CTATCTGCAT 


GAGATTATGG 


AAGGAGCTCC 


AGTATCAGAA 


GAATTTGCAA 


AACTTTTGAC 


18300 


AGGTGAAGCT 


GCCCTAGAAG 


CAATTGCTAC 


TGCAGATGCT 


TGTACCCAGT 


CTATGTTTGA 


18360 


AGAT CGC AAA 


GTAAAATTGT 


CAGAAATTGT 


AAAATAAATT 


TTGGTATTCT 


CCTATTTATA 


18420 


GGTCGACTTG 


CTCCTCTGAA 


AGTACTTTTA 


GAGGAGCTGT 


TTGACTTTGC 


TAGTTTTTGA 


18480 


AACTGAAATC 


T ATT AT ACT A 


CAAACTATTG 


AAAGCGTTTT 


AATTTTAAGC 


TATAATAATC 


18540 


TCATAGAAAT 


AAAGAAAAGG 


AGGAAAGAGG 


ATGCCACACA 


TTAGCAAAGA 


AGCCTTGATT 


19600 


GAGCAAATCA 


AAGATGGAAT 


CATCGTTTCT 


TGTCAGGCTC 


TTCCTCATGA 


ACCGCTTTAT 


18660 


ACAGAAGCGG 


GAGGGGTGAT 


TCCCTTGCTG 


GTCAAAGCGG 


CTGAGCAAGG 


TGGAGCAGTC 


13720 


GGTATCCGAG 


CAAACAGTGT 


TCGCGATATC 


AAGGAAATTA 


AGGAAGTCAC 


T AAACTT CCA 


18780 


ATCATTGGGA 


TTATCAAACG 


TG ATT AT C C A 


CCTCAGGAAC 


CCTTCATCAC 


GGCTACTATG 


18840 


AAAGAAGTTG 


ATGAATTGGC 


AGAACTGGAC 


ATCGAGGTGA 


TTGCTCTGGA 


TTGTACCAAG 


18900 


CGTGAACGCT 


ACGATGGTTT 


GGAAATTCAA 


GAGTTCATTC 


GTCAGGTTAA 


CGAGAAATAT 


18960 


CCTAATCACC 


TTTTGATGGC 


TGATACTAGT 


ATCTTCGAAG 


AAGGGCTAGC 


AGCTGTAGAA 


19020 


GCAGGAATTG 


ACTTTGTCGG 


AACAACCTTA 


TCAGGCTACA 


C ATCCT AC AG 


TCCAAAAGTA 


19080 


GACGGTCCAG 


ATTTTGAATT 


GATTAAGAAA 


CTCTGTGATG 


CTGGTGTAGA 


TGTCATTGCA 


19140 


GAAGGAAAAA 


TTCATACACC 


AGAACAAGCC 


AAACAAATCC 


TTGAATATGG 


AGTGCGAGGC 


19200 


ATCGTTGTTG 


GTGGCGCCAT 


TACTAGACCA 


AAAG AG ATT A 


CAGAACGCTT 


CGTTGCTAGT 


19260 


CTTAAATAAG 


ATGTGAGGGG 


GAGTTTTATG 


TTTAAAGTTT 


TACAAAAAGT 


TGGAAAAGCT 


19320 



WO 98/18931 



PCT/US97/19588 



169 

TTTATGTTAC CTATAGCTAT ACTTCCTGCA GCAGGTCTAC TTTTGGGGAT TGGTGGTGCA 19380 

CTTTCAAACC CAACCACGAT AGCAACTTAT CCAATACTAG ACAATAGTAT TTTTCAATCA 19440 

ATATTCCAAG TAATGAGCTC TGCAGGAGAG GTTGTATTCA GTAATTTGTC ACTACTTCTC 19 500 

TGTGTGGGAT TATGTATTGG CTTAGCGAAA CGAGATAAAG GAACCGCTGC GTTAGCAGGA 19 560 

GTAACTGGTT ACTTAGTTAT GACTGCAACG ATCAAAGCTT TGGTAAAACT TTTTATGGCA 19620 

GAAGGATCTG CAATTGATAC TGGAGTTATT GGAGCATTAG TTGTCGGAAT AGTTGCCGTA 19 630 

TATTTGCACA ACCGATATAA CAATATTCAA TTACCTTCCG CTTTAGGATT CTTTGGAGGT 19740 

TCACGCTTCG TTCCTATTGT TACATCGTTC TCTTCTATCT TGATTGCCTT TGTCTTCTTT 19800 

GTTATTTGGC CACCTTTCCA ACAACTTCTT GTTTCTACAG GTGGATATAT TTCTCAGGCG 198 60 

GGTCCAATTG GAACTTTTCT ATATGGATTT TTAATGAGAC TTTCTGGAGC AGTAGGCTTA 19920 

CATCATATAA TTTACCCTAT GTTTTGGTAT ACTG AACTTG GTGGTGTTCA AACTGTTGCA 19980 

GGACAAACAG TGGTTGGAGC TCAAAAAATA TTTTTTGCTC AATTAGCCGA TTTCGCCCAT 2004 0 

TCTGGATTAT TTACAGAAGG AACAAGGTTT TTTGCAGGTC GTTTCTCAAC AATGATGTTC 20100 

GGTTTACCGG CTGCCTGTTT AGCGATGTAC CATAGTGTTC CTAAAAATCG TCGTAAAAAA 20160 

TACGCGGGTT TGTTTTTTGG AGTTGCTTTA ACATCTTTTA TTACCGGTAT TACAGAACCA 2 02 20 

ATTGAATTTA TGTTTCTATT CGTCAGTCCG GTTCTATATG TTGTTCACGC ATTCCTTGAT 2Q280 

GGTGTTAGCT TCTTTATTGC AGACGTCTTA AATATTTCAA TAGGAAACAC ATTTTCAGGA 20340 

GGTGTAATCG ATTTCACTTT ATTTGGAATT TTGCAGGGGA ACGCTAAGAC GAATTGGGTT 204 00 

CTTCAGATTC CATTTGGACT TATTTGGAGT GTTTTCTATT ATATTATTTT TAG ATGGTTC 2 04 60 

ATT ACT C AAT TCAACGTTCT AACGCCAGGG CGAGGAGAAG AAGTAGATTC TAAAGAAATT 20520 

TCTGAATCCG CAGATTCAAC TT C AAAT ACT GCAGATTATT TAAAACAGGA TAGCCTACAA 20580 

ATT AT C AG AG CCTTGGGTGG ATC AAAT AAT ATAGAAGATG TAGATGCTTG TGTGACACGT 2 0640 

TTACGTGTAG CTGTAAAAGA AGTTAATCAA GTTGATAAAG CACTTTTAAA ACAAATTGGT 2 0700 

GCAGTTGATG TCTTAGAAGT GAAGGGTGGC ATTCAAGCAA TCTATGGAGC AAAAGCAATC 2 0760 

TTATATAAAA ATAGTATTAA TGAAATTTTA GGTGTAGATG ATTAAGTACT TACTGACTTA 2 0820 

ATAAAAAACA GAGGAGAGTG ATGGATGAGT AGGATGAAAT GAAATCGCAT ACAAGAAATA 2 0 880 

AAGAACTCAT TATCCAAGTT GGATACGCTT ATT AC AT AG G AG AAT AC AAA TGAAATTTAG 2 094 0 

AAAATTAGCT TGTACAGTAC TTGCGGGTGC TGCGGTTCTT GGTCTTGCTG CTTGTGGCAA 2 1000 

TTCTGGCGGA AGTAAAGATG CTGCCAAATC AGGTGGTGAC GGTGCCAAAA CAGAAATCAC 21060 
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TTGGTGGGCA TTCCCAGTAT TTACCCAAGA AAAAACTGGT GACGGTGTTG GAACTTATGA 21120 

AAAATCAATC A TCG AAGCGT TTGAAAAAGC AAACCCAGAT ATAAAAGTGA AATTGGAAAC 21180 

CATCGACTTC AAGTCAGGTC CTGAAAAAAT CACAACAGCC ATCGAAGCAG GAACAGCTCC 212 4 0 

AGACGTACTC TTTGATGCAC CAGGACGTAT CATC C AAT AC GGTAAAAACG GTAAATTGGC 213 00 

TGAGTTGAAT GACCTCTTCA CAGATGAATT TGTTAAAGAT GTCAACAATG AAAACATCGT 213 60 

ACAAGCAAGT AAAGCTGGAG ACAAGGCTTA TATGTATCCC ATTAGTTCTG CCCCATTCTA 214 20 

CATGGCAATG AACAAGAAAA TGTTACAAGA TGCTGGAGTA GCAAACCTTG TAAAAGAAGG 21480 

TTGGACAACT GATGATTTTG AAAAAGTATT GAAAGCACTT AAAGACAAGG GTTACACACC 21540 

ACGTTCATTG TTCAGTTCTG GTCAAGGGGG AGACCAAGGA ACACGTGCCT TTATCTCTAA. 21600 

CCTTTATAGC GGTTCTGTAA CAGATGAAAA AGTTAGCAAA TATACAACTG ATGATCCTAA 21660 

ATTCGTCAAA GGTCTTGAAA AAGCAACTAG CT GG ATT AAA GACAATTTGA TCAATAATGG 217 20 

TTCACAATTT GACGGTGGGG CAGATATCCA AAACTTTGCC AACGGTCAAA CATCTTACAC 21780 

AATCCTTTGG GCACCAGCTC AAAATGGTAT CCAAGCTAAA CTTTT AG AAG CAAGTAAGGT 21840 

AGAAGTGGTA GAAGTACCAT TCCCATCAGA CGAAGGTAAG CCAGCTCTTG AGTACCTTGT 21900 

AAA CG GGTTT GCAGTATTCA ACAATAAAGA CG AC AAG AAA GTCGCTGCAT CTAAGAAATT 219 60 

CATCCACTTT ATCGCAGATG ACAAGGAGTG CGGACCTAAA GACGTACTTC GTACACGTGC 22020 

TTTCCCAGTC CCTACTTCAT TTGGAAAACT TTATGAAGAC AAACGCATGG AAACAATCAG 22080 

CGGCTGGACT CAATACTACT CACCATACTA CAACACTATT GATGGATTTG CTGAAATGAG 22140 

AACACTTTGG TTCCCAATGT TGCAATCTGT ATCAAATGGT GACGAAAAAC CAGCAGATGC 22200 

T TTG AAAGCC TTCACTGAAA AAGCGAACGA AACAATCAAA AAAGCTATGA AACAATAGTC 2 22 60 

CTTAGTTATT CTATAAAAAG TAGTTTTTTA AAGAACCTAA GAGTGTATAC CCCCTTTTCC 22 32 0 

CT CT AC AC AG ATAGTGTAAG AAAAGGGGGC TTTTGTTTAA AATCTAAGAA ACTGTCACGA 22 3 80 

AATTAAAATG AAGTTCTTAC ATAAGCGAAT CATAAAAAAT TTCATTTTGA TTTTAAAACA 22440 

GTTCAAGAAA GTCAAAAAAT TATTCTATTT GAAAGAGAGG TGCCGACTCT GAAAGTCAAT 22 500 

AAAATCCGTA TGCGGGAAAC AGTGATTTCC TACGCTTTCC TAGCACCAGT ATTATTCTTC 22 560 

TTTGTCATCT TTGTGTTGGC TCCGATGGTG ATGGGCTTCA TTACAAGTTT CTTTAACTAC 22 620 

TCAATGACTA AATTTGAGT.T TGTAGGCTTG GATAACTATA TCCCTATGTT TAAAGATCCT 22 680 

GTCTTTACAA AATCTCTGAT TAACACAGTT ATTTTGGTTA TTGGATCTGT ACCACTTGTT 22740 

GTTCTATTCT CACTCTTTGT AGCATCTCAG ACCTATCATC AAAATGTCAT TGCCAGATCC 22800 

TTCTACCGTT TCGTCTTCTT CCTTCCTGTT GTAACGGGTA GTGTTGCCGT GACAGTTGTT 2 2860 
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TGGAAATGGA TTTATGACCC ACTATCAGGG ATTCTAAACT TTGTCCTTAA GTCCAGCCAC 22920 

ATCATCAGCC AAAACATTTC TTGGTTGGGA GATAAAAACT GGGCATTGAT GGCGATTATG 22980 

ATTATTCTCT TGACCACTTC AGTTGGTCAG CCCATCATCC TTTATATCGC TGCCATGGGG 2 3 040 

AATATTGACA ATTCACTGGT TGAAGCGGCG CGTGTTGATG GTGCAACTGA GTTTCAAGTT 23100 

TTTTGGAAGA TTAAATGGCC AAGCCTTCTT CCAACAACTC TTTATATTGC AATCATCACA 23160 

ACAATTAACT CATTCCAGTG TTTCGCCTTG ATTCACCTTT TGACATCTGG TGGTCCAAAC 23220 

TACTCAACAA GTACCTTGAT GTACTACCTT TACGAAAAAG CCTTCCAATT GACAGAATAC 23280 

GGCTATGCCA ACACAATTGG TGTCTTCTTG GCAGTCATGA TTGCTATCGT AAGCTTTGTT 23340 

CAATTTAAAG TACTTGGAAA CGACGTAGAA TACTAAAGAA AGGAGACAGC TATGCAATCT 23400 

ACAGAAAAAA AACCATTAAC AGCCTTTACT GTTATTTCAA CAATCATTTT GCTCTTGTTG 2 3460 

ACTGTGCTGT TCATCTTTCC ATTCTACTGG ATTTTGACAG GGGCATTCAA ATCACAACCT 23S20 

GATACAATTG TTATTCCTCC TCAGTGGTTC CCTAAAATGC CAACCATGGA AAACTTCCAA 23580 

CAACTCATGG TGCAGAACCC TGCCTTGCAA TGGATGTGGA ACTCAGTATT TATCTCATTG 2 3640 

GTAACCATGT TCTTAGTTTG TGCAACCTCA TCTCTAGCAG GTTATGTATT GGCTAAAAAA 2 3700 

CGTTTCTATG GTCAACGCAT TCTATTTGCT ATCTTTATCG CTGCTATGGC GCTTCCAAAA 237 60 

CAAGTTGTCC TTGTACCATT GGTACGTATC GTCAACTTCA TGGGAATCCA TGATACTCTC 23820 

TGGGCAGTTA TCTTGCCTTT GATTGGATGG CCATTCGGTG TCTTCCTCAT GAAACAGTTC 2 3 880 

AGTGAAAATA TCCCTACAGA GTTGCTTGAA TCAGCTAAAA TCGACCGTTG TGCTGAGATT 2 3940 

CGTACCTTCT GGAGTGTAGC CTTCCCGATT GTGAAACCAG GGTTTGCAGC CCTTGCAATC 24 000 

TTTACCTTCA TCAATACTTG GAATGACTAC TTCATGCAAT TGGTAATGTT CACTTCACGT 24060 
AACAATTTGA CCATCTCACT TGGGGTTGCG ACCATGCAGG CTGAAATGGC AACCAACTAT 24120 
GGTTTGATTA TGGCAGGAGC TGCCCTTGCT GCTGTTCCAA TCGTCACAGT CTTCCTAGTC 24180 
TTCCAAAAAT CCTTCACACA GGGTATTACT ATGGGAGCGG TCAAAGGATA ATACTCTGCG 24240 
AAAATCTCTT CAAACTACGT CAGCTTCACC TTGCCATACT TAAGTATTGC CTGCGGTTAG 24300 
CTTCCTAGTT TGTTCTTCAA TTTTCATTGA GTATAGGAAA ATCAATCTAT CAAGATACAG 24 360 
AAGTATATTT TATAGATTTA GAGAATATAG AGGTTATAAG TGTCTACAAA ATGGAGGGTA 24 4 20 
TGCAGTTACT TTATGAAGTT TTGTCAGACA CTTATAAACT TAAGAATGGT TTTAGTTAAC 24480 
TATCAGAAAC GAAGGAAAGA GTATGATTTT TGACGATTTG AAAAACATCA CCTTTTACAA 24 540 
AGGGATTCAT CCTAATTTAG ACAAGGCTAT CGACTATCTC TACCAACATC GTAAGGATTC 24 600 
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TTTCGAATTA 


GGAAACTATG 


ATATTGATGG 


AGATAAAGTC 


TTTCTAGTTG 


TTCAGGAAAA 


24660 


TGTCCTCAAT 


CAAGCTGAAA 


ATGATCAATT 


TGAGTATCAT 


AAGAACTATG 


CAGATTTGCA 


24720 


TTTGCTGGTA 


GAAGGACATG 


AATATTCGAG 


CTACGGTTCA 


CGTATCAAAG 


ACGAGGCAGT 


24780 


AGCATTCGAC 


GAAGCGAGTG 


ACATTGGCTT 


TGTTCATTGT 


CATGAACACT 


ACCCACTCTT 


24840 


GTTGGGTTAT 


CACAATTTTG 


CGATTTTCTT 


CCCAGGTGAG 


CCACATCAGC 


CAAATGGTTA 


249C0 


TGCAGGCATG 


GAAGAAAAGG 


TTCGAAAATA 


TCTCTTTAAA 


ATTTTGATTG 


ATTAAAAATA 


24960 


GGATGAATTG 


TTTTTTTGTA 


AAGCTTTGAT 


AATACTCTAC 


CATGAAATTG 


ATCTTTGTGA 


25020 


GGT AG AG AAA 


TGAGAATAAA 


ATATTTAAAA 


ATTGGTATCT 


TCTAAGTATG 


CTGCAAGAGC 


25080 


TAGTTTCTTA 


GATGGACAGG 


GGATTACAGT 


TGATGAGATG 


GCTTGGATAA 


TTAGGGGCAT 


25140 


TGTGAATGCA 


TTGATTGGTA 


GATACATAAA 


ATTAGGTACT 


TATCCGGCTA 


AGTATGGTAT 


25200 


TAGTATGGCA 


CGCTCGATCT 


TAAGTAGGGT 


AGCTGCAACT 


GC AG C AG C AA 


GAGTAGGATT 


25260 


ACTGACCAAG 


ATTTCTGGAT 


GGATTTTACG 


AG T AGCTGTG 


AATGTAGCTC 


ATGTATATGG 


25320 


TAATTTTGCC 


AACAATATTG 


CTGCAGCTTG 


GGATGCATAT 


G AT AAAATTC 


CTAACAATGG 


25380 


TCCTATAAAC 


TTTTAAAATG 


CGAGAATGAA 


AGCACTTTGT 


ATTTTTTTAT 


TGAATATGTT 


25440 


AGCTTGGACA 


GTGCTTGCAA 


TGATAATTCG 


TGGAGGGCTA 


GATGGATTTG 


ATAGGCATAC 


25500 


TTGGAGTACT 


ATTTTAATTG 


CGTCGCTGTT 


CGGGGTATAT 


GATTATAAGC 


CCATAGATAA 


25560 


AAATAGAAAA 


AAGTCCAAAA 


GAAAAAATAG 


ATTTGTTCAT 


GGTAGGGACT 


TATGAAAGCT 


25620 


TTACTGACAA 


AAAAGAAAAC 


AGTTTACAAA 


GAAAAATGAT 


GGAGGAGCAA 


ACATGGCACA 


25680 


AAAAGGAGTA 


AGCCTTATCA 


AG G C AGC ATT 


TGATACAGAT 


AACTTTCTCA 


TGCGTTTTAG 


25740 


TGAGAAGGTC 


TTGGACATCG 


TGACAGCCAA 


TCTTCTTTTT 


GTCGTCTCTT 


GTTTACCCAT 


25300 


CGTGACGATT 


GGAGTGGCTA 


AAATCAGCCT 


CTACGAGACC 


ATGTTCGAAG 


TT AAG AAG AG 


25860 


CAGACGGGTG 


CCTGTTTTTA 


AAATCTATCT 


AAG ATCTTT C 


AAGCAAAATC 


TGAAACTAGG 


25920 


TCTTCAGCTG 


GGTTTAATGG 


AGTTAGGAAT 


TGTGTTTCTT 


ACCCTTTCAG 


ATCTCTATCT 


2598.0 


TTTCTGGGGT 


CAAACACCTC 


TGCCCTTCCA 


ATTGCTGAAA 


GCCATTTGTT 


TAGGTATTCT 


26040 


GATTTTTCTT 


ACTATCGTGA 


TGCTGGCTAG 


TTACCCTATC 


GCGGCACGTT 


ATGACCTATC 


26100 


TTGGAAAGAA 


ATTCTTCAAA 


AAGGATTGAT 


GTTGGCTAGT 


TTTAACTTTC 


CTTGGTTCTT 


26160 


CCTCATGTTA 


GCCATTCTTG 


TCCTCATTGT 


GATGGTTCTT 


TATCTGTCCG 


CCTTCAGTCT 


26220 


ACTCTTAGGT 


GGCTCAGTCT 


TCCTACTTTT 


TGGGTTTGGA 


CTATTGGTCT 


T T AT CC AG AC 


26230 


TGGATTGATG 


GAGAAAATTT 


TCGCAAAATA 


CCAATAGGAG 


CTTTATTTCT 


GAAACTACTT 


26340 


TC AAAGGCT C 


CAAACGCTAT 


TCTATAAGCG 


AG AAAC T AAA 


ATCGG 




' 2 6385 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2 716 base pairs 
(3) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCTGCCCGCA TTGCCCTAGG CATTAAGTAA AC AT AT AAAA GCATGTGAGA GACTGTTGGA 60 

AAAGCGAGGA AATTTCCCCT CTTTTCCTCT ACTCTCTCCT TTCTTTTGCT GATTTTATTC 12 0 

AAAGAAAATG ATATAATAGT AGTTATGGAG AAAAAGAAAT TACGCATCAA TATGTTGAGT 180 

TCAAGTGAGA AAGTAGCAGG ACAGGGAGTT TCAGGTGCTT ACCGTGAATT AGTTCGTCTT 24 0 

CTTCACCGTG CTGCCAAGGA CCAATTGATT GTTACAGAAA ATCTTCCAA? CGAGGCAGAT 300 

GTGACTCACT TTCATACGAT TGATTTTCCC TATTATTTAT CAACCTTCCA AAAGAAACGC 3 60 

TCAGGGAGAA AGATTGGCTA TGTGCATTTC TTGCCAGCTA CACTTGAGGG AAG TTTG AAA 4 20 

ATTCCATTTT TCTTAAAGGG AATTGTGAAA CGCTATGTAT TTTCTTTTTA CAACCGGATG 4 80 . 

GAGCACTTCG TTGTGGTCAA TCCTATGTTT ATTGAGGATT TGGTAGCAGC TGGT ATTCC A 540 

CGTGAAAAAG TGACCTATAT TCCTAACTTT GTCAACAAGG AAAAATG GC A TCCTCTACCA 600 

C AAG AAG AGG TAGTCAGACT GCGCACAGAT CTTGGTCTTA GTGACAATCA GTTTATCGTA 6 60 

GTAGGTGCTG CGCAAGTTCA GAAACGTAAA GGGATTCATC ACTTTATCCG TCTGGCTGAG 7 20 

GAATTGCCTC AG ATT ACC TT TATCTGGGCT GGTGGCTTCT CTTTTGGTGG TATGACAGAT 7 80 

GGTTATGAAC ACTATAAGAA AATTATGGAA AATCCCCCTA AAAA TTTG AT TTTTCCAGGC 840 

ATTGTATCGC CAGAGCGGAT GCGCGAATTG TATGCTCTAG CGGATCTTTT CTTGTTGCCT 900 

AGTTACAATG AGCTCTTTCC TATGACTATT TTAGAACCTG CGACTTGTGA GGCTCCTATT 9 60 

ATGTTGCGTG ATTTAGATCT CTATAAGGTG ATTTTGGAGG GAAATTATCG GGCGACAGCG 1020 

GGTAGAGAAG AGATGAAAGA GGCTATTTTG GAATATCAAG CAAATCCTGC TGTCTTAAAA 10 3 0 

GATCTCAAAG AAAAGGCTAA GAATATTTCC AGAGAGTATT CTGAAGAGCA TCTGTTACAA 1140 

ATCTGGTTGG ACTTTTATGA GAAACAAGCC GCTTTAGGGA GAAAGTAAAA AGTGAGGTAA 12 00 

TCTATGCGAA TTGGTTTATT TACAGATACC TATTTTCCTC AGGTTTCTGG TGTTGCGACC 12 60 

AGTATTCGAA CCTTGAAAAC AGAACTTGAA AAGCAGGGAC ATGCTGTTTT TATCTTTACG 13 20 

ACGACAGATA AGGATGTCAA TCGCTACGAA GATTGGCAAA TTATCCGCAT TCCAAGTGTT 13 80 
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CCTTTCTTTG 


CTTTTAAGGA 


TCGTCGCTTT 


GCCTACCGAG 


GTTTTAGCAA 


GGCACTTGAA 


1440 


ATTGCTAAAC 


AGTATCAGCT 


AGATATTATC 


CATACTCAGA 


CAGAATTTTC 


TCTTGGCCTG 


1500 


TTGGGGATTT 


GGATTGCGCG 


TGAATTGAAA 


ATTCCAGTCA 


TCCATACCTA 


TCACACCCAG 


1560 


T AT G AAG ACT 


ATGTCCATTA 


TATTGCTAAG 


GGGATGTTGA 


TCCGGCCGAG 


TATGGTCAAG 


1620 


TATCTGGTTA 


GAGGTTTCCT 


GCATGATGTG 


GATGGGGTTA 


TTTGCCCTAG 


TGAGATTGTC 


16B0 


CGTGACTTGC 


TATCTGATTA 


TAAGGTCAAG 


GTTGAAAAAC 


GGGTCATTCC 


TACTGGGATT 


1740 


GAATTAGCCA 


AGTTTGAGCG 


TCCGGAAATC 


AAGCAGGAAA 


ATTTGAAAGA 


ACTGCGTAGT 


1800 


AAACTAGGGA 


TTCAAGATGG 


TGAAAAGACG 


TTGCTTAGTC 


TTTCGAGAAT 


CTCCTATGAA 


1860 


AAAAATATTC 


AAGCAGTTTT 


AGCAGCCTTT 


CCTGATGTTC 


TGAAAGAGGA 


AGACAAGGTT 


1920 


AAACT GGT AG 


TAG CTGGGG A 


TGGCCCTTAT 


CTGAATGACC 


TCAAAGAGCA 


AGCCCAGAAC 


1980 


CTAGAGATTC 


AAGACTCAGT 


CATCTTTACA 


GGGATGATTG 


CTCCTAGTGA 


GACGGCTCTT 


2040 


TACT AT AAAG 


CGGCGGATTT 


CTTCATTTCG 


GCATCGACAA 


GCGAAACGCA 


AGGTTTGACC 


2100 


7ACTTGGAAA 


GCTTAGCCAG 


TGGAACACCT 


GTCATTGCTC 


ACGGAAATCC 


TTATTTGAAC 


2150 


AACCTCATCA 


GTGATAAAAT 


GTTTGGAACC 


TTGTACTATG 


GAGAACATGA 


TTTGGCTGGT 


2220 


GCTATTTTCG 


AAGCCCTGAT 


TGCAACACCA 


GACATGAACG 


AGCATACCTT 


ATCAGAGAAA 


2280 


TTGTATGAGA 


TTTCAGCTGA 


GAACTTTGGG 


AAACGAGTGC 


ATGAGTTTTA 


TCTGGATGCC 


2 340 


ATTATTTCAA 


ATAACTTCCA 


GAAAGATTTG 


GCTAAAGATG 


ATACGGTCAG 


TCAGCGTATC 


2400 


TTT AAG AC AG 


TTTTGTATCT 


TCAGCAACAG 


GTCCTTCCTC 


T AC CT GT AAA 


AGGATCTAGA 


2460 


CGCATGTTGA 


AGGCTTCAAA 


AACACAGTTG 


ATCAGTATGA 


GAGACTATTG 


G AAAG ACC AT 


2520 


GAAGAATAGA 


AAGAGGAACA 


GCTATGAAAA 


AAACAATTAA 


TGAGAAGCGG 


TCGTGATAAA 


2530 


AAGATTGCGG 


GTGTTTGTGC 


TGGGGTGGCC 


CATTATCTGG 


ATATGGATCC 


GACTATCGTT 


2640 


CAAGTCATTT 


GGGGTGTTCT 


TACTTGCTGT 


TACGGAGCTG 


GAATTGTAGC 


TT AC ATT ATT 


2700 


TTATGGATTA 


TCGCGA 










2716 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13926 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC : 5: 
CTTTCGTTTT GCCTTATTCA AGACATGACC CCCATCACCA ATCA7CTCAA ACTGCGAATC 
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TGTTAACAGT 


CT ATGG AG AG 


CTTTCATAGA 


ACTAAGATTC 


GGTTTATCTT 


TGCTGCCACA 


120 


AATTACTAAG 


GTTGGATAAG 


GGTAAGTTCC 


TGCTATATCC 


GTTAAATCAA 


GTGTCTTCAA 


180 


CTCCTCAGAA 


ACTCCGACCA 


TAAGAGTCTT 


GTCTGCTCCC 


TGTTTTTCAA 


ATACTCTTTT 


240 


GGGAAGTAGT 


TTAAAAATCA 


GCAATTGAAG 


ATAAAATAGG 


ATATTCCCTG 


CTAATTTAAG 


300 


CGGGCATCCT 


GACAGAATCA 


AAGCTCGAAG 


ATTTGGTAAA 


TCGTAACTGG 


AAAGTTCT AG 


360 


TG7CAGGGCA 


GCACCTAAGG 


ACAATCCAAT 


CAAAACAAAA 


GGTTCTGTCT 


CTTGAGCTAG 


420 


GTGCTGATAA 


ACTCGCTCTT 


T AG CT TGTTG 


AT AGTT ACT A 


ACTCCAGAAG 


GAAATAACTC 


490 


GATAGCCTCA 


GAAGGATAAT 


CTGTCAGTAG 


ATTCCGAACT 


TCTTTCCAAG 


ACTCTGCTGA 


540 


CTGCCCTAAC 


CCATGCAAAA 


ATATTAATTT 


CATCTAGTTC 


TCCTCAAGGC 


TTAATTCATA 


600 


CAAGCCTCTC 


ACTGCATTAC 


AGCCGTAAAT 


AGCTTCTGCT 


TGGGTTAAAT 


CTGCCAAGGT 


660 


CAAGACTTTC 


TCTTCTACCT 


GTCCTGTTTC 


TAGCAAATGC 


TGACGGTAAA 


TTCCTGGCAA 


720 


GATTCCAAGT 


CGGATAGGCG 


GTGTGTAGAG 


TTTTCCAGCG 


ATTTTCAGAA 


CCAAATTTCC 


780 


TATAGAGGTT 


TC AAG C AGTT 


CTCCTCACTT 


ATTGTGGTAA 


ATCTTCTCTT 


GTTCTCCTAG 


840 


GCTCAAATGC 


GGTCGGTGAG 


TGGTTTTAAA 


GTAGGTAAAG 


GATTGATTCA 


AAGCAGCTTC 


900 


CTGAAGACAG 


ACTTGGGCCT 


GACAAAAGCT 


TGT ACTG AG A 


GGGGTTAATA 


CTTGACGATT 


960 


GACTTCTATC 


TCT C C AGATT 


TGCTAAGCCT 


GATTCGCAAG 


CGGTAATCTC 


GAT T AGCTTC 


1020 


ACAATCCTGA 


CACTCTTCCT 


CAATCTTGTG 


TCCCAAGTCT 


TCTGCATCAA 


AAGGAAAAGC 


1080 


AAAATAACGA 


CTAGCTTTTC 


TCAGCCTTTC 


CAGATGTTGT 


TCTTCAAACA 


TCAGTTGTTT 


1140 


TTGGCTGATT 


TTTCCAGTTG 


TAATTAATTG 


GAAGCGAGCT 


TGTTTACGAT 


AGAGAACTGC 


1200 


TGCCTTTTGA 


TGAACCTCTC 


GGTATTCAGA 


TTCCCATGTG 


CTATCCCAAG 


TAATCCCTCC 


1260 


GCCAACTCCA 


TAAATGGCTT 


GACCTTTGTG 


AAGTTGAATG 


GTACGAATGG 


C C AC ATT AAA 


13 2 0 


AATCCGTCGT 


CCATTTGGAA 


(jLAAUAoALL 




CAGTAGACTC 


CACGCGCTTG 


1380 


AGGCTCCAAG 


TCCTTGATAA 


TCTCCATTGT 


CGCAATTTTC 


GGTGCACCCG 


TTATGGAACC 


1440 


ACAAGGAAAG 


AGTGAGCGGA 


AGATTTCAAC 


AAGGTCCACA 


TCCTCTCGCA 


ACT G ACTCT T 


1500 


GATGGTCGAA 


GTCATCTGCC 


AAACAGTTGA 


ATACTGCTCT 


ACCTGACACA 


GACGCTCCAC 


1560 


GTGCTCGCTC 


CCAACTTCAG 


AAATACGGTT 


CATATCATTG 


CGCAAGAGGT 


CCACAATCAT 


1620 


CATATTTTCA 


GAGCGATTTT 


TGCGATCCTG 


TTCCAACCAA 


CTGGCCTGTT 


CAAGATCTTC 


1680 


TTGGTCAGTT 


ACCCCACGCT 


GAGTCGTCCC 


CTTCATTGGT 


CGTGTTGTCA 


ACTCGCGATC 


1740 


ATTTTGCTCA 


AAAAAGACCT 


CTGGGCTCAT 


GGAAATCACT 


GTCATCTCGT 


CATGTTCCAC 


1800 
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ATAGGCATTG TAGCCCGCCT CCTGCTCTAC CACCATACGA TTGTAGATGG C AAAAGG ATT 18 60 

CGCATT7AAC TTTTGCTTAA GTTGGACGGT GTAGTTGACC TGATAGGTAT CTCCCTGCCG 1920 

TAAATGATGG TGAATTTGGG CAATGGCCTT TTCATAGTCT GCTGCAGACG TTACTTCCTG 19 90 

CCAATTTGAG GGCAAATCAA TATCCTCATA AGTCAGAGGA ATAGGGGAAG TTTCTACGAT 2 04 0 

ATCATGAACA GT AAAGT AAA GCAGGTACTC TCCCAGTAGG GGATCCTTGT GAACTGCTAA 2100 

TTTTTCCTCA AAAGCAGGTG CAGCCTCGTA GCTGACATAC CCCACCACAT AATAACCTTG 2160 

CTCTTGGTAG CTTTCCACTT GTGCCAGCAA ATCTGCCACT T CTTCT AC AT TTCTCGTTTT 22 20 

CAACTCTTTA ATAGGCTGGG TAAAGGTATA TCTCTCCCTCC AAAGTCCTAA AATCAATCAC 2 2 30 

TGTTTTTCTA TGCATACCTT AAGTATAGCA TAAAATAAGA AAACCCTCAT CCGCAAAGCA 2 340 

GATGAGAGAT TTCAATTATT TAAAGATTGA ACTTTTAAAG CTATTTGTTT GTTGAAGAAC 2 4 00 

TTTCTTATAA AC AG CT7CTT TTAATTTAAC TGTATTATTC ATAGATACTG TTTTATTACC 2 4 60 

GTTTGCTTCT TGTTTAAGAG TTTCGGC ATC TTTTTT AAC A GCTTCTTTAA ACAATGTCAG 2 52 0 

TAAATCATCG TATGATGAAA CGGAAGAACC ATTTACTTCG AATGTTGTTA ATCCTTTCGT 2 530 

TCCTTTATCT TTAACTTCTT TGAAGTAACC TTTTTT AAAT TCTTCAATAG T ATT AAATGT 2640 

ATTGTTAGAT ATTTTCTTGA TAATATATTC ATCACTTAGA ACAGACTCAC CATCTGTTTT 2700 

AGATTGTTGT TTATATTTAT TTGAAGCATA ACCTAAGAAC CCATTTTCGT ATCCGTAGTA 27 60 

ACCCCATAAT CTAAAAGCAT TATGTTTGAA TGAAACAGCT CCAGGAGCAC CTTTACTAGT 2820 

ATTACCTCCC T AG AT AC CGG TCATCATTCT AACACCTACA TAAGGTGATT GATCGTTATA 28 80 

GCTAATTGCT TCGGGTTTAT AGATACCATT ACCTGGATTG CGATTAGTCA TTAATTGTTG 2940 

ATCAACTAAA TCATTAACAG ATTGAATATT TAATTCATTT TTCTCTTCTT GACTTAGATT 3 000 

TCGAATTTTA TCCCATTGAT TTAATTTATT GTTATCACGG TATTCTCTAT CTATTTTTTT 3060 

GAACCATGCA CTATTTAAAT CTTTATTTTG TTGAGAAATC ACAGATTCAG CCTCAATTTC 3120 

ATCAAGAAGA GTTAAAGTCT CATTATAACC CTTCATATAT CTATTAATAT CTTCTCGTGT 3180 

TTTTAGAGTT TTTGGATCTG TAATATACCA CTGATTCCCA TCATTTTTGC GTTTAAATAC 324 0 

CATATTAATA CCTAAAGAAC CAAACTCATC AAATCCACTA CCAGTAACAG GAGTTTGTAG 3 300 

CATACCCTGA GCATATGCTT CAGCATCAGT ACCTTCACGG TGTCCAAAGC CACCTAAGTA 3 3 60 

AATCGCACGG TCCTTGACGT GTGTTGTTTC ATGTGTGTAA ACTGAAATAC CGTATTCACC 3420 

AACCATTTCT AAATGAACAT ATTTT AC AT C AGTTCTAATA 7CATCAGAGT TAGGATATAT 34 80 

AGCAGCATAA GCTCCTGTTC CATTATAATT ATAATACTTA TCCATAGGAC CAAAGAATTC 3540 

TCTAAGAGGA GTATATACTT TGTCGGTATT ATAGCGGCCA TATTTTTCAA CCCATCCACC 3 600 
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AGGAGCCTTA TAACCTTCCC AAATAGGAAT AACAGCATCT CTTACTAGTC GTTGTTTAAC 3 660 

GTTATCAGAC GCTAGACGAT ACCAGAAATC ATAATAGTTT CTATAACCAT CTGCAGCTTT 3 720 

GTTAACGATA TCTTTAATAT CTTCTAATGA TTTTTT AC CT AATCCCTCTG CACTACCAAA 3 7 80 

GGCAATTGCA TTATAATTTG AAATTAAATA AAGATGTGCT TTATCAATAT TCAGTAGTGG 3 840 

GAGTATAGTA TTTCTAAGGT GACTTCGTTT TAAATTATCG AATGCACGAT GTTTAGAATT 3 900 

TTTAATTTCT TCGACCTCAG AAGCGCGTTC TGCGATGTAG ACATGGTCTT CTGTAGC AT C 3 9 60 

AATAAACCAA TCGTTCATAT TGTCTATATT TGTGAACAAT TGTCTATTAT AATTTAAAAA 4 02 0 

TGCATCTAAA TTACCTGATT TAGTATATTT AGCCAATACT TGACCGAATG CGTCGAATGT 4 080 

ACGTGAACCT TT AATGTTCT TCTCTTTAGA AC CG AT TT C A ATTAATCTGT CTAATACGCT 4 140 

AACTTTTTCA C CAT AG AAAT CTGGTTTGAA TAGCATTAAT TCTTTAATAT TAACATCACC 4 200 

AAATTTAACT CCATAGTAAC GATTTAGGTA AGTTAAACCT AGTAATAAAG CTGCTTTGTT 4 260 

TTTCTCGACT TTATCACGAA TCATTTGACG AGCAGCTGCA GAATCATTTA GTTGATGTTC 4 32 0 

TTCGTTTTCA ACTAATTTTG TGATTAGGTT TGTTAAGTTT TCTTTAACAT CTGTGAAGCT 4 380 

TTCTTCTAAA TATAAATCTT TGATTGCATT AACTCTATAG TCACCTAATC GATTTAGATG 4 4 40 

CTGATACATC GTTTGAGACT GAAGCTCTAC TGATTCTAAA ATAGATTTTA TATCATTAAC 4 500 

AAGAGTAGTG TTATCTTTTT GAACGATATT AGGTGT AT AT TTAATTCCTA AGTCAGTTAT 4 5 60 

AGTATATTCT TTTACATTAC TTAAACCTTC ACTGCTAGAA GACAAGTTAA AGTAATCTTT 4 620 
TGTACCCTCC GCATAGTGAA CAATAATTTT ATTACCTTCA TCT AGGTTTG TGATAAACTC 
ATTGTTGTTC ATCGCGGTAA CAGAAAGAAC TTCTTTAGTA TTTAGATGGT GTTCTTTATT 
TAATTTATTA CCTTGATATA CAATATAATC TTT ATTGT AG AATGGTATTA ATTTTTCAAG 
ATTTTTATAG GCTTGGTTAT ATTCAGCGTT ATAATCTTGA ATACTAGAAT AGGCTTTTTC 

TTCATTAAGT TTTGCAAGAG GAGATAGATC ACTTTCTAAT TTATCAGCAG TAATATTGAA 4 920 

AGTAGTAACT TTAGCATCAG CTTGTTCTTT AGTT AATTT A G T AAAT G TTT TAGATTTCCT 4 980 

AAATGATCTA TTACCTGACG AATATCCCTC TACCGCATAT AAATCTTTTA TATGAGCACT 504 0 
AGCATAATCA GAATCATCAA CGTCGTTAGA GCCGAATAAC TCCTCTCCAC GGATAATCTT 5100 
AGCATAGCTG ACAGAATTAC TTACCGTACC TACAGGCCAA GTCTTACTTG CTATTGCTCC 5160 
AACTTCT ACT GGATTTGAAA CATCTATTTT ACCTTTTACA ACCGACTCAG TTAGGAGAGC 52 2 0 

TTTTGTACCA ATAAGATGGT CTAGAGTTAA TCCATAATCT ACTTTAGGAA CTAACAAGCT 5280 
GGCGCGTGTT TTGTTTCCTG TAATAGTAGC ATCAACATAT GCTTTTCTAA CAATTCCTCT 5 34 0 



4680 
4740 
4800 
4860 
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ATAGTTTGTA CCTGCAATTC CCCCTGTATG AGAGCCATTT CCACTTGTAG AGTGTAGTTT 5 4 00 

GCCAAAGAAA CCAACATTTT CAATACGAGT TCCATCATTC ATATTATTTA CAAATCCAGC 54 60 

AAC ATT ATT A CGACCTGAAA GTGTGCCTGT AATTTTGACA TTTGTAATAA CTGAAGAACC 5520 

TTTCATAGTA TTGGCTAATG ATGCAATATT ATCTTGACCA GAACCTTCTA TCTCTACATT 5580 

TTCAAAATTC ACATTATTTA TCGTTCCCTT TGTTATCACA TTAAATAATG GATGTTCCAA 564 0 

TTCAGTAATA GCAAATTGTT TTCCTTCAGA ACTTAAAAGT TTTCCTG TG A ATTCTTTAGT 5700 

GATATATGAT TTTCCATTAG GAACAACATT TCTAGCGCTC ATTGATTGTC CCAGACGATA 57 60 

TTCTTTTGAA GGATCGTTTT GAATAGCTTC CACTAATTCT TTGAAATTAT AATATACATT 5820 

ATCTTCGTGG ACTTTAGGTT TTTCAATATA CTGAACGTAT TCTTCTTCAA ATTTATTATC 58 80 

AGCAGTTCTA GAG ACT AAA T TGTCTGCGAT TGCTGTAACT TT AT AT AC AG CTGTTCCGTT 5940 

AACCGTACTT TCTTCTATAT TTTTAACAGC TAGTAATGTA GTTTTCTGAT TATTTGAAGT 6 000 

TATTTTTAAA TAATAATTGC TCTTATCATC AGGAATAGTT CTTATCAGTG ATTCATTAGT 6060 

TTCTTTTCCA TTTTCGTATT TGATTAAATC TGTACGTTTA ATATTTTTAA GCTCAACTTT 6120 

TTTAAGATCT AATTG AAT AT TTTGATTTTC TAGAGTTTCA GTTTCTTC AC CGTTACCTCT 6180 

GTCGTAAATC ATAGTTGTAG ATAGGGTGTA TTCTTTGTAG TACTCTAGGT TCTTAAATGC 62 4 0 

AGCGCTTATA GTTTCTGTTG TTACCTTGTC ATCTGTAAGG ACTACAGTAT TAATAACTTC 6300 

TTCTCCTTTT TTCAATTCAC CTGTGATTGA TTTGATTTTT GTTTTGTTTT GATTTTCTAG 6 3 60 

AGTATACTTA GCAACAGCTT CACGTTCCAA TATTTTCTTA 7CGCTACTAG TCAATGTTAA 64 20 

TATTGGCTTT TCAGATAATT CAACCAATTT TTCAATACTT GCAGTTAATT TTTCAACAGC 6 4 80 

TTCGTTAACT TCACTTTGTT TAGCATCTCT ATTAGCTGCA ACTTTTTCAG CCTTTGTAAC 6 54 0 

TTCAGTTTGG AGGTTTTGCC AACTTCTATC ACTGTAATGT TCTTTTACCT TTGTTTTTGC 6 600 

ATCTGCAATC GTATTGTTTA ATTCAGTTTT ATCAACGTTT AGAGCGTCAA TACCCCTTTT 6 6 60 

AAGTTTATTT GTCTCGCTAT TTACCTCAGG CTGTTTTACA GGCTCTCAAG CATAGACACC 67 20 

TTTTGCAGTT TCTAAAACAG GTCCAAGAGC ATTGTAACTT GCTGTAGAAT AATCAGTAGG i7 80 

AGAAACTGAA CTAGCTTTAT CAATTTGATT ATTTAACTCA CTTTTATCAA CTGGTTCTTT 68 4 0 

AGTACCAATA CCCTTTATTT TATCTTCTCG TTTCGGTGTT TCCTCTACAG CCTTCTCTTC 6900 

TTCAGGAACT TCTGGTTGCT TTTCTGGCTC AACTGGTGCC GTTGGTGCCT CTTCGTCTTC 6 9 60 

TCTTGGCGCG ACTGGTTCAC CTGCTTGTTC AACTTTTGGT TCCTCTGTTG GTTCTGTTTG 7 0 20 

TTTTTCTACA GCAGGCGTTT CAACTTTTGG TTGTTCAATA GATTGATTAA CAGTCTCCTC 7 080 

TTTTGGTTCT ACAGTTTCTT CAGCCTTGGT ATCTGGAGTT GACTCTTCTT GTTTCGGTGT 7 140 
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TTCCTCTACA GCCTTCTCTT CTTCAGGAGC TTCTGGTTGC TTTTCTGGCT CGACTGGTGC 7 200 

CTTTTCGTCT TCTCTTGGCG CGACTGGTTC ACCTGCTTGT TCAACTTTTG ATTCCTCAGC 7 2 60 

TGCTTTGTCT GATGGTTGAC TTTCTGGCTT AACTG CT ACT TTTTCCTCTG GTTTTGACTC 7 3 20 

AACTTCTCCA CCTACTTCTT CAACTGGAGC TGGTTCTGCT GAATCTTCTT TCCCCTCTTC 7 3 80 

TACTTTAGGA AGGGTGTCGT CAGTAGCTTT TACCTCCGAT TTTGGTTCTT CCTTTGGACT 74 4 0 

TTCTTCTGTT TTAGGTCCTT CTTCTTTTGG AGCTTCCTCT GTCTCTACTA CTTGGTTTTC 7 500 

TGTCCTAGCT TGCTCCTGAT TTGTT ATTG A TTGAGGAGTC TCAACTTCGA CCACAGTCAC 7 5 60 

CTCTCCACGT TTTGCTGAGG TTTCTTCTAA AACAGTGTCC AAGCCAAGCG TTTTGAGGAT 7 62 0 

GTCACCTGAT AGATAACCAA CATAGCGATA GCCCTCCATT TCAACAACAC CCTCTCGACT 7 68 0 

AGCCAGCGCT AGGGTCGCAA CTGGGTCTAC AGCCCCTGCA CTAGGAAGAA CTACCAATCC 774 0 

CATAGCTCCA ACTAGAAACA CGCTAGCAAT TTTCTTTCTC TTGTAGATTA AAAGCAAGCT 7 800 

« CCCAACAGTC AGCAAACCAA AAGCTGTCAA AACAGATGCT TCTGT CCCTG TTTGAGGCAA 7 8 60 

CTGATCTTTT TCATACACCA AACCATATAC AACTTCATTC CTGTCAGGCT TTCCTGTCTG 79 2 0 

AATTAAATCT TTAGCTTCTT GTGAAATAAT CTCTTTATTT ACATAGTGAT AGGTGGCTGC 79 8 0 

GTCCACTACA GAAGGAGCCA TCAAAACGCT TCCAACAAAT ACAGAGCCTA CAACTCCCTT 8040 

AATCTTACGA ATTGAAAAAC GGTCTTTTTT AAACACTTTT ATCTCCTTTA TTCATTCTCA 8100 

AAACTT CCT A ATAGCATCTT GCGGATAGTG CGCACGCGCA CCTCCGATTA ATTTTGGACG 8160 

ACTAGCCAGT GCCCTTACAT GGGCATGACC AATCTCTCTC AAAATAGGGC GAATCGGAAC 82 20 

CTGAACATGC TTGACATGCA TGCCAATTGC AGTGTCTCCG ATATCCAATC CAGCATGAGC 82 80 

CTTGATAAAT TCAACCTCAA CTGGATCCTG CATAAACTTA AAGGCTGC C A ACTGCCCCGA 8340 

ACCTCCTGCA TGAAGAGTAG GATGGACACT GACAATTTCC AGACCAAACT GCTCTGCCAC 84 00 

CTGACGTTCA ACAACGAGAG CCCGATTGAC ATGCTCACAA CCTTG AACTG CTAAATGGAT 84 60 

ACCTCTACTA CCTAGAATAT CCAAGATAGT CTCCACTATC AGCTCACCAA TCTCTTGACT 852 0 

GCATTCTTTC CCAATATGAC CACCTAGCAC CTCACTAGAA GATAGACCTA AAACAAAAAG 8 580 

GGCCCCCTGC TTCAAATTGG TCTTTTCTAA AACATCTTCC ACTACCTGAC GTGTTTCTCT 8 64 0 

TTGAATCTGT GTCTCGTTCA TCTCTGTTAC CTCTGTTGTC ACTCTTCTAT CATACCGTTT 87 00 

TTTCTTGTTT TTAGCAAGAT AG AC AACCT A GAAAGTTTGC CCAATTACGC ATAAAACTCC 87 60 

CAGAATTGAC TGGGAGTTAG CTAGTTTCTA TTCTATTTAT ATATATTTCA ACTTTCGTCC 8 82 0 

CTTTTTGGGG TCTAGAATCA ATCTTCATAT GGTAATTGGC TCCAAAATGA AGTTTGAGCC 8 8 80 
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GTTGATCGAC ATTTTCAAGA CCAACTCCCC CACGTTTGAG TTefte^TTGA CTACTATCAC 894 0 

CAGCATCTTG GAAGCCAACG CCATCATCCT CAATACGGAT GACCAATCCC GAATCCTGTT 9000 

TCTGGACAGA AAGTTTAATA TGGCCCTGAC CTTCCTTTTC CTTAATGCCA TGGTAAAGAG 9060 

CATTTTCTAC AAGGGGTTGT AGGACCAGCT TGGGTAAGAC TAAATTATCA AAGGCAACAT 9120 

TTTCATTAAT TTCGTATTCC AGCTTATCTC CATAGCGTTG TTTCTGGATA AAGAGATACT 9180 

GGCGGACATG ATTGATTTCG TCAGAGAGAC AAAT CAAGTC CTTGCCTTGA TTGAGCGCCA 924 0 

AGCGGAAATA GGTTGCCAAG GACTTGGTCA CCTGCACCAC TCGCTGACTA TCATGAAATT 9 300 

CAGCCATCCA GATGATGGTG TCCAAAGTGT TATAGAGGAA ATGTGGATTA ATCTGGCTCG 9 3 60 

AAAGCGCTTG AACTTGCTAC TGACGGGTCG TTTCTTCCTG GCTACGAATA CCTACCATCA 9420 

ACTGATCAAT CTGATCCAAC ATAGCATTAA ATTGGCGAGT TACTTCTCTC AGTTCATAGG 94 80 

CACCAACTTC CTTGG C AC G A AGATTTTGAG CACCAGAAGC AATTTCCAAC ATGGTTTCTC 954 0 

TCAAATCCTT CAAAGGAGCA ATCCAGCGTT TAAGACTGAA CCACACTAAG C AG AG AC AG A 9 600 

CAAGAAGAGA TGTCACACTG GCCCCAAGCA AGGTCCACAA GAGCTGACTC CGAACCTGGT 9 6 60 

CTAACTTTTC CAATGATGAC ACGCCAAGCA CCGTCCAATC AGTT CCTGC A ATCTTCTCTT 972 0 

GACTGACGTA GCATTTGTGA CCAGGAGTAT AACCCTGACC TGTATCGATG TAGCGTTTCA 97 80 

TAGCCTCCAT TTTGCTAGAC GAACTATAAA CTCTGTGTTG AGGATGGTAG ACAAATTCAT 984 0 

GGTTTTCATT GATAATGAAG GCAAAGCCCT GCTGCCCCAA CTGGAGTTGA TTGAGATAGG 9 900 

CTTCCAGAGT TTCATAAGAA ATATCCAAAC GAAGCACACC AAGATTGGCT CCCTTTGCAT 9 960 

CAACAAGTTC TTGAGTGACA GAAATGACCC ACTGACTATC TGATTTACGA CCTCGAGTCA 10020 

AAAC AGG CAT AGCTCCCTGA TGAATGGCCT TTTGGTACCA ATCCTCAGCC ATCATATCAG 10080 

AGGAAGTTTT CAT CTG C ACA CTCTCATCTG TAGAAATGAC CTG AC C AG AT TTGGTCACCA 10140 

GCACAACAGT TTTCAAGTCC TT AT CTG ACT TCAAGATGGT CAAAAACAAA TCTCGCATTC 102 00 

CCTCGACCTT GTCTTGACTG GGATTCTCAG CATAGGCCAG AACATCCGTC TGCTGGGTCA 102 60 

AACCAGTCGA GGTGGTTTCT AGTTTTTTGA TATAAGACTG AATAAAGTGG CTAGTCTGGC 10320 

TGATGGTCGT TTGGCTGTTG CCCTCAATGG TGGCCTCAAT GGCTGAAGAA CTTGATTGAT 10 38C 

AGTAGAAAGT TCCAACCAGA GCTAGGAGAA TGAGAAAGAC CAGAAAGATG GAAATAACCA 10440 

TTCTAACTAA AAGAGAAGAA CGCTTCATCG GTCTTCTCCC TTCTTAAACT GACGAGGTGT 10500 

CACACCTGCA ATCTGCTTAA AACGTTGGGT AAAATAGTTC ATATCTTCAA AACCAACCTT 105 6 C 

CTCTGCGATC TCATAAATCT TCAGATCTGT AGTTAAAAGC AAGAGCTTGG CTTGTTTAAC 10620 

ACGTTCTCTC ACCAGATAAT CCTGAAAAGG CAAGCCCAAC TCTTTCTTAA TCAAGGAACT 10680 
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CAGATAGGTC 


GGACTAAAAC 


CTAAGTCACT 


GGCTAAAGAC 


TTTAAACTAA 


ATTGGCTATC 


10740 


AGCCAGATGA 


GACTGGATTT 


TCTGGGCCAT 


GTTTCCTTCA 


AACCTATTAG 


TCAATAAATC 


L0800 


TTGTAACTGC 


^C^TCT^CT 


CTTCCTTGTC 


TAGTTTTTGT 


TTGATTTTCC 


CCAACATTTC 


10860 


CTC AAT AT CC 


TG ACG AG AAA 


AGGGTTTGAG 


CAGGTAGTCG 


TCCACACCTA 


GTTTGACAGC 


10920 


AGACAAGGCA 


TAATCAAAAT 


CATCGTAACC 


TGTTAAAAAG 


ACCAAATGAA 


CCTGAGGATA 


10980 


GGTT T CT C GT 


ACCAGACTGG 


CCAACTGGAT 


GCCATTTAGA 


TGAGGCATGT 


TGATATCGGT 


11040 


TAAAATG AT A 


TCTGGCACCT 


GCTTTTGGAT 


CAATTCCCAA 


GCCTGCCTTC 


CATTTTCAGC 


11100 


CT'GACCGATG 


ATTTCCATAT 


CGTAGGCTGC 


TACATTGACC 


AGTTTAGTCA 


AACCTTGTCT 


11 160 


T AC C AG AT AT 


TCATCTTCTA 


CGATTAAGAT 


TGTGTAGGTC 


ATGCTCTGCT 


CCTTTACCAC 


11220 


TT ACT AGT AT 


CACTATAGCA 


AAATTCTCCT 


CTAACTGCTT 


AGGAAAGACC 


TCTTATACTC 


11280 


AATAAAAATC 


AAAAAGTAAA 


CTAGGAAGAT 


AGCCACAGGT 


TTCTCAAAGT 


ACCGCTTTGA 


11340 


nCTTGTAAA T ' 


AAAACTGACG 


AAGTCGACTC 


AAAGTATAGC 


TTTGAGGTTG 


TAGATAAAAC 


11400 




GATAACCCTA 


CATACGGTAA 


GGCGACGCTG 


ACGTGGTTTG 


AAGAGATTTT 


11460 


C C A AC AGT AT 


TAATCAACAT 


AATCTAGTAA 


ATAAGCGTAc 


CTTTTTCTTC 


CATTTGGTCT 


11520 




ACCGGATAGA 


GAGGCTATTG 


ATACAGTAAC 


GTAAGCCGCC 


CTTGTCCTGT 


11580 


GGACCATCCG 


TAAAGACATG 


CCCAAGGTGA 


GAATCTCCTA 


CTCGGCTCCG 


CACTTCCATA 


11640 




TGTAGGACTT 


ATCTTCCTTG 


TAGGTCACAA 


CATCTGGACT 


GATGCGTTGG 


11700 


GTAAAACTAG 


GCCAGCCACA 


ACCAGACTCA 


AATTTGTCTT 


TTCATGAAAA 


GAGAGGTTCC 


11760 


CCAGT^GCTA 


TATCCACATA 


GATACCGGAT 


TCAAATTTAT 


CCCAGTAACG 


GTTTGAGAAA 


11820 


GC^CGTTCTG 


TTTGATTTTC 


CTGGGTAACT 


G CAT ACT CCT 


CAGGTGACAG 


GGTCTTTTTC 


11880 


AATTCCTCAT 


CACTTGGTTT 


TGGATATTTG 


CTGGCATCAA 


TGACAGGATA 


GGCCGCCTGA 


11940 


TTAACATTGA 


TATGGv^AOl A 




TTTTTCTTGA 


GATAGTCTTG 


ATGGTAATCC 


12000 


TCAGCCACCA 


CAAAATTCTT 


CAAGTTTTCC 


TTTTCAACTG 


CTAGAGGTTG 


ATCGTATTTC 


12060 


TTAGCCACCT 


CATCAAAGAC 


TTGGTTAATC 


ACTTCCAAAT 


CCTTGTCATC 


TGTGTAATAA 


12 120 


ACACCAGTAC 


GGTACTGGGT 


CCCCACATCA 


TTTCCTTCTT 


TATTTTTGCT 


GGTTGGATTG 


12130 


ATAATGCGGA 


AATAGTGAAG 


CAGGATTTCC 


TTGAGAGAAA 


TTTGCTTGGC 


ATCATAGGTG 


12240 


ACATGGACGG 


TTTCTGCATG 


ACCTGTTTGG 


TTAATCAATT 


CGTACTTGGT 


TGTTTCTCCT 


12300 


CTACCATTTG 


CATAGCCTGA 


AACGGCATCC 


GTCACCCCGG 


GAACACGTGA 


GAAATATTCC 


12360 


TCCACTCCCC 


AGAAACAACC 


TCCAGCTAGA 


TAAATTTCGT 


GCAAGTCTGC 


GTCTTTACTA 


124 2 0 
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ATTTCTGTTT 


TTTTCACTGC 


TTTTCCTCCT 


TGGCTAACTG 


CCGCCTTTTC 


AATTTGCGAG 


12480 


GCATCTCTCT 


GCCCTGCATT 


TCGTATCAAT 


AGAACATAGA 


AACCGGTTAT 


GGCTAGAAAA 


12540 


AATACTCCTA 


GCAACAAGAA 


GATTTTTAAC 


TTATCATTCA 


TAAGACGCCT 


CCTAGGCTAA 


12600 


TTCCTTCAAA 


GTTTGCAAAA 


TTGCATCTTT 


TTCCATGAAT 


CCTGGATGTG 


TTTTGACCAG 


12660 


CTTGCCTTCT 


TTGTCTATAA 


AGGCTTGGGT 


TGGGTAAGAA 


CGGACACCAT 


AAGTTTCCAA 


12720 


AAGTTTGCCT 


GATGGGTCAA 


CTAGGACTGG 


GAGATTTTTA 


TAATCCAATC 


CCTTATACCA 


12780 


ATTCTTAAAG 


TCCGCTTCAG 


ATTGCTCTCC 


CTTATGTCCT 


GGTGACACTA 


CTGTCAAGAC 


12840 


CACATAGTCA 


TCACCAGCTT 


CTTTAGCAAT 


CTCATCCGTA 


TCTGGAAGAC 


TAGCCAGACA 


12900 


GATGGAACAC 


CAAGAAGCCC 


AG AATTTG AG 


ATAGACTTTC 


TTGCCCTTGT 


AATCAGATAA 


12960 


ACCG7AGGTC 


TTGCCATCTA 


CTCCCATCAA 


TTCAAAATCA 


GCCACCTCTT 


TCCCTTTAGC 


13020 


TGCGCTTGTT 


TTACTAGCTG 


TCTGCTCCGT 


CTTCATTTCA 


TCTTTCGTTT 


GG TGTTC ACT 


13080 


AGTCACGGAC 


TTGCCTGAAC 


AAGCCGTCAA 


ACAAAGGAGC 


GAACCTGCTC 


CAAGAACACA 


13140 


TGTTTGCCAT 


TTTTTC AT AT 


TGATATTCCT 


TTCCATTTTA 


TTCAAATAAT 


TGACTTAAAA 


13200 


TTGAAGCATT 


TCCAAACAGA 


ACCAAGAAGC 


CCATCACAAT 


AATGAGAAAA 


CCACCCACTT 


13260 


TTTTGAGGAT 


TCCGAGATAG 


GGATGAAGTT 


TTCGGAAATG 


TTTCAAAACA 


TAACTAGAGG 


13320 


TCAGAGCTAG 


AAGCAAGAAT 


GGTAGCGCCA 


AGCCCAGCGT 


ATACACCAAC 


ATGAGACCAG 


13380 


CTCCCTGCCA 


AGCTCCTGAA 


CCACCTGAAG 


CCGCCAAGGC 


CAAAACAGAC 


CCCAGAACCG 


13440 


GCCCCACGCA 


AGGCGTCCAA 


GCAAAACTAA 


AGGTCAAGCC 


CAATAAAAAT 


GCCTGACTAT 


13500 


AGCCCTTACC 


ATTTTGCCCC 


TGTCCTTGCA 


GTTGTAGCCT 


CTTTTCCTTA 


TAAAGCCCCT 


13560 


TAAAGTGTAG 


AATCTCCATT 


TGGTGCAAAC 


CAAGAAGGAT 


AATAATTGCC 


CCAGTAAGAT 


13620 


ATTGGAACCA 


AGAAGCATAA 


AGCAAATCGC 


CTAAAAAACC 


AGCTCCATAG 


CCCAACAAAA 


13680 


TAAATATAAA 


GGAAATTCCT 


GCTATAAAGG 


CCAGAGTTCG 


TAATAAACTA 


GTAACTGAGA 


13740 


TTGAAAATTT 


GCCGCTACAA 


GCCTGAGCAC 


CATCCTTATC 


ATCTAGTAAC 


ACTCCTGTAT 


13800 


AGACCGGTAA 


CAAAGGTAAG 


ATACAAGGAG 


AAAAG AAGG A 


TAGAATCCCT 


GCCAAAAAGA 


L3860 


CACTTAGAAA 


AAAGAAAATA 


TGACCCATAA 


AGTTCCTCCT 


ATCATTTTAT 


TCATAGATTT 


13920 


ATTATA 












13926 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A> LENGTH: 20199 base pairs 
(8) TYPE: nucleic acid 
(C> STRANOEDNESS : double 
(D) TOPOLOGY: linear 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 6: 



C v_ v_ A <j l_ A Vj AA 


AAAiOOt— Al 1 


tgc: ag at aa t 


GGAAATCGTA 


AAAAAACTAT 


CTTTGAGAAA 


60 


ATAACCT TUT 


I I A I HjIOA I 


1 A a V- A i vjv— 1 /A 


GTAGCAAGTT 


TATTGGGAAT 


TTTTGCAACT 


120 


GCAATTGCil G 


L.CC 1 I- AO 1 nn 


TPTATAAAAT 


AGATTCAAGA 


AAATTTAGTG 


ACTGGGATTT 


180 


CCCAGCCC 1 I 


1111 AAAvj 1 vj 


ap AAG AAAT A 


ATGAGTATGT 


TTTTAGATAC 


AGCTAAGATT 


240 


AAGGTCAAGG 


C 1 u(j 1 AA 1 OO 


1 V-TVjC Vj/\ 1 uu 1 


ATGCT^GCCT 


TTCGTCGTGA 


AAAAT ATGTC 


300 


CCvAATOoAb 


OCCt- 1 IVjVjvjO 




GGTCGTGGAG 


GCAATGTGGT 


CTTCGTTGTA 


360 


GACGAAGOAC 


I ALu 1 Al_L. I 1 




CGCTACAATC 


GTCATTTCAA 


GGCTG ATTCT 


420 


GGTGAAAAAG 


GGATGACCAA 






CTGAGGACCT 


TAGAGTTCGA 


480 


GTAC C AC AAG 


GTACGAC TO 1 






ACGTTTTAAC 


AGATTTGATT 


510 


GAACATGGGC 


AAG AA I I I A 1 




flGTGGTC GTG 


GTGGACGTGG 


AAATATTCGT 


600 


TTCGCGACAC 


/"* A A A A ft aTl^P* 

CAAAAAAIv-L 


TrrArrr.rAA 


ATCTCTCAAA 


ATGGAGAACC 


AGGTCAGGAA 


660 


CGTG AGTT AC 


•I * ip^rr a a *'"P 


nAAArt l L i lO 


GCAGA7GTCG 


GTTTAGTAGG 


ATTCCCATCT 


720 


GTAGGGAAGT 


C AAL AC i 1 1 * 


AAVj IIjI 1 1 1 


AfCTCAG CT A 


AG CCT AAAAT 


TGGTGCCTAC 


780 


CACTTTACCA 


CTATTG 4 ACL 


ft a ATTTir.rT 

AAA 111 AOu 1 


ATGGTTCGCA 


CCCAATCAGG 


TGAATCCTTT 


840 


GCAGTAGCCG 


ACTTGCCAGG 


1 1 Ibnl 1 OAA 




AAG GTGTTGG 


TTTGGGAACT 


900 


C AGTT CC T C C 


r^T»/^ ft f" ft Tf*f~ A. 
U Ia_ACA1v_OA 




GTTATCCTTC 


ACATCATTGA 


TATGTCAGCT 


960 


AGCvjAGGGCC 


r^Tf^ ft Tff ft T*ft 
L» 1 L>A 1 LLA L A 


Tf! Afin APT AC* 


CTAGCTATCA 


AT AAA G AGCT 


GGAGTCTTAC 


1020 


AATCTTCGCC 


TCATGGAGCG 


TCCACAGATT 


ATTGTAGCTA 


ATAAGATGGA 


CATGCCTGAG 


1080 


AGTCAGGAAA 


ATCTTGAAGA 


CTTTAAGAAA 


AAATTGG CTG 


AAAATT ATG A 


TGAATTTGAA 


1140 


G AGTT AC C AG 


CTATCTTCCC 


AATTTCTGGA 


TTGACCAAGC 


AAGGTCTGGC 


AACACTTTTA 


1200 


GATGCTAC AG 


CTGAATTGTT 


AGACAAGACA 


CCAGAATTTT 


TGCTCTACGA 


CGAGTCCGAT 


1260 


ATGGAAGAAG 


AAGCTTACTA 


TGGATTTGAC 


GAAGAAGAAA 


AAGCCTTTGA 


AATTAGTCGT 


1320 


GATGACGATG 


CGACATGGGT 


ACTTTCTGGT 


GAAAAACTCA 


TGAAACTCTT 


TAATATGACC 


1380 


AACTTTGATC 


GTGATGAATC 


TGTCATGAAA 


TTTGCCCGTC 


AGCTTCGTGG 


TATGGGGGTT 


1440 


GATGAAGCCC 


TTCGTGCGCG 


TGGAGCTAAA 


GATGGGGATT 


TGGTCCGCAT 


TGGTAAATTT 


1500 


GAGTTTGAAT 


TTGTAGACTA 


GGAGACTGGT 


ATGGGAGATA 


AACCGATATC 


TTTCCGAGAT 


1560 


GCGGATGGTA 


ATTTTGTTTC 


CGCCGCAGAC 


GTTTGGAATG 


AAAAGAAATT 


GGAAGAACTA 


1620 
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TTTAATCCTC 


TCAATCCAAA 


TCGTGCCTTG 


AGATTGGCAC 


CAACTAAAAA 


GGAAAATCCA 


1680 


TCTCAGTAAA 


GAAGCTAAAA 


AATCCCGTGC 


CTCATCAGAC 


ACGGGATTTT 


GTGGTACGAC 


1740 


AGGCATGTAT 


AGCAAACTGA 


ATCTGGAATA 


GCACAGCATA 


TCTTCTAAAA 


TATAGTAAAA 


1800 


TGAAATGAGA 


ACAGGACAAA 


TCGATCAGGA 


CAGTAAAATC 


GATTTCTAAC 


AATGTTTTAT 


i860 


AAGCAGAGAT 


GTACTATTCT 


AGTTTCAATC 


AACTATATTG 


TTATAAATTG 


ATTTGAATTT 


1920 


CAAAATTAAA 


TTGTTTCATT 


CTTATTTCAA 


TTTGTTATAG 


TATATCTGAT 


GTCAAAGTTC 


L980 


TCGGCGAGTC 


AAATAGCGAT 


TCCCAAGCCT 


GACTATCGTG 


AGGTAGCGGA 


TTAAAATGGT 


2040 


CTGGGGATAG 


ACCGTTTTAA 


GTCTGACGCT 


GGAAATAAGA 


ATTGTCAGAA 


GAAGGGATAG 


2100 


CGAAATCGTG 


GCTCTACGAA 


CAGGAACGTG 


ATAATAAGGC 


GTATATAGCG 


GATAAGAGGG 


2150 


CATCAAACTC 


TAAAGTCCAA 


AAAGGTAGTC 


GTAACCTATA 


TGCGTAAATC 


ACGAGAGTAA 


2220 


TTGAATTCGT 


ACTAAGATTT 


TCTATTTTCA 


CTGTAACCTT 


TTAACGCCCT 


TATATCTTGT 


2280 


ATACACGAGG 


AAAGATGTAC 


GACTTATCCC 


GTGAGGTCTA 


TCACTATAAA 


GAGAAAACGA 


2340 


CAGATAGAAG 


TGATCCTGAG 


TCACGGTTAT 


CTGTCTGATA 


GGACGGTATG 


TAT AAAACG C 


2400 


TTCTGTGAAC 


TGAGAGAAGG 


GGGAGAACTT 


CTTGCTAAAA 


TTTAGTTGAA 


CAGCCGTATT 


2460 


CCGATACTTA 


GATAAGAGAT 


CTAGTCTTAG 


CTCCTACTCA 


GTTTTAGGGG 


ATAAAAAAGG 


2520 


GGCAATAGCG 


ATTCGAGAAA 


GATTATACTC 


TTCGAAAATC 


TCTTCAAATC 


ACGTCAATAT 


2580 


CGCCTTGTCG 


T ATGTGT AGG 


ATACTGACTA 


CGTCAGTTCC 


ATCTACAACC 


TCAAAACAGT 


2640 


GTTTTGAGCA 


ACcTGCGGCT 


AGTTTCCTAG 


TTTGATCTTT 


GATTTTCATT 


GACTATTAG7 


2-700 


AATTCAGTTA 


CTAACTCGTC 


AACTCTGATT 


TATCCAATAA 


AATTG AAAAG 


GATGGAAAAA 


2760 


AGGATAAATT 


TATGATATAC 


TTTATTTTGA 


AGACCTTATT 


AGAAATCTTG 


AAAGAGTATT 


2820 


GAAAACTTAG 


AATGAGAAAA 


ATTGTTATCA 


ATGGTGG ATT 


ACCACTGCAA 


GGTGAAATCA 


2880 


CTATTAGTGG 


TGCTAAAAAT 


AGTGTCGTTG 


CCTTAATTCC 


AGCTATTATC 


TTGGCTGATC 


2940 


ATGTGGTCAC 


TTTGGATTGC 


GTTCCAGATA 


TTTCGGATGT 


AGCCAGTCTT 


GTCGAAATCA 


3000 


TGGAATTGAT 


GGGAGCTACT 


GTTAAGCGTT 


ATGACGATGT 


ATTGGAGATT 


GACCCAAGAG 


3060 


GTGTTCAAAA 


TATTCCAATC 


CCTTATGGTA 


AAATTAACAG 


TCTTCGTGCA 


TC TT ACT ATT 


3120 


TTTATGGGAG 


CCTCTTAGGC 


CGTTTTGGTG 


AAGCG A C AG T 


TGGTCTACCG 


GGAGGATGTG 


3180 


AT CTTGGTCC 


TCGTCCGATT 


GACTTACACC 


TTAAGGCGTT 


TGAAGCTATG 


GGTGCCACTG 


3240 


CTAGCTACGA 


GGGAGATAAC 


AT G AAGTT AT 


CTGCTAAAGA 


TACAGGACTT 


CATGGTGCAA 


3 300 


GTATTTACAT 


GGATACGGTT 


AGTGTGGGAG 


CAACGATTAA 


TACGATGATT 


GCTGCGGTTA 


3360 


AAGCAAATGG 


TCGTACTATT 


ATTGAAAATG 


CAGCCCGTGA 


ACCTGAGATT 


ATTGATGTAG 


3420 
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CTACTCTCTT GAATAATATG GGTGCCCATA TCCGTGGGGC AGGAACTAAT ATCATCATTA 3 4 80 

TTGATCCTGT TGAAAGATTA CATGGGACAC GTCATCAGGT CATTCCAGAC CGCATTGAAG 3 54 0 

CTGGAACATA TATATCTTTA GCTGCTGCAG TTGGTAAAGG AATTCGTATA AATAATGTTC 3 60O 

TTTACGAACA CCTGGAAGGG TTTATTGCTA AGTTGGAAGA AATGGGAGTG ACAATGACTG 3 660 

TATCTGAAGA CACCATTTTT GTCGAGGAAC ACT CT AATTT GAAAGCAATC AATATTAAGA 3*720 

CAGCTCCTTA CCCAGGCTTT GCAACTGATT TGC AAC AACC CCTTACCCCT CTTTTACTAA 3 730 

GAGCGAATGG TCGTGGTACA ATTGTCGATA CGATTTACGA AAAACGTGTA AATCATGTTT 3 84 0 

TTGAACTAGC AAAGATGGAT GCGGATATTT CGACAACAAA TGGTCATATT TTGTACACGG 3 900 

GTGGACGTGA TTTACGTGGG GCCAGTGTTA AAGCGACCGA CTTAAGAGCT GGGGCTGCAC 3 960 

TAGTCATTGC T GGGCTT ATG GCTGAAGGTA AAACTGAAAT TACCAATATC GAGTTTATCT 4 020 

TACCTGCTTA TTCTGATATT ATCGAAAAAT TACGT AATTT AGGAGCGGAT ATTAGACTTG 4 080 

TTGAGGATTA AACCGTAGAG GTGTTTATGA ATATTTGGAC CAAATTAGCA ATCTTTTCTT 4140 

TTTTTGAAAC GGATCGCTTG TATTTCCGTC CTTTCTTTTT TAGTGATAGT CAGGACTTCC 4 200 

GCGAGATAGC TTCAAATCCA GAAAATCTTC AATTT ATTTT CCCAACGCAG GCAAGTCTGG 4 260 

AAGAAAGTCA ATATGCACTG GCCAATTACT TTATGAAGTC CCCTTTGGGA GTGTGGGCAA 4 32 0 

TTTGTGACCA GAAAAATCAA CAAATGATTG GTTCTATTAA ATTTGAGAAG TTAGATGAAA 4 3 80 

TCAAAAAAGA AGCTGAGCTT GGCTATTTTT TGACAAAAGA TGCTTGGTCG CAAGGATTTA 44 4 0 

TGACAGAGGT TGTTAGAAAA ATTTGTCAGC TTTCTTTTGA GCAATTTGGC TTAAAACAAT 4 5 00 

TATTTATCAT TACCCACCTT GAAAATAAAG CTAGCCAAAG AGTTGCTCTT AAGTCTGGAT 4 560 

TTAGTTTGTT CCGTCAGTTT AAGGGAAGTG ATCGTTACAC AAGAAAAATC CGGGATTATC 4 620 

TTGAATTTCG GTATGTAAAA GGAGAGTTCA ATGAGTAAGC ATCAGGAAAT TCTAAGCTAT 4 680 

TTGGAGGAAT TACCAGTAGG TAAAAGGGTC AGTGTTCGTA GCATTTCGAA TCATCTAGCA 4 740 

GTTAGTGATG GAACAGCCTA TCGGGCTATT AAAGAAGCTG AAAACCGTGG AATTGTCGAG 4 800 

ACCCGTCCTA GAAGTGGAAC AATTCCTGTT AAATCCCAGA AAGTTGCTAT AGAGAGATTA 4 360 

ACGTTTGCTG AAATTGCAGA AGTGACTTCT TCTGACCTTC TGGCTGGGCA AGAAGGTTTA 4 920 

GAGAGAGAAT TTAGTAAGTT TTCAATTGCT GCCATGACTG AACAAAATAT CTTGTCTTAC 4 9 80 

CTTCATGATG GGGGGCTCTT GATTGTCGGA GACCGAACCC GTATTCAGTT GCTAGCCTTG 5040 

GAAAATGAAA ATGCAGTTCT GGTTACAGGG GGATTTCAGG TTCATGATGA TGTGCTTAAA 5100 

CTGCCCAATC AAAAAGGGAT TCCTGTTCTA AGAACTAAGC ATGATACCTT TACCGTCGCG 5160 
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ACCATGATCA ATAAAGCCTT GTCAAATGTC CAAATCAAGA CTGATATTCT CACAGTTGAG 522 0 

AAACTTT AT C GCCCTAGTCA TGAGTATGGT TTTCTGAGAC AGACAGATAC AGTTAAAGAT 52 8 0 

TATTTGGACT TGGTTCGTAA GAATCGTAGC AGCCGTTTCC CTGTTATCAA TCAACATCAG 5340 

GTCGTTGTTG GTGTTGTAAC CATGAGAGAC GCTGGTGATA AATCACCAAG CACGACAATT 5400 

GATAAGGTTA TGTCTCGTAG TCTATTTTTG GTTGGATTAT CGACAAATAT TGCCAATGTG 54 60 

AGTCAACGGA TGATCGCAGA AGACTTTCAA ATGGT AC C AG TTGTTCGAAG CAATCAAACT 5520 

TTGCTTGGCG TTGTGACGCG ACGAGATGTC ATGGAGAAGA TGAGCCGTTC CCAAGTTTCG 55 80 

GCTCTACCAA CTTTTTCTGA GCAGATTGGA CAAAAGCTCT CTTATCACCA TGATGAAGTA 564 0 

GTCATTACAG TGGAACCCTT TATGCTAGAA AAAAATGGAG TTTTGGCTAA TGGTGTATTG 57 00 

GCAGAAATTC TGACCCACAT GACCCGATTT AGTTGTT AAT AGTGGTCGCA ATCTCATTAT 57 60 

CGAGCAGATG CTGATCTACT TTTTGCAGGC TGTTCAGATA CATGATATAT TGCGCATTCA 5820 

GGCACGGATT ATTCATCATA CGAGACGGTC AGCTATAATT GATTACGATA TTTATCATGG 58 80 

TCACCAGATT GTTTCAAAAG CAAATGTGAC TGTTAAAATT AATTAGAAAC TAGGAGAAAA 59 4 0 

GATGATAACA TTAAAATCAG CTCGTGAAAT CGAAGCTATG GACAAGCCTC GTGATTTTCT 6000 

AGCAAGTATT CATATAGGCT TACCTCATTT GATTAAGCCA GGCGTAGATA TGTGGGAAGT 6060 

TGAAGAATAT GTCCGCCGTC GTTGTAAAGA AGAAAATTTC CTTCCACTTC AG ATT GGGGT 6120 

TGACGGTGCC ATGATGGACT ATCCTTATGC TACCTGTTGC TCTCTTAACG ATGAAGTGGC 6180 

TCACGCTTTC CCTCGTCATT ATATCTTGAA AGATGGTGAT TTGCTCAAAC TTCAT ATGGT 62 40 

TTTGGGAGGT CCCATTGCTA AATCTGACCT AAATGTCTCA AAATTAAACT TCAACAATGT 63 00 

TGAACAAATG AAAAAATACA CTCAGAGCTA TTCTGGTCCT TTACCAGACT CATCTTGGGC 6 3 60 

TTATGCTGTT GGTACACCGT CCGAAGAAGT CAAAAACTTG ATGGATGTAA CCAAAGAAGC 64 20 

TATGTACAAG GGTATTGAGC AAGCTGTTGT TGGAAATCGT ATCGGTGATA TCGGTGCGGC 64 80 

TATTCAAGAA TACGCTGAAA GTCGTGGTTA CGGTGTAGTG CCTGATTTCG TTGGTCATGG 654 0 

TGTTGGCCCA ACTATGCACG AAGAACCAAT GGTTCCTAAC T ATGGT ATTG CAGGTCGTGG 6 600 

ACTCCGTCTT CGTGAAGGAA TGGTCTTAAC CATTGAACCA ATGATCAATA C AG G CG ATTG 6 6 60 

GGAAATTGAT ACAGATATGA AAACTGGTTG GGCGCATAAG ACCATTGACG GTGGATTGTC 672 0 

ATGTCAGTAT GAACACCAAT TTGTCATTAC GAAAGATGGA CCTGTTATCT TGACTAGCCA 67 80 

AGGTGAAGAA GGAACTTATT AATAAAAAGT GAAAAGACTA CTGGAAGTTT ATTTTGATAA 68 4 0 

AAAATCCAGT AGATCTTTTC ATAATAAAAC GCATTGTATC AAGTGTTAGG GGCTGATATC 6 900 

ATGCGTTTTT CTGCTTTTAA GATTTTTTCC AACTCTGTTT . GTAAGCGCAT CAT AA C AAAG 6960 
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GGTCTAGGAT TCAGGGCTCT CCTCCTATAT ACT ATT AGT A AAGTAAAACT AAGGGAGGAT 7 020 

ATTTTAGTGT CGCAGTCTAT TGTTCCTGTA GAG AT T C C AC AATATTGTCG TTTTGATTCT 7 080 

AAAAAGAGAA ATGGAA7TCT GTTTAATGTT CGTATTGCCA ATCTT AAATT TACTTTTTTA 7140 

TATTATACTT CCTGCGAAAC AAAATATGGT ATAGTAGTTC TATGAATGAT GAAGCAAGTA 7 2 00 

AACAACTAAC TGATGCACGA TTTAAGCGTC TTGTTGGTGT TCAGCGTACC ACTTTTGAAG 72 60 

AGATGTTAGC TGTATTAAAA ACAGCTTATC AACTTAAACA CGCAAAAGGT GGACGAAAAC 7 3 20 

CT AAATT AAG CCTAGAAGAC CTTCTTATGC CCACTCTTCA ATAGTGCGAG AATATCGAAC 7 3 80 

TTATGAAGAA ATTGCGGCTG ATTTTGGTAT TCACGAAAGC AACTTTATCC GTCGGAGCCA 7 440 

ATGGGTTGAA ATAACTCTTG TTCAAAGTGG TTTTACGGTT TCAAGAACTC CTCTCAGTTC 7 500 

TGAGGACACG GTAATGATTG ATGCGACGGA AGTAAAAATC AATCGCCCTA AAAAAACAAT 7 5 60 

TAGCGAATGA TTCTGGTAAA AAGAAATTTC ACGCTATGAA GGCTCAAGCG ATTGTCACAA 7 620 

GTCAAGGGAG AATTCTTTCT TTGGATATCG CTGTGAACTA TAGTCATGAT ATGAAGTTGT 7 680 

TCAAAATGAG TCGTAGAAAT ATCCAACAAG CTGGTAAAAT CTTGGCTGAC AGTGGTTATC 77 4 0 

AAGGGCTCAT GAAGATATAT CCTCAAGCAC AAACTCCACG TAAATCCAGC AAACTCAAGC 7 8 00 

CGCTAACAGC TGAAGATAAA GCCTATAACC ATGCGCTATC TAAGGAAAGA AG C AAGGTT G 7 8 60 

AG AAC ATCTT TGCCAAAGTA AAAACGTTTA AAATATTTTC AACAACCTAT C G AAAT CAT C 7 9 20 

GTAAACGCTT CGGATTACGA ATCAATTTGA GTGCTGGTAT TATCAATCAT GAACTAGGAT 7 980 

TCTAGTTTTG CAGGAAGTCT ATTGAGGTAT TGAGCTACTT TATGAAAAAA TTGGCTGAAA 804 0 

AGTCGAGTGT TTTAGAAACC CACAGTGTAC TATTCTAGTT TCAATCCACT ATATTTTGCT 8 100 

ACTCCCCGTA AAGTTTCTAT TTTCCCTGAT TTCTGATATA ATAGAAATAT TGACTTCAAG 8160 

AGTAAGGAAG AGAAGATGAA CGCATTATTA AATGGAATGA ATCACCGTCA GGCTGAGGCG 8 220 

GTGCAAACGA C AG AAGGTC C CTTCCTAATC ATGGCAGGGG CTGGTTCTGG AAAGACTCGT 8 230 

GTTTTGACCC ACCGTATCGC TTATTTGATT GATGAAAAGC TGGTCAATCC TTGGAATATC 8 34 0 

TTGGCCATTA CCTTTACCAA CAAGGCTGCG CGTGAGATGA AAGAGCGTGC TTATAGCCTC 84 00 

AATCCAGCGA CTCAGGACTG TCTGATTGCG ACCTTCCACT CCATGTGTGT GCGTATTTTG 84 60 

CGTCGCGATG CGGACCATAT TGGCTACAAT CGTAATTTTA CAATTGTGGA TCCTGGTGAA 85 2 0 

CAGCGAACGC TCATGAAACG TATTCTCAAA CAGTTGAACT TGGACCCTAA AAAATGGAAT 8 5 80 

GAACGAACTA TTTTGGGGAC CATTTCCAAT GCTAAGAATG ATTTGATTGA TGATGTTGCT 8640 

TATGCTGCCC AAGCTGGCGA- TATGTATACG CAAATTGTGG CCCAGTGTTA TACAGCCTAT 87 00 
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CAAAAAGAAC TTCGTCAGTC TGAATCCGTT GACTTTGATG ATTTGATTAT GCTGACCTTG 87 60 

CGTCTCTTTG ATCAAAATCC TGATGTTTTG ACCTACTACC AGC AAAAATT CCAATACATC 8 820 

CACGTTGATG AGTACCAAGA TACCAACCAC GCTCAGTACC AATTGGTCAA ACTCTTGGCT 8 880 

TCCCGTTTTA AAAATATCTG TGTGGTTGGG GATGCGGACC AGTCTATCTA CGGTTGGCGT 8940 

GGTGCTGATA TGCAGAATAT CTTGGACTTT GAAAAGGATT ACCCCAAAGC CAAGGTTGTT 9000 

TTGTTGGAGG AAAATTACCG CTCAACCAAA ACCATTCTCC AAGCGGCCAA CGAGGTTATT 9060 

AAAAATAATA AAAATCGCCG TCCTAAAAAT CTCTGGACTC AAAACCCTGA TGGGGACCAA 9120 

ATCGTTTACT ATCGTGCCGA TGATGAGCTG GATGAGGCTG TATTTGTAGC CAGAACCATC 9180 

GATGAACTTA GTCGCAGTCA AAACTTCCTT CATAAGGATT TTGCAGTTCT CTATCGGACT 9 24 0 

AATGCCCAGT CCCGTACAAT TGAGGAAGCC CTGCTCAAGT CTAACATTCC TTATACCATG 9 3 00 

GTTGGCGGAA CCAAATTCTA CAGCCGTAAG GAAATTCGCG ATATTATTGC TTATCTCAAC 9 3 60 

CTTATTGCTA ATTTGAGTGA CAATATTAGT TTTGAGCGTA TTATCAACGA GCCTAAACGT 9 4 20 

GGAATTCGTC T AGGT AC ACT TGAGAAAATC CGTGATTTTG CAAATTTGCA AAATATGTCT 94 80 

ATGCTGGATG CTTCTGCTAA T ATT ATGTTG TCTGGTATCA AGGGTAAGGC AGCCCAATCT 9 54 0 

ATCTGGGATT TTGCCAATAT GATGCTTCAT TTGCGGGAGC AGCTAGACCA CTTAAGCATT 9600 

ACAGAGTTGG TTGAGTCCGT CCTAGAAAAA ACAGGTTATG TCG AT ATTCT TAACTCCCAA 9 6 60 

GCGACTCTAG AAAGCAAGGC ACGGGTTGAA AATATCGAAG AGTTTCTTTC TGTTACGAAG 97 20 

AACTTTGATG ACACCACGGA TGTGACAGAA GAGGAAACTG GTCTGGACAA ACTGAGTCGT 97 80 

TTCTTAAATG ACTTGGCTTT G ATTGCC G AC ACAGATTCAG GTAGTCAGGA GACATCAGAA 9840 

GTGACCTTGA TGACCCTGCA TGCTGCCAAA GGTCTCGAAT TTCCAGTTCT CTTTTTGATT 9900 

GGGATGGAAG AAAATGTCTT TCCACTTAGT CGTGCGACTC AAGATTCAGA TGAATTAGAA 996 0 

GAAGAGCGCC GTCTAGCCTA TGTAGGTATC ACGCGTG C AG AG AAAATT CT CTATCTGACC 10020 

AATGCCAACT CACGCTTGCT TTTTGGTCGT ACCAATTATA ACCGTCCGAC TCGTTTTATT 10080 

AACGAAATCA GTTCAGACTT GCTTGAGTAT CAAGGTCTGG CTCGTCCTGC AAATACAAGC 1014 0 

TTTAAGGCAT CAT AT AG C AG TGGTAGTATT TCCTTTGGTC AAGGTATGAG TTTGGCTCAC 1C200 

GCTCTTCAAG ACCGTAAACG CGGTGCTGCC CCAAAATCAA TCCAGTCAAG CGCTCTTCCA 102 60 

TTTGGTCAAT TTACAGCTGG CGCAAAACCA GCATCTAGCG AGGCAAATTG GTCCATTGGT 103 2 0 

GATATTGCTC TCCACAACAA ATGGGGAGAG GGAACCGTTC TGGAAGTTTC AGGTAGCGGT 103 80 

GCTAGGCAGG AATTGAAAAT CAATTTCCCA GAAGTAGGTT TGAAAAAACT TTTAGCCAGT 10440 

GTGGCTCCAA TTGAGAAAAA AATCTAATTT TCCATCCTTC TCACGAATAA TAAAGTGAGG 10 50C 



WO 98/18931 



PCI7US97/19588 



189 

AGGATTTTTA TGTACAGTAT TTCATTCCAA GAAGATTCAC TATTACCAAG AGAAAGGCTG 10560 

GCCAAGGAAG GAGTTGAAGC GCTTAGTAAC CAAGAGTTGC TAGCTATTTT ACTCAGGACA 10 620 

GGAACACGTC AAGCTAGCCT TTTTGAAATT GCCCAAAAAG TCTTGAACAA TCTTTCAAGC 10 680 

CTAACGGATT TGAAAAAAAT GACCCTGCAG GAATTCCACA GTTTGTCTGG TATTGGGCGT 10740 

GTTAAGGCCA TAGAATTACA AGCTATGATT GAACTGGGGC ATCGTATTCA CAAACACGAG 10800 

ACTCTTGAAA TGGAAAGTAT TCTCAGCAGT CAAAAGTTGG CCAAGAAGAT GCAGCAGGAA 10860 

TTAGGGGATA AAAAACAAGA GCACCTGGTG GCACTCTATC TCAATACTCA AAATCAAATC 10920 

ATCCATCACC AGACCATTTT TATCGGGTCT GTAACTCGTA GTATCCCTGA ACCGCGAGAG 10980 

ATTCTTCACT ATGCAATCAA GCATATGGCG ACTTCTCTTA TCTTGGTCCA CAATCATCCT 11040 

TCAGGAGCGG TAGCGCCTAG CCAAAATGAT GATCATGTCA CTAAACTTGT TAAAGAAGCC 11100 

TGCGAATTGA TGGCGATTCT TCTCTTGGAC CATTTGATTG TCTCTCATTC TAATTACTTT 11160 

AGTT ATCGTG AAAAGACAGA TTTAATCTAA AGTTCATTAA CGACATAGTC AAAGAGTTTT 11220 

TTATCTTTGG GACGATTTTC AAAAAGAAGT TCTGGATGCC ATTGGACACC GAGAAAGGCG 112 80 

ACATCATCCG TACTCATGAC AGCCTCAATG ATACCATCTT TACGATCATG AGCCACAACT 11340 

TTTAAATTTG GTGCTAAGTC CTTGATGCTC TGGTGGTGGA AGGAGTTGAT ATGAGAGATT 11400 

TCTCCATAGA TTTCTTGGAG AACGGTATCT GGTTCTGTTA CCAAGCGTTG AGTTGTGTAC 11460 

TCAACAGAAC AATCCTGCCA ATGGTCTT CG ATATCTTCGT ACAAAGTTCC ACCCATGCCA 11520 

ACGTTAAAGA GTTGCGTACC ACGGCAGACA GAGAAAATGG CCTTTTTCTG TTTAATAGCT 11580 

TCCTTGATGA GGGCCAGTTC GAAGATATCT CTTTGAAGCT GATAGTCATC ACTATCAATG 11640 

GTTTTGGGTT CGCCATAAAA TTTTGGATCG ACATTTTGCC CACCTGTCAA GATCACCTTC 11700 

TCAATCAAAC TGATATAGTG GCAGGCCATT TCTTGATCAC CAATCCGTAC GATGATGGGA 11760 

ATCCCTCCAG CATCTTTAAC GCCTTCAACA AAGCCTTTTG CTGCGTAGCT CATCATGATG 11820 

TCATCATCTG GATGAGTTTT TTCGTTTCCT GTAATCCCAA TAACTGGT— TTTCATAAAA 11880 
TGATTTTCGC TTTCTAATCC TCTTTTCGCA TGAAGTAGAG GAGGGTTTGG AGTTCACTTG 11940 
TCAAATCGAC ATACTGAACG ACCACGTCTT TTGGTAAATG CAGATGGACT GGTGAAAAAC 12O00 
TGAGAATTCC TTTCACACCA GCATCAACCA AGAGATTAGC AAC CTCTTGT GACTTGACGC 120 60 

TGGGAACAGT TAGGATAGCA GTCTTCACAT CAGCATCCTT GATTTTATCC TTGATCTGAG 12120 
AAATCCCGTA AATGGGAATC • CCGTCAGGAG TTTGGGTACC GACTTCAGGA TGGTCGTCTA 12130 
GGTCAAAGGC CATGATAATC TTCATCTTGT T AC GTTCGTG GAAGCGGTAG TGGAGAAGGG 12240 
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CATGGCCCAT ATTTCCAATA CCAACCAGCA TGACATTGGT AATAGAGTTG TCATTGAGCA 123 00 

AATCGGCAAA AAATGTCATT AGTTTTTTGA CATC AT AG CC AAAACCACGA CGACCAAGTT 123 60 

CACCAAAATA GGAAAAATCA CGACGTACGG TCGCTGAATC AATACCGATA GCCTCTGCAA 124 20 

TTTGCTT AG A GTTGGCACGT TCAATC TTTT CTGCATGAAA TCTCTTAAAA ATTCGATAGT 124 80 

AGAGAGAGAG TCTTTTTGCT GTAGCTTTTG G AAT AGC AAA CTGTTTATCT TTCACAAAAT 1254 0 

CACAACCTTT CTATTCTTCT ATTTTATAGA AAC ATTGTG A AAAAATCAAC AAAAATAAGA 12600 

AAAAACTAAG AAAAATCTTA GTTTTGATGT AAAAAATCTG CATGAGATAG AAAACGGTAG 12 660 

AGGTCTCCGA CCAGCCCCTG ATAAACTTTT TTGCCCCTAA AAGT C AG AG A AGTCACATAA 127 20 

AGTGTATCTG GTAAGGTTAC ACATCCTGAC AAAGTCAACA TGAGAGCCTC ATGATCCTCA 127 80 

TACTTGAGAC TACGCTCTAC ATGATAGCAG TCCTTATACC TCAGTTCAAA CATTTTGGCT 12840 

CTATCTTTCC GATTTTGTAA AGACACCACG TTCTACCAAG CTATCCATGA GGAAGTAGAA 12900 

TTTTTCCTGA TGAATATGGT GGTCTTCTGA TTTGAAAATA TCAACTAGAC GAAGGCCAAA 12960 

CTTGTCAGTG ATATTGATTT TAGCCCCTGT AAGTTCCTTG TTAATGATCA TTTTGAGTTG 1302 0 

CAAGCCTTCA CCGCTGTTTG GCACTTTTTC CAAAAGGCGA GTCAGTTCAT AGTTACCAAC 13080 

CTTAGTTTCA AAAAAGGTGT TATCTTTGAG GGTGAATTTT TTAACAGAAG GGCTAAGAGT 13140 

GTAATCGTAA CGACAATTTT TTAACTGAAT GATTTTTTCA AATGCCATAT GGCTAACCTC 13200 

CGATAATTTC TTTTAAGGTT TTTGCGAGGG TTTGTAGGTC TTCAACGGTA TTTTGTGGCG 132 60 

ACAAACTGAT GCGAAGGGAT TCCTTCAAGC GTTCTGAATT TGCGCCATAC ATGGCTTCAA 13 3 20 

GAACATGGCT GGATTGGACA ACCCCTGCAG TACACGCTGA GCCAGTAGAG ATTGAAATTC 13 3 80 

CAGCTAAATC TAGCCGAAGG AGTAAGAGGT CATTTTTCTC ACCAGGAAAT C C AAT AT TG A 134 40 

GAACATAAGG GAGATGATGT TTTCCTCTAT TCAGGTAATA CTGAATGCCC TCCAGCTCTG 13 500 

CCAGAAAGGC AGTTTCTAGA TTTTGTACAT GTTGAAAATG TTCTTCTTGT TTTTCTAGGT 13 560 

CTTCTTTTAG GGCTGCAACC ATGCCTACAA TGGCAGGCAG ATTTTCAGTT CCTGCACGTT 13 620 

TTTTCTGTTC CTGGTCTCCG CCATGTAGAT AGGAATCAAA GTCCATGCTA GATGCGTAGA 13 680 

GAAAACCGAT TCCCTTAGGA CCATGGAATT TGTGGGCAGA AGCAGTGAGA AAATCAATGC 137 40 

CCAATTCTTC TGAATGAATT GGGATTTTAC CAATAGCCTG AACTGCATCA ACATGATAGG 13 800 

CAGCAGGGTG TTGCTTGACT ATTTGGCCAA TTTCAGCGAT GGGCAGTAGG TTTCCTGTCT 13 8 60 

C ATT ATT G AC AAAC ATGGT A GAAACCAAAA TCGTATCGTC AC GT AAAGCC TTTTGAATTT 13 920 

GCTCGGCTGT G ATTTCTTG A TTTTCTGGCT GGATAATGGT TGCTTCAAAC CCAAAGTGTT 13980 

G AAC C AAGT A ATCAATTGTT TCAAGGACAG CATGGTCCTC GATGGCAGTT GTGATGATAT 14 040 
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GTTTTCCTTG 


TTCTTGGTGA 


CGAACACAGT 


AGCCAATGAT 


GGTAGTATTA 


TTGCCTTCAG 


14100 


TCCCACCAGA 


AGTGAAAAAG 


AT ATGTT GAG 


GTTTTGTCCT 


TAGTAACTGG 


GCTAGTTCCT 


14160 


GACGGGCTTC 


TCGCAAGAGT 


TTGCCAGCTT 


GACGACCATG 


ACCATGAATA 


CTAGAAGGAT 


14220 


TTCCGTGGGT 


TTCTTGCATA 


ACCTTGGTCA 


TAGCTGAAAT 


AGCAACTGCT 


GACATAGGAG 


14280 


TCGTTGCAGC 


ATTGTCCAAA 


T AAATC AAAG 


AATCACCTTA 


TTTCTTTTTA 


TTGTAGGCAA 


14340 


AGAGTGCGCT 


GACTGGTTTT 


CTTTCGTGAA 


TACGGACGAT 


AGCATCACCA 


ATTAACTCAC 


14400 


TAGCAGTGAT 


GTAGCATACA 


TTTTTAGGAG 


TTTTTTCTTT 


TGTTGCTACT 


GAATCAGTCA 


14460 


CAAGAATTTC 


TTTAATATTA 


GTATTGTCAA 


GAAGCTCAGC 


AGCTCCCTCG 


ACG AAG AG AC 


14520 


CGTGGCTAGA 


AACAGCATAA 


ATTTCTGTAG 


CTCCTTCACG 


TTCAACGATT 


TTAGAAGCTT 


14580 


CAGAGAACGT 


ACGTCCTGTA 


TTTAAAATAT 


CATCAATCAA 


GATAGCTTTC 


TTACCTTCAA 


14640 


CATCACCAAT 


AATATAACCT 


TCGTTACGAG 


TTGCATCGTC 


TTG AGGGT AG 


TCGATAATGG 


14700 


CGATAGGAGC 


AT C AAG AT AT 


TCAGCCAGGC 


TACGCGCACG 


TTTGACACCT 


GAATTTTTAG 


14760 


GGCTAACGAC 


AACAACATCT 


GAACCAAGCA 


ATCCTTTATC 


GCAGTAATGT 


TTTGCGAATA 


14320 


GGGGAACAGT 


GAAAAGATTA 


TCCACTGGAA 


TATCAAAGAA 


ACCTTGAACC 


TGAACGGCAT 


14880 


GCAAATCAAG 


AGTCAGGATA 


CGATCAACTC 


CAGCCTTAAC 


CAGCATATTG 


GCAACTAGTT 


14940 


TTGCTGTAAG 


TGGCTCACGA 


GGACAAGCAA 


TGCGGTCTTG 


ACGTGCATAG 


CCAAAATATG 


15000 


GAAGGACAAC 


GTTGATACTG 


TGGGCACTTG 


CACGCACACA 


AGCATCGACC 


ATGATTAACA 


15060 


ATTCCATTAG 


GTGGTTGTTG 


ACAGGGAAAC 


TTGTTGATTG 


CATGATGTAA 


ACATCATAAC 


15120 


CACGGACACT 


TTCTTCGATA 


TTTACTTGGA 


TTTCTCCGTC 


TGAAAATTGA 


CGTGATG AT A 


15180 


GTTTTCCAAG 


TGGGACACCA 


ACAGCTTGGG 


CAATTTTTTG 


TGCAATCTCT 


TGGTTAGAGT 


15240 


TGAGTGCGAA 


AAGTTTCATG 


' i"I Wf ** * I^C*** AT 


CTGACATTA? 


AGACCGTCCT 


CTGTAAACTT 


15300 


1A1 /w\ 1 ^.V— I 


AflTTATATTT 

r\ k_> link i * i 


ACCTTACATA 


TATGAACTGG 


GATTTGTGTA 


T rn TTTATCTT 


15360 


TTCTATTTTA 


CCAAAAAATG 


GAGATTATTT 


CAGCTATTTT 


TCATACTTTT 


GACAAATCGA 


15420 


AC C AATTTTG 


AAGGAGCTTT 


TTGATAGGAA 


ATCTGATTTT 


TCTCTAAAAA 


TTGTCGAAAA 


15480 


TCCTGTTTGC 


CTTGCTCATG 


ATTTTCCACT 


TCAAGCTCCA 


ATTCGTAATC 


TGTTATATCA 


15540 


AAGTATCGGC 


TCTGATCCAG 


TGCCATGAGA 


CCAATAGCTG 


TTTTCATTTC 


ATAGCGAAGC 


15600 


GTTGTTAGAC 


AACCAAGAAC 


CTGCCAGTTC 


TTACTTTGGA 


T AC C ATGTTT 


CGCCAATTCA 


15660 


TCCAGTACTA 


GCCCTTGAGG 


AAGTTCTTCC 


TTACTCAGAT 


AGTTCTCAGC 


ATCTTTTAGT 


15720 


TGCAATTTTT 


GGTTGTATTC 


CATGTTTCCA 


ACACTCTGCG 


GGACTTTGAG 


; TGTCAACTCA 


15780 
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GCCCAGTCTT CAAAGGTTCG AATCCGCATA GCGACTTTCT TTTe^K. ,-CAG TTCAAAATCA 15340 

GGCGTGTCGA TGTAGTAATT TGTTTGAAGA ACAGGAGTGA CACCTGTGAA CTGGTCTTTT 15900 

AGACGATTGT ATTCATCTTT TTTCAATAGT GTTTTCAATT CAATTTCTAA ATCTTTCATT 15960 

TTTCTTACCT TTTTTTATCG TTGAAAGCGG ATTTATGGTA TAATAAGCAT TGTATTTATT 16020 

GTATATGAAT CTGGAGAAAA AATCAAAGAT ATTTTTGACG GATAATATGA GAACAAGGGA 16080 

GAATATATGA CCTTAGAATG GGAAGAATTT CTAGATCCTT ACATTCAAGC TGTTCGTGAG 1614 0 

TTAAAGATTA AACTTCGTGG TATTCGTAAG CAATATCGTA AGCAAAATAA GCATTCTCCA 16 200 

ATTGAGTTTG TGACCGGTCG AGTCAAGCCA ATTGAGAGCA TCAAAGAAAA AATGGCTCGT 16260 

CGTGGCATTA CTTATGCGAC CTTGGAACAC GATTTGCAGG ATATTGCTGG CTTACGTGTG 16 320 

ATGGTTCAGT TTGTAGATGA CGTCAAGGAA GTAGTGGATA TTTTGCACAA GCGTCAGGAT 163 80 

ATGCGAATCA TACAGGAGCG AGATTACATT ACTCATAGAA AAGCATCAGG CTATCGTTCC 164 40 

TATCATGTGG TAGTAGAATA TACGCTTGAT ACCATCAATG GAGCTAAGAC TATTTTGGCA 16500 

GAAATTCAAA TTCGTACTTT GGCCATGAAT TTCTGGGCAA CGATAGAACA TTCTCTCAAC 16560 

TACAAGTACC AAGGGGATTT CCCAGATGAG ATTAAGAAGC GACTGCAAAT TACAGCTAGA 16620 

ATCGCCCATC AGTTGGATGA AGAAATGGCT GAAATTCGTG ATGATATCCA AGAAGCCCAG 156 80 

GCACTTTTTG ATCCTTTGAC TAGAAAATTA AATGACGGTG TAGGAAACAG TGACGATACA 167 40 

GATGAAGAAT ACAGGTAAAC GAATTGATCT GATAGCCAAT AGAAAACCGC AGACTCAAAG 16 800 

GGTTTTGTAT GAATTGCGAG ATCGTTTGAA GAGAAATCAG TTTATACTCA ATGATACCAA 16 860 

TCCGGATATT GTCATTTCCA TTGGC GGGG A TGGTATGCTC TTGTCGGCCT TTCATAAGTA 16920 

CGAAAATCAG CTTGACAAGG TCCGCTTTAT CGGTCTTCAT ACTGGACATT TGGGCTTCTA 169 80 

TACAGATTAT CCTGATTTTG AGTTGGACAA GCTAGTGACT AATTTGCAGC TAGATACTGG 17 040 

GGCAAGGGTT TCTTACCCTG TTCTGAATGT GAACGTCTTT CTTGAAAATG GTGAAGTTAA 17100 

GATTTTCAGA GCACTCAACG AAGCCAGCAT CCGCAGGTCT GATCGAACCA TGGTGGCAGA 17 160 

TATTGTAATA AATGGTGTTC CCTTTGAACG TTTTCGTGGA GACGGGCTAA CAGTTTCGAC 17 2 20 

ACCGACTGGT AGTACTGCCT ATAACAAGTC TCTTGG CGGT GCTGTTTTAC ACCCTACCAT 172 80 

TGAAGCTTTG CAATTAACGG AAATTGCCAG CCTTAATAAT CGTGTCTATC GAACACTGGG 17 340 

CTCTTCCATT ATTCTGCCTA AGAAGGATAA GATTGAACTT ATTCCAACAA GAAACGATTA 174 00 

TCATACTATT TCGGTTGACA AT AG CGTTT A TTCTTTCCGT AATATTGAGC GTATTGAGTA 174 60 

TCAAATCGAC CATCATAAGA T T C ACTTTGT CGCGACTCCT AGCCATACCA GTTTCTGGAA 17 5 20 

CCGTGTTAAG GACGCCTTTA TCGGCGAGGT GGATGAATGA GGTTTGAATT TATCGCAGAT 17 580 
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GAACATGTCA 


AGGTTAAGAC 


CTTCTTAAAA 


AAGCACCAGG 


TTTCTAACGG 


ATTGCTGGCC 


17640 


AAGATTAAGT 


TTCGAGGTGG 


AGCTATTCTG 


GTCAATAATC 


AACCGCAAAA 


TGCAACGTAT 


17-700 


CTATTGCACG 


TTGGAGACTA 


CGTTACCATT 


GACATTCCCG 


CTGAGAAAGG 


CTTTGAAACC 


17760 


TTGGAGGCTA 


TTGAGCTTCC 


ATTAGATATT 


CTCTATGAGG 


ATGACCACTT 


TCTAGTCTTG 


17820 


AATAAACCCT 


ATGGAGTGGC 


TTCTATTCCT 


AGTGTCAATC 


ACTCTAATAC 


CATTGCCAAT 


17880 


TTTATCAAGG 


GTT ACT ATGT 


CAAGCAAAAT 


TATGAAAATC 


AGCAGGTTCA 


CATTGTTACC 


17940 


AGACTAGATA 


GGGATACTTC 


TGGCTTGATG 


CTCTTTGCCA 


AGCACGGTTA 


TGCCCATGCA 


18000 


CG ATT AG AC A 


AG C AGTTG C A 


GAAGAAATCT 


ATCGAGAAAC 


GCTACTTTGC 


TTTGGTTAAG 


18060 


GGAGATGGAC 


ATTTGGAGCC 


AGAAGGGGAA 


ATTATTCCTC 


CGATTGCGCG 


TGATGAAGAT 


13120 


TCCATTATTA 


CCAGACGAGT 


GGCTAAAGGC 


GGAAAGTATG 


CCCATACTTC 


ATACAAGATT 


18130 


GTAGCTTCTT 


ATGGAAATAT 


TCACTTGGTC 


TATATTCACC 


TGCACACTGG 


TCGAACCCAT 


13240 


CAAATCCGAG 


TCCATTTTTC 


TCATATCGGT 


TTTCCTTTGC 


TGGGAGATGA 


TTTGTATCGT 


18300 


GGTAGTCTCG 


AAGATGGTAT 


TCAACGTCAG 


GCTCTGCATT 


GCCATTACCT 


ATCCTTTTAT 


18360 


CATCCATTTT 


TAGAGCAAGA 


CTTGCAGTTA 


GAAAGTCCCT 


TGCCGGATGA 


TTTTAGTAAC 


13420 


CTTATTACCC 


AGTTATCAAC 


TAATACTCTA 


TAAAAACTGT 


CTCAGAGTAT 


AATTATTATC 


13480 


TTAAAGGAGA 


AAACTCATGG 


AAGTTTTTGA 


AACTCTC AAA 


GCCAACCTTG 


TTGGTAAAAA 


18540 


TGCTCGTATC 


GTTCTCCCTG 


AAGGGGAAGA 


GCCTCGTATT 


CTTCAAGCAA 


CAAAACGCTT 


18600 


AGTAAAAGAA 


ACAGAAGTGA 


TTC-TGTTTT 


GCTTGGAAAT 


CCTGAAAAAA 


TTAAAATTTA 


18660 


TCTTGAAATT 


GAAGCAATCA 


TGGATGGTTA 


TGAGGTCATC 


GACCCTCAAC 


ATTATCCTCA 


18720 


ATTTGAAGAA 


ATGGTTTCTG 


CCTTGGTGGA 


GCGTCGCAAG 


GGCAAAATGA 


CTGAAGAAGA 


13780 


TGTACGCAAG 


GTTTTGGTTG 


AAGATGTCAA 


CTACTTTGGT 


GTGATGTTGG 


TTTACTTGGG 


18840 


TTT^GTTCAT 


GGAATGGTGT 


CAGGAGCGAT 


TCACTCAACA 


CCTTCAACAC 


TTCGCCCAGC 


18900 


TCTACAAATC 


ATCAAAACTC 


GTCCAAATGT 


AACTCGTACT 


TCAGGAGCCT 


TCCTCATGGT 


18960 


TCGTGGTACG 


GAACGTTACC 


TATTTGGAGA 


CTGTGCCATT 


AACATCAATC 


CAGATGCAGA 


19020 


AGCCTTGGCT 


GAAATTGCCA 


TCAACTCAGC 


AATCACACCT 


AAGATGTTTG 


GCATCGAACC 


19080 


TAAAATTGCC 


ATGTTG AG CT 


ATTCTACTAA 


AGGTTCAGGG 


TTTGGTGAAA 


GCGTTGATAA 


19140 


GGTCGTTGAA 


GC AACTAAAA 


TTGCTCACGA 


CTTGCGTCCT 


GACCTTGAAA 


TCGATGGTGA 


19200 


GTTGCAATTT 


GATGCAGCCT 


TTGTTCCTGA 


AACTGCACC7 


CTGAAAGCTC 


CTGGAAGTAC 


19250 


' GGTAGCTGGT 


CAAGCAAATG 


TCTTCATCTT 


CCCAGGTATC 


GAGGCAGGAA 


ATATTGGTTA 


19320 



WO 98/18931 



PCT/US97/19588 



194 

CAAGATGGCT GAACGCCTGG GTGGCTTTGC GGCTGTAGGA CCTGTTTTGC AAGGTTTAAA L93 80 

CAAGCCAGTT AATGATCTTT CTCGTGGATG T AATG C AG AT GATGTTTACA AGTTGACCCT 19440 

CATCACAGCA GCTCAAGCAG TTCATCAATA GTGAAAACTA TAAAGTGATA TACTATGCTA 195 00 

TACTGTAGTT ATGAAACTAT GTACGAAAAG CACTGCCATT AATTCCTGAG AACTAAATTA 195 60 

CTGATTGGTG TCAAAAAGGA AAACTTCCAA GCGATGATAT CCTGTCTATA CACGACCTAT 19620 

AGAAATCTGT AATATACATA TCCGTAAAAC GATAAATTCC CTTTTTGATT TTAAATGAGT 19680 

ATGAAAAGAG AATTTTTTGG CTCTTTGTCA ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC 19740 

GAGAAAGGAC AAATTTCATC CTTTCTTTTT TGATATTCAG AGCGATAAAA ATCCGTTTTT 198 00 

TGAAGTTTTC AAAGTTCCGA AAACCAAAGG CATTGCGCTT GATAACTTTC ATGAGATTAT 198 60 

TGGTCGCTTC CAGTTTGGCG TTAGAATAGT GTAGTTGAAG GGCGTTGATA ATCTTTTCTT 19 920 

TATCTTTGAG GAAGGTTTTA AAGACAGTCT GAAAAATAGG ATGAACCTGC TTAACATTGT 19980 

CCTCAATAAG TCCG AAAAAT TTCTCTGGTT CCTTATTCTC GAACTGAAAA AGCAAGAGTT 2 0040 

GATAGAGCTG ATAGTGGTGT TTCAAGTCTT CCGAATAGCT C AAAAGCTT G TTTAAAATCT 2 0100 

CTTTATTGCT TAAGTGCATA CGAAAAATAG GACGATAAAA TCGCTTATCA CTCAGTTTAC 20160 

GGCTATCCTG TTGAATGAGT TTCCAGTAGC GCTTGATAG 20199 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACCCGATGTA TCAGCGGATA TTTACTCTAT TTTTCAAACG ATGTTATACC CACAATAAAA 60 

GAAAAAAGAC CCTAAGGTCT CCTTTGCTTT T ATT ATT AAA CGCGTTCAAC TTTACCTCA7 120 

TTCAAAGCAC GAGCTGAAGC CCAAACTTTT TTAGCTTTAC CATCGATAAG AACAGTAACT 18 0 

TTTTGAAGGT TTGCTTTTAC GGCACGTTTT GTTTGGTTCA TCGCGTGTGA ACGGTTGTTT 2 40 

CCTGATACAG TCTTACGACC TGTAAAGTAA CATACTTTAG CCATTGTGTT TTCCTCCTAT 3 00 

TAGATCTAAT ATAGCGGATG TGCTAGCACC ACATACCGTA CTATGTTATC ACATTTTCTT 3 60 

GTTTTTTGCA AGGGAATTGG AAGATTTTTT ATTTGTGTCT TAAATCAGCT CTTGCGTGAC 4 20 

ATTTcTGCTC TCCACATGCC ATCGTTGATT AACAGAACAC CAGAATTAAA ATTATGTGTA 4 80 

TAAAAATCAT CTCTAACTGC AG CT AAGGGT ATAGCCGTCA AGTCCAAATC CCACAGCTCA 54 0 
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TCTATCGATT 


TTCTTACAAC 


AATATCTGAA 


TCCAAATACA 


GTACACGAGA 


CTCGCTTACA 


600 


TACTTTGGAA 


TAAAATACCT 


AAAAAAGCCG 


CATATGAAAG 


TCCCTCAAAG 


GGGAGACGAT 


660 


AACCTTTCAG 


AATATTACTG 


TCAATCTAAA 


CATTCACAAT 


CTCACTATTC 


AAAGTCTCTA 


720 


GTCTTTTTTC 


CATCAATTGG 


AACCATTCTC 


GCGGAAGGTC 


ATCATTAAAA 


ACATAAAACT 


780 


TAAGATTATA 


ATGATGAACA 


CAAAGAGATT 


TTATTGTTGT 


TTCAACTTTA 


TCCATATAAG 


840 


CATTATCTGC 


ACCTAAGACA 


ATCGCTTTTT 


TCTCTTCTTT 


C ACT TTTT AT 


CTCATTTCTT 


900 


TTTATTCCCA 


TCATATTATT 


CCCATCATAT 


GTTTCCCATC 


ATATGTTTCT 


ACGTAACCAT 


960 


TATTTTCGCC 


TATTCGTTCG 


TAAAACCATA 


C C AGTGG AG A 


TTTTAGATGA 


AGTCCCATTA 


1020 


CGGTTTACAA 


TTTTTACATT 


ACGACACGGA 


GTTTTACAAA 


TCGATTTCAT 


TTGCCAAACG 


1080 


TAGTTAGTGA 


GGCAGTTAGC 


TAGTTCGCCA 


AATAGCGACT 


AGCGTCCAAC 


AATTTGGAAC 


1140 


TTTACTTCCA 


ATTCTTGGTA 


CTGAGTCACA 


TCTTCTCCTC 


T AA CT C T AC G 


TCTGGATACT 


1200 


TGTCCGCAAA 


CCAGCGGAGG 


GCAAAGTCAT 


TTTCAAAGAG 


AAAGACTGGT 


TGGTCAAAAC 


1260 


GGTCTTTGGC 


TAAGATATTG 


CGACTTGACG 


ACATCCGTTC 


ATCCAAGTCC 


TCAGGCTTGA 


1320 


TCCAACGAAC 


GGTCTTTTTA 


CCCATTGGGT 


TCATAACTAC 


TTCCGCATTG 


TACTCGCCTT 


1330 


CCATGCGGTG 


TTTAAAGACT 


TCAAACTGGA 


CTTGACCTAC 


AGCCCCTAGC 


ATGTACTCAC 


1440 


CTGTTTGGTA 


ATTCTT AT AA 


AGCTGAACGG 


CTCCTTCTTG 


CACCAATTCC 


TCAATCCCCT 


1500 


TGTGCAAGCA 


TTTTTGCTTC 


ATAACATTCT 


TAGCAGAAAC 


TTTCATGAAA 


ATCTCAGGTG 


1560. 


TAAAGGTTGG 


CACGCGTTCA 


AATTCAAACT 


TGTTTTTTCC 


AACCGTCAAG 


GTATCCCCAA 


1620 


CCTCATAACT 


ACCGGTATCG 


TAAACCCCGA 


TAATATCACC 


TGCCACGCCA 


TTCGTCACAT 


1680 


TCTCACGACT 


CTCCGCCATA 


AACTGGGT AA 


C ATT AG AT AG 




TTAT^AGTAC 


1740 


GACGGAGATT 


GACACTCATG 


CCCCGCTCAA 


ATTCGCCAGA 


TACGATACGG 


ACAAAGGCAA 


1800 


TACGGTCACG 


GTGACGACjCjO 


r<->/-— * rpprnmp /» 


Li l ^j<jr\. .ill 


AAAGACAAAG 


CCTGAGAAAT 


1860 


CCTTGTCATA 


AGGATCCACA 


ATTTCACCGT 


CTGTTTTCTT 


GTGACCATGT 


GGTTCTGGAG 


1920 


C AAACTTG AG 


GAAGGTTTCA 


AGGAAGGTCT 


GCACACCAAA 


GTTTGTCAGG 


GCTGAACCGA 


1980 


AAAAGACAGG 


CGTCAATTCT 


CCAGCCAGAA 


TAGCTTCCTC 


TGAAAACTCA 


TTCCCGGCTT 


2040 


CATTTAAAAG 


CTC AATGTC A 


TCCTTGACTT 


GCTCGTAGAA 


AGGATTGCTA 


CCAAAGAGTT 


2100 


TGTCCCCGTC 


TTCTAGACTG 


GCAAAACGCT 


CATCCCCTTT 


GTAAAGCTCT 


AAACGTTGGT 


2160 


TATAGAGGTC 


ATACAAGCCC 


TCAAAGGCTT 


TCCCCATCCC 


GATAGGCCAG 


TTCATAGGGT 


2220 


AGCTAGCAAT 


GCCCAAGATT 


TCTTCCAATT 


CTTGCAAGAG 


ATCCAAAGGC 


TCACGACCGT 


2230 
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CACGGTCCAG CTTGTTCATA AAGGTAAAGA CTGGAATGCC ACGATGTTTC ACAACCTCAA 2 34 0 

ACAATTTCTT GGTTTGAGCC TCGATCCCCT TGCCAGAGTC CACGACCATG ACCGCAGCAT 2400 

CCACCGCCAT CAAGGTACGA TAGGTATCTT CTGAGAAGTC CTCGTGCCCT GGCGTGTCTA 24 60 

AGATATTCAC GCGCTTGCCG TCGTAGTCAA ATTGCATAAC AGATGAAGTA ACAGAAATCC 2 52 0 

CACGTTGCTT CTCGATATCC ATCCAGTCAG ATTTAGCAAA AGTCCCTGTT TTCTTCCCTT 2580 

TTACCGTACC AGCCTCACGA ATCTCACCCC CAAAGTAGAG TAACTGCTCA GTGATGGTTG 2 64 0 

TTTTCCCCCC GTCCGGGTGG GAGATAATCG CAAAGGTACG ACGTTTCTTA ATTTCTTCTT 2 7 00 

GAATATTCAT AAGTTCTCTT TCTTTGATTC TCTATTTTTC TTGTTTCAAT AGCTGAGAAT 27 60 

CATTTTTACA TTGGATTTTA CC ATTCCTTT CAACACTCCA TT AT AT CGG A TTTTAGCATT 2820 

TTTTTCAATT TCTATTTCTT TTCACTTCCC CCTCCCTTAT T T AT AGG AAA ATATGGTAAA 2 880 

ATAGAACAGA CTAAAAATCA TCATTTCACG AAAGGATGCA AGATGAAAAT TACGCAAGAA 2 94 0 

GAGGTAACAC ACGTTGCCAA TCTTTCAAAA TTAAGATTCT CTGAAGAAGA AACTGCTGCC 3000 

TTTGCGACCA CCTTGTCTAA GATTGTTGAC ATGGTTGAAT TGCTGGGCGA AG TTG AC AC A 3060 

ACTGGTGTCG CACCTACTAC GACTATGGCT GACCGCAAGA CTGTACTCCG CCCTGATGTC 3120 

GCCGAAGAAG GAATAGACCG TG AT CGCTTG TTTAAAAACG TACCTGAAAA AGACAACTAC 3180 

TATATCAAGG TGCCAGCTAT CCTAGACAAT GG AGG AG ATG CCTAATGACT TTTAACAATA 3 2 40 

AAACTATTGA AG AGTTGC AC AATCTCCTTG TCTCTAAGGA AATTTC TGC A ACAGAATTGA 3 300 

CCCAAGCAAC ACTTGAAAAT ATCAAGTCTC GTGACGAACC CCTCAATTCA TTTGTCACCA 3 3 60 

TCGCTGAGGA GCAAGCTCTT GTTCAAGCTA AAGCCATTGA TGAAGCTGGA ATTGATGCTG 3 4 20 

ACAATGTCCT TTCAGGAATT CCACTTGCTG TTAAGGATAA CATCTCTACA GACGGTATTC 34 80 

TCACAACTGC TGCCTCAAAA ATGCTCTACA ACT ATG AG CC AATCTTTCAT GCGACAGCTG 3 540 

TTGCCAATGC AAAAACCAAG GCCATGATTG TCGTTGGAAA GACCAACATG GACGAATTTG 3 600 

CTATGGGTGG TTCAGGTGAA ACTTCACACT ACGGAGCAAC TAAAAACGCT TGGAACCACA 3 6 60 

GCAAGGTTCC TGGTGGGTCA TCAAGTGGTT CTGCCGCAGC TGTAGCCTCA GGACAAGTTC 3 720 

GCTTGTCACT TGGTTCTGAT ACTGGTGGTT CCATCCGCCA ACCTGCTGCC TTCAACGGAA 37 80 

TCGTTGGTCT CAAACCAACC TACGGAACAG TTTCACGTTT CGGTCTCATT GCCTTTGGTA 3 84 0 

GCTCATTAGA CCAGATTGGA CCTTTTGCTC CTACTGTTAA GGAAAATGCC CTCTTGCTCA 3 900 

ACGCTATTGC CAGCGAAGAT GCTAAAGACT CTACTTCTGC TCCTGTCCGC ATCGCCGACT 3 9 60 

TTACTTCAAA AATCGGCCAA GACATCAAGG GTATGAAAAT CGCTTTGCCT AAGGAATACC 4 02 0 

TAGGCGAAGG AATTG AT CCA GAGGTTAAGG AAACAATCTT AAACGCGGCC AAACACTTTG 4 0 80 
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AAAAATTGGG 


TGCTATCGTC 


GAAGAAGTCA 


GCCTTCCTCA 


CTCTAAATAC 


GGTGTTGCCG 


4140 


TTTATTACAT 


CATCGCTTCA 


TCAGAAGCTT 


CATC AAA CTT 


GCAACGCTTC 


GACGGTATCC 


4200 


GTTACGGCTA 


TCGCGCAGAA 


GATGCAACCA 


ACCTTGATGA 


AATCTATGTA 


AACAGCCGAA 


4260 


GCCAAGGTTT 


TGGTGAAGAG 


GTAAAACGTC 


GTATCATGCT 


GGGTACTTTC 


AGTCTTTCAT 


4320 


CAGGTTACTA 


TGATGCCTAC 


TACAAAAAGG 


CTGGTCAAGT 


CCGTACCCTC 


ATCATTCAAG 


4380 


ATTTCGAAAA 


AGTCTTCGCG 


GATTACGATT 


TGATTTTGGG 


TCCAACTGCT 


CCAAGTGTTG 


4440 


CCTATGACTT 


GGATTCTCTC 


AACCATGACC 


CAGTTGCCAT 


GTACTTAGCC 


GACCTATTGA 


4 500 


CCATACCTGT 


AAACTTGGCA 


GGACTGCCTG 


GAATTTCGAT 


TCCTGCTGGA 


TTCTCTCAAG 


4560 


GTCTACCTGT 


CGGACTCCAA 


TTGATTGGTC 


CCAAGTACTC 


TGAGGAAACC 


ATTTACCAAG 


4620 


CTGCTGCTGC 


TTTTG AAGC A 


ACAACAGACT 


ACCACAAACA 


ACAACCCGTG 


ATTTTTGGAG 


4680 


GTGACAACTA 


ATGAACTTTG 


AAACAGTCAT 


CGGACTTGAA 


GTCCACGTAG 


AGCTCAACAC 


4740 


CAATTCAAAA 


ATCTTCTCAC 


CTACTTCTGC 


CCACTTTGGA 


AATGACCAAA 


ATGCCAACAC 


4300 


TAACGTGATT 


G ACTG GTCTT 


TCCCACGACT 


TCTACCAGTT 


CTCAATAAAG 


GGGTTGTTGA 


4860 


TGCCGGTATC 


AAGGCTGCTC 


TTGCCCTCAA 


CATGGACATC 


CACAAAAAGA 


TGCACTTTGA 


4920 


CCGCAAGAAC 


TACTTCTATC 


CTGATAACCC 


CAAAGCCTAC 


CAAATTTCTC 


AGTTTGATGA 


4980 


ACCAATCGGA 


TATAATGGCT 


GGATTGAAGT 


CAAACTAGAA 


GACGGTACGA 


CCAAGAAAAT 


5040 


CCGTATCGAA 


CGTGCCCACC 


TAGAGGAAGA 


CGCTGGTAAA 


AACACCCATG 


GTACAGATGG 


S100 


CTACTCTTA? 


GTTGACCTCA 


ACCGCCAAGG 


GGTTCCCTTG 


ATTGAGATTG 


TATCTGAGGC 


5160 


AGATATGCG? 


TCTCCTGAAG 


AAGCCTATGC 


TTATCTGACA 


GCCCTCAAGG 


AAGTTATCCA 


5220 


GTACCCTGGC 


ATTTCTGACG 


TTAAGATGGA 


GGAAGGTTCG 


ATGCGTGTGG 


ATGCCAACAT 


5280 


CTCCCTTCGT 


CCTTATGGTC 


AAGAGAAATT 


CGGTACCAAG 


ACTGAATTGA 


AGAACCTCAA 


5340 


CTCCTTCTCA 


AACGTTCGTA 


/\J\Kj\j L i 1 vj/V 




TAACGCCACG 


CTGAAATTCT 


5400 


TCGCTCAGGT 


GGTCAAATCC 


GCCAAGAAAC 


ACCCCGTTAC 


GATGAAGCGA 


ATAAAGCAAC 


5460 


CATCCTCATG 


CCTCTCAAGG 


AAGGGGCTGC 


TGACTACCGC 


T ACTTCCC AG 


AACCAGACCT 


5520 


ACCCCTCTTT 


GAAATTTCTG 


ACGAGTGGAT 


TGAGGAAATG 


CGGACTCAGT 


TGCCAGAGTT 


5580 


TCCAAAAGAA 


CGTCGTGCGC 


GTTATGTATC 


TGACCTTGGT 


TTATCAGACT 


ACGATGCTAG 


5640 


T C AG TTG ACT 


GCTAATAAAG 


TCACTTCTGA 


CTTCTTTGAA 


AAAGCTGTTG 


CCCTAGGTGG 


5700 


TGATGCCAAA 


CAAGTCTCTA 


ACTGGCTCCA 


AGGGGAAGTC 


GCTCAGTTCT 


TGAATGCTGA 


5760 


AG GT AAAAC A 


CTGG AAC AAA 


TCGAATTGAC 


ACCAGAAAAC 


TTGGTTCAAA 


TGATTCCCAT 


5820 
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CATCGAACAC CCTACTATTT CATCTAAGAT TCCCAAGAAA GTCTTTGTCC ATCTAGCTAA 
AAATGGCGGT GGCGCGCGTG AATACGTGGA AAAAGCAGGT ATGGTTCAAA TTTCAGATCC 
AGCTATCTTG ATCCCAATCA TCCACCAAGT CTTTGCCGAT AACGAAGCTG CTGTTGCCGA 
CTTCAAGTCA GGCAAACGTA ACGCCGACAA GGCtTTACAG CATTCCTTAT GAAGGCAACC 
AAAGGCC AAG CCAACCCACA AGTTGCCCTT AAACTACTTG CACAGGAATT GGCGAAGTTG 
AAAGAAAACT AGACAGAACA AAACCAGCCC T AAGGTTGGT TTTTTCTTCT CTACCAACTC 
CCAATAACTA TTTTGGCTTT ATTTCCAGAG TATTTTATGG TAAAATGAAG AGTAATAATA 
TTTATTAAAG AGGTAAAAAC ATG ATT G AAG CAACTACCTT AAAAGCTGGT ATGACCTTTG 
AAACAGCTGA CGGCAAATTG ATTCGCGTTT TGGAAGCTAG TCACCACAAA CCAGGTAAAG 63 60 

GAAACACGAT CATGCGTATG AAATTGCGTG ATGTCCGTAC TGGTTCTACA TTTGACACAA 
GCTACCGTCC AGAGGAAAAA TTTGAACAAG CTATTATCGA GACTGTCCCA GCTCAATACT 
TGTACAAAAT GGATGACACA GCATACTTCA TG AAT AC AG A AACTT ATG AC CAATACGAAA 
TCCCTGTAGT CAATGTTGAA AACGAATTGC TTTACATCCT TGAAAACTCT GATGTGAAAA 
TCCAATTCTA CGGAACTGAA GTGATCGGTG TCACCGTTCC TACTACTGTT GAGTTGACAG 
TTGCTGAAAC TCAACCATCT ATCAAAGCTC CT ACTGTT AC AGGTTCTGG T AAACCAGCAA 
CGATGGAAAC TGG ACTTGT C GTAAACGTTC CAGACTTCAT CGAAGCAGGA CAAAAACTCG 
TTATCAACAC TGCAGAAGGA ACTTACGTTT CTCGTGCCTA ATCTCTAGAA AGAGGTCATT 6340 
CTATGGGAAT TGAAGAACAA CTTGGCGAAA TCGTTATCGC CCCACGTCTA CTTGAAAAAA 
TCATTGCTAT CGCTACTGCA AAGGTAGAGG CTCTTCACTC TTTTT C AAAC AGATCAGTGT 
CTGATACCCT TTCAAAACTT TCACTCGGCC GTGGCATTTA TCTTAAAAAC GTGGACGAAG 
AACTCACAGC AGATATCTAT CTCTACCTTG AGTACGGAGT AAAAGTTCCT AACGTAGCGG 
TTGCTATCCA GAAAGCTGTC AAAGATGCCG TCCGTAATAT GGCTGATGTA GAACTCGCTG 
CTATCAATAT TCACGTTGCA GGTATCGTCC C AG AT AAAAC ACCAAAACCA GAATTGAAAG 7200 
ATCTATTTGA CGAGGACTTC CTCAATGACT AGTCCACTAT TAGAATCTAG ACGCCAACTC 
CGTAAATGCG CTTTTCAAGC TCTCATGAGC CTTGAGTTCG GTACGGATGT CGAAACTGCT 
TGTCGTTTCG CCTATACTCA TGATCGTGAA GATACGGATG TACAACTTCC AGCCTTTTTG 7330 
ATAGACCTCG TTTCTGGTGT TCAAGCTAAA AAGGAAGAAC TAGATAAGCA AATCACTCAG 74 4 0 

CATTTAAAAG CAGGTTGGAC CATTGAACGC TTAACGCTCG TGG AG AG AAA CCTCCTTCGC 7 500 

TTGGGAGTCT TTGAAATCAC TTCATTTGAC ACTCCTCAGC TGGTTGCTGT TAATGAAGCT 7560 
ATCGAGCTTG CAAAGGACTT CTCCGATCAA AAATCTGCCC GTTTTATCAA TGGACTGCTC 



5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 



6420 
6480 
6540 
6600 
6660 
6720 
6780 



6900 
6960 
7020 
7080 
7140 



7260 
7320 



7620 
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AGCCAGTTTG 


TAACAGAAGA ACAATAAGGC 


TCTTTGTCAA 


CTGTAGTGGG 


TTGAAAAAAA 


7680 


GCTAAGCTCG 


AG AAAGG AC A 


AATTTCGTCC 


TTTCTTTTTT 


GATGTTCAAA 


GCGATAAAAA 


7740 


TCCGTTTTTT 


GAAGTTTTCA 


AAGTTTCGAA 


AAC CAAAGG C 


ATTGCGCTTG 


ATAACTTTCA 


7300 


T GAG ATT ATT 


GGTCGCTTCC 


AGTTTGGCAT 


TAGAATAGTG 


TAG TTG AAGG 


GCGTTGACAA 


7360 




ATCTTTGAGG 


AAGGTTTTAA 


AGACAGTCTG 


AAAAATAGGA 


TGAGCCTGCT 


7920 


TAAGATTGTC 


CTCAATAAGT 


CCGAAAAATT 


TCTCTGGTTC 


CTTATTCTGG 


AAGTGAAACA 


7990 


GCAAGAGCTG 


ATAGAGCTGA 


TAGTGGTGTT 


TCAAGTCTTG 


TGAATGGCTC 


AAAAGCTTGT 


8040 


CTAAAATCTC 


TTTATTGGTT 


AAGTGCATAC 


GAAAAGTAGG 


ACGATAAAAT 


CGCTTATCAC 


8100 


TCAGTCTACG 


GCTATCCTGT 


TGAATGAGTT 


TCCAGTAGCG 


CTTGATATCC 


TTGTATTCAT 


3160 


GGGATTTTCG 


ATGAAACTGA 


TTCATG ATTT 


GGACACGCAC 


ACGACTCATG 


GCACGGCTAA 


8220 


GATGTTGTAC 


AATGTGAAAG 


CGATCAAGAA 


CGATTTTAGC 


ATTCGGGAGT 


GAAACAGTCT 


8280 


GGGAGACTGT 


TTCAGCCTGA 


GCCTAGGAAT 


TTGAAAGCGA 


AGCTGTTTAG 


CCAAGTCATA 


8340 


GTAAGGGCTA 


AACATATCCA 


TAGTAATAAT 


TTTGACGCGA 


CATCGGACAA 


CTCTATCGTA 


8400 


GCGAAGAAAG 


TGATTTCGAA 


TGATAGCTTG 


TGTTCT AC C C 


TCAAGAACAG 


TGATGATATT 


8460 


GAGATTGTTA 


AAATCTTGCG 


CAATGAAGCT 


CATCTTTCCC 


TTTGTAAAAG 


CATACTCATC 


8520 


CCAAGACATA 


ATCTCAGGAA 


GACAACAAAA 


ATCATGTTTA 


AAGTGAAAAT 


CATTGAGCTT 


8580 


ACGAATAACA 


GTTGAAGTTG 


AGATGGAAAG 


CTGATGGGCA 


ATATCAGTCA 


TAGAAATCTT 


8640 


TTCAATCAAC 


TTTTGAGCAA 


TCTTTTGGTT 


GATGATACGA 


GGGATTTGGT 


GATTTTTCTT 


8700 


GACGATAGAA 


GTTTCAGCGA 


CCATCATTTT 


TGAACAGTGA 


TAGCACTTGA 


ATCGACGCTT 


8760 


TCTAAGGAGA 


ATTCTAGTAG 


GCATACCAGT 


CGTTTCAAGA 


TAAGGAATTT 


TAGAAGGTTT 


8820 


TTCAAAGTCA 


TATTTCTTCA 


ATTGGTTTCC 


GCACTCAGGG 


CAAGATGGGG 


CGTCGTAGTC 


8880 


CAGTTTGGCG 


ATGATTTCCT 


TGTGTGTATC 


v_ «. 1 rt 1 1 Kjt\ » \j 




; LTGGATATT 


39 4 0 


AGGGTCTTTA 


ATGTCTAGTA 


ATTTTGTGAT 


AAAATGTAAT 


TGTTCCATAT 


GAATCTTTCT 


9000 


AATGAGTTGT 


TTTGTCGCTT 


TTCATTATAG 


GTCATATGGG 


ACTTTTTTTC 


TACAATAAAA 


9060 


T AGG CTCC AT 


AATATCTATA 


GGGGATTTAC 


CCACTACAAA 


TATTATAGAG 


CCAACAATAA 


9120 


AAAGAAAAAG 


TGTTTGATAG 


ATATCAAACA 


CTTTTTTCTT 


TGCCTCCCAC 


TATCTAAAAA 


9180 


AATGATAATA 


GATATAATTG 


TAAACAAAAA 


TCCAGATAGG 


TTTTGCATGA 


TTGAGAAAGT 


9240 


TAAAAAAACT 


ATGGCAGAGA 


ATCGTTAATC 


TCAGATTGTC 


GGTAGAACGA 


TAAACAAGGG 


9300 


CAAAAAAGAA 


ACCAATCAGA 


CTATAATATA 


ATAAACTAAT 


TGGATCTCTG 


TGAGATAGTA 


9360 
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TCAAATGGCT 


AATCCCAAAG 


ATGATAGCAG 


ATAGGATAAC 


ATCCAAATAG 


TACTTGGACT 


9420 


AGGGAAAGAA 


GGTATTCATA 


AAATACCCTC 


TATCAAGAGT 


CTCCTCAAAA 


ACAGGACCGA 


9480 


TG ATT AC AG G 


CAGGACAAAA 


GATAAGATAG 


TCGATAAAAA 


GGTTGGTTGT 


CCATTTGAAA 


9540 


AAAGCACGGT 


AAAAT ACT C A 


TCATGAATAT 


TCCTATGATT 


AATCAAATGA 


GCATAGCGTG 


9600 


CCCAAAAATT 


ACCGAGAATC 


TGATAAACCA 


C AT AAGTTG C 


AAATAAGTAG 


AAGACAAATG 


9660 


ACCAGTTCCA 


GCTCTTTTTC 


TCAAAGATAA 


AGAGCATCTT 


TTTCTTTTTT 


AACCTCCAAA 


9720 


TTAATAGAAG 


GAAACTTCCC 


ACTAATCCCA 


TTGTT AAAAT 


AAGAGAATAG 


ACATCAGCTC 


9780 


CTAACCCTAA 


AATGATCGTC 


ACATACAATC 


CAATTGTTTG 


TGGT AAAT AG 


GTAGATAGTA 


9840 


AAATAATAAG 


CAAAAATATT 


CCAAATTGTC 


TTAGTTTTTT 


TGTGTTTCTC 


ATCGTACTTT 


9900 


TTTGAAAGAT 


TACCCTGCTC 


GGAAGCCGTA 


CTTCCAAGCA 


TCTATATAAG 


AATT AAGTG C 


9960 


CCCTTGCCTC 


ATATAGGGAG 


CAAATTCTCT 


ATAATATAAC 


CATCTACTAT 


ATCCATCTTC 


10020 


CCAAACAGCA 


AGACCACCTG 


AAGTTTGCTC 


CAAGTCCTCA 


GTTGAAAGAA 


CTGT AAATGT 


10080 


ATTTGTACCT 


GTCATTGCAA 


GTACCTTCTT 


AAAAT AG ATT 


GTTGTAGGCT 


CACATTTATA 


10140 


GTATATTT CT 


TTTTTTGTCT 


ATTTTATAGC 


CCATCTCCTC 


AACTGGCAAT 


TTTTCGACCT 


10200 


GAATTACATT 


TTT C C AT AAA 


AAATG AG AC C 


TTTCTAGTCT 


CATTTAGTCA 


TTCTT AGT AT 


10260 


TTTCTAAATC 


GTTGATAGCG 


TTCTTCCAGC 


AACTCTTCTA 


GCGGTTTTTG 


TGAAAGTCTA 


10320 


GCCAGCTCCG 


TTTGGAGTTC 


TTTTTTGACA 


CTCTTAATCA 


GTTCTTTACT 


AG AAAGTC CT 


10380 


ATTTCAGAAA 


TCACCTTATC 


CACCACGTCC 


ATTTCTAACA 


GTTCATGCGA 


AGTGATTTTC 


10440 


ATCAGTTCTG 


CTGCTTCCAT 


AGCGCGAGTA 


CCGTCCTTCC 


ATAAAATGGA 


AGCAAAGCCT 


10500 


TCTGGACTGA 


GAATCGCATA 


GAT AGAATTT 


TCCAGCATCC 


AGACACGGTC 


CGCGACAGCT 


10560 


AGAGCCAGAG 


CCCCGCCTGA 


ACCACCTTCA 


CCGATAATAA 


TGGCGATAAT 


AGGAACTTTC 


10620 


AG GTC AC T C A 


TTTCCATGAG 


ATTGCGAGCG 


ATAGCTTCCC 


CTTGACCACG 


TTCTTCCGCT 


10680 


CCGACACCAG 


GATAAGCACC 


TGCTGTATTG 


AT AAAGGT C A 


CAACTGGACG 


GCCAAATTTC 


10740 


TCAGCCTGTT 


TCATCAACCG 


CAGTGCCTTT 


CGGTAGCCTT 


CTGGATGTGG 


TTGGCCAAAA 


10800 


TTCCCTTTGA 


GGTTGTCTTG 


CAAACTCTTG 


CCTTTTTGGA 


TACCAACCAC 


TGTTACAGCT 


10360 


TGGTCTCCAA 


GCCAACCAAT 


ACCACCAACA 


ACTGCACCAT 


CATCACGAAA 


AGAACGGTCA 


10920 


CCATGTAATT 


GGATAAATTC 


AT C AAAAATG 


CCTGTCGCAA 


AGTCCAAGGT 


TGTCAAGCGA 


10980 


CTCTGCTCAC 


GCGCTTCTCT 


GACTATTTTT 


GCAATATTCA 


TCTAGGACTC 


CCTCCATGCA 


11040 


ATCTGACTAG 


GCTAGCAATC 


GTATCTGGTA 


AGTCTCTTCT 


TTTGACAATA 


GCATCCACAA 


11100 


AGCCATGTTC 


TAATAGGAAT 


TCTGCCTTTT 


GGAAATCCTC 


AGGCAAGCTT 


TCACGAACCG 


11160 
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TATTTTCAAT CACACGACGC CCAGCAAAAC CAACCAAGCT CTGTGGTTCA GCCAGAATGA 1122 0 

TATCGCCTTC CATAGCGAAA GAAGCTGTCA CACCACCAGT CGTTGGATCT GTCAAAATGG 11280 

TCAGGTAAAA GAGACCAGCA TTTGAATGCC GTTTAACCGC CGCAGAGATC TTAGCCATCT 11340 

GC AT GAG ACT CATGATTCCT TCCTGCATAC GGGCTCCACC ACAGGCTGTG AATAGGACAA 11400 

CTGCCAATTT TTCGACAGTC GCATACTCAA ACAAACGAGT GATTTTTTCA CCTACAACCG 114 60 

TACCCATAGA AGCCATGATA AAGTTAGAAT CCATAATCCC AAGAGCCACA GTCTGACCTT 11520 

TAATAAGAGC AGTTCCTGTC ACAACGGCTT CATGCAGACC TGTTTTTTCA CGCATAGATG 11580 

CCAGTTTCTT TTGGTAACCA GGGAAATGCA AGGGATCCTT GCTTTCAATC CCTGTAAACA 11640 

ATTCTTTGAA GCTTCCCATA TCAATCGTCA AAGCCAAGCG TTCTTGCGCA GAAATACGAA 11700 

AGGTATAGCT ACAGTGCGGA CAGATACGTT CACTTCCCAG ATCCTTCTGA TAG ATGGT AT 117 60 

GCTTACAGCC TGGACACTGG GAAAATAATT CATCTGGAAC CTCTGGCTTA GCTTGAGGTT 11820 

TTTCCCTAAC CGAACGATTG GGATTGATTC GAATATACTT ATCTTTTTTA CTAAATAGAG 11880 

CCATTGATTC CCCTTTTCGG TTTAAACTCT TAAAGTCATT TTATTCTTTT TCTTG AT ATT 11940 

TAGGTAAGAA GGTTTCCATC AAGAAGGAAG TATCATAATC CCC AGCAATG ACATTGCGAT 12000 

CTGAAATGAG GTCAAGCTGG AAATCTGCAT TGGTCTGCAC TCCTTCAATT TCTAATTCAT 12 060 

AGAGGGCACG TTGCATTTTC ATCAAGGCGT CAAAACGATT TTCGCC3TGT ACTATGATTT 12 120 

TGGCAATCAT ACTATCATAA TAAGGCGGAA TGGTATAACC TCGATAAACT GCTGAATCCA 12 190 

CGCGCAAGCC AACTCCACCA CTTGGCAGAT AG AG ATT AG T AATCTTACCT GGACTTGGAG 122 4 0 

CAAAGTTAAA GGCTGGGTTT TCTGCATTGA TACGACACTC GATGGCATGA CCGCGTAGGA 12 300 

CAATATCTTC TTGCTTAACA GACAAAGGCT GACCTGCCCC AATGCAAATC TGTTCCTTAA 12 3 60 

CGATATCAAC ACCTGAAACA AACTCTGTTA CTGGATCTTC TACCTGAACA CGAGTATTCA 12 42 0 

TCTCCATGAA ATAGAAATTG CTACTTGCTT CATCAAGAAG AAATTC AATG GTTCCTGCAT 12 4 80 

TCTCATAGCC AACAAACTCT GCCGCTCGAA CAGCAGCAGC ACCTATTTCA TGACCCAGCG 12 540 

TTTTTCCGAT TGCAATCGAG GGACTTTCTT CCAAAACCTT TTGGTTATTC CTTTGAAGAG 12 600 

AACAATCCCG TTCACCCAAG TGAATCACAT GTCCATGCTC ATCACCTAGG ATTTGAACCT 12 660 

CAATGTGCCG ACCTGGATAG ATAACCCGTT CTATGTACAT GGCACCATTG CCATAATTGG 12 720 

CCTTGGCCTC ACTAGAGGCA GTTTCAAAGG C AG AAA CG AG GTCATCTGGT TTTTCAACCT 12780 

TACGAATCCC TTTACCACCT CCACCTGCTG AAGCCTTGAG CATAACAGGA TAGCCAATTT 12 34C 

TTTCAGCAAC AATCAAAGCT TCTTCAGAGT TATCCACTTC TCCATCTGAA CCTGGTATAA , 12 900 
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CAGGCACACC 


TGCTTTAATC 


ATCTGAGCAC 


GCCCATTGAT 


CTTATCCCCC 


ATCATATCCA 


12960 


TAACATCACC 


AGATGGACCG 


AT AAAC TTG A 


TACCTACTTC 


TTCACACATG 


GTCGCAAATT 


13020 


TCGAATTTTC 


ACTGAGAAAT 


CCAAAACCAG 


GGTGAATAGC 


TTCTGCCTCA 


GTCAAGACTG 


13C80 


CAGCTGATAG 


AACTGCATTA 


AT ATTG AG AT 


AAGACTCTGT 


TGCCTTGCCA 


GGACCAATAC 


13140 


AAACTGCTTC 


ATCTGCCAAA 


AGCGTATGAA 


GAGCTTCCTT 


ATCAGCAGTT 


GAATAAACCG 


13200 


CTACCGTCGC 


AATCCCCAAT 


TCACGTGCCG 


CACGGATAAT 


ACGAACCGCA 


ATTTCACCAC 


13260 


GATTGGCAAT 


TAAAATTTTT 


CGAAACATGG 


AGAACCTCCT 


TAGTTCCCAA 


TTGCAAAAGT 


13320 


AAGGGTACCA 


CTGGCTGCAA 


GCTTGCCATC 


CACTTCAGCC 


TTTGCTTCAA 


CCACAGCTAT 


13380 


GGTGCCACCA 


CGTTTTACAA 


AAGTCGCTGT 


CATAACCAAT 


TGGTCGCCTG 


GTACAACTTG 


13440 


CTTCTTGAAC 


TTAACCTTGT 


CCATACCAGC 


GTAAAAGACC 


AGTTTTCCTT 


TATTTTCAGG 


13500 


TTTTGATAAC 


TCCAACACAC 


CGGCAGTTTG 


CGCCAAGGCT 


TCCATAATCA 


CAACACCTGG 


13560 


CATAACTGGG 


TATTGAGGAA 


AGTGGCCGTT 


AAAGAAAGGC 


TCGTTGATGG 


TCACATTTTT 


13620 


GATAGCAACA 


ATGGTATCCT 


CGCTCACTTC 


CAAGACACCG 


TCCACTAGAA 


CCATAGGATA 


13680 


ACGGTGGGGA 


AGAGCTTCTT 


TGATTCCTTG 


AATATCGATC 


ATTTGATACG 


TACCAATCCT 


13740 


TTACCAAACT 


CAACCATTTC 


TTCGTTAGAG 


ACGAGAATTT 


CCGTTACCAC 


AC CATC CTT A 


13800 


GGACCTGGGA 


TTTCATTCAT 


CACTTTCATG 


GCTTCGATAA 


TTACCAATGT 


TTGACCTTTT 


13860 


TTGACACTAT 


C AC C AACTGT 


AACGAAGGCA 


GGTTTATCTG 


GTCCAGCAGC 


CAAGTAAACC 


13920 


ACTCCAACAA 


GTGGACTCTC 


TACAAGATTT 


CCCTCAGTAG 


CCACACTTGC 


TTCAGCTGGA 


13980 


GCTGGAACTT 


CTTCTGCTAC 


AGTCTCTCCT 


GGAGCAGATG 


TAGGAGCTAC 


TGGACTCGGT 


14 040 


GTTGCTAGAA 


CGGGTGCTGG 


AGCGACTTGA 


GTTGCAACTT 


CAGCCACAGG 


TCTTGCTTCA 


14100 


TTCTTGCTAA 


ACTGCAACTC 


ATCCGTCCCA 


TTTTTATAAG 


AAAATTCTCT 


CAAACTTGAC 


14160 


TGC7CAAATT 


GAGTCATCAA 


GTCTTTAATA 


TCGTTTAAAT 


TCATACTTAT 


CTATTCTCCC 


14220 


AACCTTTGAA 


AGCAAGAACT 


GCATTGTGGC 


CTCCAAAACC 


AAAAGTATTT 


GAAATAGCGT 


14280 


ATGGAATTTC 


TTTCTCCAAG 


CCTTGTCCAT 


AAACG A C ATT 


AG CTTCG ATA 


TAATCTCATA 


14340 


CTTCACTTGT 


CCCAGCTGTC 


ATTGGTACAA 


AGTTATGACG 


CAT AGCTTCG 


ATGGTGACGA 


14400 


TAGCTTCTAC 


TGCACCCGCA 


GCCCCCAGCA 


AATGTCCTGT 


AAAAGACTTG 


GTTGATGATA 


14460 


CAGGTACTTC 


CTTACCAAGA 


ACAGCTACGA 


T AGC AC C ACT 


TTCTCCTTTT 


TCATTGGCAG 


14520 


GAGTTGACGT 


TCCGTGAGCA 


TTGACATAGG 


CTACTTGCTC 


TGGAGAAATC 


TCAGCTTCTT 


14580 


CCAAGGCTAG 


TTTGATGGCC 


TTGATAGCTC 


CCTGACCTTC 


TGGATGTGGA 


GAAGTCATGT 


14640 


GGTAGGCATC 


ACAAGTATTT 


CCGTAACCAA 


CCACTTCAGC 


CAGGATAGTA 


GCTCCACGTT 


* 14 70.0 



WO 98/18931 



PCT/US97/19588 



203 

TTTCAGCGTG TTCAAGACTT TCTAGAACCA ACATCCCTGA ACCTTCACCC ATAACAAACC 14 7 60 

CATTGCGATC CTTATCAAAT GGGATCGAAG CACGAGTTGG ATCCTCTGTA GTAGAGAGAG 14 320 

CTGTTAAGGC TTGGAAACCA GCGATGGCAA AAGGTGTGAT AGAAGCTTCT GTTCCTCCCA 14 880 

CCAACATCAC ATCTTGGAAA CCAAACTTAA TGGAGCGGAA GGCATCCCCA ATCGCATCAT 14940 

TTGATGAAGA GCAGGCAGTA TTGATAGATT TACAAACACC GTTTGCACCA AAACGCATGG 15000 

CTACATTCCC AG AAGC CAT A TTTGGTAAAG CTTTTGGAAG AGTCATTGGT TTGACACGTT 150 60 

TGGGTCCTTT TTCATGAAGG CCAAGTACCT GATCTTCAAT TTCCTTGATT CCACCAATAC 1512 0 

CAGATGCAAC G AT AAC ACC A AAACGATCCC TATTAAGAGC C T CT AC AT C A AGATTGGCAT 15130 

GATTTACAGC CTCTTGGGCT GCATACAAGG CAT AT AAAG A AT AGTT AT C A AAACGGTTGG 15240 

T AT CTTTTTT TACAAAGTAT TTATCGAACG GAAAATCTTG GATTTCTGCC GCATTATGCA 15 300 

CATCAAAGTC ACTATGATCA AATTTTGTAA TGCCACCAAT GCCGATTTTC CCAGTTGCTA 15 360 

AACTATTCCA AAATTCTTCT GGTGTATTTC C G ATTGG AG A TGTTACTCCA T AAC CTGTT A 15420 

CCACTACTCG ATTTAGTTTC ATTCTTTTCA CCTCTAGCTT TCGCTACATA CTTAAGCCAC 154 80 

CAT C AATGC C AACCACTTGT CCAGTTAGAT AATCTTGGCC TGCTAAAAAT ACTGTCAAAT 15 540 

CTGCAACCTG CTCTGCCTGC CCAAATTCTT TCATCGGAAT CTGAGCTAGT GTAGCTTCCT 15600 

TAATCTTATC TGACAGGATA GCGGTCATAT CAGACTCAAT CATTCCTGGA GCAATCACAT 15660 

TGACTCGTAT ATTCCGACTA GCGACCTCGC GTGCCACAGA CTTGGTAAAG CCAATCAAGC 157 20 

CAGCCTTAGA AGCAGCATAA TTAGCTTGAC CAATATTCCC CATCAAACCA ACAACACTAC 157 30 

ACATATTAAT GATAGCACCT TCTCTGCCTT TCATCATCGG TTTCAAGACT GATTGTGTCA 15340 

TATTAAAGGC ACCAGTCAGA TTGACCTTGA GCACTTTTTC AAAATCTGCT TCTGTCATCT 15900 

TGAGCATAAG AGTATCTTCC GTAATCCCTG CATTGTTGAC CAAAACATCT ACTGAACCCA 15 960 

GTTCTGCAAT AGCTTGATCA ATCATACGCT TAGCGTCTGC AAAAT CTG AT ACATCTCCTG 16020 

AAATGGGAAC CACCTTGATA CCATAGTTTG AAAACTCAGC GAGCAATTCT TCTGAGATTG 16080 

CCCCACGACT GTTTAAGACA ATGTTGGCTC CTGCTTGAGC AAACTTCTGC GCGATGGCAA 16140 

GACCAATTCC ACGACTCGAA CCTGTAATAA AGATATTTTT ATGTTCTAGT TTCATTTTTT 162 00 

TCCTTTCAAA ACTTCTACTT ATTTTAGTCT ATTTTTCTAA AAGTGCTACT AAACTCGCTT 162 60 

GATCTTCCAC ATGAGCTAAG TGAGCAGTTT GATCAATTTT TTTAACAAAA CCTGACAAGA 163 20 

CTTTCCCCGG TCCAATCTCG ATAAAGTTGC TTATGCCTGC TTCTTGCATG ACCCCAATAC 163 80 

TTTCATAGAA ACGAACGGGT TCCTTCACCT GACGCGTCAA GAGCTGAGCA ATGTCCTCTT L644 0 
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TTTGCATCAC 


AGCAGCTTCT 


GTATTGCCGA 


CTAGGGGACA 


AGTAAAATCT 


GAAAAACTTA 


16500 


CCTGACCTAC 


AGTTTCAGCT 


AGTTTCTGGC 


TAG C AG G TT C 


AAGGAGAGCG 


GTGTGAAAGG 


16560 


G ACCTG AC AC 


CTTAAGAGGA 


ATCAAGCGTT 


TGGCACCTGC 


TTCTTGCAAA 


AGTTCAACCG 


16620 


CTCGATCAAC 


TGCAACCACT 


TCTCCAGCAA 


TGACGATTTG 


TGCAGGTGTG 


TTATAGTTGC 


16680 


CTGGAGTAAC 


CACTCCAAGT 


TCAGAAGCTT 


TTTGACAGGC 


TTCTTCAATG 


ACCTCTACTG 


16740 


GCGTATTGAG 


AACTGCTACC 


ATCTTGCCAG 


AGTCAGCAGG 


AGCCGCTTCT 


TCCATATAGG 


16800 


CTCCACGCTT 


AGCTACCAAG 


GCAACCGCAT 


CTTCAAAATC 


CAAGGCGCCA 


CTTGCCACCA 


16860 


AGGCAGAGTA 


TTCTCCAAGA 


GACAAACCAG 


CAACCATATC 


AGGCTGATAG 


CCCTTTTCTT 


16920 


GCAATAAACG 


GTAGATAGCA 


ACCGAAGTCG 


CTAGAATGGC 


TGGTTGCGTA 


TAGCGGGTCT 


16980 


GATTGAGTTT 


GTCTTCTTCC 


GTATCGATGA 


GATAACGCAA 


ATCATAACCG 


AGCACCTGGC 


17040 


TCGCTCGATC 


AATCGTTTCT 


TTAACAATCG 


GATACTGATC 


ATAGAAATCC 


CGTCCCATCC 


17100 


CTAGATACTG 


GGCACCTTGA 


CCAGCAAATA 


AAAAGGCTGT 


TTTAGTCATT 


TCTTACAACT 


17160 


CCTGTCCAGC 


GAGAGGCTTC 


TTCTTGAATT 


TTCTTAGCGG 


CTCCGTAATA 


CAAATCTTTT 


17220 


AGGATTTCTT 


CAGCTGTTTC 


TTCTTTAGAA 


ACAAGCCCTG 


CGATTTGACC 


TGCCATAACA 


17280 


GACCCACCAT 


CCACATCACC 


GTGAACAACT 


GCTTTGGCTA 


GAGCACCTGC 


TCCCATTTGT 


17340 


TCAAAGATTT 


CTAAATCAGG 


ATCTTCTTGC 


TTAAAGGCAT 


CTTTTTCAGC 


CAGTTCAAAA 


17400 


TCTCT AGT C A 


ACTGATTTTT 


AATAGCACGA 


ACAGCATGAC 


CAAAGTGCTG 


AGCTGAAATC 


17460 


GTAGTATCAA 


TATCCCTTGC 


TTTTAAAATT 


TTCTCCTTCT 


AGTTTGGATG 


GGCATTCGAC 


17520 


TCTTTTGCAA 


CTACAAACCG 


TGTCCCCACC 


TGTACAGCCT 


CTGCACCTAG 


CATAAAGCCA 


17580 


GCCGCAGCAC 


CTTCACCATC 


CGCAATTCCT 


CCTGCAGCAA 


TAACAGGAAT 


AG AT AT AG CT 


17640 


GTGGCTACCT 


GTCGCACCAA 


GGTCATGCTT 


GTTAATTTAC 


CGATATGCCC 


CCCAGCTTCC 


17700 


ATTCCTTCTG 


CAATAACAGC 


GTCTGCACCG 


ATTTTTTCCA 


TGCGTTTAGC 


TAAAGCGACA 


17760 


CTAGGAACAA 


CAGGAATAAC 


GATTATCCCA 


GCTTCATGGA 


AACGTTCCAT 


ATACTTGCTT 


17820 


GGATTTCCTG 


CTCCTGTTGT 


GACAACTTTA 


ACACCTTCTT 


CAATAACGAG 


ATCCACGATG 


17880 


TCTTCCACAA 


AG GG AG AT AA 


GAGCATCATG 


TTGACCCCAA 


AGGGTTTATC 


AGTCAATGAT 


1794C 


TTGATTTTAT 


CAATATTGGC 


CT T G AC AA CT 


TCTTTCGGGG 


CATTTCCCCC 


ACCGATAATT 


1800C 


CCTAATCCTC 


CAGCCTTGGA 


AACAGCCCCT 


GCCAAATCAC 


CATCAGCAAC 


CCAGGCCATC 


18060 


CCTCCTTGGA 


AAATAGGATA 


ATCAATCTTC 


AATAATTCTC 


TAATACGCCT 


TTTCATAGTG 


18120 


CCTCCAACCT 


TCCTTGCTTA 


CGTAATAGTT 


CGATTTCACC 


ATAATTTCAC 


AGTCAAACTA 


18180 


TTACCTAAAC 


AAGAGGGAGT 


GGCTTTCTCC 


CTACTCCTTC 


TACTAATATT 


CTGCTT ATTT 


18240 
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TGCTTCCTCT 


TCAACGTAAG 


CAACCAAGTC 


ACCAACTGTT 


TTCAAGTCAT 


TTTCTGCTTC 


18300 


GATTTGGATA 


TCAAAAGCAT 


CTTCCATTTC 


TGAGATTACT 


TGGAACAAGT 


CCAATGAATC 


13360 


TGCGTCCAAA 


TCATCAAAAG 


TTGATTCAAG 


TGTTACTTC7 


GATGCCTCTT 


TTCCAAGTTC 


19420 


TTCAACGATA 


ATTTCTTGTA 


CTTTTTCAAA 


TACTGCCATG 


ATAGGACTCC 


TTTAAAATAA 


18480 


ATAGTTTTTT 


TATAACAATG 


TGTTCACCAC 


ATGATTACCT 


AAATTGTAAG 


AATGAGCGTG 


18540 


CCCCAGGTCA 


AGCCTCCACC 


GAAGCCTGAT 


AGAAGAACAG 


TCTGGCTACC 


ATCTAAAGGG 


18600 


ATGAGACCTT 


GTTCTACACA 


CTCTGAAAGT 


AAAATCGGGA 


TACTGGCTGC 


ACTGGTATTG 


18660 


CCATATTCCA 


TCATATTGGC 


TGGAAGTTTG 


GCTCGGTCAA 


CACCAATTTT 


TCTAGCCATC 


18720 


TTATCCAAAA 


TACGGTCATT 


GGCTTGATGA 


AGTAGCAGAT 


AATCCAAGTC 


TGTCACCTCT 


18780 


ATAGGAGATT 


CATCAATAGT 


CTGCTTGATA 


GACTTGGCTA 


CATCTCGAAT 


GGCAAAATCA 


18840 


AAGACTGTGC 


GTCCATCCAT 


CTTCAAAAAC 


GAATCTGCAC 


TTTCTTGATC 


TGAAAATGGA 


18900 


G AATGT AAAC 


CTGAATGCCC 


ATAAGTTAAA 


CACTCGCTGC 


GACTTCCATC 


GCTATTGACA 


13960 


CTCTCAGCTA 


AGAAATGCTC 


TTGCTCGCTA 


GCTTCTAACA 


AGACACCACC 


AGCACCATCT 


19020 


CCAAACAACA 


CAGCTGTTGA 


TCGATCCGAC 


CAATCGACTC 


CCTTAGAGAG 


GCTTTCACTA 


19080 


CCAATCACCA 


AGCCTTTTTG 


AAAGCGACCA 


GAAGCGATAA 


ACTTTTCAGC 


AGTTGAAAGA 


19140 


GCAAATACAA 


ATCCACTGCA 


AGCCGCGGTT 


AAGTCAAAAG 


CAAAGGCTTT 


ATTAGCACCA 


19200 


ATATTAGCTT 


GAACACGAGC 


AG CTGT AG AG 


GGCATCATCG 


AATCTGGAGT 


AATGGTAGCT 


19260 


AGGATGATAA 


AATCCAGTTC 


TTCTCCTGTT 


ATTCCAGCTT 


TTGCCATCAG 


TTTCTTAGCA 


19320 


ACCTCTGTAG 


CCAAATCACT 


CGTAGATTCT 




TATGCCTTTG 


TCGTATTCCC 


19380 


GTTCGACTTG 


AAATCCACTC 


ATCATTGGTA 


TCCATAATCT 


GAGCCAAGTC 


GTG ATTTGT A 


19440 


ACCACTTGCT 


CTGGCACATA 


ATGAGCAACC 


TGACTTATTT 


TTGCAAAAGC 


CATTATTTCA 


19 500 


AATCCTCCAA 


AAATTGGTAA 


AGATTAGTCA 


AACCTTTACC 


CATGACAGCA 


ATTTCTTCCT 


19560 


CGCTCATGCC 


ATCAATAATT 


TTTTCTACCA 


TGGCCTTGTG 


GAACCCTTTA 


TGCAGTCTAT 


19620 


GAATCAAGCG 


ACCCTTCTTT 


GTCAAATGCA 


GATGCACCAC 


ACGACGATCC 


TCTTCTGACC 


19680 


GAACTCGCTC 


AATGTAGCCC 


GG 








19702 


(2) INFORMATION FOR SEQ ID NO: 8 











(i> SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 6211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAAAATTTCC TCTCTTCTCT TGAAAAATTT TGAAAAAATG GTATGATAGT AACAAGTTAT 60 
TTTTAAGAGG AAAGAAAGGG GAATAATGGA GAAAATCAGT TTAGAATCTC CTAAGACGGG 120 
GTCGGACCTA GTTTTGGAAA CACTTCGTGA TTTAGGAGTT GATACCATCT TTGGTTATCC 180 
TGGTGGTGCG GTTTTGCCTT TTTATGATGC GATATATAAT TTTAAAGGCA TTCGCCACAT 24 C 

TCTAGGGCGC CATGAGCAAG GTTGTTTGCA TGAAGCTGAA GGTTATGCCA AATCAACTGG 300 
AAAGTTGGCT GTTCCCGTCG TCACTAGTGG ACCAGGAGCA ACAAATGCCA TTACAGGGAT 360 
TGCGCATGCC ATGAGCGATA GCGTTCCCCT TTTGGTCTTT ACAGGTCAGG TGGCCCGAGC 420 
AGGGATTGCG AAGGATGCCT TTCAGGAGGC AGACATCGTG GGAATTACCA TGCCAATCAC 4 80 

TAAGTACAAT TACCAAGTTC GTCAGACAGC TGATATTCCG CGTATCATTA CGGAAGCTGT 54 0 

CCATATCGCA ACTACAGGCC GTCCAGGGCC AGTTGTAATT GACCTACCAA AAGACATATC 600 
TGCTTTAGAA ACAGACTTCA TTTATTCACC AGAAGTGAAT TTACCAAGTT ATCAGCCGAC 6 60 

TCTTGAGCCG AATGATATGC AAATCAAGAA AATCTTGAAG CAATTGTCCA AGGCTAAAAA 7 20 

G C C AGTCTTG TTAGCTGGTG GTGGAATTAG TTATGCTGAG GCTGCTACGG AACTAAATGA 7 90 

ATTTGCAGAA CGCTATCAAA TTCCAGTGGT AACCAGTCTT TTGGGACAAG GAACGATTGC 84 0 

AACGAGTCAC CCACTCTTTC TTGGAATGGG AGGCATGCAC GGGTCATTCG CAGCAAATAT 900 
7GCTATGACC GAAGCGGACT TTATGATTAC TATTGGTTCT CGTTTCGATG ACCGTTTGAC 9 60 

CCGGAATCC? AAGACTTTCG CTAAGAATGC TAAGGTTGCC CACATTGATA TTGACCCAGC 132 0 

TGAGATTGGC AAGATTATCA GTGCAGACAT TCCTGTAGTT GGAGATGCTA AGAAGGCCTT 1080 

GCAAATGTTG CTAGCAGAAC CAACAGTTCA CAACAACACT GAAAAGTGGA TTGAGAAAGT 114 0 

CACTAAAGAC AAGAATCGTG TTCCTTCTTA TGATAAGAAA GAGCGTGTGG TTCAACCGCA 12 00 

AGCAGTTATT GAACGAATTG GTGAATTGAC GAATGGAGAT GCCATTGTGG TAACAGACCT 1260 

TGGTCAACAC C AAATGTG G A CAGCTCAGTA TTATCCCTAC CAAAATGAAC GTCAGTTAGT 1320 

G ACTTC AGGT GGTTTGGGAA CAATGGGCTT TGGAATTCCA GCAGCAATCG GTGCTAAAAT 138C 

TGCTAACCCA GATAAGGAAG TAGTCTTGTT TGTTGGGGAT GGTGGTTTCC AAATGACCAA 1440 

CCAGGAGTTG GCTATTTTGA ATATTTACAA GGTGCCAATC AAGGTGGTTA TGCTGAACAA 1500 

TCATTCACTT GGAATGGTTC GCCAGTGGCA GGAATCCTTC TATGAAGGCA GAACATCAGA 15 60 

GTCGGTCTTT GATACCCTTC CTGATTTCCA ATTGATGGCG CAGGCTTATG GTATTAAAAA 1620 

CTATAAGTTT GACAATCCTG AGACCTTGGC TCAAGACCTT GAAGTCATCA CTGAGGATGT 1680 
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TCCTATGCTA ATTGAGGTAG ATATTTCTCG TAAGGAACAG GTGTTACCAA TGGTACCGGC 17 40 

TGGTAAGAGT AATCATGAGA TGTTGGGGCT GCAGTTCCAT GCGTAGAATG TTAACAGCAA 1800 

AACTACAAAA TCGTTCAGGA GTCCTCAATC GCTTTACAGG TGTCCTATCT CGTCGTCAGG 13 50 

TTAATATTGA AAGCATCTCT GTTGGAGCAA CAGAAGATCC GAATGTATCG CGTATCACTA 1920 

TTATTATTGA TGTTGCTTCT CATGATGAAG TGGAGCAAAT CATCAAACAG CTCAATCGTC 1980 

AGATTGATGT GATTCGCATT CGAGATATTA CAGACAAGCC TCATTTGGAG CGCGAGGTGA 204 0 

TTTTGGTTAA GATGTCAGCG CCAGCTGAGA AGAGAGCTGA GATTTTAGCG ATTATTCAAC 2100 

CTTTCCGTGC AACAGTAGTA GACGTAGCGC CAAGCTCGAT TACCATTCAG ATGACGGGAA 2160 

ATCCAGAAAA GAGCGAAGCC CTATTGCGAG TCATTCGCCC ATACGGTATT CGCAATATTG 2 2 20 

CTCGAACCGG TGCAACTGGA TTTACCCGCG ATTAAAAATC CAACTTAAAT TTATTAAACC 2 2 80 

AG C CTAAAAG GCAATAAATA ATAGAAAAGA GAGAAAAGCT ATGACAGTTC AAATGGAATA 2 3 40 

TGAAAAAGAT GTTAAAGTAG CAGCACTTGA CGGTAAAAAA ATCGCCGTTA TCGGTTATGG 2 4 00 

TTCACAAGGG CATGCGCATG CTCAAAACTT GCGTGATTCA GGTCGTGACG TTATTATCCG 2 4 60 

TCTACGTCCA GGT AAATCTT TTGATAAAGC AAAAGAAGAT GGATTTGATA CTTACACAGT 2 520 

AGCAGAAGCT ACT AAGTT GG CTCATGTTAT CATGATCTTG GCGCCAGACG AAATTCAACA 2S80 

AG AATTG T AC GAAGCAGAAA TCGCTCCAAA CTTGGAAGCT GGAAACGCAG TTGGATTTGC 2 64 0 

CCATGGTTTC AACATCCACT TTGAATTTAT CAAAGTTCCT GCGGATGTAG ATGTCTTCAT 2 700 

GTCTGCTCCT AAAGGACCAG GACACTTGGT ACGTCGTACT TACGAAGAAG GATTTGGTGT 2 7 60 

TCCAGCTCTT TATGCAGTAT ACCAAGATGC AACAGGAAAT GCTAAAAACA TTGCTATGGA 2 820 

CTGGTGTAAA GGTGTTGGAG CGGCTCGTGT AGGTCTTCTT GAAACAACTT AC AAAG AAG A 2 880 

AACTGAAGAA GATTTGTTTG GTCAACAAGC TGTACTTTGT GGTGGTTTGA CTCCCCTTAT 2 940 

CGAAGCAGGT TTCGAAGTCT TGACAGAAGC AGGTTACGCT CCAGAATTGG CTTACTTTGA 3 000 

AGTTCTT C AC GAAATGAAAT TGATCGTTGA CTTGATCTAC GAAGGTGGAT TCAAGAAAAT 3060 

GCCTCAATCT ATTTCAAACA CTGCTGAATA CGGTG ACT AT GTATCAGGTC CACGTGTAAT 3120 

CACTGAACAA GTTAAAGAAA ATATGAAGGC TGTCTTGGCA GACATCCAAA ATGGT AAATT 3 180 

TGCAAATGAC TTTGTAAATG ACT AT AAAG C TGGACGTCCA AAATTGACTG CTTACCGTGA 3 2 40 

ACAAGCAGCT AACCTTGAAA TTGAAAAAGT TGGTGCAGAA TTGCGTAAAG CAATGCCATT 3 3 00 

CGTTGGT AAA AACGACGATG ATGCATTCAA AATCTATAAC TAATTAGAAA TATATAGCGC 3 3 60 

TGGAGATGAT TTTATCAAAA AGATTATGAG AAAAATTGCA TCGTTATTAT TGCTTCTAGT 3 4 20 
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TGTATAATGT 


AATTACACCG 


TCGGTAATAC 


TGCTAGCAGA 


CCAAAATAAA 


GC AGATTGGT 


3490 


CGTATGATGA 


AAATGCTGTA 


ATTAACATTT 


ATGATGATGC 


TAATTTTGAA 


GATGGTAGGT 


3540 


TGCATATGAA 


CTTTGAACAA 


TTCTTCAAAT 


TGGC ACAAAT 


AGCTAGAGAA 


GAAGGTCTTG 


3600 


AAATTCATTC 


TCCGTTTGAG 


AGAGCTGGTG 


CGACTAAATC 


TGCTCGTTAT 


ATAGCGAAAT 


3660 


GGATTTTGAG 


AAATAAAAAA 


CATTAACAAA 


TATAGTTGGT 


AAATCATTAG 


GACCTAAATC 


3720 


AGCTGTTAGA 


TTCGGAGAAG 


CTTTATCCTA 


TATTGAAGGT 


CCTCTTCGCA 


GAATAAATGA 


3780 


GACGATAGAT 


GGCGGTTTAT 


ATCAAATAGA 


GCAAATTATT 


GCATCTGGAT 


TGAAAGAATC 


3840 


GGGTTTAAAT 


GACTGGACTG 


CGAAAACTTT 


AGCTTCAGCT 


ATTCGTGGGA 


TATTAGATGT 


3900 


ACTTATTTAG 


GGGTTGAAAT 


CATATGAATA 


TTACCAATTT 


GTTTTCTATC 


AAGACAGGAT 


3960 


GTGATGAAAC 


TGATAGGCAA 


CTGCAAAAAC 


TATTTTTTCA 


GTTGGATTTA 


C AATTGGG AG 


4C20 


AATTGACAGA 


TCAACTAAGA 


AAATTAGATT 


CTAATTTTGT 


TCCTCGTAGT 


CAATTTGTAG 


4080 


AC AC GTTG G A 


TTTGAATGAT 


GTAGAATATA 


AAGAAATTTT 


AAACTATTTT 


ATCTTCCATC 


4140 


GTAATGATAG 


TGAAGAAAGT 


TTGGTAGAAT 


GGTTATATGA 


TTGGATTTCC 


ACAAATCGTT 


4 2 00 


ATGAACTTCC 


TAAAGAGTTT 


TCGATTCGTA 


TGGCTCATAA 


ATACCATGAA 


AGTGTTACTG 


4260 


AAGTTTTCGG 


AGATGAATAA 


CTAAAAAACA 


GTCATTACTC 


ACTGTTTTTT 


ATAGAAAAAG 


4320 


AGGTTTTATA 


TGTTAAGTTC 


AAAAG AT AT A 


ATCAAGGCTC 


ACAAGGTCTT 


GAACGGTGTG 


4380 


GTTGTGAATA 


CTCCACTGGA 


TTACGATCAT 


TATTTATCGG 


AGAAGTATGG 


TGCTAAGATT 


4440 


TATTTGAAAA 


AAGAAAATGC 


CCAGCGTGTT 


CGCTCCTTTA 


AAATTCG TGG 


TGCCTATTAT 


4500 


GCCATTTCCC 


AGCTCAGCAA 


GGAAGAACGT 


GAACGTGGGG 


TAGTCTGCGC 


TTCTGCGGGA 


4560 


AATCATGCGC 


AGGGAGTAGC 


C T AT ACT TGT 


AATGAAATGA 


AAATTCCTGC 


T ACT AT CTTT 


4620 


ATGCCCATTA 


CTACGCCACA 


ACAAAAGATT 


GGTCAGGTTC 


GCTTT TTTGG 


TGGGGATTTT 


4680 


GTAACTATTA 


AACTAGTTGG 


AGATACCTTT 


GATCCCTCAG 


CCAAAGCAGC 


TCAAGAATTT 


4740 


ACAGTCTCTG 


AAAATCGTAC 


CTTTATTGAT 


CCTTTTGATG 


ATGCTCATGT 


TCAAGCAGGT 


4800 


CAAGGAACAG 


TTCCTTATGA 


GATTTTAGAA 


GAAGCTCGAA 


AAGAATCGAT 


TG ATTTTG AT 


4860 


GCTGTCTTGG 


TTCCTGTTGG 


TGGTGGCGGT 


CTCATTGCCG 


GGGTTTCTAC 


CTATATCAAG 


4920 


GAAACAAGTC 


CAGAGATTGA 


GGTTATCGGA 


GTAGAGGCGA 


ATGGAGCGCG 


TTCCATGAAA 


4980 


GCTGCCTTTG 


AGGCTGGAGG 


TCCAGTAAAA 


CTCAAGGAAA 


TTG AT AAATT 


TGCTGATGGG 


5040 


ATTGCTGTGC 


AAAAGGTAGG 


TCAGTTGACC 


TATGAAGCAA 


CTCGTCAACA 


TATTAAAACT 


5100 


TTGGTAGGTG 


TCGATGACGG 


ATTGATTTCT 


GAAACCTTGA 


TTGACCTTTA 


CTCTAAGCAA 


5160 


GGGATAGTCG 


CAGAACCTGC 


TGGAGCGGCT 


AGTATCGCCT 


CTTTAGAGGT 


TTTAGCTGAA 


5220 
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TATATTAAGG GGAAAACCAT TTGTTGTATC ATTTCTGCAC CAAATAATCA TATCAACCGT 5 280 

ATGCCAGAAA TGGAAGAGCG TGCCTTGATT TATCATGGTA TCAAACATTA CTTTGTGGTC 5 3 40 

AATTTCCCAC AACGTCCAGG AGCTTTGCGT GAGTTTGTAA ATGATATCCT GGGGCCAAAT 5 400 

GATGATATCA CACGTTTTGA GTATATCAAA CGACCTAGCA AGGGAACAGG CCCAGTATTA 5 4 60 

ATTGGGATCG CTTTAGCAGA TAAGCATGAT TATGCAGGTT TGATTCGTAG AATCGAAGCT 5 52 0 

TTTGATCCAG CTTATATTAA CTTAAATGGT AATGAAACGC TTTATAATAT GCTTCTCTGA 5S80 

GGACTAATAA AAAAATATCA TACCTTCATT TTGATTTCCT ATCTATTGAC AAGCATAGTC 5 64 0 

ACACTGTCTT TAATACTCTT CGAAAATCTC TTCAAACCAC GTTAGCTCTA TCTGCAACCT 57 00 

CAAAACAGTG TTTTGAGCAA CTTGCCCCTA GCTTCCTAGT TTGCTCTTTG ATTTTCATTG 57 60 

AGTATAAGGT ATGATTTGAT TTCTTTTTCT TGACAAATAT ACT AT ATT AA AAACATATAT 5 3 20 

AAGTAATTAA CTGAGCTTAT CTGTCTTGTC ATCTCTATTA AGG AT GGTTT AGATAATCGC 5 880 

GTGTCTGCTT CTAGGCTAGC ACCTCAATAT CCAAAGGACT GATCAATTTC AAGGACATAA 59 4 0 

GGAATACCTA TCTCTCAGAT GATTTATTCA GGAAGAAAGA TAGGAGTTTT TGAGCTAGTG 6000 

AAGGCTTGGA TTTCTAAAGG TTAGAACTAT CATCTTCAGT TCTTAAATCC AAGAAATAAG 6060 

CTATCTTACG GAAATAGAGA AGCATTTTTT AAGAACTTGA ATAATTTCGC ACCTTAAGAG 6120 

GGTAATAATA CAGTATTTTT ATTAGCAAAT ATTTATGGTG TAGAGGCTAG CAAAACCTAT 6L8 0 

ATATTATCGG ATTTAAAAAG GAAGTAAGAA A 6211 



(2> INFORMATION FOR SEQ ID NO: 9: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCGGACTCCC CACGATTCTT - CAAAATAACT GAGTATATTT CTATCTTGAT TTTCAGATAT 6 0 

AAATTCTTCC TTCTGTGGCC TCTTCTTACG CTTGAGAAGA GCTTCTCCGA CATGGCTTCT 120 

TCCTTACTGA GCAAAACCTT GAGCATAGAT AAGTTTGACT GGCAAGCGTG CTCTTGTATA 130 

TTTGGCTCCC TTCCCACTAT TGTGGATAGC GAGGCGTCTT CTCATATCAG TCGTATAGCC 24 0 

T AT AT AGT AG GATCCATCAC GACACTCCAG AACGTACATA TAAGCCTTAT GATCCATAAT 3 00 

AAATCTCTTC GATTTCGGGC GTATAAGAGC CATCATCATT GTGGACAATC AAAGGAGGTA 3 60 
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AGACCTTAAA GCCACTTGTT GAGCCATCCT TGATCGCCTC AATCAAAAGC ATATTGGCTT 420 

CCTTTTCTCT TTTTGGATAA ACAAACTGCA GGCGCTTAGG GGCTAGATTA TGTCCTTTTA 4 80 

ACGTATCCAA AATATCCAGA AGTCGATCAG GACGATGAAC CArGGCCAAA CGCCCATTAG 54 0 

ACTTG AG AAT ACTCTGGGCA CTACGACAGA TTTCTTCCAA ATTAGTCGTG ATTTCGTGTC 600 

GAGCCAAGAG ATAATGTTCA CTCTCGTTCA G ATT AG AAT A AGGATTCACC TTGAAATAGG 660 

GTGGATTACA CAAAATCATA TCCACCTTAC TCCCCTGAAT GTGAGCAGGC ATATTTTTCA 720 

AATCATCGCA GATGACCTGC ATTTGCTCCT CTAATCCATT CAAACGG AC A GAGCGTTCAG 780 

CCATATCCGC CAAACGCTCC TGAATCTCAA C AG AC AAT AT CTGTGCTTGA GTACGACTGC 340 

TAGCAAAAAG CCCCACTGCT CCATTCCCAG CACAGAAATC CACAATCAAC CCCTTCTTAG 900 

GAAAACGTGG AAATCGTGAT AAGAGAACAC TATCCACCGA ATAGCTAAAA ACCTCTCTAT 960 

TTTGAATGAT TTTGATATCT GTCGAAAAGA GCTGGTTAAT GCCCTCTCCT GATTTTAATA 1020 

ATTGTTCTTC TTCCATGGTC CTATTATAGC AAATTCATAT TAACATTACA AAAAATATAA 1080 

AACTCTAAAC TACTTCTTCT TTTTTAAATG GTGCAGGGCT TCTCCAGTCC AGATTGGTAG 1140 

CATTCGTCGA AAGGGAGCAA AGCCGTAGTT AAAGCGGTCG CTTGAAAAGC GTCTCCGTCT 12 00 

AGGAAACTGG TACTTTTCTT CCTCCAAAGT GCGGATAGAA AGACTGGCTT TCCCTGTAAA 1260 

TTCATCTAAA TCCACTACCT GAACTTGAAC CTCTTCATCG ACTTTCAAGG TTTCATGAAT 1320 

ATTTTCAATA AATCCTGTCC GAATCTCTGA AATCTGAATC AGCCCCGTAT CACCCGTCTC 1380 

TAACTCAACA AAGGCACCGT AGGGCTGAAT CCCTGTAATA CGCCCCTTTA GCTTATCACC 144 0 

GATTTTCATC TTAGTCCTCG ATTTC AAT AG TTTCAATTAC AACATCTTCA ACTGGCTTGT 15C0 

CCATAGCTCC TGTCTCAACA GCAGCAATGG CATCCAAGAC AGCCTAAGAT GCTTCATCAG 1560 

CTAACTGACC AAAAACCGTG TGACGGCGGT CTAGGTGAGG TGTCCCACCT TGATTGGCAT 1620 

AGATTTCTGC AATCGGTTCT GGCCAACCAC CACGAGTAAT TTCTTTCTTA GAATAAGGTA i680 

GGTGTTGGTT TTGCACGATA AAGAACTGGC TGCCGTTGGT ATTTGGACCA GCATTTGCCA 174 0 

TGGAAAGAGC ACCACGGATA TTGTAAAGCT CTTCTGAGAA TTCATCCTCA AAAGATTCCC 1800 

CGTAGATTGA CTCGCCACCC ATACCAGTTC CAGTTGGGTC TCCACCTTGG ATCATAAAGT 1860 

CCTTGATAAT ACGGTGGAAA ATGACACCAT CATAGTAGCC ATCTTTTGAA AGAGATACAA 1920 

AGTTAGCCAC TCTTTTAGGA GCATGTTCAG GGAAAAGCTT GATACGTAAG TCTCCGTGAT 19 80 

TGGTCTTAAT AGTCGCAAGA GGACCTTCTA CTGTTTCAAT GTCTACTTGT GGAAAATGCA 2040 

ATTCTTTTTC TACCATACCA AATACTTCTA AGGCAGCAAA AATGCCATCT TCTTCTAATG 2100 

TTTTTGTAAT ATAATCTGCT TTTTCTTTGA TTTTATCATG AGAAATTCCC ATGGCAACGC 2160 
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TGATTCCAGC ATAATCAAAG AGTTCCAAGT CGTTGAGACC ATCTCCAAAA ACCATCACCT 2220 

TCTCTGGTTT CAAGCCAAGG TGTTCCACAA CCTTTTCCAC CCCCGTCGCT TTGGAGCCTG 2280 

AAATCGGCAC AATATCAGAC GAATGTTGAT GCCAACGAAC CATGCGAAGT TTGTCTGAGA 2 34 0 

GACTGTCAGG CAAGTGCAAG TCATCTCCCT TATCTTCAAA AGTCCACATC TGATAGATAT 2400 

CTTCTTTTTC ATGGAAATCG GGATCTACAT CTAAGTCGGG ATAAATTGGA TTGATAGCTT 2 4 60 

CACTCATCAT ATCGGTGCGA GTCGACAACT TGGCATCATG ACTCCCAACC AAGCCATACT 2520 

CAATTCCTTC TTGCTTAGCC CAAGAGATAT ACTCCTCAAC AT CT G ACTTT TCAATCTGAT 25 80 

GCTGATAAAT GACCTGACCT TTTTTATCTT CGATATAAGC CCCATTCAAA CTTACAAAAA 2 64 0 

AGTCAGGCTT GAGATCACGA ATCTCTGGAA CAACACCAAA AATGCCACGT CCAGAGGCGA 2700 

TTCCTGTTAA AATTCCTTTT TCACGCAACT GTTTAAAAAC AGTGGGAATT GTAGTTGGAA 27 60 

TAAACCCTGT CTTTGAATTC CGCAATGTAT CATCAATATC AAAAAAGACA ATCTTGATCT 2 32 0 

TCTTTGCCTT GTATCTTAAT TTCGCGTCCA TCTCACTACC TCTTTCAATC TAACTCTTTC 2 880 

CATTATATCA TAAAGTAGGC AAATCCCCTA TTTTCAAAAA GTTTATCATT TTTATTTTAA 2 940 

TTTCTTGGAT GAGAAAAGAG AC AT ATT TAT GAAAAAGCTC CATCGTGCTT TTAATGTGTT 3 000 

CTCTTGTTTT CAAACTCGTA AAAAGGGAGC CACTGATCCT AACTCGCTCT CTCATTTCAA 3 0 60 

AGCTTGTGAA AAAAGACCCG TTGGGGTCTT AATTCGCTTT CTT GTTTTC A AGCTCATGAA 3120 

AAAGAGACCC AACTGGGTCT TTTCTTTAAT CTTCGTTT AC GAAAGGCATC AAAGC C ATT A 3180 

CGCGAGCGCG TTTGATAGCT GTTCTTACTT TACGTTGGTT TTTAGCTGAA CTTCCTGTTA 3 2 40 

CACGACGAGG AAGGATTTTC CCACGTTCTG AAACGAAACG GCTAAGAAGC TCAGTATCTT 3 300 

TGTAATCAAC ATATTCAATT TTGTTTG CT G CGATGTAATC AACTTTTTTA CGGCGTTTGA 3 3 60 

ATCCGCCACG ACGTTGTTGA GCCATGTTTT TTCTCCTTTA TAAGTTTAGT TGTCCATTAG 3 42 0 

AATGGTAAAT CATCATCTGA AATATCCAAT GCCTTTCTTC CTCCAAATGG ATTTTCATTA 3 4 80 

CGTGAAAAGT CTGGTACTGA ATTTGTAGGT GCTGAATAGT TTGCAGTTGG TGCAGAGTAA 3 54 0 

GCTCCACCTG TGTGACCCTC ACGCACACTA CGGCTTTCC A ACATTTGGAA ATTCTCAGCC 3 6 00 

ACGACCTCTG T C AC GT AG AC ACGTTGTCCT TGCTGGTTAT CGTAACTACG AGTCTGGATA 3 6 60 

CG ACCTGTC A CCCCGATAAG TGAGCCTTTT TTAGCCCAGT TAGCAAGATT TTCAGCCTGT 37 2 0 

TGGCGCCACA T AACG AC ATT GATAAAATCA GCCTCACGTT CACCATTTTG ACTCTTAAAT 37 80 
GTACGGTTTA CTGCAAGAGT AAAAGTCGCA ACTGCTACAT TTGATGGGGT AT AACG C AAC . 3 84 0 
TCAGCGTCAC GTGTCATACG CCCTACAAGT ACAACATTCT T AATC AT ACT TTACCTTCTT 3900 
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ACGCGTCAAT TTTGACGATC ATGTGACGAA GAATGTCAGC GTTGATTTTT G AAAG ACGGT 3 960 

CAAACTCTTT AAGAGCTCCA TCGTCATTTG CTTCAACGTT AACGATGTGG TAAAGTCCTT 4 0 20 

CACGGAAATC TTGGATTTCG TATGCAAGAC GACGTTTTTC CCAAGTTTTT GATTCAACAA 408 0 

CAGTTGCACC GTTGTCAGTC AAAATAGAGT CAAAACGTGC TACCAAAGCG TTTTT AGCTT 4140 

CTTCTTCAAT GTTTGGACGA ATGATATAAA GAATTTCGTA TTTAGCCATT GATATGTTCC 4 200 

TCCTTTTGGT CTAATGACCC CAAGACTTTG CAAGGGGTAA GTGAGGTTCG CTCACAATAA 42 6 0 

ACT ATT AT AC TAGAAAAAAT TTTTTTACGC AAGTAAAAAC ACTAGAATTC GAAAAAACGC 4 3 20 

CACATGGGCG TTTTCCTGTT CTTATGGTTT GATACGGTGC AACATACGTG GGAATGGAAT 4 380 

AGCTTCACGG ATATGTTTTG TTCCTCCTGC GAAGGTTACC ATACGTTCGA TACCGATACC 4 44 0 

AAATCCTCCG TGTGGAACTG TACCGTATTT ACGAAGGTCA AGGTAGAATT CATATTCTGT 4 500 

ACGATCCATG CCAAGTTCAT CCATCTTAGC GACAAGGGCA TCGTAATCTT CCTCACGCAT 4560 

AGACCCACCG ATAATTTCTC CATAGCCTTC TGGAGCAAGC AAGTCTGCAC AAAGCACGCG 4 62 0 

CTCTGGATTT CCAGGAACTG GTTTCATGTA GAAGGCCTTG ATGGCTGCTG GATAGTTCAT 4680 

GACAAATGTT GGCACACCAA AGTGGTTTGA AATCCAAGTT TCGTGTGGTG ACCCAAAGTC 474 0 

ATCACCATGC TCAAGATGCT CGTAGTCAGC ATCTTCATCA TTTTCATGCT CTTGCAAGAG 4 8 00 

GTCAATGGCT TGATCGTAAG TGATACGTTT GAATGGCTCT GCAATGTAGC GTTTCAAGAG 4 8 60 

TTCTGTAT C A CGTTCCAAGG TTTCCAAGGC TTGAGGCGCG CGGTCAAGAA CACCTTGTAG 4 920 

AAGAGCTTTC ACATAAGCTT CTTGCAAGTC AAGCGACTCA TCATGTGTCA AGTATGAGTA 4 9 80 

CTCAGCATCC ATCATCCAGA ACTCAGTCAA CTGACCGCGT CTTTTTCATT TTTCAGCACG 504 0 

GAAAACTGGA CCAAAGTCAA AGACACGACC AAGACCCATA GCCCCTGCTT CTAGGTAAAG 5100 

CTGACCTGAT TGGCTCAAGT AGGCTGGCGT TCCGAAGTAG TCAGTTTCAA AGAGTTCTGT 5160 

AGAATCTTCT GCCGCATTTC CTGAAAGAAT TGGGCTGTCA AACTTCATAA AACCGTTCTT 52 20 

GTCAAAGAAC TCATAAGTTG C AT AG AT AAT AGCGTTACGG ATTTGCAACA CAGCTACTTG 52 80 

CTTACGAGAG CGTAgCCACA AGTGACGGTT ATCCATCAAA AAGTCTGTTC CGTGTTCTTT 5340 

TGGTGTGATT GGGTAGTCTT GAGATTCACC GATCACTTCG ATGTCTGTGA TGTCCAACTC 54 00 

ATAGCCAAAT TTAGAACGTT CGTCCTCTTT GACAATACCT GTCACATAAA CAGACGTTTC 54 60 

TTGGC TCAAG CGTTTGATAA CATCAAACTT CTCAAGTCCC ACTTCTTCAC CAAATTTTTC 5 520 

GACAAAGTTT GGTTTAAAAG CCACACCTTG AAAGAAGGCT GTTCCATCAC GCAATTGTAA 55 8 0 

GAAAGCGATT TTTCCTTTTC CTGATTTGTT GGCAACCCAA GCGCCAATCC TCACTTCCTG 564 0 

ACCAACATAG TCTTTTACGT CAATAATCGT TACACGTTTT GTCATTATTT TTCCTTTTCT 57 00 
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TTTTTATTCT TTATGGCAAA CCACCTCTAT ATTGTTCCCA TCCAGGTCAA TCATAAAAGC 57 60 

AGCATAGTAA ATCGGATGCT CACTTCGATA ACCACGAGCC CCATTGTCTC GCCCACCTGC 58 20 

CTCTAAGCCA GCCTCATAAC AAGCCTGAAC TTCTTCCTTA TTTTCTCCTA AAAAAGCAAA 53 90 

ATGAACAGGA TCTT GTGTTC CCTGAGTCAG CCAAAAATCA CCACCAGGAT GACGGCTGTT 594 0 

CGGGGATAGA AAACTAATTA GAGAACTAGT CTTAAAAGCC AATTTATAGT CCAAAGGAGC 6000 

GAGAAAACTC CTATAAAATC CTTATGAAAT TTGTAAATCC TTTACCTTAA TCTCAAAATG 6060 

ATCAATCATT CTCACTACCC AT AAATG CTT TCAAGCGTTC GACTGCTTCT TTAAGCGTGT 612 0 

CTAGGTCTGT CGCATACCTG AGGCGGACAT TTTCTGGTGC TCCAAATCCA GCTC CTGTT A 618 0 

CCAAGGCCAC TTCGGCTTCT TCTAAGATAA CAGTTGTAAA GTCTGTCACA TCCGTGTAGC 624 0 

CTTTCATCTC CATCGCCTTT TTGACATTTC CGAAGAGATA GAAGGCCCCT TGCGGTTTGA 6 300 

CCACTTCAAA TCCTGGTACC TCTGCAAGGA GGGGATAGAT GGTATTAAGA CGTTCCTCAA 6 3 60 

AGGCCTGACG CAT G CTT T CT ACAGTATCTT GCTCACCTGA TAGAGCCTCA ACTGCTGCAT 64 2 0 

ATTGGGCTAC TGCTGACGGA TTCGAAGTTG TTTGACCTGC AATCTTGGAC ATGGCAGCGA 64 80 

TAATGTCTGC TTCTCCAACG GCATAACCAA TCCGCCAACC AGTCATGGCA T AAGTTTT AG 654 0 

ACACACCATT GATGACCACT CTTTGCTTGC GAATCGCTTC CGATAGGCTA GAAATCGGTG 6 600 

TGAACTCATG ACCATTATAA ACCAAGCGGC CATAGATATC GTCTGCTAGG ATGAGAATAT 6660 

CATTTTCTAC AGCCCAGTTT CCAATTGCCA AGAGTTCCTC ACGGCTGTAA ATCATACCTG 67 2 0 

TGGGATTAGA TGGCGAATTC AGCACCAAAA CCTTGGTCTT CTCACTGCCA GCTGCTTCTA 6780 

ACTGCTCTAC GGTCACCTTA AAGTGATTGT CTTCCTTACC AG AAAC AAAG ACGCGAACGC 6 84 0 

CTTCTGCCAT CTTGACCTGA TCTCCATAGC TAACCCAGTA TGGGCTTGGG ATGATGACTT 6 900 

CATCACCTGG ATTGACCACA G CC AT AAAG A AGGTATAGAG AGAATATTTG GCTCCCGCAG 69 60 

CGACTGTCAC TTGATTTGAC GCTACAGAAT AGCCGTAAAA GCGCTCAAAG TAGCTATTGA 7 02 0 

CCGCCGCCTT AAGCTCTGGC AGACCTGAGG TTACTGTATA AAAACAAGCA CGCCCATCTC 70 80 

GAATCGATGC AATGGCGGCA TCTTGGATAT TTTTGGGAGT AGTGAAATCT GGCTCACCCA 7140 

AGGT T AG AG A CAAAATATCT CTACCCTCAG CCTTCAGTGC TTTGGCACGG GCTCCACCAC 72 00 

CCAAAGTCAC ACTTTCTTCC ATTTCTAAAA CACGGTTGGA TAGTTTCATA GGCCCTCCTT 72 60 

GTTGACCAAT GCTCCTGTTT CAAAATCTAC TAGATAAAAA TCAGATCCTG ACTTAACTTC 73 20 

CCAGATTGGC TTATCTTGAT AACGG CC AAA GGTTATCTTC TCAATCTCGC CAGCTCCCTT 7 3 80 

TTCCTTAGAA ACCGTTTCTG CTTTTTCTTG TGAAACACCC TCATTTAGCT GATAAACGTA 74 4 0 
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AATCTTATGG TCATCTTTAC CAATCAGGAC AGCAAGCGCT TCTTGCTGTT TGTTACGACC 7500 

AAGAACGCTG TAATAAGATT CCAAGCCATT GTATAAATCA ACCTGATCAG CCTGCTCTAA 75 60 

TCCTGCATAC TGCTGAGCTA ATTTTTCTCC TTCACTTTTA GCTGTTTGAT AGGGTTTCAT 76 2 0 

GCTAAGAGAA ACCATATACA GAAAGGAACC ACTGATAACC ACAAACAAAA TCGTCATCCC 7 680 

TAGACCATAC TGCCACAGTA GATTATTTTT TCCTTTGTTT TGTCTTTTTT TCACTCGTCT 77 4 0 

ATTTTACCAT CTATTAAGCT TTATTACAAG TGAATATAAG AATACTCTTC GAAAATCTCT 7 800 

TCAAACCACC TCAGCTTTAT CTGCAGACCT CAAAGCTGTG CTTTGAGCAA CCAATTCTAT 7 8 60 

TTCTCCCTTC AAACAAAACC GATTTTGAAA GTGAAACAGT TCTTACTTTT TCAGTCACAA 7 9 20 

ATGATTAGAG TTTGCCGGG 79 3 9 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9397 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCGCTCTACC GTCAAATAAT TACCATTTTG TTTAATACCG AAATTTTTAT CTACTGAAAA 60 

TTCAGTTGGT CTGTTGGTAC GATCGTCGTA TACAGTACCA TTCTCACGAA TAGTATAATT 12 0 

GTAATCAGTA TCACCTTGTT TCCTTAATTT AAGGTAATAA TTACCATCAA TTTCTTTATA 180 

ACCTGAATCT TTTCTAGTTG CTTCTCTAAA ACTTACTCCA GCAGGCATCA CATCAGCAAA 2 40 

CATGAGTACT TGTTTGTTCT TTTTTTCAAC AATAACAGAG TCAATATAGG TTGCACCACC 300 

GCTGATTTGT AAGTCACGTC CACCAACTTC ACGAGGCCAT TCTAATGGTA CTGGCGCAAA 3 60 

ATCATCGAAT GCCAATGTTA ATTTTGGTTT AGTCCATGTC TTACCATTAT CATCACTATA 4 20 

ACT TGT AG C A AT ATT AATTT TATTCAAGAA ATCATGAGTT CCACCGTAAC GAGCGTCAAT 4 80 

GCTTGAAAAT ACCCGACCAT TGCTAAAAGT ATACAGAACT GGAATACGGA AATAGTTAGA 54 0 

ACCTGTTGTA TCATTAGCCG TATAAATTAA ATGTCCAGTA ACAGCGTTTG TTGTCATCTT 600 

TTTAACAGTT TCTTCATCCA AT GC ACT ATT AAAGAATTTG ATATTTTCTA GTGTTCCGTT 6 60 

AAAACCAAAC GCCGTTTTTC CTGCACGTTT CACTCCCCCA AGCATATAGT AATCAATACC 7 20 

TTTAATATCC TTGATGTTTA GGAAATTATC CACTTTCTTT TCTACTACTT TTGTACCATT 730 

TGCGTATAAA GAATATGTTT TTTTGACTGA ATCTCCTACT ACTGCAACAG TGTTAGTCAC 84 0 

AGCCTCTTGT TTGTACTTAC CCCAAACTGA AGCAGGTCTG GAT ACT AGGT TATTTTTATT 900 
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GGAAGAAGTA TCACGCGCTT 
ATAACTATTT TGTTGACCGG 
CTCACTTGAC TGATTGTACT 
TAACTCTTTA CTAACATTTT 
AGGAGTTTCT TCCGCTGTAG 
TACAGTAACT TCCGAAGAGT 
TGCAACAGGT TGCACCAACT 
AACTGAGTTA GCAACAAATG 
AATATTTTTT TTCATTTTAT 
TTTC AT C ATT GCAATGAATC 
TCCCTGGAAG CAATTCAACA 
CTTCGCT AAA AGGTACACGT 
TATTTTCAAC AACAATATGA 
GAACTCCATC AGCTAGATAA 
TTTTTCCATC AACTAAATCT 
AATATTTATT GACAGAAGCA 
TTTCTCTTTT CTATTAGTGA 
CATGAATCGT TGTATATTTA 
AACCTGAAAC TACTGATACA 
TTGGAGGAAG GAACTATTTA 
AAGCACCTTG TTTATTTGCT 
GACCAAGTAC AAGTCCCATG 
GAGCCATGAC AATGGAAACA 
CAATTTTCGT ACGACGATTG 
TCCATGAAGT TGCAACAGAG 
TCGCTGCCAA GATCAAACCT 
AGATTTGGTC TTGAGGGATA 
CGTACAAGCC TGTACCAATC 
TTGTGAACAA CATCTTATTA 



215 

CCATCCCCAA CTCACCATTG TCTCT AAGG A ACACATCTAC 9 60 

GTTTGGAATT AGATATTCCA AAC AG AGCTT GTAAGCCTTT 102 0 

TAATCACTAC AGTAAAGTCA CCGCTAGTAA ATTTATCCTT 108 0 

CTCCGCCCCC TGTTAAAGTA AC ATT ATT TT TTTCTAAGAC 114 0 

AAGATGGATC CTTAACAGTA GTTTCAACTG TTCGAGGTTG 1200 

TATCCGATGT AGGTTGTACT TCCGAAATCG GAGTCGTTGG 12 60 

TTGGTGTTCA TACTTCAGAA GTTTCAGTCT CCTGAGCTGC 13 20 

CTGATAATAC C AC T AC AG T A CCT AAGGTTA CATATTGTTT 13 80 

TTTTCCTCGT TTAAAACTTT GATAACAAGT TTTTTAACAG 14 40 

TTTGGTTGGT GAAGATCTTC TTCAAAAGTC ACCAACATAT 1500 

ATTTG AT AGT CTTTGCTATC GTAAAAAGCA ATATCCTTCT 15 60 

GACTGGGCAC GAACTGGGGA AGTTACTGCC ATTTTTTCAG 162C 

ATATCTAAAT ATTTCTTATG AGTTTCAAAA ATATCTCCTG 1630 

GTCATACAAT TTGCAAAAAC ATTTTCCCCG TCAATATCAA 174 0 

GTCAAATTTG TATTTTCTAA AAAATCACAG ACTTTTGAAA 1800 

TATCGTTTAA AATCAGATTG TTCAGAAATA ATCATATTAT 18 60 

CGAACTTCCC AACTTGAATC CGCTTTAATT TCTGTAATAT 1920 

GGTGCAGATA CTTTATTTCC AGT AAC AAC A GATACAATAT 19 80 

CAGATTGAAA TCAATGAATA TGCCCAGTAG CTAACAGCTG 20 '10 

ATAAATACCA TGACGATGGT TGATACAATC AGCGCTGCAT 2100 

TTTTTAGAAA CAAATCCAAG AATAAATACA CCACCAAGTA 2160 

AAACTATTGA ACCATTCGTA TGCAGATTTA AT ATC TG AGT 2 2 20 

CCAATTGAGA ATAAACCTAC TGCTAGAGAT AC G AATTGTG 2 2 80 

TCTGACATAT TTTTAGAAAT GACATCTTGA AT ATC C AA TG 2 3 40 

TTCAAACCTG TTGAAATAGT TGATTGAGAT GCTGCATAAA 2 4 00 

GTGATACCTA CTGGTAACTG GTATGCAATA AAGTACATAA 24 60 

TTGCTAGCTG CACTATCTGC ATTTTGTACT TGATAGAATA 2520 

AAGT AAAAG A CTGTTCCAGT TGCAAGTGAC AAAACACCGT 2 580 

AGTTTCTTAA TATTTTCTCT TGTAGTAAAA CGTTG AACC A 2 54 0 
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AATCTTGAGA 
AGATGGAGTT 
CGTTTGC7AA 
ATAAAGCTAA 
CGGATTTTAG 
AAATATTGAT 
TAGACATACG 
T AG AATT AAA 
TAGGTAAGAT 
ATAAAATCCA 
GCATTGTGGC 
AGAACTCTTT 
GTAAATAAAC 
ATCTCCATAT 
CTGCTTGTTT 
AACCTAAATC 
CCTTACCTCA 
TATCTAAATC 
CATAAGTACC 
CTGGACCATT 
GTACAGGCAT 
ACAAACTACC 
TATTTGACGC 
TGAAATAAAT 
CCAATTCGAT 
CTTTAGCAAC 
T AC AT TC ACC 
GTACCAGAGA 
ATGCAGGGAT 
TTTTTTATTT 



TGAAGCATAG 
TGAAAGCAAG 
TGTTTCTGCT 
AACGACACCA 
ACCACCAGTA 
CTCAATTCCT 
TCCCAATTGA 
ACGTTTATCC 
AAAACGAATT 
GCTACCTGCA 
AAAAATGGAT 
TCCTTTCATC 
AATCAAGATA 
TGATTTTATT 
TGCAACTTCC 
AAGTTTTTCA 
TATCATCTTA 
7CGTTCTTGA 
ACCAATACCA 
GAATACAATG 
AGAAGAATTT 
AGTCAACGCA 
AGCTTCACTC 
AGGTGCGATA 
ACT AT CTTTC 
TTCCATAACA 
TGAAGAACCA 
TTTTACACGA 
AACGCCTTTG 
GATGACATTA 



GAAGACAAGA 
TTAGGATCGA 
ACTGCACCAA 
CTAATCAGAA 
TAAGAATAAA 
GTCAATACTG 
TAAATAATAA 
AAGTAATCAT 
GTCAGTGGAA 
TAAGAGCTAC 
ACCGAAGTAA 
TCTTTTTTAG 
ATTAAGTCAA 
TATTATAAAA 
AAGTCACCTT 
TTTAGACGCA 
TAGATAACTT 
ATCAAACTTT 
GC^TCTGCTC 
TAATCTTCTC 
TTAACTCCAA 
ACCCCTGCCA 
ATTGCATTCC 
GCTGCAATAG 
GTGTTATTAC 
GCTTCAATAA 
TTTACATAGA 
TCTTCGCTAA 
TATTTAGTTA 
ATAAATCGCT 



216 
TTGTAAAGCC 

AAAGTTTTTC 

AGCC AC CTTT 

TCACACCTTG 

CAATTCCAAC 

ATAAACCAGC 

ACAAGAGTGC 

ATGCCGT AT C 

TAGCTACTAC 

CAGCGAGTCC 

CATACCAAGG 

AGAAATAGAT 

TTATTGTAAA 

ATTCTTTTCG 

CTGCCAATGC 

AAACTT CTTT 

CATTGATAGC 

CCAATTTCAA 

CCATCAACCC 

CACCTGCAGC 

TCACACGAGG 

ATTGTGGAAT 

AATATGCTGC 

CATCGACTCC 

ATGCAATATG 

TTTGTTTACG 

TACCTTTTAC 

TTTCACCATT 

AATCTTTCAT 

GAGCAATTTC. 



TGAACCCATC 
ATTTGCAGCA 
AATATTAGCA 
AATAAAGTCT 
TACACCCATC 
TGATGGGAGG 
TGAAATAATA 
GATGTCTATC 
CATCCCTAAT 
CAAGAAGGAA 
AACCGAACCA 
ACCTGCAACC 
TCCTGTTGTC 
TGCTTGTTGA 
TTCTAAAGGT 
TGCTACAGCA 
ATATTGAAGT 
CAACAAATCT 
ACCACCAAGA 
TACAAACATT 
ATTTTGACCC 
ATTATAGATA 
GATTGAATAC 
AACACTTTCT 
GTTGATAACT 
ATCTTCTACA 
ACCTTTGTCA 
TTCATCATAG 
CAGATTTCTC 
TTTTGGACGT 



ACAATTAAAA 
AGGAATTTCC 
ATCAGTACAA 
GTCCATAATA 
AAAATAATCA 
TACATAATGA 
CGAAGTGCTT 
CGTGCAAAGA 
TGAGCAAACC 
ATCGGACTGA 
TCTCCTTTAA 
AACACCCCAA 
CCCATAACAT 
ATAAGTTCTG 
TGACGAACAG 
TACATATTTC 
TTTTTAGCTG 
CGCATAACCC 
TATTGTTCAT 
TGAATATCTT 
ATTGTTGCAT 
ATAAAATCTG 
TCTGGCAATT 
GAATGTTTTG 
GTTAATTTAC 
CTTTCGTAAA 
ATGAAATATT 
C AAG C AT AAA 
CTTTATATTG 
GTAATCGCTC 



2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3130 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3790 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440- 
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CACCAATGAC TACACTGGTA ACACCTAAAC TATAAGCTTT TTTTAATTGT TCTGGATAAT 4 500 

CAATTTTTCt TCGGCAATTA CCGGAATATT AAAATCAGCC AATTTTTT C A TTAGTTCAAA 4 5 60 

ATCAGGCTCA TCTGATTGTA CACTTGTACT TGTGTAACCT GATAATGTTG TACCAACAAA 4 62 0 

ATCAACGCCT GATTTAAATG CATAGAGACC TTCATCTAAA TTACTTACAT CCGCCATCAC 4 680 

CAATTGATTC GG AT ATTTTT CTTTTATTTT TTTGATAAAT TCACTGACAA CTAAGCCATC 4 740 

ATATCTTGGT CTTAAAGTTG CATCAAATGC AATGACTGTT GTTCCGCATT CTACAAGTTC 4 800 

ATCTACTTCT TTCATCGTAC CAGTAATATA TGGTTCTTGA GGTGGATAAT CCCTTTTGAT 4 8 60 

AATTCCAATT ATTGGTAAAT CTACTACTTT CTGAATTGC7 TTAATATCAC GCACAGAATT 4 92 0 

TCCGCGAATG CCCACTGCTC CTGCCTCTAA AGCTGCTTTA GCCATAAAAG GCATCAAGCT 4 9 SO 

AAATTCTTCA TTATAAAGGG CTTCACCACG TAAAGCTTGA CAAGAAACAA TGACTCCACC 5 04 0 

TTGAACTTGG CTTATAAATT TTTCTTTAGT CCAAATTTGG CTCATTTTAT TATTCCTCCT 5100 

TATGGATAAT AGTTTGATTG TAATAATATT GTCTCTCTGG ACTTTCCAGA T AATT AG AG A 5L60 

ATAAGCAGTC TGT AATT AAA AGTATTGGAA ACTGAGGTGA TATGCGATTG CC AT AC GAG A 522 0 

GATGATCGGT CGAAGCTAAT AACAATAGTT CATCAAAGAA ACAATCTTCT TCGTCAAATT 52 80 

TTCTTGTAGT CATTAAAACT GTTTTAGCGC CTTTATCTGC AGCTTTTTGT AGACCTTCTA 53 4 0 

GTACAATATC AGTTTGACCT GAAATGGATG CTCCAATGAC AAGGCAATTT TCATTAAGTA 5400 

GTAAGCTACT CCACAAAATC ATATCCTCGT CTGATAATAC TTCACCAATC ACTCCGAGAC 5 4 60 

GCATAAATCT CATCTTCATT TCTTGTAAAG CAAGAACAGA ACTTCCTTTA CCGTAGAGAT 55 20 

ATACACGCTC AGCAGTTTCT ATCATCTCAG CAATACGCTC AAGTTGAACT TCATCAAGAA 55 30 

CCGTGTAAGT TTTTCTCAAC ATTTCCTCAT AGTCGGATAA AACTTTTTCT CTTGCCTCTG 5640 

T AT AT AATGC CAACTTTTCT TTCTCATGAA TCATCTCTTG GTATTTGAAA ATGAATTGTC 57 0 0 

TAAAACCTTT AAAACCACAT TTTTTCCCAA ATCCAGTCAA TGTTGCTTTG GAT AC ATT AA 57 60 

GGTATTCGCA CAATGCTTTA GATGAATAAT CATTCAGAGG TTGCTCTTTT AAGAAGAATT 58 20 

TAGCAATGTC TTTTTCAGCA TATGCCATAT TTGGTAACTT AGCTTCTATC ATTGGAATTA 5880 

GTTCTTTTTG CAGTAACATA TGAGCTCCTT AGTTGAAGTA AACGTTTACA TTCTTTATTT 5 94 0 

TAACACTTTT TTTTTTTTTC AATATTTTTC AT AAA TT AG A AACTAGTTTC CAATTTCTTT 6000 

CGTTTCATAA CAGAACAACA AACATAAAAA TATAATAGTT TTTATTCTTT TTATCGTAAT 60 60 

TATATGTATT GTAAGAACGT TTATCACTAA TAATATGTTC ATATTAAAAT ATTTTAGTAA 6120 

TATTTTATTT TGCTTTTATT ATTTCTTTTC GGAATTTCTA TATAATATTT T ATTTCT AAA 6190 
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AAAATTGAAA 


AAATATTTCT 


AG TTTCTTT A 


TTTTATATAG 


f.TAATATATT 


TTATTTCTAA 


5 240 


ATT AAAAG AG 


AATCCCATAA 


AAACTACAGA 


TTTATGAGAT 


AAATCAGGTC 


ACCTATTTTA 


6300 


AAAAAGCAGC 


AAACT AT AAA 


CTAAAAAGTT 


CCACACCAAA 


TGTAACCCCA 


TACTTCCCCA 


6360 


TAACTCAGAT 


TTATAGCGCA 


CC AT AC CT AA 


AAACATTCCA 


AGTGAAACGT 


ACAGACACCA 


6420 


AGCTAGAATG 


GTTCCTGGAT 


GATGTACTAA 


GGCAAATAAA 


ACACTTGTCA 


AAGCAACTCG 


6480 


AATATCTAAT 


TTTCTAACCA 


AGTTCCATAA 


AATTTCACGA 


TACAGAAATT 


CTTCAACCAT 


6540 


ACTCGCATTG 


ATTAAGAACA 


ATAAAAATGA 


AAACCAAGGA 


ACTTG ATGTT 


GAAGGCCAAT 


6600 


TAAATTTGTT 


TGATTCGTGC 


TTCCTTGAGC 


ATGAATCAGG 


CTAAAACATA 


GACTTATAAT 


6660 


CAGTAGACTA 


GCTAGTCCAA 


TACCAAGGCA 


TTTCATCCTA 


GTTTTCATAT 


TGACCTTGAC 


67 20 


CACTTGTTTT 


CGTTGACCAT 


ACATCCATAA 


AAAAGAAAAA 


AGAGACGCAC 


CATAGAGAAC 


6780 


CTGTAGTATA 


GTTAACTCAC 


CGATACAAAG 


AAATTTCAAT 


AAGTATAGAG 


ATACCAATAG 


6840 


GACATTTACT 


TGTTGGAATA 


TAT AAA CTGG 


AATTATTCTT 


TTCATACTTA 


CCTCCGAAAT 


6900 


AAATCTTCAT 


AATCTAAATC 


TAATATCTGC 


ACAATCCTTT 


CTACCCATGG 


ACTTTGAGGC 


6960 


ATTCGTTGTT 


CCATCTTGTA 


GTGGCGAATC 


TTTTGATATA 


AACGATTCAA 


TTCACTTGGA 


7020 


TAGTGAAACT 


CTCCCGCAAA 


CATTTTTCTG 


GTTAACTCAA 


TCCAGCTGAT 


ATTTCTTTCA 


7030 


GCCAAAATAA 


TGGACAAGTT 


CTCCCAAAAT 


CGTTCAGCCA 


TATTrCTTCT 


CCTTTAGTTA 


7140 


GATAAATAAT 


GTGTTTGyGC 


CATGTAAATC 


AATTGTTTCG 


TATCTCTTGG 


CAATAGAGCT 


7200 


CTAGCCTCTT 


CCAAATTCAG 


ACTTGGATAA 


ACCCGCTTAT 


TTGAAACCAC 


AAAAGG AAGT 


7260 


CCGATGGTTA 


GTTCAGGATT 


TTTTAAAATT 


ATCTCAACGA 


AATCCG7TAA 


TCTTAGATTG 


7320 


TCACGGTTCT 


TAAATGGTAA 


TAAATTGGGA 


GATAAAAACT 


CAAAACAATC 


TGAAGAATAG 


7 380 


CTCATCATCT 


CAATTAATTT 


GTCCTTTGTC 


ATTTCAGAAA 


CTCAATGACA 


AGATACCTCA 


7440 


ATGCCATAGT 


TTTGGAAGAA 


GT CTAAAAG A 


AGTTGATTTC 


TTTGGCTATT 


TTTACTTAGA 


7500 


TAGAGATCAA 


TCATGGGAGA 


CCTCCAACAA 


ATTTGCTTCC 


ATTTGATATT 


CTGAGACGAT 


7560 


TAAGGAATCT 


AAC AACTTTG 


AGAAGTTAAT 


CGATTTCTTG 


TCTTCATCAT 


AAGCTTTTAC 


7620 


AGTTACTTGG 


GTTGTAAGTA 


TCCCCTCTTT 


TCCCTCGGCT 


CG AT AGTCTT 


GTCAATATAA 


7680 


AACAAAAACA 


AGATTCTGAT 


TATCATCTAC 


AAAGGCATTA 


ACTCCGTTCT 


TTATATCCTG 


7740 


ACTTTCAAGG 


AATTCCATAA 


CGTTTTGAAG 


ATAGGATTCA 


TAAAATAGTG 


GGTAATTATG 


7800 


TTTTTTATGG 


TAATCATCTA 


AAAATGTTAC 


CTCAAACTCA 


CATGGATAAT 


TCCCCATCAA 


7960 


AAATATTTGT 


TCATCCAGCT 


GTTTGATTTC 


TGCATCATCT 


AATTCTGTTT 


CTAATTCATC 


7920 


ACAATCTAGT 


ATTGATTCTT 


TATTTAATGC 


TTTTATCTTT 


TTCCTCTATT 


TCTTTTAATT 


7980 
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TCTTTGCGAT TGCGGCAATC ACAGGAACGG T T AC ACT ATT ACCAACTTGT TTATAGAGCT 3040 

GACTATTAAT AGAGACTTTT CTAGCAGCTT CAAAAGCCTA ATCAGGAAAG CCATGCAATC 8 LOO 

GAAAACACTC TTTAGGAGTG ATTCGTCGTA TTCTCAAACG GTAAAATTGT C C ATCT ATTA 8160 

AAACACCAGC TACTTGGTAA ACTTGTTTAT CTTCTCCTTC ATAGCTAGCC ACTACTACTC 82 2 0 

CCATTTGACC ACTAGTTGTT AACGTATTAG CTATACCTTT TCCAACTCTA CCACGACGAT 82 80 

ACTGAGAACT TGGTCTTTCT AAATTGATTG AATCCCCAAT CTCTGCTTGA GCATATCCTT 834 0 

TTTTCGTTGC TTCCCGTACT TTTAGAAATT GGATTGCTTC TGGAATTAGT ATTTTGGGGA 84 00 

7TTTATCTCC TCCTTGCATC GTAGTCAGTG TTGG AG AT AA GCCCTCAC77 CCATAGACAC 84 60 

GACCTGTCTC CTTAAAGCTA GTCGGTAAAT CTCCAACAAC GACAATGCCA TAACGATCCT 8520 

GAGTATTTAA AGTAAACATC GGCTCTTGAT TTTCCTTAAA GCGTCTCCCA TTTTGTCTCT 85 80 

TGTCTAATCT ATCTGGTGTC ATACAAGGAA TCGCAACTTT AAATCCTTCT CCTTTACCAC 8640 

GAACTAAGGT TGGCGCAAGA CCTTCTGAAT AATAGACTTT ACCGCTCATT CCACTTCTTG 3700 

ATGGATTCAA ATTTCCTAGT GCTTTCAAAG TCTCAGAGTT AGTTGCTTGA CCTTCTCGTC 87 60 

TGAAAGGAAA TAAGAGTCTG GTACCTTTCT TTCTAGAATG TCCGATAATA AACACCCTCT 3820 

CTCTGTTTTT GGGAACGCCA AAATCCTTAC TCTTAAGCAC CTGCCACTCA ACATCAAACC 8 880 

CCAACTCATC AAGTGTGGTA AGTATTGTGG TGAACGTCCG TCCCTTATCG TGATTGAGTA 8 940 

GGCCTTTAAC ATTTTCAACA AAAAGAAAAC GTGGTTGGAT TTGTTTCCCC GCCCGAGCAA 9000 

TTTCAAAGAA CAAAGTTCCT CTAGTATCTT CAAATCCCAA TCGTCTTCCT G CG ATT G AAA 9060 

ATGCTTGACA AGGGAATCCC CCACAGATGA CATCGACTTT CCCTCTAAGT TTTTTAAATT 912 0 

CCTCATCTGA AACATCTCCT ATGTCATGAA ATTCTATTTC TCCTTCCGTT TGAAAAATGG 9180 

ACTTATAAGA TTTCCTAGCA AATTTATCAA TCTCACAAAA TCCCAAGCAC TCATGCCCTT 9240 

GAGCTTCCAT TCCCATCCTA AAGCCTCCTA TCCCAGCAAA TAAATCTAAA ACCCAAATCA 9 300 

TTCATACCTC TCTCAACTAG ATGTAACTTA CAAAACCCCT GACCTCATGA GCCACTTTCT 9 3 60 

TCCTCCTCAT GAGGTCAGTT TTACTTTCTG CTGTTCCAGT ATCGTTTTTC CTCCCTAGAT 94 20 

TTCCTCAAAA GGGCAGACTC CTCCCTTGGT TCGTCACACG ATTTTTTCAT CTCGACTGTT 94 80 

CTTTAATGCA TCATTAACGA CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC 954 0 

AGGTTGACTT TTCTAATCCT AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA 9600 

CTAGACAATT TGAGGAGCTG CTTGCGTCCT GTTCGAACAC ATTTTCCTAC CACGTGAAGA 9660 

AAAAGATGGC GGAAGCGTTT GATTGTTAAA GTTTGGAAGT CACCTCCAGC TAGATGTTTG 97 20 
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AG AAAAAG AT AGAGATTGTA GGCGATACAG CTCATCATCA TACGAACTCG TTTTTGATTA 9780 

AGGTTGAACT AT C CGTTTT A TCGCCAAAAA ATCCCTCCTT CATCTCCTTG ATGAAATTCT 9 84 0 

CGGCTTGACC ACGTCC AC G A TAAAGCTGAA AC TGGTCTTG GCTTGTTCCG GTACCGA 9897 
(2) INFORMATION FOR SEQ ID NO: II: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: il: 

CCGTGGAACA AGCCAAGACC AGTTTCAGCT TTATCGTGGA CGTGGTCAAG CCGAGAATTT 60 

CATCAAGGAG ATGAAGGAGG GATTTTTTGG CG AT AAAACG GATAGTTCAA CCTTAATCAA 120 

AAACGAAGTT CGTATGATGA TGAGCTGTAT CGCCTACAAT CTCTATCTTT TTCTCAAACA 180 

TCTAGCTGGA GGTGACTTCC AAACTTTAAC AATCAAACGC TTCCGCCATC TTTTTCTTCA 240 

CGTGGTAGGA AAATGTGTTC GAACAGGACG CAAGCAGCTC CTCAAATTGT CTAGTCTCTA 3 00 

TCCCTATTCC GAATTGTTTT CAGCACTTTA TT CT AGG ATT AGAAAAGTCA ACCTGAATCT 3 60 

TCCTGTTCCT TATGAACCAC CTAGAAGAAA AGCG TCGTT A ATGATGCATT AAAGAACAGT 4 20 

CGAGATGAAA AAATCGTGTG ACGAACCAAG GGAGGAGTCT GCCCTTTTGA GGAAATCTAG 4 80 

CGAGGAAAAA CGATACTGGA ACAGCAGAAA GTAAAACTGA CCTCATGAGG AGGAAGAAAG 5 4C 

TGGCTCATGA GGTCAGGGGT TTTGTAAGTT ACATCTAGTT GAGAGAGGTA TGAATGATTT 6 00 

GGGTAAATAC AATGAGCTTG AAAGAAGTAG CAAACTCACC AAGCGCCAAT TCTTTGAGAA 6 60 

TCAGATGCTG G ATT AT AC C A TCATTGCGCA TGAGACTTTT GAAATCATCC GTCATTCTGT 7 20 

CTACCAGACA GATGATCGTG AAGTGGAAAA TGCTCTGGCT TTTGAAGTGA AAAATGATGA 780 

AACAGACAAG CTGATTCTGT TATTAAGCGA GGATATTGGT GTAGGTGAAA AATTGTGCCT 840 

CGTTGACGGA ACAAAAATGC GTGGAAAATG TTTAGTATAT GATAAAATAA ATGAGAGAAT 9 00 

GATTCGCTTG CAGTGCTAGA AATAGGCATT TTGAATAGTG AATATGTTAT AATAAGTATT 9 60 

AGTAGGAGG7 GTTTTAGATT GGAGAAGAAA CTGACCATAA AAGACATTGC CGAAATGGCT 1C2 0 

CAGACCTCGA AAACAACCGT GTCATTTTAC CTAAACGGGA AATATGAAAA AATGTCCCAA 1080 

GAGACACGTG AAAAGATTGA AAAAGTTATT CATGAAACAA ATTACAAACC GAGCATTGTT 1140 

GCGCGTAGCT TAAACTCCAA ACGAACAAAA TTAATCGGTG TTTTGATTGG TG AT ATT AC C 12 00 

AACAGTTTCT CAAACCAAAT TGTTAAGGGA ATTGAGGATA TCGCCAGCCA GAATGGCTAC 12 6 0 
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CAGGTAATGA TAGGAAATAG T AATT AC AG C CAAGAGAGTG AGGACCGGTA TATTGAAAGC 13 20 

ATCCTTCTCT TGGGAGTAGA CGGCTTTATT ATTCAGCCGA CCTCTAATTT CCGAAAATAT 1380 

TCTCGTATCA TCGATGAGAA AAAGAAGAAA ATGGTCTTTT TTGATAGTCA CCTCTATGAA 1440 

CACCGGACTA GCTGGGTTAA AACCAATAAC TATGATGCCG TTTATGACAT CACCCAGTCC 1SC0 

TGTATCGAAA AAGGTTATGA ACATTTTCTC TTGATTACAG CGGATACGAG TCGTTTGAGT 1560 

ACTCGGATTG AGCGGGCAAG TGGTTTTGTG GATGCTTTAA CAGATGCTAA TATGCGTCAC 1620 

GCCAGTCTAA CCATTGAAGA TAAGCATACG AATTTGGAAC AAATTAAGGA ATTTTTACAA 1680 

AAAGAAATCG ATCCCGATGA AAAAACTCTG GTATTTATCC CTAACTGTTG CGCCCTACCT 17 4 0 

CTAGTCTTTA CCGTTATCAA AGAGTTGAAT TATAACTTGC CACAAGTTGG GTTGATTGGT 1300 

TTTGACAATA CGGAGTGGAC TTGCTTTTCT TCTCCAAGTG TTTCGACGCT GGTTCAGCCC I8 60 

TCCTTTGACG AAGGACAACA GGCTACAAAG ATTTTGATTG ACCAGATTGA AGGTCGCAAT 1920 

CAAGAAGAAA GGCAACAAGT CTTGGATTGT AG TGTG AATT GGAAAGAGTC GACTTTCTAA 19 80 

AATGAAGGAA AATGACTTGC AATCTCTGTT AAGAAATAAA ATAATCCCAC CTAGAACAAG 204 0 

CTAGGTGGGA TTATTTGCCT ATGAAATGAG AAATTATGGG AGCAAGCTCC TAAATCAACT 2100 

CTTTTTCATC tACTTCTTTA ACTACTTGAT AAAAGTTATA GAAGTAGGCC AAACTTGAAA 2160 

TGATGGTTAC GACTAGGAAT ATTGAAAATT TCCATTGGAC AGGGTTGGTT AAAAGTTGTG 22 20 

GAAAGGATAT GAGGAGAAAG AAGAGGGCTG CCTTGACGAC AGGTATCCGT TTTGATTGTA 2 2 80 

TTTTCTCAAG TCCTTTATTG AGCGCAGCAA GAAAGAGGAG TAGGAGTAGT AAAACTGTAT 2 3 40 

GAGAAATAGC TCCTGAAGTA AGGCCCAAGA AAAGGAAAAT ACTGATAAAA ACATGAATGA 2 4 00 

TCAGTAGTCT AGCTAGTGAT TTCATAAGGC ACCTCCTAAT CCTGGTCTTT TTTAGCTCTT 24 60 

GCAATACGAA GTGAGTCCAC AATATGTATC ATCACTCCGA AAAAGAAAGC TCCCAGTATA 2520 

GTTTTAAAAA TATGTTTTGT ATTTAGAAGA GAACTGATAA AATTTGGATT TTCACTTGTT 2580 

AGGGTATCAA TGAGTGGAAT TATAAAAAAT ATCACTGTTC CATAAATCGA ACCTGCTTTC 2 6 40 

AGACCAGGAT AACGTAACTG TTTCTTTTCT TTTTTCATGA GTTTCCTCCT AATCCTCATC 2 7 00 

TTGATTTTTC TTAGTTTTTG CAATGCGACG GGAGATGAGG AACTGTATGC TCGCTCCGAA 27 60 

GAAAATAGAA CCGAGAATAC TTGATACACC ATTTCTT AT A GTGAGAAGAG AATGAAAATA 2 82 0 

GTCCTGACCT TCATCTATGA GTATCCTGAG AAGAGGAGTT ATAAAAAACA TCCATAGACC 2 880 

AAAGAACAAA CCTGCTTTCA GACCTGGGTA GTGTAGTTGC TTGCTTT CTT TCTCATTCAG 294 0 

CATATCTGGT TCAATGACTG TGATGCCTGT TTTTTTCATT TGGTAGGTGA CATAGCCAGA 3 000 
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AGCGATGAGG GCAATCACTA AAATCAGAGG AGGATAGATT AGAGCCACTT CTTGAGGGTA 3060 

TTTATAGGCC AGAAGGAGTG GAATAAGATT TCCGAAAATC ATCAGATAAA AG AGG AT G AT 3120 

AAAGACTTGG TTCCCAATAC TATCGGCCTC ACCCCGTTTC TATTCGTCAA GGGGACCAGA 3180 

AATACCGTAT GTGCGTTTGA TCAGTTTTTC AGTGAAGGTT T CTTTTTTC A TGAGTTTGCT 3240 

CCTTTTTTAA aaatcTTCCT CCCAAAAGAG ACTGTTGAGG TCAGTTTGGA GGCTGCGGGC 3 300 

GAGATTGAGA CAGAGTTCCA AGGTTGG ATT GTACTTGTCG TTTTCAATCA TATTGATACT 3 3 60 

CTGTCTCGAG ACACCGATAT CCTTGGCGAG TTCGAGCTGG GAAATACCCA ATTCCTTGCG 342 0 

AAATTCTTTC ACACGATTCA TCTGTTCTCC TTTCTGATTT ATGTCGT ATA TATTTGACTA 34 80 

TATTATAGTC TTTTAAACAT AAAGTGTCAA GTATTTTTGA CATATTTTTT GAAGAAATAG 3 54 0 

TAGTCTCCTT GTCCTATTTG TCTGACAAGT GCAAGCTGGT CGGATTTGTG GTAAAATAGA 3 600 

TAAGATATGA CAAAAGAATT TCATCATGTA ACGGTCTTAC TCCACGAAAC GATTGATATG 3 660 

CTTGACGTAA AGCCTGATGG TATCTACGTT GATGCGACTT TGGGCGGAGC AGGACATAGC 37 20 

GAGTATTTAT TAACTAAATT AAGTGAAAAA GGCCATCTCT ATGCCTTTGA CCAGGATCAC 37 80 

AATGCCATTG ACAATGCGCA AAAACGCTTG GCACCTTACA TTGAGAACCG AATCGTGACC 3 840 

TTTATCAACG ACAACTTCCG TCATTTACAG GCATGTTTGC GCGAAGCTGG TGTTCAGGAA 3900 

ATTGATGGAA TTTGTTATGA CTTGGGAGTG TCTAGTCCTC AATTAGACCA GCGTGAGCCT 3960 

GGTTTTTCTT AT AAAAAGG A TGCGCCACTG GACATGCGGA TGAATCAGGA TGCTAGCCTG 4 02 0 

ACAGCCTATG AAGTGGTGAA CAATTATGAC TATCATGACT TGGTTCGTAT TTTCTTCAAG 4 080 

TATGGAGAGG ACAAATTCTC TAAAC AG ATT GCGCGTAAGA TTGAGCAAGC GCGTGAAGTC 4140 

AAGCCGATTG AGACAACGAC TGAGTTAGCA GAGATTATCA AGTTGGTCAA ACCTGCCAAG 4 200 

GAACTCAAGA AGAAGGGGCA TCCTGCTAAG CAGATTTTCC AGGCTATTCG AATTGAAGTC 42 60 

AATGATGAAC TGGGAGCGGC AGATGAGTCC ATCCAGCAGG CTATGGATAT GTTGGCTCTG 4 3 20 

GATGGTAGAA TTTCAGTGAT TACCTTTCAT TCCTTAGAAG ACCGCTTGAC CAAGCAATVG 4 3 80 

TTCAAGGAAG CTTCAACAGT TGAAGTTCCA AAAGGCTTGC CTTTCATCCC AGATGATCTC 444 0 

AAGCCCAAGA TGGAATTGGT GTCCCGTAAG CCAATCTTGC CAAGTGCGGA AGAGTTAGAA 4 500 

GCCAATAACC GCTCGCACTC AGCCAAGTTG CGCGTGGTCA GAAAAATTCA CAAGTAAGAG 4 560 

GGAAAAAGAT GGCAGAAAAA ATGCAAAAAA CAGGTC AAAT ACTACAGATC CAACTTAAAC 4 620 

GGTTTTCGCG TGTGGAAAAA GCTTTTTACT TTTCCATTGC TGTAACCACT CTTATTGTAG 4 68 0 

CCATTAGTAT TATTTTTATG CAGACCAAGC 7CTTGCAAGT GCAGAATGAT TTGACAAAAA 47 4 0 

TCAATGCGCA G AT AG AG G AA AAGAAGACCG AATTGGACGA TGCCAAGCAA GAGGTCAATG 4 80C 
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AACTATTACG 


TGCAGAACGT 


TTGAAAGAAA 


TTGCCAATTC 


ACACGATTTG 


CAATTAAACA 


4860 


ATG AAAAT AT 


TAGAATAGCG 


GAGTAAGATA 


TGAAGTGGAC 


AAAAAGAGTA 


ATCCGTTATG 


4920 


CGACCAAAAA 


TCGGAAATCG 


CCGGCTGAAA 


ACAGACGCAG 


AGTTGGAAAA 


AGTCTGAGTT 


4990 


TATTATCTGT 


CTTTGTTTTT 


GCCATTTTTT 


TAGTCAATTT 


TGCGGTCATT 


ATTGGGACAG 


5040 


GCACTCGCTT 


TGGAACAGAT 


TTAGCGAAGG 


AAGCTAAGAA 


GGTTCATCAA 


ACCACCCGTA 


5100 


CAGTTCCTGC 


CAAACGTGGG 


ACTATTTATG 


ACCGAAATGG 


AGTCCCCATT 


GCTGAGGATG 


5150 


CAACCTCCTA 


TAATGTCTAT 


GCGGTCATTG 


ATGAGAACTA 


TAAGTCAGCA 


ACGGGTAAGA 


5220 


TTCTTTACGT 


AGAAAAAACA 


CAATTTAACA 


AGGTTGCAGA 


GGTCTTTCAT 


AAGTATCTCG 


5280 


ACATGGAAGA 


ATCCTATGTA 


AGAGAGCAAC 


TCTCGCAACC 


TAATCTCAAG 


CAAGTTTCCT 


5340 


TTGGAGCAAA 


GGGAAATGGG 


ATTACCTATG 


CCAATATGAT 


GTCTATCAAA 


AAAGAATTGG 


5400 


AAG CTG C AG A 


GGTCAAGGGG 


ATTGATTTTA 


CAACCAGTCC 


CAATCGTAGT 


TACCCAAACG 


5460 


GACAATTTGC 


TTCTAGTTTT 


ATCGGTCTAG 


CTCAGCTCCA 


TGAAAATGAA 


GATGGAAGCA 


5520 


AGAGCTTGCT 


GGGAACCTCT 


GGAATGGAGA 


GTTCCTTGAA 


CAGTATTCTT 


GCAGGGACAG 


5580 


ACGGCATTAT 


TACCTATGAA 


AAGG AT CGTC 


TGGGTAATAT 


TGTACCCGGA 


AC AG AAC AAG 


5640 


TTTCCCAACG 


AACGATGGAC 


GGTAAGGATG 


TTTATACAAC 


CATTTCCAGC 


CCCCTCCAGT 


5700 


CCTTTATGGA 


AACCCAGATG 


GATGCTTTTC 


AAGAGAAGGT 


AAAAGGAAAG 


TACATGACAG 


5760 


CGACTTTGGT 


CACTG CT AAA 


ACAGGGGAAA 


TTCTGGCAAC 


AACGCAACGA 


CCGACCTTTG 


5820 


ATGCAGATAC 


AAAAGAAGGC 


ATT AC AG AG G 


ACT TTGTTTG 


GCGTGATATC 


CTTTACCAAA 


58S0 


GTAACTATGA 


GCCAGGTTCC 


ACTATGAAAG 


TCATGATGTT 


GGCTGCTGCT 


ATTGATAATA 


5940 


ATACCTTTCC 


AGGAGGAGAA 


GTCTTTAATA 


GTAGTGAGTT 


AAAAATTGCA 


GATGCCACGA 


6000 


TTCGAGATTG 


GGACGTTAAT 


GAAGGATTGA 


CTGGTGGCAG 


AACGATGACT 


TTTTCTCAAG 


6060 


GTTTTGCACA 


CTCAAGTAAC 


GTTGGGATGA 






GGAGATGCTA 


6120 


CCTGGCTTGA 


TTATCTTAAT 


CGTTTTAAAT 


TTGGAGTTCC 


GACCCGTTTC 


GGTTTGACGG 


6180 


ATCAGTATGC 


TGGTCAGCTT 


CCTGCGGATA 


ATATTGTCAA 


CATTGCGCAA 


AG CTC AT TTG 


6240 


G AC AAGGG AT 


TTCAGTGACC 


CAGACGCAAA 


TGATTCGTGC 


CTTTACAGCT 


ATTGCTAATG 


6300 


ACGCTGTCAT 


GCTGGAGCCT 


AAATTT AT T A 


GTGCCATTTA 


TGATCCAAAT 


GATCAAACTG 


6360 


CTCGGAAATC 


TCAAAAAGAA 


ATTGTGGGAA 


ATCCTGTTTC 


TAAAGATGCA 


GCTAGTCTAA 


6420 


CTCGGACTAA 


CATGGTTTTG 


GTAGGGACGG 


AT C CGGTT T A 


TGGAACCATG 


TATAACCACA 


6480 


GCACAGGCAA 


GCCAACTGTA 


ACTGTTCCTG 


GG C AAAAT G T 


AGCCCTCAAG 


TCTGGTACGG 


6540 
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CTCAGATTCC TGACGAGAAA AATCGTGGTT ATCTAGTCGG GTTAACCGAC TATATTTTCT 6 600 

CGGCTGTATC GATGAGTCCG GCTGAAAATC CTGATTTTAT CTTGTATGTG ACGGTCCAAC 6 660 

AACCTGAACA TTATTCAGGT ATTCAGTTGG GAGAATTTGC CAATCCTATC TTGGAGCGGG 67 20 

CTTCAGCTAT GAAAGACTCT CTCAATCTTC AAACAACAGC TAAGGCTTTA GAGCAAGTAA 6780 

GTCAACAAAG TCCTTATCCT ATGCCTAGTG TCAAGGATAT TTCACCTGGT GATTTAGCAG 684 0 

AAGAATTGCG TCCCAATCTT GTACAACCCA TCGTTGTGGG AACAGGAACG AAGATTAAAA 6900 

ACAGTTCTGC TGAAGAAGGG AAGAATCTTG CCCCGAACCA GCAAGTCCTT ATCTT ATCTG 69 60 

ATAAAGCAGA GGAGGTTCCA GATATGTATG GTTGGACAAA GGAGACTGCT GAGACCCTTG 702 0 

CTAAGTGGCT CAATATAGAA CTTGAATTTC AAGGTTCGGG CTCTACTGTG CAGAAGCAAG 7080 

ATGTTCGTGC TAACACAGCT ATCAAGGACA TTAAAAAAAT TACATTAACT TTAGGAGACT 714 0 

AAT ATGTTT A TTTCCATCAG TGCTGGAATT GTGACATTTT TACTAACTTT AGTAGAAATT 72 00 

CCGGCCTTTA TCCAATTTTA TAGAAAGGCG CAAATTACAG GCCAGCAGAT GCATGAGGAT 72 60 

GTCAAACAGC ATCAGGCAAA AGCTGGGACT CCTACAATGG GAGGTTTGGT TTTCTTGATT 7 3 20 

ACTTCTGTTT TGGTTGCTTT CTTTTTCGCC CTATTTACTA GCCAATTCAG CAATAATGTG 73 8 0 

GG AAT G AT TT TGTTCATCTT GGTCTTGTAT GGCTTGGTCG GATTTTTAGA TGACTTTCTC 7 4 40 

AAGCTCTTTC GTAAAATCAA TGAGGGGCTT AATCCTAAGC AAAAATTAGC TCTTCAGCTT 7500 

CTAGGTGGAG TTATCTTCTA TCTTTTCTAT GAGCGCGGTG GCGATATCCT GTCTGTCTTT 7 5 60 

GGTTATCCAG TTCATTTGGG ATTTTTCTAT ATTTTCTTCG CTCTTTTCTG GCTAGTCGGT 7 620 

TTTTCAAACG CAGTAAACTT GACAGACGGT GTTCACCCTT TAGCTACTAT TTCCGTTGTG 7 6 30 

ATTAGTTTGT CTGCCTATGG AGTTATTGCC TATGTGCAAG GTCAGATGGA TATTCTTCTA 77 4 0 

GTGATTCTTG CCATGATTGG TGGTTTGCTC GGTTTCTTCA TCTTTAACCA TAAGCCTGCC 7800 

AAGGTCTTTA TGGGTGATGT GGGAAGTTTG GCCCTAGGTG GGATGCTGGC AGCTATCTCT 7360 

ATGGCTCTCC ACCAAGAATG GACTCTCTTG ATTATCGGAA TTGTGTATGT TTTTGAAACA 7920 

AC TTCTGTT A TCATGCAAGT CAGTTATTTC AAACTGACAG GTGGTAAACG TATTTTCCGT 7 980 

ATGACGCCTG TACATCACCA TTTTGAGCTT GGGGGATTGT CTGGTAAAGG AAATCCTTGG 804 0 

AG CG AGTGG A AGGTTGACTT CTTCTTTTGG GGAGTGGGAC TTCTAGCAAG TCTCCTGACC 8100 

CTAGCAATTT TATATTTGAT GTAAGAATGG CACCCTGATG TTTCAGGG 3148 
(2) INFORMATION FOR SEQ ID NO: 12: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 909 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 
{D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12: 
TACTCCACCC TTAATATCCG TTCCTGTAAA TACTTTACCC CTTTTAAGTT CATAGAATTG 60 
AACTTTTAAA TGCTTGTCTT CAAGCATCTT TTCCATCCAA . TTTTTAGGAG TTTGACCAGC 120 
TTT AAAT AAA AACCTTGCTG GGGTGATTAG TAT AG AT TT A TCTGCGATTT TATAAGCTTC 180 
ATCAATAAAA TAGTGATATA TCGGCTCATC TCTGGCTTCT CCTGTTTCCT GATACGGAGG 2 40 

ATTTCCTATC ACGACATCAA ATTTCATTTC ACTTTCCTCC CTAGATAGGC GCTCAAAACC 300 
TATCATTCTA TTCTTTTTCC AGTCTTTGAT ATGGGTTTTA GATTCTTCTA CTTCTTGGAC 3 60 

TTCTAGCTCA TCCGCAAACA AACTCAATTG TTGAGATTGC TTTTGTTTAG CTGAATAAGG 420 
ACT AC TTTTT TTCAATCCAT CCATCTGAAA GACATTGTAA GAG AT AA TAG TCGCAATTTC 4 80 

TTTCTTTTGC TCTAATGTTG GTTGATTTCC AGTCTTAGCT AGATAATAGT CCTCAAAAGT 54 0 

TGCCAAAAGA TTCTCACGCG CCAAAAGGAG AGAATCTCCT TGATACTCAT AACCATACGA 600 
AGCATGATAA GCATCTTTTA CAAGTTTATA AAATGTGACT TCATCTGAAA CCTCACGACT 660 
AATCCGTTGC AGTTTTCTAT CAACAAAACC AACTCGCTCA GATAATGGAA TTTCCTCACC 72 0 

AGTTACGGTA TCATATCTCG TTACCATATA AGGTGCTTCA CCACAAGTTA CCTCTAACCA 7 80 

TCGTAAGTCC ACATACTCCT CAAGACTTAA CGAGCCTAAT TTCGATTCTA CATATCCATT 8 40 

TTGCTTTGCG ACCAACCACG TTG GTGT AAA CACTTCTGCC CTTATTTTTG TCCGATCTT7 900 
TTGTTCATAT TTG G ATT TTT CAGATCTGGG CTGAATCAAG TTGGCAAAGT TTCCAGTAAC 9 60 

CTTACTTGGA TTGATGCGAT CACTTGGACC AAATCCCTTT CCTAACAATT CATAAGAATG 1020 

CGTAnCCCAA ACAATTGATT TCTTTGTCGT TCGATCTTTT AAAAGAATTT TTAATAAGTC 1080 

AGCCGATTCT TTAGCCAAAC TTTCTTCACT AATATCTATT GTCATCAGCA ACCTCTCTTA 114 0 

TATTGTAAGC CCTATTATAT CATATTTTAA AGAATGAAAA TTT ACT TG AA AAAAGTAATT 12 00 

CAATAAATAT CTCTCCGATG ACCAACTTCT AGAGTAGCAA CGACTAATTC ATCATCTACA 12 60 

ATTTGTACGA TAACTCGATA ATTACCAATT CTATACCGCC ATTGACCAAC GCGATTACCA 13 20 

ACCAAAGCCT TTCCGTGTCG TCTTGGGTCT TCCAAAACAT TGGTTTGTAA ATAGTTTGTA 13 80 

ATTAGCTTCT GCGTATAACG GTCCAATTTT TTCAATTGCT TGATAAAACG TCTTGTTGGA 1440 

ACTAATTTAT ACAAATTATT CATCCTTCAA GCCTAAATCA TGCATCATTT CTTCCCAAGT 1500 

AATGGGTTCA ACTCCTTTTT CCAAGTCTTC TAAATACTCT TGATAGGCTA AATCTCCCAC 1560 



WO 98/18931 



PCT/US97/19588 



226 



ACGAGCATCG 


7ATTCATCTT 


CTAGGGCTTC 


AAGAGTTTTG 


GTGCGAATAA 


GTTCCGAAAG 


1620 


GG AAACT CCT 


TCAAACTTAG 


CCATTGCTTT 


CATAAATGTT 


TTATCAGCTT 


CAGAAACTTT 


1630 


TAATGTAATA 


GTAGTCATCT 


TTTGTGCTCC 


CTTTTTTAAT 


GGTAACACCA 


TTGTATTACT 


1740 


TTTTACGTGT 


TCAGTCAATA 


TAAAAAGAAC 


ACCTTCTCAG 


CGTTCTTTCT 


AT ATCT CTGT 


1900 


CAATGGTGTT 


GCGGTATCTG 


GTGAGGTATC 


ATAAACCTTA 


AAGTCTACTC 


CGACTCCCAG 


1360 


ATCAGCTTGA 


GCCAGCTGAT 


TGACCATGGT 


CATATGAGCC 


AGTTCCTTGA 


TATTGTTTTC 


1920 


CTTAGATAAA 


TGCCCAAGGT 


AAATCTTCTT 


AGTACGATTT 


CCTAGCGTCC 


GAATCATAGC 


1930 


TTCAGCACCG 


TCCTCGTTAG 


AAAGGTG AC C 


AAGGTC AG AT 


AGGATTCGTT 


GTTTGAGTCG 


2C40 


CCAAGCGTAA 


GAACCTGATC 


GCAAAATCTC 


TACATCATGG 


TTGGCCTCGA 


TAACATAACC 


2100 


ATCCGCATTT 


TCGACAATGC 


CCGCCATACG 


GTCACTGACA 


TAACCTGTAT 


CTGTCAAGAG 


2160 


GACAAAACTC 


TTATCATCCT 


TCATAAAGCG 


ATAGAACTGC 


GGTGCGACTG 


CATCATGGCT 


2220 


TACACCAAAA 


CTCTCGATGT 


CGATATCTCC 


AAAGCTTTTC 


GTTTTACCCA 


TTTCAAAAAT 


2280 


ATGCTTTTGC 


GAAGAATCCA 


CCTTGCCAAG 


ATATTTACTA 


TTTTCCATAG 


CTTGCCAGGT 


2340 


CTTTTCATTG 


GCATAAAGAT 


CCATACCATA 


CTTCCGAGCC 


AAAACGCCTA 


CTCCATGGAT 


240Q 


ATCATCTGAA 


TGCTCATGGG 


TAATCAAGAT 


GGCATCCAGG 


TCTTCTGGCT 


TACGGTTAAT 


2460 


TTCAGCTACC 


AGACTGGTAA 


TTTTCTTGCC 


AGACAAGCCT 


GCATCTACTA 


AAAGCTTCTT 


2520 


TTTTGAGGTT 


TCCAGATAAA 


AAGAATTTCC 


ACTGGAACCC 


GACGCTAAAA 


TACTGTATTT 


2580 


AAA G CCT ATT 


TCACTC ATTC 


TAGTCTTCTA 


CTTCATCCTC 


CCATACTTCT 


TCTTTCACTG 


2640 


CATCCTTATC 


ATAAGGGAGT 


ACAATGGTAA 


AGGTTGAACC 


CTTGCCGTAT 


TCACTCTTGG 


2700 


CCCAAATAAA 


CCCCTTATGT 


TGTTTGATAA 


TTTCTTT AGC 


GAT AG AC AGT 


CCTAGACCTG 


2760 


TACCACCTTG 


TGCACGACTT 


CTAGCACGAT 


CCACACGATA 


GAAACGGTCA 


AAGATACCTG 


2320 


GTAAATCCTG 


CTTAGGAATC 


CCCAAACCGT 


GGTC AG AAA T 


GGATAAAATC 


ATCTGGTCTT 


2830 


CACTTGTCTT 


CATTCTGACA 


GTGATTTTAC 


CCCCATCTGG 


CGAATACTTA 


ATAGCATTAT 


2940 


TTAAAATATT 


CTCGACAACC 


TGCGTCATCT 


TATCTGTATC 


AATTTCCATC 


CAGATAGAAT 


3000 


TGATGGGATA 


ATCTCTCACC 


AACTCATATT 


TTTTCTCCTT 


TTCCTGTCCT 


TTCATCTTGT 


3 060 


CAAAACGATT 


GAGGATAAAG 


GTAATAAAAG 


CAGTGAAGTT 


AATCAGTTCC 


ACATCTAGGT 


3120 


GACTGGTAGC 


ATTATCAATA 


CGTGAAAGAT 


GGAGGAGATC 


CGTCACCATG 


CGCATCATAC 


3130 


GGTTCGTCTC 


ATCAAGAGAA 


ACCTTGATAA 


AGTCTGGTGC 


TACAGTTTCA 


CACAAAGCCC 


3240 


CCTCATCCAA 


GGCTTCAAGA 


TAGGATTTTA 


CGCTAGTCAG 


AGGAGTCCGT 


AACTCATGGC 


3300 


TAACATTGGA 


AACAAAGAGT 


CTTCGTTCGC 


CTTCTTCCTT 


CTCCTGCTCC 


GTCGTATCAT 


3360 
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GCAAAACAGC 


CACCAAACCT 


GAAATAAAGC 


CAGACTCTCG 


ACGTATCAAG 


CCAAAfirGAA 




CTCGAAGGTT 


CAAATATTCG 


CCATTGATAT 


CTTGGGAATC 


TAGCAACAAT 


TCTGGAfT^T 
i>- l uunv. 1 i I 


T d tin 


GGGTAATCAA 


ATCACGCAAT 


TCATAGTTTT 


CTTCTATCTT 


G AGC AAT T C C 


AAAATGCTTC 

AAAA 1 V-J\_ 1 1 l_ 


J JtU 


TATTCAGAAC 


ATCTTCCTTA 


ACCAACCCCA 


GTTGCTTCTT 


GGCTGTATm 


TTAATPATf-A 


JOUU 


TAATCTGACC 


CCGACGGTTA 


GTCGCAAGAA 


CCCCATCTGT 


TATATAAAAP 


AriAATAfTAT 
AVjAA 1 1 A 1 




TTAGCCTCTT 


ACTCTCTTGT 


TCTAGATTTT 


CCTGAGTGAG 




I <_v_LaA^. AAO I 




CAT TCAAAT T 


ATTGGTAATA 


TT GGTG ATTT 


CACIACCC ACC 


TTrraTiTr a 

I LVjV,nlnlUn 


AtjAA\_v. i lOu 


J 'HO 


AATAATCTCC 


TGCAATCAAA 


TCTTTAACCT 


TTTGATTGAC 




t: a at a tt a t 




CACGTCTATT 


TTCCAGTAAT 


AAGAGGGTCA 


CAACAAGGAT 


r.AAArrTAAr 1 


AAAA TP A (T" A 


3 9 00 


TAAAGATAAA 


ATCTCTGGTA 


AAAATGGTTT 


CT^TCAHTAA 

v_i i k i v. Aw 1 nn 


ATPAA^rATT 
r\ 1 LnnuLA I I 


aIi 1 1_ I ^_ A 1 vj 


3 9 60 


TAATACCCTA 


CACCACGCCG 


CGTCAAGATA 


T A CTCTCIC, T C* 






4 020 


TTCTCACGCA 


GACGTCGTAC 


AGTCACATCA 


ACTGTACGGA 




a * A*j 1 ^.Ai AA 


4080 


CCCCAGACAG 


TCTCAAGCAA 


GT GTTCG CG C 


GTGATGAPTT 


fZ A f"f"*mT A TP 


COA I 1 AAA 


4 14 0 


TGATACAAAA 


GCTCAAATTC 


ACGATGGGTT 


AACTC rn AG TT 


* i LULv.n 1 A 


TTTTTT AGCC 


4 200 


ACGTAGGCGT 


CTGGAACAAT 


TTCTAAATCC 


CC AATTTGG A 


TAGGTTCAfin 


til nV_ 1 Alv. 1 




CCTTCCTGAC 


CATCTACTGG 


CATAGGTTGA 


GAACGACCPA 


GAAfiA^PTTT 
onAvjnVjv I • 1 


AAl_A^(j*_ljV_t_ 


** J Z u 


TGCAACTCAC 


GATTGGAGAA 


GGGTTTTCTT 


ACATAGTPAT 






4 3 80 


ATAACCTTAT 


CAAATTCACT 


ATCTTTGCCT 


GAAAGCATAA 






4 4 40 


TTACGAATGG 


TCTTAGCAAC 


^TCTAAACCA 


» » * it. i\j 


VjAJMjW A i V- AA 


ATmfi AT 1 A 
A i V_ LnbnA i A 


4 500 


ATAATATCTG 


GTTCCTCTGC 


TTCAAATTGC 


TCTACCGCTT 


CACGACCATT 


AAAAGrAfiT" 1 


4 560 


ACAACTTCGT 


AACCTTCCTT 


GGTCATATTA 


AA CTTG AT AA 


TATCCCACAT 

1 J* 1 V- ^ t\ VJ «\ i 






TCATCTACAA 


TTAGTATTTT 


TTTCATATGT 


TCACCTTTTT 


CTCTACTATT 


ATACCAAAAA 


4680 


AATAGTCAGA 


AGACACAATA 


GCTAGTCTTG 


GCTACTGTCT 


AAGTTGGCTT 


GTGCATAAAC 


4740 


CTGCCAGATT 


TTTTGTTGGG 


GTTTGGCAAG 


TGGGTAATTC 


TTGAATTCTT 


CTGGTGAAAG 


4800 


CCAGCGAACT 


TCCCTATCTG 


AAAAATCATG 


GAAGTCACTC 


ACCTGACCTG 


CTACAATCTG 


4860 


TACATGCCAT 


TTTCCATGAC 


TAAAAACATG 


CTGGACTGTA 


TCAAAACAAA 


CATCAAGCCA 


4920 


ATCAACATCT 


AGGTCATAGT 


CCTGCTGGAA 


ACTCTCTTCT 


GGACTGGGAC 


CAAAGTTCAC 


4980 


ACTTTCTTCC 


GCAACCTGAT 


GAAAGAGGTC 


AAACTGCTCT 


TCTTGCGAAA 


AG TT AT C AAC 


5040 


TTCTATAAAG 


GGGAAATGCC . 


AAAAACCTGC 


CAAGAGCTTT 


TCGCTTTCAT 


TTTTTTCAAG 


5100 
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TAAAAATTG? CCTTGAGAAT TTTTCACAAC TAAGGCTTTA AGATAAATAG GAACCGGCTT 5160 

TTTCTTAGGA GATTTAATTG GATAACCGTC CATGGTTCCA TTCTGATATG CCGCACTAAA 5220 

GTCCTTGACT GGGCTTTCTT CAGGTCTGGG ATTTACAGGA GACTCAATAT CAGACCCTAA 52 80 

GTCCATCAAG GCTTGATTAA AATCACCCGG ACGATCCCGA TTAATCAAGA TCTCCATCAT 534 0 

TGCCTGAAAA ATTTTTC GAT TACTTGGAAT CCCAATATCG TGGTTGACTT CAAACAGACG 54 00 

CGCCAAGACC CGCATGACAT TACCATCTAC AGCTGGCTCA GGC AAGTTAA AAGCAATACT 54 60 

GGAAATGGCT CCTGCTGTGT AAGGTCCAAT CCCTTTCAAG CTGGAAATTC CTTCATAGGT 5 5 20 

ATTTGGAAAT TGGCCACCAA AGTCAGTCAT AATCTGCTGG GCTGCAGCCT GCATATTGCG 5 580 

AACTCG AG AA TAATAGCCCA ACCCCTCCCA AGCTTTCAGT AAACTCTCCT CAGGCGCAGT 5 6 40 

TG C C AGACTT TCGACAGTTG GAAACCAGTC CAAAAATCTT TCGTAGTAAG GGATAACTGT 57 00 

ATCCACCCTG GTCTGCTGAA GCATGATTTC AGATACCCAG ATGTGATAAG CATTTTTACT 57 60 

TCTCCTCCAA GGCAAATCTC TTTTGTTTTC ATCATACCAA GCCAGAAGTT TCTCACGGAA 58 20 

AGAAATGACT TTCTCCTCCG GCCACATGAC G AT ACCGT AT TCTTTCAAAT CTAACATATC 5 880 

TCTAGTATAA CACAGAAGGT TTCACCTGTC TTTGTATCTG ATTTATAATA TTTTCAATAG 5 94 0 

ATAGTATATA ACTTTTCTAT CTACTTATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAG 6000 

CTAGCCGCAG GTTGCTCAAA ACACTGTTTT GAGGTTGTGG ATAGAACTGA CAGAGTCAGT 60 60 

ATCAT AT Ac T ACGGCAAGGT GAAGCTGACG TAGTTTGAAG AGATTTTCGA AGAGTATAAA 6120 

TCTTATTGAT GAACTGCTTG CAGTCTGAGA AAAAATGAGC TTGGATATTA TTTCCAAACT 6 180 

CACTTAAAGT CAATTTCAAT CCACTAGAAC AAGCCTAGTA CAGTTCCATC CCTTTCAACA 6 240 

TCCATGTTGA GAGCTGCTGG ACGTTTTGGA AGACCTGGCA TGGTCATAAC ATCACCAGTT 6 3 00 

AAGGCAACGA TGAAGCCTGC ACCTAATTTT GGTACCAATT CACGAATGGT AATTTC AAAG 6 3 60 

TTTTCTGGTG CTCCAAGCGC ATTTGGATTG TCTGAGAAAC TCTATTGAGT TTTAGCCATA 6 4 20 

CAGATTGGCA ATTTGTCCCA ACCGTTTTGA AC G ATTTG AG CAATTTGTGT TTGAGCTTTC 64 80 

TTCTCAAAGT TCACTTTGCT ACCACGATAG ATTTCAGTGA CAATTTTTTC AATCTTTTCT 654 0 

TG G AC AG AAA G GTCATT AT C ATACAAACGT TTATAGTTAG CTGGATTTTC AGCAATTGTC 6600 

TTAACAACTG TTTCGGCAAG TGCTACTCCA CCTTCTGCTC CATCAGCCCA GACACTAGCC 66 60 

AATTCAACTG GTACATCGAT TGAGGCACAG AGTTCTTTTA AGCCTGCAAT TTCACCTTCT 6720 

GTATCAGATA CAAATTCGTT AATAGCTACA ACTGCTGCAA TACCGAACTT ACCCATATTT 67 80 

TCAACGTGGC GTTTCAACTT AGCAAAACCT GCACGAACTG C CTCT AC ATT TTCTTCAGTC 6840 

AGAGCGTCTT TAGCCACACC ACCATTCATC TTAAGGGCAC GAAGGGTTGC GACAATAACA 690 0 
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ACTGCATCTG GAGATGTTGG CAAGTTTGGT GTCTTGATAT CAAGGAATTT CTCACCACCA 6 960 

AGGTCCGCAC CAAAACCACC TTCAGTAACA GTGTAATCAG CCAAGTGAAG CGCTGTTGTC 7020 

GTCGCCAAAA CAGACTTACA GCCATGAGCG ATATTGGCAA ATGGACCACC GTGTACAAAG 7 080 

GCAGGTGTAC CGTAAATTCT CTGAACCAAG TTTGGCTTAA TAGCATCCTT CAAAATCAAA 714 0 

GCCAAGGCAC CCTCAACCTG CAAATCACCT ACAGAAACAG GCGTACGGTC ATAGCGATAA 7 2 CO 

CCAATAACGA TATTCGCCAA ACGACGTTTC AAGTCCTCGA TGTCCGTTGC CAAGCAAAGA 7 260 

ATTGCCATGA TTTCTGAAGC AACTGTAATA TCAAAACCAT CCTCACGTGG AATACCGTTT 7 3 20 

AGAGGACCAC CAAGACCAAC AGTCACATGG CGGAGCGTAC GGTCGTTCAA GTCCACAACG 7 3 80 

CCTTTCCAGA GGATACGACG TTGATCAATT CCCAGCTCAT TCCCTTGGTG CAAGTGGTTG 7 44 0 

TCAATCAAGG CAGAAAGGGC ATTGTTGGCA G TTGT AAT AG CATGCATATC TCCAGTAAAG 7 5 00 

TGGAGGTTGA TGTCTTCCAT TGGCAGAACT TGTGCATACC CACCACCAGC AGCACCACCC 7 5 60 

TTGATCCCCA TCACTCCACC AAGAGACGGT TCGCGGATAG CAATCATGGT TTTCTTGCCA 7 62 0' 

ATCTTGTTCA AGGCATCCGC AAGACCAATG GTAAGCGTCG ACTTTCCTTC ACCTGCAGGT 7 630 

GTTGCGTTGA TGGCAGTAAC CAAGATCAAT TTACCGACTG GATTGCTCTC AACTGCACGA 7 7 40 

ATTTTATCAA AGCTGAGTTT AGCCTTGTAC TTTCCGTACA ACTCCAAATC GTCATAAGAA 7 300 

ATACCAAGTT TCTCTACAAC ATCAACAATT GCCTTCAACT CAATACTCTG TGCGATTTCA 7 8 60 

ATATCTGTTT TCATTCAAAA TTCCTCTAAC CTCTTATATC ATAATTCATT AT A TC AC AAA 792 0 

ACAAGATTTT TAACATCCTA AAACTCTCTA AACGTTCGTA AATATCTCTC TTTTTAAGAC 7 9 80 

TTTTAGAGTC CTTTCTTAAA TTTTATATGG CTTTATAGTT TGAAACTATA ATAAATCTTC 80 4 0 

GTTTTTACCA AAAATTTATC ACTTTCATTT TACTTACCGC TTATTTTTGT GTACAATAGT 8100 

GCTATGAAAA TTTTACTTAC ATCGGGCGGT ACCAGTGAAG CTATCGATAG CGTCCGCTCT 3160 

ATCACTAACC ATTCTACAGG TCACTTGGGG AAAATTATCA CAGAGACTTT GCTTTCTGCA 3 2 20 

GGGTATGAAG TTTGTTTAAT TACGACAAAA CCAGCTCTGA AGCCAGAGCC TCATCCTAAC 8 280 

CTAAGTATTC GAGAAATTAC CAATACCAAG GACCTTCTAA TAGAAATGCA AGAACGTGTT 834 0 

CAGGATTATC ACGTCTTGAT CCACTCAATG GCTGTTTCTG ACTACACTCC TGTTTATATG 84 00 

ACAGGGCTTG AGGAAGTTCA GGCTAGCTCC AATCTAAAAG AATTTTTAAG CAAGCAAAAT 8460 

CATCAGGCCA ACATTTCTTC AACTGATGAG GTTCAGGTTT TGTTCCTTAA AAAGACACCC 3520 

AAAATCATAT CCCTAGTCAA GGAATGGAAT CCTACTATTC ATCTGATTGG TTTCAAACTG 3 580 

CTGGTTGATG TTACCGAAGA TCATCTGGTT GACATTGCAC GAAAAAGTCT TATCAAGAAT 364 0 
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CAAGCAGATT TAATCATCGC GAATGACCTG ACTCAAATTT CAGCACATCA GCACCGAGCT 3700 

ATATTTGTTG AG AAAAAT C A GCTTCAAACA GTCCAGACTA AAGAAGAAAT TGCAGAACTC 87 60 

CTCCTTGAAA AAATTCAAGC CTATCATTCT TAGAAAGGAA AACTATGGCA AACATTCTCT 88 2 0 

TGGCTGTAAC GGGTTCAATC GCCTCTTATA AGTCGGCAGA TTTAGTCAGT TCTCTAAAAA 888 0 

AACAAGGCCA TCAAGTCACT GTCTTAATGA CTCAGGCTGC TACACAGTTT ATCCAACCTT 8 94 0 

TGACACTACA GGTACTCTCA CAG AATCCTG T CC ACTTGG A TGTCATGAAG GAACCCTATC 9000 

CTGATCAGGT CAATCATATC GAACTTGGAA AAAAAG CAG A TTTATTTATC GTGGTACCTG 9060 

CAACTGCTAA CACTATTGCA AAACT AG CTC AC GG ATT TG C GGACAACATG GTAACCAGTA 912 0 

CAGCTCTAGC CCTACCAAGT CATATTCCCA AACTAATACC TCCTGCTATG AATACAAAAA 9180 

TGTATGACCA TCCAGTAACT CAGAATAATC TGAAAACATT AG AAACT ACG GCTATCAGCT 92 4 0 

G ATT GCTCCT AAGGAATCCC TACTAGCTTG TGGAGACCAC GGACGAGGAG CTTT AGCTG A 9 3 00 

CCTCACAATT ATTTTAGAAA GAATAAAGCA AACTATCGAT GAAAAAACGC TCTAATATTG 93 60 

CACCCATTCC TATCTTTTTT GCTACCATGC TCGTGATACA CTTTCTGAGC TCACTTATCT 94 2 0 

TTAACCTTTT TCCATTTCCA ATCAAACCGA CCATTGTTCA TATTCCTGTC ATT ATTGC C A 94 80 

GCATTATTTA TGGTCCACGA GTTGGGGTTA CACTTGGATT TTTGATGGCA TTACTTAGCT 9 54 0 

TGACGGTTAA C ACG ATT ACG ATTCTACCGA CAAGCTACCT CTTCTCTCCC TTCGTACCAA 9600 

AC GG AAAC AT CTACTCAGCT ATCATTGCCA TCGTCCCACG TATTTTGATT GGTTTAACTC 9 660 

CTTACTTAGT CTATAAACTG ATG AAAAAC A AGACTGGTCT GATTTTAGCT GGAGCCCTTG 972 0 

GTTCcTTGAC AAATACTATC TTTGTCCTTG GAGGAATCTT CTTCCTATTT GGAAATGTTT 9 7 8C 

ATAATGGAAA TATCCAACTT CTTCTCGCAA CCGTTATCTC AACAAATTCA ATTGCTGAAT 984 0 

TGGTCATTTC TGCAATTCTA ACCCTAGCCA TTGTTCCACG ACTACAAACC TTGAAAAAAT 9 900 

AAAAAC AGG 9 909 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 112 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TAATTTTCAT ATAATAGTAA AATAGAATGT GTGATTCAAT AATCACCTCA AATAGAAAGG 60 
AAATTCTATG TCAAATCTAT CTGTTAATGC AATTCGTTTT CTAGGTATTG ACG C C ATT AA 120 
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TAAAGCCAAC TCAGGTCATC CAGGTGTGGT TATGGGAGCG GCTCCGATGG CTTACAGCCT 130 

CTTTACAAAA CAACTTCATA TCAATCCAGC TCAACCAAAC TGGATTAACC GCGACCGCTT 240 

TATTCTTTCA GCAGGTCATG GTTCAATGCT CCTTTATGCT CTTCTTCACC TTTCTGGTTT 300 

TGAAGATGTC AGCATGGATG AGATTAAGAG TTTCCGTCAA TGGGGTTCAA AAACACCAGG 3 60 

TCACCCAGAA TTTGGTCATA CGGCAGGGAT TGATGCTACG ACACGTCCTC TAGGGCAAGG 4 20 

GATTTCAACT GCTACTGGTT TTGCCCAAGC AGAACGTTTC TTGGCAGCCA AAT AT AACCG 4 80 

TGAAGGTTAC AATATCTTTG ACCACTATAC TTACGTTATC TGTGGAGACG GAGACTTGAT 540 

GGAAGGTGTC TCAAGCGAGG CAGCTTCATA CGCAGGCTTG CAAAAACTTG ATAAGTTGGT 600 

TCTTCTTTAT GATTCAAATG ATATCAACTT GGATGGTGAG ACAAAGGATT CCTTTACAGA 66 0 

AAGTGTTCGT GACCGTTACA ATGCCTACGG TTGGCATACT GCCTTGGTTG AAAATGGAAC 72 0 

AGACTTGGAA GCCATCCATG CTGCTATCGA AACAGCAAAA GCTTCACCCA AGCCATCTTT 7 80 

GATTGAAGTC AAGACGGTTA TTGGATACGG TTCTCCAAAC AAACAAGGAA CTAATCCTGT 84 0 

ACACCCCCCC CCTCTTGGAG CACATGAAAC TGCATCAACT CGTCAAGCCC TCGGTTGGGA 900 

CTACGAACCA TTTGAAATTC CAGAACAAGT ATATGCTGAT TTCAAAGAAC ATGTTCCAGA 9 60 

CCGTGGCGCA TCAGCTTATC AAGCTTGGAC TAAATTAGTT GCAGATTATA AAGAAGCTCA 10 2 0 

TCCAGAACTG GCTGCAGAAG TAGAAGCCAT CATCGACGGA CGTGATCCAG TCGAAGTGAC 1080 

TCCAGCAGAC TTCCCAGCTT TAGAAAATGG TTTTtCTCAA GCAACT 1-2 6 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCGGCAACAA AAAAGAAAAA ATCAACAGTT AAAAAAAATC TAGTCATCGT GGAGTCGCCT 60 
GCTAAGCCAA GACGATTGAA AAATATCTAG GCAGAAACTA CAAGGTTTTA GCCAGTGTCG 120 
GGCATATCCG TGATTTGAAG AAATCCAGTA TGTCCGTCGA TATTGAAAAT AATTATGAAC 180 
CGCAATATAT TAATATCCGA GGAAAAGGCC CTCTTATCAA TGACTTGAAA AAAGAAGCTA 2 40 

AAAAAGCTAA TAAAGTTTTT CTCGCGAGTG ACCCGGACCG TGAAGGAGAA GCGATTTCTT 300 
GGCATTTGGC CCATATTCTC AACTTGGATG AAAATGATGC CAACCGTGTG GTCTTCAATG 3 60 
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AAATCACCAA GGATGCAGTC AAAAATGCTT TTAAAGAACC TCGTAAGATC GATATGGACT 4 20 

TGGTCGATGC CCAACAAGCT CGTCGGATCT TGG AT CGCTT GGT AGGGT AT TCGATTTCGC 4 80 

CTATTTTGTG GAAGAAGGTC AAGAAGGGCT TGTCAGCAGG TCGCGTTCAG TCCATTGCCC 54 0 

TTAAACTCAT CATTGACCGT GAAAATGAAA TCAATGCCTT C C AG CC AG AA GAATACTGGA 600 
CAGTTGATGC TGTCTTTAAA AAGGGAACCA AACAATTTCA TGCTTCCTTC TATGGAGTAG 660 
ATGGTAAAAA GATGAAACTG ACCAGCAATA ACGAAGTCAA GGAACTCTTG TCTCGTCTGA 7 20 

CGAGTAAAGA CTTTTCAGTA GATCAGGTGG AT AAG AAAG A GCGCAAGCGC AATGCTCCTT 7 80 

TACCCTATAC CACTTCATCT ATGCAGATGG ATGCTGCCAA TAAAATCAAT TTCCGTACTC 8 40 

GAAAAACCAT GATGGTTGCC CAACAGCTCT ATGAAGGAAT TAATATCGGT TCTGGTGTTC 900 
AAGGTTTGAT TACCTATATG CGTACCGATT CGACTCGTAT CAGTCCTGTA GCGCAAAATG 960 

AGGCGGCAAG CTTCATTACG GATCGTTTTG GTAGCAAGTA TTCTAAGCAC GGTAGCAAGG L02 0 

TCAAAAACGC ATCAGGTGCT CAGGATGCCC ATGAGGCTAT TCGTCCCTCA AGTGTCTTTA L080 

ATACACCAGA AAGCATCGCT AAGTATCTGG ACAACGATCA GCTT AAGCT A TAT AC CCTT A 114 0 

TCTGGAATCG TTTTGTGGCT AGCCAGATGA CAGCGGCCGT TTTTGATACC ATGGCTGTTA 12 00 

AATTGTCTCA AAAAGGGGTT CAATTTCCTG CCAATGGTAC TCAGCTTAAC TTTGATGGTT 12 60 

ATCTTGCCAT TTATAATGAT TCTGACAAGA ATAAGATGTT ACCGGACATG GTTGTTGGAG 13 20 

ATGTGGTCAA ACAGGTCAAT AGCAAACCAG AGCAACATTT CACCCAACCC CCTGCCCGTT 13 80 

ATTCTGAAGC AACACTGATT AAAACCTTAG AGGAAAATGG GGTTGGACGT CCATCAACCT 144 0 

ACGCGCCAAC CATTGAAACC ATTCAGAAAC GTTATTATGT TCGCCTGGCA GCCAAACGTT 1500 

TTGAACCGAC AGAGTTGGGA GAAATTGTCA ATAAGCTCAT CGTTG AAT AT TT C C C AG AT A 15 60 

TCGTAAACGT GACCTTCACA GCTGAAATCG AAGGTAAACT GGATGATGTC G AAGTTGG AA 1G20 

AAGAGCAGTG GCGACGGGTC ATTGATGCCT TTTACAAACC ATTCTCTAAA GAAGTTGCCA 1680 

AGGCTGAAGA AGAAATGGAA AAAATCCAGA TTAAGGATGA ACCAGCTGGA TTTGACTGTG 1740 

AAGTGTGTGG CAGTCCAATG GTCATTAAAC TTGGTCGTTT TGGTAAATTC TACGCTTGTA 1300 

GCAATTTCCC AGATTGCCGT C AT AC CC AAG CAATCGTGAA AGAGATTGGT GTTGAGTGTC 13 60 

CAAGCTGTCA TCAGGGACAA ATTATTGAGC GAAAAACCAA GCGTAATCGC CTATTCTATG 1920 

GTTGCAATCG CTATCCAGAA TGTGAATTTA CCTCTTGGGA CAAGCCTGTT GGTCGTGACT 19 80 

GTCCAAAATG TGGCAACTTC CTCATGGAGA AAAAAGTCCG TGGTGGTGGC AAGCAGGTTG 204 0 

TTTGTAGCAA AGGCG AC T AC GAGGAAGAAA AGATGGCTCT TTGTCAACTG TAGTGGGTTG 2100 

AAGTCAGCTA AG CT CG AG AA AGGACAAATT TTGTCCTTTC TTTTTTGATA TTCAGAGCGA 2160 
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TAAAAATCCG TTTTTTGAAG TTTTCAAAGT TCCGAAAACC AAAGGCATTG CGCTTGATAA 222 0 

CTTTGATGAG ATTATTGGTC GCTTCCAATT TGGCGTTAGA ATAGTGTAGT TGAAGGGCGT 2280 

TGACGATTTT CTCTTTGTCC TTTAGAAAGG TTTTAAACAC AGTCTG AAAA AGAGGATGAA 2340 

CCTGCTTTAG ATTGTCCTCA ATGAGTCCGA AAAATTTCTC CGGTTCCTTA TTCTGAAAGT 2 4 00 

GAAACAGCAA GAGTTGATAG AGCTGATAGT GATGTTTCAA GTCTTGTGAA T AG CT C AAAA 2 4 60 

GCTTGTTTAA AATCTCTTTA TTGGTTAAAT GCATACGAAA AGTAGGGCGA TAAAAATGTT 2 520 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A> LENGTH: L0993 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTTTCTCGAT AATAACTTCC ACCTTATTAT TTGGGATACC CTCCTCTTCT TCACCACCAC 60 

GTTCATAGTA GTCATCGCCA TAGAGAAAAG CTACGATATC AGCGTCCTGC TCAATAGACC 120 

CAGATTCACG AATATCAGAC AAGACCGGTC TCTTGTCCTG ACGTTGTTCT ACACCACGAG 180 

AAAGCTGACT CAGAGCGATT ACTGGAACCT TCAATTCCTT GGCTAGTATT TTCAACTGAC 24 0 

G AG AAATTT C AGAAACTTCT TGTTGACGAT TTTCTCGACC AGTTCCCGTG ATAACTTGCA 300 

AATAGTCTAT CAAAATCAAA CCAAGATTTC CAGTTTCTTG AGCCAATTTA CGAGAACGAG 3 60 

AACGAATCTC TGTAATCCGA ATACCTGGCG TATCATCGAT ATAGATACTG GCGTTAGcTA 42 0 

GATTACCCTG AGCAATAGTA TATTTTTGCC ACTCCTCATC TGTCAATTGC CCTGTACGGA 4 80 

TAGAATGTGA CTCCACTAAG CCTTCTGCAG CTAACATACG ATCTACCAAG CTTTCCGCAC 54 0 

CCATTTCGAG TGAAAAAATA GCAACCGTTT TGTCCAACTT AGTCCCAATG TTCTGAGCGA 6 00 

TATTCAAGGC AAATGCTGTC TTACCAACTG CTGGACGAGC TGCTAAGATA ATCAACTCCT 66C 

CCTCATGAAG TCCTGTTGTC ATATGATCCA AATCACGATA ACCTGTCGCA AT ACCTGT AA 72 0 

TATCGGTCGT TTGTTGCGAG CGAGCTTCCA GATTTCCAAA GTTGAGATTC AACACATCTC 7 80 

GAATGTTCTT AAACCCGCTT CGATTTGCAT TTTCACTGAC ATCAATCAAC CCTTTTTCTG 84 0 

CCTGAGCAAT AATTTCATCA GCTGGTTGTG ACGCTTCGTA AGCTTGGTTG ACAGACTCTG 900 

TCAACTTGGC AATTAAACGA CGTAGCATTG CTTTTTCTGC AACAATCTTA GCATAATACT 9 60 

CCGC ATT AG C AG AAGTTGG C AC AG AATT AA CAATCTCAAC CAAGTAAGAC AAGCCACCAA 1020 
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CACCACCATC GGATAAATCG ACCATGGCTT GGAAAATCAA ACGATGGGCA TACTTAAAAA 114 0 

AGTCCCGAGA CTCAATGTAT TCTCGCACAA AAACAAGTTT ACTCTCATCA ATAAAGATAG 1200 
CCCCTAAAAC CCATTGCTCA GCTAAGATAT CTTGAGGTTG TACTCGTAAC TCTTCTACTT 12 60 

CTGCCATCAG ACTTCCCTTC CTTTTACAAT CTTGTCAAGA AGGTGTAAAC TTATCCTTCT 132 0 

TTCACACGAA GATTGATTAC ACTTGTGATA T CTTG AT AG A TTTT C ACTGG CACATCAATC 13 80 

AAACCAACCG CTCGAATCCG AGCTTGTACT TGAATATGAC GTTTATCAAT CTTAATTCCA 14 4 0 

AATTGCTTTT GCAATTCTTC TGCAATCTTC TTATTGGTAA T AG AACC AAA GGTACGACCA 1500 
TCTGGACCAA CTTTTTCAAC AAATTCTACA AC AGTTT CTT CTGCTTCAAG TTCTGCTTTA 1560 
A TTGCTTT T C CTTCTGCAAT CATCTCAGCG TCAGCTTTTT CTTCCGATTT TTGTTTACCA 1620 
CGAAGTTCAC CTACACCTTC AGCAGTCGCT TCTTTGGCTA GATTCTTTTT GATAAGAAAG 1680 
TTTTGCGCAT ACCCTGTTGG TACTTCCTTA ATTTCGCCTT TTTTACCTTT TCCTTTAACA 17 40 

TCTGCTAAAA AGATTACTTT CATTCTTCTT TCTCCTTTTC CTTCATTTCA TTTAATACAA 18 00 

TTTCTGTCAG TTTTTCACCT GCTTCTGACA AGGTTACATC TTTAATTTGA GCTGCTGCCA I860 

AATTAAAGTG GCCTCCACCG CCTAACTCTT CCATAATCCC TTGTACATTC AGTTT ACT AC 19 2 0 

GACTTCGAGC TGAGATAGAG ATAAATCCTT GTGTATTCTT CGCAAGAACA AAACTCG CTT 1980 

CAATACCTGA CATGG CT AAC ATGGCATCTG CTGCCTTACT AATAACAACT GTATCATAGC 204 0 

ATTTCATGTC CTTAGCCTCT GCTATTAGTA CATCTGAACC TAATTTACGC CCCTGTAAAA 2100 

TAAGTTCATT GACCTCACGA TATTCTTCAA AATCTGTCGC AGCGATTTCC TGGATAGCAA 216 0 

T AC TAT C ACT TCCGCGCGTT CTGAGATAGC TAGCAACATC AAATGTCCGA CTAGTTACTC 2 2 20 

GCGAGGTGAA ATTTTTAGTA TCCAACATCA TACCAGCCAT CAAGACACTT GCTTGCATAC 2 28 0 

GACTCAAACG ATTTTTCTTA GAATTCTGGA ACTGAATCAA TTCCGTTACC AACTCACTGG 2 3 40 

CACTACTTGC ACCACTTTCG ATATAAGTAA TAACCGCATT ATCTGGAAAA TCCTGATCCC 2 4 00 

TTCTATGGTG GTCAATAACA ATGGTTTGGG TAAATAAATC ATAAAATTCT TTTGATAATC 2 4 60 

TTAAGGCTGT CTTTGAATGG TCTACAAGAA TCAACAAAGA ACGATTGGTC ACCATCCCCA 2 520 

TTGCATCCTT AACAGACAAC AACTTCGTAA CTCCTTCTTT TTCTATGAAT GAAACAGCTC 2 580 

GTTCAATATC TGGAGACATT TGTTCTTCAT CATAAAGAGC ATAGCTATTT TCAATCACAT 2 64 0 

TGCTGGCGAA CAACTGCATA CCTACAGCAG AGCCCAAAGC ATCCATGTCT AAATTTTTGT 2 7 CO 

GACCGACTAC AAAAACC TG A TCTACACTCC GAATCTTATC TGAAATAGCT GTCATCATAG 27 60 

CGCGCGTACG ACTCCGTGTA CGCTTCATTG AAGCAGCAGA CCCACCACCA AAATAAACTG 2 8 20 
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GATTTTTCGT TTCGTCGTTT TCCTTAACAA CCACCTGGTC GCCACCACGT ACTTCAGCCA 2 880 

AGTTCAAATT GAGCAAAGCA ACTTTCCCTA TCTCATCATG ATTTCCATCG CCATAAGAAA 294 0 

ATCCCATACT TAAGGTCAAG GGCAACTCTC TCTGTTTCGA CTCTTCTCTG AAAGCATCAA 3 000 

TAACAGAAAA TTTATCATTC ATCAAGCCCT CAAGCACCGT GTAGTCAGTA AATAGATAAA 3 0 60 

ATCGATCCAT ACTTACCCGA CGAGAAAACA TCATGTGTTT TTCTGAAAAC TCTGATATAA 3120 

AATTAGCTAC AAAACTATTG ATTTGACTAA TATCTGACTC AGAAGTTTCA TCCTCCAAAT 3180 

CATCATAATT ATCCACAGAG ACAATCCCAA TCACTGGTCT ACTTGTTACC AATTCATCTG 32 40 

TTATGGCTTG TTCCCTGGAT ACATCTACAA AATACAAAAC ACCGGAAGAA GCATCCATAT 3 300 

GAACAGCATA ACGCTTCTCA CCAAGCTTGG CATAAGTAGA CCGATTTCCT ACTGAAGCCT 3 360 

TGATAATCGT TTGAACAGCT TCTAAATCAA AATCACCATC TTCCTTGGTC AAAATCAATT 3 42 0 

CAGCATAGGG ATTAAACCAC TCAACCTCTC CACAAGATAA ATTCAATTTC ATAACACCTA 34 80 

CAGGCATCTG TTCCAATAGA GCTGTCAAAC TTTCTTCCGC TTGGTGGTTT ACATACTGTA 3 54 0 

TCTGTTCTAC ATCACTCCTT CTATAATGCA CTCTCAGTTT CTTAAATAAA AAAACATAGC 3 600 

CTC CT AC AAA AAGAAACAAA ATTAAAACCG TCAACAGATT ATTATTAACA AAAATAATGA 3 660 

AAGTGGATAA GACTCCAAAC GCAATCAATC CTACTAGAAT AGGAAAAATT GGACTTACAT 3720 

AAAATTTTTT CATTCAAAAC CTCTTGGCAC CCATTATACC ATAATACCCC TCAAAAAGCC 3780 

ACTTTTTAAA AGTGTAATCA GTAATTCTAT CAATTATAAG AAAAAGGTAG TTTACAATTC 3 84 0 

AGTAAACCTA CCTTTACACA TATTGAAATT AAGATTCTTT AACCTCTAAC AAACCAATTT 3 900 

CGCCATCCTC ACGACGATAA ATCACATTGG TTCTCTGATC TTCAACATCC AC AT AG AT AA 39 60 

AGAAATCATG CCCCAATAAA TCCATTTGTA GAATTGCTTC TTCCAAATCC ATTGGTTTTA 4 020 

AATCAATTTG TTTTGAACGA AC AACTTT AG ACTGGACAAT ATTTGAATCT TCCACCAAAG 4 080 

CATCTGTAAA TAATTGACCA GTTGCTACCT TATTTTTATT TTTACCCTCC ATTTT TGTTT 414 0 

TATTTTTACG AATCTGACGT T C AATTTT AT CAGTTACAAG GTCAATTGAA C CAT AC AT AT 4200 

CTTG AG AT AC ATCTTCTGCG CGGAGAGTAA TAGATCCAAG CGGAATCGTT ACTTCCACTT 4 260 

TAGCCCTTTT TTCACGATAA A CTTTT AAGT TAATTCGGGC ATCCAACTCT TGTTCTGGTT 4 3 20 

GGAAGTACTT TTCGATCTTT TCGAGTTTAG AAACTACATA ATCACGAATT GCTTCTGTTA 4 3 80 

CTTCTAGGTT TTCACCACGG ATACTATATT TAATCATATG AGTACCTTCT TTCTAAACAT 4 440 

TTTTCTTTTT ATGATTTTAT TATAACGCTT TCATTCTATT TTTGCAAATT TTTTCCTCAT 4 500 

CTTACAAGGG AAAATGTTTT TACATCCTTA GC ACCAGCTT CTTCCAACAG TTTCTTAACA 4 560 
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CGATTTATAG TTGCTCCTGT AGTATAGATA TC AT CT AT AA GTAGGATTTT TTTAGGAATA 4 620 

GTGACTCCAC TTTTAATAAA GAAAGGAAGT TCTGTCCCCA AGCGCTCTGA ACGATTTTTA 4 680 

GAAGAACTGG CTCTCTCTTC TCTTTTCTCT AATAAATCCA GATACTCAAA GCCTGCTGCC 4 7 40 

TCTACCAAGC CCTCAACCTG ATTAAATCCT CTATTAGCAT ATCTATCAGG ACTTAGGGGA 4 800 

ATT ACAAC AA ATTGATACTC TTTGTACTTT TTCAACTCCT CACTTAAAAA TGAAGCGAAA 4860 

ACTTTTCTTA ACAGGAAGTC TCCATCAAAC TTATACCGAC TGAAAAAATC CTTCATACCT 4 920 

TGATTGTAAG TAAAAATCGC TCTATGACTG ACTTCAACTC CCTCTTTACA C C AAAGTTG A 4 9 80 

CAATCTTGAC ACTTTGTTGA C AACTC TGTT TTCATACAAT TTGGACAGTT CTCTTCCCCA 504 0 

ATTCTTTCAA AAGTAGAATC ACACTCTCAA CAAAGACAAG AGT C AT C ATT CCTCAGAAGT 5 100 

AAGAGACTAC TAAAAGTTAA AAC AGTCTT C ATAGTCTGCC CACATAACAA GCACTTCATA 5160 

CACCACCCTC CTTATTCATC ATCTGAATTT CCTTAATCGC CTTCTTGATT GAAGCATTTA 52 20 

ACCCATCATG GAAGAAAAGC AAATCTCCTG TCGGTCTATC CATGCTTCGT CCAACTCGTC 52 80 

• CACCAATCTG AATC AAACT A GACTTGGTAA ACAAACGATG ATTGGCCTCT ACTACGAAAA 5340 

CATCCACACA AGGGAAGGTA ACTCCGCGCT CCAAGATTGT CGTACTGATA AGTATTGTCA 5400 

GTTCTCCATC TCGAAAAGCT TGTACTTGCT CTAATCGATC CTCTGTTACA CAAGATACAA 54 6 0 

AGCCAATTTT CTCATTTGGA AATTGCTCCT GTAAGATTTC TGCTAACTGC TCCCCTTTCT 552 0 

T AATTTCT G A AGCAAAAATG AGTAACGGAT AAGCTGTCTT TCTCTGCTTC TCAATATACG 5580 

ACTTTAACTT TGGTGACAAA CGATTCTTGT CTAAGTAGCG ATTAAAATCC GATAACCAAA 564 0 

TTGGTTTTGG AAT AATC AAC GGATTTCCAT GAAACCGTCT CGGTAAATTC AGTCTTTTTA 5 700 

CTTCTCCTAA ACGGACCTTT TTATCTAACT CATTGGTCGA AGTCGCTGTT AAAAAGATTC 57 60 

T C AATC C ATT CTCCTTTACA CTATTCTTGA CAGCGTGGTA AAGCATGGGA TTATCAACAT 5 32 0 

AAGGAAAAGC ATCTACTTCA TCCACTATCA GCAAATCAAA AGCTTGATAA AACTTCAATA 5 8 80 

ACTGATGGGT TGTTGCAACA ACTAGTGGTG TTCGAAAATA AGGTTCCCAT TCTCCATGTA 5940 
GCAAAGCTAT CCCGCAAGAA AAATCCTGTT GCAGGCGCTT GTACAGCTCC AAACAAACAT 6 000 

CTATGCGAGG ACTAGCCAAA CACACTGCAC CACCCGCATT GATCACTTTA GCCACTACTT 60 60 

GATAAATCAT TTCTGTCTTT CCAGCTCCTG TTACCGCATG AACTAAGGTT GGCTTTTGCT 6120 
TGTCTACTAC TTGAAGCAAT CCCTCTGACA CCTTCTCTTG AAAAGGAGTT AATTGGCCGC 6180 
GCCATTTGAG AACATCTTGC TTTGGAAAAT CCTCCTCCGG AAAATAGTAT AAAGTTTGAT 6 240 

CACTTCTGAC TCGCTTCATC AGCAAGCACT CT CG A C AAT A GTAAGCACCG ATGGGCAAAT 6 300 

ACCATTCTTC TAG AAT AGT A CTATTACAGC GTTGACAGAA AAGTTTCCCC TTCTCCTTTC 6 3 60 
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TCATTGCTGG AAGTTTCTCC GCCAACTGAC 
ATAAACGACC GAGATAATCT AAATTTACTT 
TTTAGATGAT TTTTTAGTAC AATTAAATCA 
AAGTCCAAGA AG AAAT C AAA AAATCTCGCT 
AAGAAGAGGC TCGTGACTTC ATTACTGCCA 
ACTGCTCTGC CTTCATTATT GGAGAACCTA 
AGCCTAGTGG TACTGCTGGT GTTCCCATGC 
ATGTCTGTGT GGTCGTGACA CGCTACTTTG 
TTCGTGCTTA CGCCGGCAGT GTCGCCTTAG 
AAGAAGAGGC TGGCATTGCT ATTC AAATGT 
TCCTTAAAGA ACATGGTCTC ATGGAGCTGG 
TGATTTATGT TGATAAAGAA GAAAAAGAAA 
AT GG AAAAGT CACTTTAACT GACCAAGGTT 
TGTAAACAAT GAATAATACA GCGTTTCGTT 
AT AAAAAG AG GCGTACCAAA ATATACTAGA 
CGTTTTCCTT CACACCTATT TACTAGAATT 
CTTTGATCTA TGATATATAG AAATGGTATG 
AACAGGTATT CATATGTCTA TTTATAACAA 
TGTTAAACTT AACAACATCG TGCCAGAAGG 
ATTTAATCCT GGTTCATCTG TAAAAGACCG 
ACAAGATGGT ATTCTGAAAC CTGGTTCTAC 
TATTGGACTT TCATGGGTAG GTGCTGCTAA 
AACTATGAGT GTAGAACGAC GTAAAATTAT 
TCCTGGTAGC GAGGGAATGA AAGGTGCTAT 
TGATGGTTTC CTTCCTCTTC AATTTGACAA 
AACAGGAGCT GAGATACTAG CTGCTTTCGG 
AGTAGGT ACT GGTGGAACGA TTTCTGGTGT 
CATTCAAGTT TTTGCAGTAG AAGCAGATGA 
TCCTCACAAA ATTCAAGGTA TCTCAGCTGG 
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GTTCTTCTTC TG TTAATTCA 
TCATACTTCT TTATTCGTAA 
TGGAATTTAG G AC AATT AAA 
TTATCTGCCA TGCCAAGCGT 
TCAAAAAAGA A C ACT AC AAA 
GTGAAATTAA ACGTACAAGT 
TTGGGGTACT AGAAAATCAC 
GTGGTATTAA ACTAGGCGCT 
CTGTCAAAGA AATTGGTATT 
CTTATGCTCA GTACCAAGAG 
AT AC AAAC TT TACAGATCAA 
CTATTAAAGC TGCACTTGTG 
TACGAGAGGT TGAAGTTCCT 
G AC ATTCT C A CAACTACTTT 
AAATGAAGCA ATTCAAACGA 
AGCTGAACGC AATCACTTGA 
GATAGCGTTA TACT AAAC AT 
CATTACTGAA TTAATCGGTC 
TGCTGCAGAC GTCTATATAA 
TATTGCCCTT AGCATGATTG 
TATTGTTGAA GCAACAAGTG 
AGGGTATAAA GTCGTCATCG 
C C AAGCTT AT GGTGCTGAAC 
TGCTAAGGCT CAAGAAATCG 
TCCAGCTAAT CCAGAAGTAC 
T AAAGATG G A TTAGATGCCT 
TTCTCATGCA CTCAAATCAG 
ATCTGCTATT CTATCTGGTG 
ATTTATTCCT CATACACTTG 



TTCTCAGTAA 64 2 0 

AAACTAGCAC 64 80 

GAGGACGGTC 654 0 

GTTTATAGCG 6600 

GCG AC AC AT A 6660 

GATGATGGTG 67 2 0 

AATCTCACC A 67 80 

GG AG G ACT AA 684 0 

ATT G AAAT AA 6900 

TACAGTAACT 69 60 

GTCGATACGA 7 020 

GAGTTTTTTA 7 080 

GT AAACTTAG 714 0 

AGCGAGCAAA 7 200 

AACCTGATAT 72 60 

AAATT AATG A 7 320 

ATCTTATACA 7 380 

AAAC AC CG AT 7440 

AGCTTGAAGC 7500 

AAAAAG CT^ A 7 5 50 

GAAACACCGG 7 620 

TTATGCCTGA 7 680 

TCGTCCTAAC 774 0 

CTGCTGAACG 7 800 

ACGAAAGAAC 7860 

TTGTTGCTGG 7 920 

AAAATTCTAA 7980 

AAAAACCTGG 804 0 

ATACTAAAGC 8100 
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CTATGATGGT ATCGTTCGTG TAACATCAGA TGACGCTCTT GCACTCGGAC GTGAAATTGG 3160 

TGGAAAAGAA GGCTTCCTTG TAGGGATTTC CTCAGCTCCA GCTATCTACG GAGCCATCGA 822 0 

GGTTGCCAAA AAATTAGGTA CAGGTAAAAA AGTCCTTGCC CTAGCACCAG ATAACGGTGA 82 80 

ACGTTATCTC TCTACAGCAC TTTATGAATT GTAACCGTCC AATAACGAAG TCTATTGAAA 8 34 0 

AATCTCCAGA CTAGAGAACT CACGGATAGT TCCTAATCTG GAOATTl'CTT ATTTGCACTT 8400 

TTCTTGTACA ACTTTAGTCC ATGGTAAATA GGCCTCTAAA ACCTCTTTGT TTACGAGAGT 84 60 

TTCCACGTTT GGAAGACATT CTAGAAGATA GG AT AG AT AT TTCTCACTAT TTATAATGGA 8520 

TTGAAATAAG ATATGAACAA ATCGATTAGA ACATGATGGT AAAGCGTAAT CCCTTGTTTC 8580 

TCAGCTTTCC CAGACAAAAA AGTCCAATAG TAAGTCAGCT GACTATCACT CTCTAGCACC 86 40 

CTATAAGAAG TTTCATCCCC ATGAAGTAAG GGCTGAGTCA ATAGTCTCTC TCGCAAGAGG 8700 

TTATAAAGGG GCTCCAAATA GTATTGACTC GTCTTGATAT GCCAATTAGA GATTTCCTTA 37 60 

CGTGTGATTG GTAAACCCAT CCTAGCCCAA TCTTCTTCTT CGCGATAATT GGGTACCTTC 88 20 

AGATTAAACT TCTGATGGAT GGTGTGAGCC ATAATAGAAG CTGAGCCAAA GTTATGCGCT 8 880 

AAAGGGGCTT TAGGAATAGG AG CTTTCACA AGCTTATCCA GATGATTATC TTTTACTCGT 894 0 

TATGGACAAT GCTATATGGC ATAAATCAAG TACCTTAAAG ATTCCGACTA ATATTGGCTT 9000 

TGCATTTATT CCTCCATACA CACCAGAGAT GAACCCCATT GAACAAGTGT GGAAAGAGAT 9060 

TCGTAAACGT GGATTTAAGA ATAAAGCCTT TCGAACTTTG GAAGATGTCA TACAAGCACT 912 0 

GGAGAAGGAG GTGATAAAGT CCATCGTTAA TCGG AG A CGG ACTAGAATGC TTTTTGAAAA 9130 

CAGATGAGTA TAAAAAGAAA GTCCTCATTT CAATAGAAAT CACGACTTTC TGATGAATTT 924 0 

ATAGTAAAAT GAAATAAGAA CAGGATAGTC AAATCGATTT CT AAC AATGT TTTAGAAGCA 9300 

GAGGTGTACT ATTCTAGTTT AAATC C ACT A TATTTGGGGA GTGATAGAAA AGC CCTTC AT 9 3 60 

CAGCCAATCT ACTTGTTCAG GTGCGAGAGC TTTGACATCC TTTTCTGTAC TGGACCAAGT 94 20 

CAGTTTTCCG TTCTCAAAGC GTTTATATAA TATCCAAAAT CCTTGACCAT CCCAGTAAAG 94 80 

AACTTTAAAG CGGTCTTTAC GTCCACCACA AAAG AG AAAG ACTTGATCGG AGAAAGGATC 9540 

CAATTCAAAG TGGGTTTTAA CTACATAGGC TAATGAGTCT ATTCCCTGCC TCATATCTGT 9 600 

CTTGCCACAA ACAAGGTGAA CTTGACCTAA ATCACTTAGT TGAATTATCA TAGTACAATA 9 6 60 

CCTTTCCTCC GATAATTATT TTTTATCTGG TATACTGGAA GTTGGGGAAT TAGGATAGAT 97 2 0 

ACCTTGTTAT GACGCGCTTA CTATGAATTT GAAGTATAGT CTCCTAAATG CACTTAGCCC 9 7 30 

TTATTATAGG GCTTTTTGTT TTAATTATTC TAATCGAGTG AGACTGGGGA AAAAACAATT 9 84 0 

TCAGGAAAAA ' TCTAAGCCCT ATACAAAAAA GGAAGCAATT TCCTTCCTTT CT ATT ATT AG 99 00 
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TTATTCAAGG CTGCTGCCAT TGTAGCTGCA ACTTCAGCTT CGAAGTCGTT TGCAGCTTTC 99 60 

TCGATACCTT CACCAACTTC AAAGCGAGCA AACTCAACTA CCGAAGCGTT AACTGATTCA 10020 

ACCTATGCTT CAACTGTCTT GCTGTCATCC ATGATGTAAA CTTGTGCAAG AAGTGTGTAA 10080 

GCTTGGTCAA CTTTAGTGTT ATCAAGCATG AAGCGATCCA TTTTACCTGG AATAATTTTG 10140 

TCCCAGATTT TTTCTGGTTT GCCTTCTGCA GCCAATTCAG CTTTGATGTC AGCTTCAGCT 10200 

TGAGCAATAA CATCATCAGT TAATTGAGCT TTTGATCCAT ACTTCAAGTG TGGAAGAGCT 10260 

GGTTTATTAA CCATTGCACG GCTTTCCTTG TCTTGGTCGA TAACGTGATT CAATTGTGCC 10320 

AACTCATCTT TAACGAATTG CTCATCCAAT TCTTTGTAAG AAAGAACTGT TGGTTTCATC 10380 

GCTGCGATGT GCATTGACAA TTGTTTAGCA AGTGCTTCGT CTCCACCTTC AACAACTGAA 10440 

ATAACACCGA TACGTCCACC GTTATGTTGG TATGCTCCAA AGTGTTGTGC GTCTGTTTTT 10500 

TCAATCAATG CAAAGCGACG GAATGAGATT TTCTCTCCGA TAGTTGCTGT TGCAGATACG 10560 

TATGCAGCTT CAAGAGTTTC ACCTGAAGGC ATTATCAAAG CAAGAGCTTC TTCGTTGTTA 10620 

GCAGGTTTTC CTTCAGCAAT GACTTTAGCT GTAGTATTTA CCAATTCAAC GAATTGAGCG 10680 

TTTTTTGCAA CGAAGTCAGT TTCAGCGTTT ACTTCAATAA CTGCTGCAAC ATTACCGTTA 10740 

ACATAAACAC CAGTCAAACC TTCTGCAGCA ACACGGTCAC CTTTCTTAGC TGCCTTAGCC 10 800 

ATACCTTTTT CACGAAGCAA TTCAATCGCT TTTTCGATGT CACCGTCTGT TTCTACAAGC 10 860 

GCTTTTTTAG CGTCCATAAC ACCGGCACCA GATTTTTCAC GCAACTCTTT TACAAGTTTA 109 20 

GCTGTAATTT CTGCCATTTT AATTCTCCTA TATTTTTTGA AAATAGGAGA GCCCGGCTAA 10980 

GCCCCGCCTC CGG 10993 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CGACGGGGAG GTTTGGCACC TCGATGTCGG CTCGTCGCAT CCTGGGGCTG TAGTCGGTCC 60 

CAAGGGTTCG GCTGTTCGCC CATTAAAGCG GCACGCGAGC TGGGTTCAGA ACGTCGTGAG 120 

ACAGTTCGGT CCCTATCCGT CGCGGGCGTA GGAAATTTGA GAGGATCTGC TCCTAGTACG 180 

AGAGGACCAG AGTGGACTTA CCGCTGGTGT ACCAGTTGTC TTGCCAAAGG CATCGCTGGG 2 40 
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TAGCTATGTA GGGAAGGGAT AAACGCTGAA AGCATCTAAG TGTGAAACCC ACCTCAAGAT 300 

GAGATTTCCC ATGATTATAT ATCAGTAAGA GCCCTGAGAG ATGATCAGGT AGAT AGGTT A 3 60 

GAAGTGGAAG TGTGGCGACA CATGTAGCGG ACT AAT ACT A ATAGCTCGAG GACTTATCCA 42 0 

AAGTAACTGA GAATATGAAA GCGAACGGTT TTCTTAAATT G AAT AGAT AT TCAATTTTGA 4 80 

GTAGGTATTA CTCAGAGTTA AGTGACGATA GCCTAGGAGA TACACCTGTA CCCATGCCGA 54 0 

ACACAGAAGT TAAGCCCTAG AACGCCGGAA GTAGTTGGGG GTTGCCCCCT GTGAGATAGG 600 

GAAGTCGCTT AGCTTTAATC CGCCATAGCT CAGTTGGTAG TAGCCCATGA CTGTTAATCA 660 

TGATGTCGTA GGTTCGAGTC CTACTGGCGG AGTAATtGAT AAAAGGGaAC ACAGCTGTGT 72 0 

TCCTCTTTTT GTATCAATTT GTATCACCAA G C ATTTTC AT AAGGAAGTCT GTTATTTCTT 730 

GAGAACTTTC TTTTTTTCCA TGTGCAATCC AAGTTTGGCA GACACCAAAA AGTGCATGAG 84 0 

TT AGAT AGAT GCTACTATAT TCTAATTCAG TGGTATTTAG ATTCAGTTGC ATAAATCGCT 900 

TTTGTAAATC TGTACTAAGC ATGATATGAA GTTTATTTCG TAAGAAATTT TGGATTTCTT 960 

TAGTCCCATT TTCAGAAAGA AGGGCAGCCA GAAGTGCTTC TG ACT CT AG A TATTCAAAAA 1020 

CTTCTAAAAT ACCGTCTCTT TTGTGATGAG CATGTTTTTG AAAAATATAT TCAAATGTAT 1080 

GGAATAGCTT GCTTTGATAG TGCTCAATCA TATCATACTT AT CCTT AT AG TGAGTATAGA 1140 

AGCTGGAACG ACT AATTC CG GCTTTT T CTA CT AATTTG AC AGTAGAAATT TTATCAAATG 1200 

GCTGTTCCAT CAGTAATTGT ACCATAGCAT TTTCAATAGT T CG CTTTGTT TTTAAGCGTT 1260 

TGTTACTTTC TTGCATATTT CCTCCTTGTA AACAAATTAG AC T AT ATGTC TAAAAATAGA 1320 

TTTTTTATCT TGTAATTTAG ATTTTTTAAT GTATAATCTA T7ATATCAAA ATTTTAGACA 1380 

ATATGTTTAA AAAAGCACAA ACTAACTTTA A A G AAT ~ 7\> ACCAATTTAA AAAAAACCAA 14 40 

CCTTTATTAT TGTCATGATC GGGATTTCTC TTATTCCAGA TCTGTACAAT ATCATATTTT 1500 

TGTCATCAAT GTGGGATCCA TATGGGCAAT TGTCTGACTT ACCTGTCGCA GTTGTAAATA 1560 

ATGAT AAAG A GGCTTCCTAT AATGGTAATA CTATGGCAAT AGG AAAAG AC ATGGTGTCCA 1620 

ATTTAAAAGA AAATAAAACC TTGGATTTTC ATTTTGTAGA TGAAGAGGAA GGAAAGAAGG 1680 

GATTGGAAGA TGGCGATTAC TATATGGTAG TGACTTTACC AAGTGATTTA TCTGAAAAAA 17 4 0 

CAACTACATT ATCCAATATT CAATCGACAG CAGCTTATCA ATCATTGACA AGTGAGCAAC 1800 

AAACTGAGAT AAGTGATTCT GTATCTCAAA ATTCAACTGA TAGTATTCAA TCGGCTCAGT 1860 

CAATTGTAGC TTTAGTACAA GATTTACAGC GAAGTTTAGA AAACTTACAA AATCAATCTT 19 20 

CTAATCTTTC GACTTTAAAA AATCAATCTA ATCAAGTATC ACCTATTACT TCTACTTCTT 1980 

TGATAGGATT GTCAAGTGGA TTAACAGAGA TACAAGGAGA TGTTACTAGC AAATTAGTTC 204 0 
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C7CCCAGTCA GTCGATTGCA TCAGGTGTAA ACGCATATAC TACAGGTGTT GATAAAGTTT 2100 

CTCAGGGCGC AACTCAAC7A AGTGAAAAAA ATGCCACCTT GACAGGTAGT TTGGATAAAC 2160 

TACTTTCAGG CTCAAACACC TTGACACAAA AATCTTCTAG ATTGACAGCA GGAGTTGGTT 2 2 20 

AATTACAATC AGG AT CTGGG CAATTAGCAG AC AAATC C AG TCAGTTACTT TCAGGTGCTT 2 2 80 

CTCCATTAGA GAATAGAGCT AATAAATTGG CAGATGGATC TGGGAAACTA GCAGAACGTG 2 3 40 

GAACAAAGTT AACTTCTGGA TTGGAAGATT TACAGACAGG ACTTCCTTCT TTAGGACAAG 2 4 00 

GACTAGGTAA TGCTAGTGAT CAACTCAAAT CAGTATCAAC AGAATCTAAA AATGCAGAGA 2 4 60 

TTTTGTCAAA TCCACTCAAT CTTTCAAAAA CAGACAATGA TCAAGTTCCT GTAAATGGAA 2 520 

TCGCAATAGC TCCTTATATG ATATCAGTTG CTCTTTTTTT GCAGCAATAT CAACAAATAT 2 580 

GATATTTGCG AAATTGCCTT CAGGACGTCA TCCAGAGAGC CGTTGGGCTT GGTTG AAATC 2 640 

TTCAGCTGAA ATAAATGGTA TTATAGCTGT TTTGG C AGG A ATTTTGGTAT ATGGAGGAGT 2700 

TCAGCTTATT GGTTTAACTG CTAATCATGA GATGAGAATA TTTATTCTCA TCATCCTAAC 2 7 60 

AAGTTTAGTA TTCATGTCTA TGGTGACCAC TTTAGCAACG TGGAATAGCC CTATAGGAGC 2 820 

TTTTTTCTCA CTTATTTTGC TTTTACTACA GTTAGCATCA AGTGCAGGTA CTTATCCACT 2880 

TGCTTTGACA AATGATTTCT TTAGATCTAT TAATCCCTGG TTACCAATGA GCTATTCAGT 2 9 40 

TTCGGGATTA CGACAAACAA TCTCTATCAA CAAGTCATTT TCCTAGCTGT CATACTAGTT 3000 

CTATTTACTA GTTTAGGTAT GCTAGCCTAT CAACATAAGA AAATGGAAGA AGATTAAAAA 3060 

AATCGACCGA TTAACTGCTC GATTTTTTAT GCCTTAGATG ACTTTCGTCT GTGATTATAG 3120 

ATTCCAAATA GTAAGAGAGA AGTAAAGGAA CAGATTGCTC CAGTAATAAA ACCATTGGGA 3190 

TCAGCTTGTT TAATTTCTAT TTTCTTACCA TCTTGGTAGG CAGACCAACC TTTGTCATAA 3 300 

GGAATGGTGA AGAAAATAGA TGTATCTTGT TGGACATCAT ATGTAGCAAA AACCTTCTTT 3 3 60 

TTAGAAGTTG ATACTGTGAC AGGTTGTTCT TTAATTTTTT GAATTGCCTC GGTGAAAGTT 3 4 20 

TTGGT AT CT A AACGATAGAA GGTAGGAGAT TCAAATGATA CTTGTGAATT TCCAGGGAAA 3 4 80 

CTAACATTGA TATTGAAAGT TTTTTTCTCT TTAGTATATC C T AG ATT AAA GAAGGAGAAG 3 540 

ACATTATCAG TTGTAAAAGT CTTTTTTTCA C C ATT T AC AA GGATGTCAAC CTTCTTTTGT 3 600 

TT AT CGTT AG AAAAGTGAAG GTTTATGAAA GAG AG AT AAA CTTGGCTGTT TTCTGGAACT 3 660 

TCAATTTGAT ACTGGATTGC TGCATCTTCA TTTGAAGAAC TTGTG AC ACT AATCAAATCA 3 720 

TTAGTATTTT CTATTTTTTC TGTTTTTTCA TAAGGTATTG GAGAAAAATA ATCAAAATTG 3 7 80 
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ACGTTAGCAA GTTGATTTAA AAATGAGGCC TG ATT AT CCA AGGHAX~TTC ATTGAACTTG 3840 

ACATCATTGT AAAC AG ATTG ACTCGCAACT GCAATCGGAA GAGAGTATTG ATTTTCATAT 3 900 

AGGGTAAGAT TATCTTTTTG ATAGATATCT TTAAAGCCAT ACTTATCAAT AGGACTG TCT 3960 

GAGATATTGT ACTGG AT AC C AAATAAACTA TCAGCCAAAA TACTATTATT TGCAT AT CGG 4 020 

AGATTGAGAT TAGTCCCAGA GGATTTAAAA CCAAGTTTAT CTAAAGTAGA GCTTGATGAA 4 080 

CGATTTCGAA CAGATGAAAA TTGAGAGATT CCATTGTAGT TGAATTTCAT ACTGTCATTT 4140 

CCTGTCTGAG TTTGTAGTTT TTCAGTACGA GTAAATTGAT TTCCAATATA TG TTG AG AAA 4 200 

G ATTC CAT AG CTGGGATATC TCGACTATAA GCACTTCGAG AAGCAAATCC CCATTCCTTA 4 2 60 

GCAATTCCGT CCATTTGAGA TGAAGCATTT AAACTCATTT CAACCAGTAT AAATAAAGAG 4 3 2C 

ATTAGAATGG CAAATAGATT CACAGATATA AACTTTTTGA TAACTGCAAG GAGTAAAAGA 4 380 

GAATAGACAA CCAAAAATTC AAGAGTAAGC AGAATATTCA AATCTGTTAA AAAAGAATAA 4 440 

TGCGATTTTA G AT AG ATGGT AG CT AAAAAT CCTGCTACTA CAAGAAAAAG CGAAACTAAA 4 500 

AAATTCCAGA C TTT AAGTTC TTTCAGACGC TTTAAGACTT CTG CTGCTGT GTAAATTAAC 4 560 

AAGGTAGAGA AAATCCAAGC ATAGCGATGT AAAAACATGT TTGGAGTATG CATGCCTTGC 4 620 

CAAAATAAGT CAAGACCTTC TATGTAAAAG CTTGCAATTA GAAATGCAAA GAATATTACA 4 6 80 

TATATGAGTT TCACGTGAAA CTTAATAGAT TTCAGCGTAA AAAATAAAAT GGT C AAAAT A 4740 

AAGGGAAATA GTCCAACAAA AATCATTGGG ATGGCCCCAT ACTTTGTTGT GTCAAAGGAA 4 800 

CCAATGAATT GCTTAGCAAA GAGATCAAGA TACCAGCTAC TTTCAGTTTG AAACTTTGTA 4 860 

ACTTCAGTCA ATTTTTCCCC ATGTGTCTGT AAATCAAATA GAGTGGGAAG AGTCATAATC 4 9 20 

AAACTAGCCA TACCAGCTAA AAAGGACATA ACTATGAAAT CAAGAACAGA TCATTTTCGA 4 9 30 

GTCTTAAAGT CCCACGAAAT TTG AC AG AG A TACCAGAAAA TAAGAAACAA TACTGTCATA 5040 

TATCCAAAAT AATAATTTTG AATAAATAAG ATTG AC AG AC TTGTAAAGTA C AAT AGG ACT 5100 

T TCTTTT C AG TTATCAGTAG ATGTAAACCA GTTATAATTA AAGGAATCAA GATAAAAACA 5160 

TCTAGCCAGG T TTTT ATCTC TAATTGACTG ACAGTGAAAC TCATCAGAGC ATAGGAAGTA 522 0 

GATAAGGCTA GTTTT AAAAT CTGAGGGATA GATTGAAACA ATTTATTCAA ACTAAAAAAG 5280 

GTTGACAGAC CAATCAATCC AAATTTTAAG AGAGTTGTCA GAT A GAT AG C ATCTG GC AT A 5 3 40 

TTCGTTAGAT CAAAAAAGTA AACCAGAGGC GCGAGAAAAC TACCCAAGTA ATAACTAGAT 54 00 

AGGGCATAGA AGTTTAGCCC TAGACCACTT GTAAAGGTGT AAAACAGATT ACTATTTCCA 54 60 

TGTAGGATAT TTCGTAAGGC TACATCAAAA ATAACGTATT GATGAAAGCC ATCTCCTAAT 5520 

AGAGGAGAGT TGTCGCTATT CCAGTAGATA CTTTGAGATA GATATACTCC AGACATAATC 55 8 0 
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ACTACAGGAA TGATGAAAGA AATAAAATAG GTTCGATATG TTTTTAAAAA TGATTTCATG 5640 

TTACCTCGTA GAATGATAGA AAACTCAGTT GCTTAACCCA ACTGAGTTTT GAAGTTTTAT 57 00 

TTAGTCTTTC CAAAGTTCTT TAACTTTTGC TTGTACTTCT GCATTTTCTA GGAATTCATC 5*760 

GTAGGTTTCA TCGATACGGT CAATGACGCC ATTTTTAGAT AAGACAATGA TATGGTTAGC 5820 

CAAAGTTTGA ATAAATTCGT GGTCATGGCT GGCAAAGATG ATTGATTCTT TAAAGTTTTT 5880 

CAATCCATCA TTCAAGCTTG AGATAGATTC CAAGTCCAAG TGATTTGTTG GATCATCAAG 5 94 0 

TACAAGGACA TTTGATTTTA AGAGCATGAG TTTTGAAAGC ATGACACGAA CTTTTTCTCC 6000 

CCCTGACAAG AC ATTT AC AG GTTTGTT AAC TTCATCTCCA GAGAAGAGCA TACGGCCGAG 6060 

GAAGCCACGT ACCAAAGTAT TGTCATCTTC TTC TTT ACT T GCGAATTGAC GCAACCAGTC 612 0 

AAGAATTGAT TCTCCTCCTG CAAAATCAGC TGAGTTATCT TTTCGTAGCT AAGATTGACT 6180 

AGTTGTAACT CCCCACTTGA CAGTTCCTTC ATAGTCAATA TCTCCCATGA TTGCACGAAT 624 0 

TAATGCAGTC GTTTGAATAT CATTTTGTCC AATAAGTGCT GTCTTATCAT CTGGACGCAA 6 3 00 

GATGAAACTA ATATTATCCA AGATAGTTTC ACCATCAATC TTTACAGTTA AATTTTCTAC 5 3 60 

TGTCAAGAGA TC ATT AC C AA TCTCACGTTC CGCTTTAAAG TTGATAAATG GAT ATTT ACG 64 2 0 

ACTAGATGGC AC AAT CTCTT CTAGCTCAAT CTTATCAAGC ATTCTCTTAC GTGATGTTGC 64 80 

CTGCCTTGAC TTAGAAGCAT TGGCAGAGAA ACGAGCAACA AATTCTTGCA ATTGTTTAAT 6 540 

TTTTTCTTCT GCTTTAGCAT TACGGTCTGC TAGCAATTTA GCAGCAAGCT CAGAAGATTC 6 600 

CTTCCAGAAG TCCTAGTTTC CG AC AT AG AG TTTGATTTTT CCAAAGTCAA GGTCGGCCAT 6 660 

GTGAGTACAA ACTTTGTTTA AGAAGTGACG GTCGTGGGAT ACT ACG AT AA CTGTGTTATC 6 72 0 

aaagtcaatc aagaa j .* . / j:\accaaj.' A^r^ArTGC a . . : ..-..v-.c CGTTACTAGG 6**0 

CTCGTCCAAG AGAAGAACAT CTGGTTT AC C AAAAAGTGCT TTGGCGAGGA CAACCTTTAC 6 840 

TTTTTCACCG TTGGCCAATT CGCTCATGTT TTGGTAGTGT AATTCTTCTG GAATCTTTAG 6 9 CO 

GTTTTGAAGT AGTTGAGAGG CTTCACTCTC TCCTTCCCAA CCTCCAAGTT CCGCAAACTC 69 60 

TCCTTCGAGT TCGGCAGCAC GAACCCCGTC CTCGTCTGAG AAATCTTCCT TCATGTAGAT 702 0 

AGCATCTTTC TCTTTCATGA TGCTATAAAG TTTTTCATTT CCCATGATAA CGACATCAAT 7080 

GGCACGTTCA TCTTCGTAGT C AAAGTG AT T TTGACGAAGA AC AG AG AG AC GTTCATCTGG 714 0 

ACCAAGAGAG ATGTGACCAG TAGTAGGTTC GATATCTCCA GCTAAAATTT TTAAAAAGGT 7 200 

TGATTTTCCG GCACCATTAG CACCGATTAA TCCGTAAGTA TTTCCTTCTG TAAATTTGAT 7 2 60 

ATTG AC AT C A TCAAAAAGTT TGCGATCACT AAAACGTAGT GAAACATCAG ATACTGTAAG 7 320 
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CAATGTTTTT CTCCTATATG TGTAATATAT TTATTCTACT AG AAAAT AC A GAAATATTCA 7 3 80 

AATTTTTATT TGTCAATTTT GTGTAAATTA TATTTACAGT ATCCTTTACA CAAATCTGTA 744C 

AAAAGCAAGG CTGATTTATT TTGATAAATT ACGGTTATTT CATTAAAAAA ATGCTATAAT 7500 

TGAAAGGACT ATATCGAAGG AGAACAAAAT G ACT AAACCC ATTATTTTAA CAGGAGACCG 7 560 

TCCAACAGGA AAATTGCATA TTGGACATTA TGTTGGAAGT CTCAAAAATC G AG T ATT ATT 7 620 

ACAGGAAGAG GATAAGTATG ATATGTTTGT GTTCTTGGCT GACCAACAAG C CTTG AC AG A 7680 

TCATGCCAAA GATCCTCAAA CC ATTGT AG A GTCTATCGGA AATGTGGCTT TGGATTATCT 7 740 

TGCAGTTGGA TTGGATCCAA ATAAGTCAAC TATTTTTATT C AAAGC C AG A TTCCAGAGTT 7 800 

GGCTGAGTTG TCTATGTATT ATATGAATCT AGTTTCGTT A GCACGTTTGG AGCGAAATC C 7 8 60 

AACAGTCAAG ACAGACATTT CTCAGAAAGG ATTTGG AG AA AGCATTCCGA CAGGATTCTT 7 920 

GGTCTATCCA ATCGCTCAAG CAGCTGATAT CACAGCTTTC AACGCTAATT ATGTTCCTGT 7 9 80 

TGGG AC AG AT CAGAAACCAA TGATTGAGCA AACTCGTGAA ATTGTTCGTT CTTTTAACAA 804 0 

TGCATATAAC TGTG ATGTCT TGGTAGAGCC GGAAGGTATT TATCCAGAAA ATGAGAGAGC 8100 

AGGGCGTTTG CCTGGTTTAG ATGGAAATGC TAAAATGTCT AAATCACTAA AT AATGG TAT 8150 

TTATTTAGCT GATGATGCGG ATACTTTGCG TAAAAAACTA ATGAGTATGT ATACAGATCC 8220 

AGATCATATC CGCGTTGAGG ATCCAGGTAA GATTGAGGGA AATATGGTTT TCCATTATCT 82 8 0 

AGATGTTTTT GGTCGTCCAG AAGATGCTCA AGAAATTGCT GATATGAAAG AACGTTATCA 83 40 

ACGAGGTGGT CTTGGTGATG TGAAGACCAA GCGTTATCTA CTTGAAATAT TAGAACGTGA 84 00 

ACTGGGTCCG G 84 11 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TGCCGTACTC AAGTACAGCC TGCGCTAAGT TTCCTAGTTT GCTCTTTGAT TTTCATTGAG 6 0 

TATTAGTAAC CAAAATCCGA CCACATAGCC AGCCCCTATC AATATAGCCA TTAAAGCTAG 12 0 

CATGGAATTT AGGAAATTAA AAACCACCGC AGATACAAAG GTTAGCACAA AAACATTAAA 13 0 

AGCAATGGTG TCAGAAGCCA AGACTAGAAT AT AGGGTGT C AACCGATCTA AAGTTTTGGA 240 

ATCTAGGAAA AATAACTGTT TATACATGAT GACCTCCTCT ATCGCTCAAA AGCAAGCCTT 3 00 
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TTGTTTTTTT ACCCCAAGAC CCTATGTAGA AAAGTGAGCA AAAACGGGAA GGTCGCTACA 3 60 

ATATTATTGA TCACATGCAC CGC AT AGGAT GGATAAATCC TCTTGCTATA GCGGGTCAAA 42 0 

CCAGCAAAGA TGATTCCAAC TGTTGCAAAG ACGAAGATAT CTAACAGACT AGGCAGCCTT 4 80 

GAAAAATGAG GGAGAGCAAA TAAAATAGAA GGAAGAAGCA AATCAAGACC AAATCGCGAA 540 

TGCTTAAAGA AACCATGTTC CAGTAATCCT CTATAAATCA ATTCTTCCAT CAGTGGAACC 600 

AGAAAGAACA GCCCTATATA AATACCTAGC TCTGCAAAGT TAGTCCCACT ATAACCAATC 660 

AATACAGCCC AACCTTCCGC AGTTGACTGA ACATCTTTAG CTGTCTGAAC GTTAAAAGAG 720 

ATCTGGAACA CTAGCACTAA TACTGTCAAA ATCGAATACC AAAGCCATTT TTTTCTTGGA 7 80 

ATGCGGAAGA GATAACCATG GCCTGTCTTA ACAAGAACCA CAATCATGAC TCCAATAAAA 84 0 

AGTAAACTCA AGATATTTTC AATCCAGAAT AAATTGCCTA TCTGAGAAGA AAATTGCCAA 900 

TAGTTTTGGA CGATAAGCGT CAGCTGAGAA AGACTAAATA CGAAAAATAA GTAAGAGAAG 960 

ACTGCACTTA TTTTGAATAG AAGTTG AT AC TTTTTCATAG AAATCCTCCC TACTATGACC 1020 

TCACCTTGTC AGGCTCTACT GCTGTAAGAT TAAGAAGACA GTTTGTTTTT TTTAAGCCTA 1080 

ACCTGACTAC TAGATAATAG ATACATTAAG GCATTAAAGA CAATGAAAAT ATGTCCATAG 114 0 

AAT AAAATC A ACCTCGCATC C AAACCAAG A TAAAGTTTGA TTATCAAAAA GATGAGCAAA 1200 

AGAATTTGAA ACCATAAGGT TTTTCCAAAA ATAAATTTAA AGCGATTTCG AATATCTACT 1260 

TCCTTGATTT TTACCGCCAC CCCTTTATTA GCAAGAAGGA AAACTCCTGC TTCAAACAAA 132 0 

CCACTGTAAA GAACAAGCCA CCCAATAGAT ACGATACAGA TTTGTAAAAA TGTCCCTAAA 1380 

AG AAT AT C C A AC AC ACT ACT CAAGAAAATA ACAAAAAATA ATCTCTATTT CATATTAAAT 14 40 

ACCTCCATTC ATTTATTTCA CTAACAATTT AAT AG AG CCT TCTACTCAAA TATCCTGTCA 1500 

GAAAAGGATA GAAACCTACT T TT T AT AAT A CTTCAAGCCC CACATGAGCA GAAGCGTGAT 1560 

AAACAAGCAG AGAATACACC TAT AT AAGCG ATTAGTTGTT GATAGAATTC TGTTTCTGAA 1 62 'J 

ATACCTCTAT ACAAACAAAT G AC AAAC AT A AAATCTGCCA AGCCGATAAA CAT AAGTTG A 1680 

TTGGTTCTAG GACTAACCAA ATCATCATTT AC TT AT ATTT AAGAGTATCT CTTTTATTTT 174 0 

AATGTATGTT AGCACTGAAA AGCAAGACAG GCCAATAATA TTTAAAATGA ACAGTAACGG 1800 

GGTTAAGTCT CTAAAAAAAT TATCTACTGA CACTACAAGA AAT ACT AT AC ATATTATAGT 1860 

CCAAACTATC TTTTTCTTAT C C AT AATT AT TTACTCCTTT CCTAACAAAT CCAGCTTATC 1920 

AATCAAGAGC GATTTTTAAC AT AATGT AG C AGCACCCGTT GCAACTTTGA CAAGTTTAGT 1980 

' ATATCATTGT TTTTTAAAAT TTTTCATCCA AATCTTGAAT TGTCATCGAA ACATCTTGAA 204 0 
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TTGTTAAAAA ATTTAAAAAG TAAGCATTAA AAA CAT ACT T TCCTCTTTAT ATTGTATTGA 2100 

TACCAACTTG TTTGT AG ACT TTTCATCCTG CTATCACATA TCATTTTGAC ACGCGAAACA 2160 

AT ATT AAAG A AACTCCCCTG TAAATTAAGC TAGCAAATAC AGGGGAGAAA TTTATTTTTT 22 20 

AGAGAGTACT ATCCGTATCC TTTTTGGAAG ATTTTGAAAA TATTTTTCTA ATTAAGTCAT 2 2 SO 

C C AT AT AAGG ACCAAATATA CCAACTACTA AACCAATAAT AAAACTTTTA AAATCCATAA 2 34 0 

TTACCACCAA CATATTGCTG CATAGGCTAC ACCTCCAAGT ATAGCTCCAC CTGCAGCACC 24 00 

AGTTACACCT ATTCCTATAG CAAATGGTCC CAATAGAAAT GTCAAACCGT TGTTGCACAC 2 460 

CCATCAATTG CGCCATATGC AACCCCTGCT GCACAACTAA TTTTTCTTCC CCAATCAATA 2 520 

TCTCCACCTT CAACGCAAGC AAGCATTTCA TTATCCATAA CTGCAAATTG TGACATCATT 2530 

TTTGTATCCA TATAGTGTAT CACTTTTCAG TTACGGAACA AGTTT AAT AT AAAAATTATC 2 64 0 

AAAAAAACAT AGGCAATAAA GAGAAAAATT AATT T AT CAT AGATTAGAAA TAATATGACA 2 700 

AAACAATTCA ATGATGTTAA TTCAATAGTC TTTTGTTTTT TATCGGAGAT ACTTATGGAT 2760 

AGATAAATAA GATAGGTTTG AAAAGCGAAG AGAATAATAA AGAATATAGC CTTCATAAAA 282 0 

TTTAGCTTTC ATTTTTATGA TGTAGCGGTA TAGGCTAAAT ATCCACAAAC CACTGCTCCT 2 880 

CCAATTCCTC CTATTGCAGC GCCCCATGGT CCTAGAAGTC TCCCATATTT CACTCCACCC 294 0 

GCTGCACAAC CTAAAGCAGC AACTACAGCT GCTCCTCCGG AATTACCTCC ATAAACCTCA 3000 

CTCAGCATTG TTTCATTTAT ATT AC AAT AA GTATTCATAC AAGTCTCCTT TTATTAAAAT 3060 

CCACCCGTTG CCCCTGTTAC TCCTGCCCAA AGATCCACAC CAAATTTAGC TCCTATGTAT 3120 

CCACATGCTC CCATAAATGG TGCTCCAACA CCACTCGCAG CACAAATAGC TGTCCCTAGC 3180 

CCCCAGCCAC CAAAAGCAGC ACCACCACCT TCT AAG A CAT TAGTTTGCCA ATTATTCTTG 3 24 0 

CCTCCTTCAA TACTAGATAA CATAGTTATA TCCATTTCAT GAAATTGTTC CATAATTTTT 3 3 00 

GTATCCATGA CAAATACTCT TTTTTATTTT T AATT TTTGT CTTGTTGTAA CTTTGACAAG 3 3 60 

TTTAGTATAT CATCGTTTTT TAAAATTTTT CATCCAGATT TTGAATAGTC ATCGAAACCT 34 20 

CTTGAATTGC AAAAATTACA TTAGACTTCC TGCAAAACTA GAATCCTAGT TCATGATTGA 34 80 

TAATACCAGC ACTCAAATTC ATTCGTAATC CGAAGCGTTT ACGATGACTT CGATAGGTTG 3 54 0 

TTGAAAACAT TTTAAACGTT TTTACTTTGG CAAAGATGTT CTCAACCTTG CTTCTCTCCT 3600 

TAGATAGCGC ATCGTTACAG GCTTTATCTT CAACTGTTAG CGGTTTGAGT TTGCTGGATT 3 660 

TACGTGAAGT TTGTGCTTGA GGATATATCT TCATGAGCCC TTGATAACCA CTGTCAGCCA 3720 

ACATTTTACC AGCTTGTCCG ATATTTCTGC GACTCATTTT GAACAACTTC ATATCATGAC 37 8 0 

AATAGTTCAC AGTGATATCC AAAGAAACAA TTCTCCCTTG ACTTGTGACA ATCGCTTGAG 3 84 0 
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TCTTCATAGC GTGAAATTTC TTTTTACCAG AATC ATTCG C TAATTCTTTT TTTACCGCGA 3 900 

TTGATTTTTA CTTCCGTCGC ATCAATCATT AC CCTGT CCT CAGAACTGAG AGGAGTTCTT 3 9 60 

GAAATCGTAA CACCACTTTG AACAAGAGTT ACTTCAACCC ATTGGCTCCG ACGGAGTAAG 4 02 0 

TTGCTTTCGT G AAC AC C AAA ATCAGCCGCA ATTTCTTCAT AAGTGCGGTA TTCTCGCACA 4 080 

TATTGAAGAC TGGCCATAAG AAGGTCTTCT AGGCTTAATT TAGGTTTTCG TCCACCTTTT 414 0 

GCGTGTTTAA GTTGATAAGC TGTTTTTAAT ACAGCTAGCA TCTCTTCAAA AGTCGTGCGC 4 200 

TGAACACCAA CAAGACGCTT AAATCGTGCA TCAGTTAGTT GTTTACTTGC TTCATAATTC 4 2 60 

ATAGAACTAT AGTAAAATGA AATAAGAACA GGATAAATCG ATCAGGACAG TCAAATCGAT 4 3 20 

TTCTAACAAT GTTTTAGAAG TAGAGGCGTA CTATTCTAGT TTCAATCTAC TAT ACT AT AC 4 3 80 

CATATTTTGT TTCGCAGGGA ATCTATTATA AAAGGGTAAG TATTGCAAAA ACACTTACCC 4 4 40 

TTTTCT TTT A TACTTCATTA AGCTCTACTT TTTATAATAC TTCAAGCCCC ACATGAGCAG 4 500 

AAGCATGATG ATTAAGCAGA GAACAGCGCC AATATAACCG ATTATTTGTT GGTAGGATTC 4 560 

TCCTGCTGTG ATACCTCTAT ACAAACAAAT AAT AG AC AT A AAACCTGTCA AGCCGATGAA 4 620 

CATAAGTTGA TTGGTT CT AG GACTAACCAA ATCATCATCT TCAAACTCTC TTATCCTCAT 4 630 

TTCCCTAGTG AGATAAACAG TAACCAAAAT AGAAGCCAAG TTAATAACTA CTAAAAGAAA 4740 

TTGGAAAACT ACGGAAAAAT TTAAAAACTG ACGAGATAGA AATAGATAAG T AG AAACAAG 4 800 

CAAGGGCAAC TGACCTAAGA ACAATCTCGC AAGGAAGATG TTCCGTTTTT TAGCAAGAAA 4 8 60 

AGTTTTCATT TCTTTTCTCC TTTCTTTTTA TTGATAGCAA AATAGATCAT AAC TGC AATC 4 9 20 

ACATAGGCTA TGGTATAAAA TAGCTGATAC CAAGCACTCT CCCTAAGCGG ATATAGAAAG 4 990 

ATGGACATGA TTAGATACAG AACGAAAATA ATCAGTATTT TTTTCTTCAT AAGATTTCCT 5040 

CCTAAATGTG CGATTTATCT TAGTTGAGCA AGAACATTTA CACTGCTAGT ATAGCACTTA 5100 

TTTTGACCTT GGATCACTCA AATCATAAAT GGTC AT C AAA ACCTCTTGAA TTGTAAAAAT 5160 

TAAAAAAGCA AGCATGAAAA AC AT A CTTTC CTCTTTATAT TGTATTGATA CCAACTTGTT 5 220 

TGTAGACTTT TCATCCTGCT ATCACATATC ATTTTG ACAG GCGAAACAAT ATTAAAGAAA 5 2 80 

CTCCCCTGTA AATTAAGCTA GCAAATACAG GGGAGAAATT TATTTTTTAG AGAGTACTAT 5 34 0 

CCGTATCCTT TTTGGAAGAT TTTGAAAATA TTTTTC T AAT TAAGTCATCC ATATAAGGAC 5400 

CAAATATACC AACTACTAAA CC AAT AAT AA AACTTTTAAA ATCCATAATT ACCACCAACA 5 4 60 

TGTTGCTGCA TAGGCTACAC CTCCAAGTAT AGCTCCACCC GCAGCACCAG TTGCTGCACC 5 52 0 

TTGCCATGTT CCTGTTTTAA TGCCTACTTG AAGACCTCTT GCTGCTCCTC CTCCAACACC 5 5 80 
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TGCTTTGGCA AAATCTCCCC AATTGCATCC GCCACCTTCA ACCCAAGCAA GC ATT TC AG T 5640 

ATCCATAACA GAAAATTGTG ACATCATTTT TGTATCCATG AC AAAT ACT C CTTTTTTAAA 5700 

AAACTAAAAT AAATCACAAT AGAATCCTCA TAATTTTACT ATAAGTCTTA CCAACTTAGT 5"? 60 

CCCAATTTAT CACCAACCAT ACCTCCTAAG CATGTTAATC CACCCCCAAT TGCACCAATG 5 8 20 

TGTGCTCCAA CAAATGCACC AGCAAGTCCA GCTACTCCTA AAGTGGCCAA ACCTGCTCCA 58 80 

GTTCCACCAG TTATAATTCC CGTAGTGACT CCTGTAATCA CTGCATTTTG ACAATCAGTG 5940 

GAGCTATACC CCCCTTCAAC TTTCGCAAGC ATTTCAGTAT CCATAACCTC TAACTGTGAC 6000 

AACATTTTTG TATTCATGAT GAATACCTCC TTTTTATTTT CAATTTGTTA CCAAAGTCTT 6060 

AAATTCAATA AACAAATAGA TTTTTT AT AG TATCTTTTTG ATTTTCTTAA AAAAGT AT AT 612 0 

ACGT CT ACT A TCTTCTTAAA GGTAGCAGTA CCTATTTTTT AGTCTAAGAT TTCAATAATC 6180 

TTGAGTATCT AAAATATCTT AATTTCGTTA TTCTCCTTGC AATAAAAAGT TTTACTATAC 6 2 40 

TATTTATTAA CTTG C AG AAA GCAAAAAATA TTAGTAAATA ATAGTTTATA CTTAACTTTT 6300 

TTATTCCTAC CAATCCATCA ACTAAGTAAA GCATCAACGA TTACATAAAC GATTGATAAT 63 60 

ATAATTAAAA TTTTCCTAAC TATCTTATTC T CATC ATT CT TAGATAACTT TGATATTTTG 64 20 

TAAGTAAGTA AATAAGACAC TAAATTAATA GCGATAATAA TACTATATTT AAGAATCATA 64 80 

ATCTTACAAA GAGGACATAA TTCCTGAACC TACACAAATA AGTGTTGCTG CTCCCCCAGT 6540 

TATCGGACCA GTCGCAGCAG CTAATAGTAC TGCTCCAATA CAACCACCGA TTGCAGATCC 66 00 

TAAATTGCCT CTTCCTCCAC TAACTATTTC GAGTTCTTCA TTATCCATAA CAGAAAATTG 6 6 60 

TTCCATCATT TTTGTATTCA TGACAAATAC TCCTTTTTTC TTTTTTTATT TTTGTCTTGT 67 20 

TGTAACTTTG ATAAGTTTAG TATATCATCC TTTTTT AAAA TTTTTCATCC AG AT CTTG AA 67 80 

TTGTC AT CG A AACGTCTTGA ATTAGCTTTT TTATTTCAAG CCACCTCTAA ATGTTTAAAA 68 40 

AAAAT AATTT CTAATCACTT TTTTACCATT CAGGAAGTTT TAATGACTAT TCAAGATTTC 6900 

ATAAAATATG AACTTAGTTT TATGACATAA TAGACCTATC CACTATATGA AAGGAATTGC 6 9 60 

CAATGACTTC TTATAAACGT ACATTTGTTC C TC AAAT AG A TGCGAGAGAC TGTGGTGTCG 70 2 0 

CTGCCTTAGC CTCGATTGCT AAATTCTATG GTTCAGATTT TTCTCTAGCT CACTTGAGAG 7 080 

AAC TTGC AAA GACCAATAAA GAAGGGACGA CTGCTCTTGG CATTGTAAAA GCCGCTGATG 714 0 

AAATGGGCTT TGAAACAAGA CCTGTTCAAG CAGATAAAAC GCTCTTTGAC ATGAGTGATG 7200 

TCCCCTATCC ATTTATCGTT CACGTTAACA AAG AAGG AAA ACT CC AAC AT TACTATGTTG 72 60 

TCTATCAAAC AAAGAAAGAC TATCTGATTA TTGGTGATCC TGACCCTTCT GTAAAAATCA 7 3 20 

CTAAAATGTC AAAAGAACGC TTTTTCTATG AATGGACTGG AGTAGCTATT TTTCTAGCTA 7 3 90 
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CCAAACCCAG 
CCTCTGATTT 

ACTATTATCA 
AATCAGATGA 
CAACAAGTCA 
ATTGATCTGA 
ACACGTCGTA 
TTGGCTTCTA 
GTCTTACTGG 
ATGTTCATCA 
AGTAATTCTA 
TCGCTCACGA 
GAAAAATCCT 
AAATTAGTTC 
ATTTCTATCG 
GAAAATATTA 
AACGAAGTCT 
TTTTTGATGG 
ACCTTAACAG 
AGTGGTTCTG 
GGGCATATTT 
CATATTAATT 
ACCTTGGGCG 
GAAATCCGTC 
GCTGGTCTAT 
TCTCCTGTTT 
GTTATAGATA 
AGTATAGCCG 
GGTA 



CTATCAACCC 
TCAAACAAAA 
ATATAGGTCG 
AATCAACTTT 
TGAGCTTCTC 
TTTTATCCTA 
CAGGAGAAAT 
CCATTCTTTC 
CACAAAACCC 
T C TT TT C T T T 
TGGTTAGCTC 
GTGAAGAAAA 
TTAAGCTCAG 
TGAATATCCT 
GTCAGCTGAT 
TCAACCTCCA 
ATCTAGTCGA 
GCGATATTGA 
ATATTAATCT 
GTAAAACAAC 
CCATCAATCA 
ACCTACCCCA 
G T AAT CAT AT 
AAGACATTGA 
CAGGAGGACA 
TAATACTAGA 
ATCTTATGTC 
AACGAACCAA 



CATAAAGATA 
ATCTCTCATT 
TTCTTACTAT 
AGGAATCATC 
CAGAGATTAT 
TATTCGCCAT 
CATTTCACGA 
TCTTTTTCTG 
TAATCTCTTC 
TATGAAACCT 
TGCCATTATC 
TCCCTATCAA 
TAAATATTCT 
TATCCTATGG 
TACCTTTAAC 
AACCAAACTC 
ATCTGAATT7 
ATTTGATGAC 
CACGATTAAA 
TTTAGCCAAA 
TCAGGATATT 
ACAAGCCTAT 
GATTAGTCAA 
AAGAATGCCT 
GAAGCAACGA 
TGAAGCTACT 
TCTAACTGAT 
CCGTGTCATT 
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AAAAGAATGG 
GCTTACATTG 
CTCCAAGGAA 
TCAGTTGGTC 
CTCCTAACCG 
ATTTTTGAAC 
TTCACAGATG 
GATGTTTCTA 
CTTCTTTCTC 
TTCGAAAAAA 
GAAGATATCA 
AATATAGACA 
ATTTTACAAA 
TTTCCCCCTC 
ACACTTTTTT 
CAATCTGCGA 
CAAGTTCAAG 
CTTTCTTA7A 
CAACGACATA 
ATGATTGTCA 
AAAAACATTG 
ATCTTTAATG 
GAAGATATTC 
ATGGGCTATC 
ATCGCTCTCG 
AGCGGTCTTG 
AAAACCATTC 
GTTCTTGACC 



TCTACTAAGC 
TTCTCTCAAG 
TCTTGGATGA 
TGGTTATCAC 
TTCTGAGTCA 
TTCCCATGTC 
CTAACTCTAT 
TTCTGATTCT 
TTATTTCCAT 
TGAACCATGA 
ACGGGATTGA 
GCGAATTTGT 
CGAGTTTAAA 
AATTAGTCAT 
CTTACTTTAC 
ACCTCGCTAA 
AAAACCCTGT 
AGTA7GGTTT 
AGCTTAGCCT 
ATTTCTTTCA 
ATAAAAAAGT 
GCTCTATTTT 
TAAAAGCTTG 
AAACTCAGCT 
CTCGTGCTCT 
ATGTCTTGAC 
TCTTTGTAGC 
AGGGG AAAAT 



AAGCTTCCTT 

CTTATTGGTC 

ATACATTCCA 

CTATATCCTC 

GAGATTAAGT 

TTTCTTTGCC 

TATAGATGCC 

TGTAGGAGCC 

TCCTATATAC 

TGTCATGCAA 

AACTATAAAG 

AG ATT ATTTG 

GCAGGGAACA 

GTCAAGTAAA 

AACTCCTATG 

TAACCGTTTG 

TCATTCACAT 

TGGATGAGAT 

AGTTGGAGTT 

ACCCTACAAA 

CTTCCCCCGT 

GGAAAACTT 

TGAAGTAGCT 

CTCTGATGGA 

TTTAACTAAA 

T G AG AAAAAG 

CCATCGTCTC 

CATTGAAGTT 



7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
3160 
3220 
8280 
3340 
3400 
8460 
8520 
3580 
8640 
8700 
8760 
3820 
8380 
8940 
9000 
9060 
9064 
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(2) INFORMATION r OR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 7780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO : 18: 

CTCCATTTTT TTGATTTCAT AAATAAACAA CCTCTCTGTT AATTTTGTAT AATTATAACG 60 

ATATCCAAGT TACTTGTCAA GTGTTTTTTA AATTTTT AT C TCAAAAATAT TTTTTCGTTC 120 

AAAAAAAGGA GCCATCAGTT GATTTCAAGC TCCCTTTTAT ACAGAATTAA ACT ATT TT AT 180 

AGTTCGACAA TCTTACCTGT TTC AAAG TAG ACAACCCATT CACAGATATT TTTAGCATAG 2 40 

TC ACCG AT AC GCTCCAAGTA GGAAATAACT TGGAAATAAT CACGACCCGT AACAATGGCT 300 

TCTGGATTTT TCTTAATCTC TTC ACT CGC A AGGTCACGGA TAGTTTCAAA AT AGTGGTT A 3 60 

ATTTGCTCAT CCATGGAGGC CACCCGGTAT GCGTCGTCAA CAGAACCATT AAGATAAAGA 4 20 

TCAAGTGCTG CTTCCACAAC GCTTTTAACT TCACGTCCCA TTTTTTTAAT TTCTTC CTCT 4 80 

ACAGCTGGAA TGCGCTCTTC CCCCTTCATA CGGATGGTTG CCTGGGCAAT GGCTACAGCG 54 0 

TGATCCCCCA TACGCTCCAC ATCTGATACA GCCTTAAGGA CAGTCAAGAC TGTACGCAAA 600 

TCTTGAGAGA CTGGTTGTTG GAGTGCGATC ATTTCAAATG ATTTCTTTTC C ACT TTC ACT 660 

TCGTATTCAT TTACTTCTGC ATCATCTTCG ATGACCTCTT TTGCCAGGTC ACGGTCATGC 720 

GTGACAAAAG CACGTACCGT ACGATTGATT TGTGAGAGCA CTTCTTGTCC CATAGCGTAG 7 80 

AACTGGT TAT GTAATTTCTC TAAATCTTCT TCAAATTGAG ATCGTAACAT CTTTCATCTC 840 

CTT AT CC AAA TTTTCCTGTA ATATAGTCTT CCGTTTCCTT GTGTTGCGGA TCAAGGAACA 900 

TCTGCTTGGT ATCATTAAAT TCAATCAAAT CTCCATCTAG GAAAAATCCT CTCTTATCAG 960 

AGATACGTGA AGCTTGCTGC ATGGAACGGG TTACCAGAAG CATGGTGTAC TTGTCTTTTA 1020 

GACCATACAA GGTTTCCTCA ATTTTACCAG CTGAAATCGG ATCCAAAGCC GAAGTTGGCT 1080 

CATCCAAGAG GATGATTTTA GGACTAGTTG CCAAGACACG GGCCACGCAG ACACGCTGCT 1140 

GTTGACCACC TG AC AATCC A ATAGCTGAAT CATATAGACG ATCCTTGACC TCATCCCAGA 12 00 

TAGAGGCACC TTGCAAGGCT TTTTCTACGC CTTCATCCAG AACCTGCTTA TCCTTAATTC 1260 

CATTGATACG AAGCCCGTAG ACAACATTCT CATAGATAGT CATAGGGAAA GG ATTAGGTT 1320 

GTTGGAAAAC CATTCCGATT TCCTTACGTA ATTCAACCGT ATCTGTACGC GGACTGTAGA 13 80 

TGTTGTGACC ATTGTACACC ACGGATCCAG TTGTGGTCAC CTCTGGATTG AGATCTCCCA 144 0 
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TGCGGTTGAG AGACTTGAGG AGGGTTCACT TCCCTGATCC AGATGGACCA ATCAAGGCTG 1500 

TAATTTCCTT AGGTTGGAAA GATAGGGAAA C ACT ATTC AA AGCCTTCTTT TTATTATAAT 15 60 

AAACGGACAG GTCTGATACC TGTAAAATCG C AT CTGTC AT ACGGTTTCCT TTCTAACCAA 16 2 0 

AGTGACCAGA T AC AT AGT C A TTGGTGGACT GTAGCTTGGC ATTTTGGAAA ATAGTTGCAG 16 80 

TCTTGTCATA CTCAATCAAA TCACCCAAGT AAAAGAAGCC TGTATAGTCA CTTGCACGAG 17 40 

CAGCCTGCTG CATATTATGC GTTACAATGA TGATGGTAAA GTTTTTCTTG AGCTCAAACA 1800 

TGGTCTCTTC TAGTTGCATG GTCGCAATCG GATCCAAGGC TGAGGCTGGC TCATCCATTA 186 0 

AGAGGATATC TGGCTTAACA GAGATGGCAC GAG CG AT AC A GAGACGTTGT TGCTGACCAC 192 0 

CTGATAAGGT CAAGGCTGAC TTGTGGAGAT CGTCTTTAAC CTGATCCCAG AGGGCAGCCT 1980 

GACGAAGGGA GGTTTCTACG ATTTCATCTA GGACTTGCTT ATCCTTAACT CCAGCACGTT 204 0 

CATGCGCAAA GGTAATATTA CGGTAAATTG ACTTAGCAAA TGGATTGGGA CGTTGAAAAA 2100 

CCATTCCAAT GTGTTTACGC ATTTC AT AAA CGTTGATTTC TGGACGGTTG AC ATC AATTC 2160 

CACGATAGAG AATCTGCCCA GT T ACTTT AG CAATATCAAT AGT ATC ATTC ATGCGATTGA 2220 

GACTGCGTAA GTAGGTAGAT TTCCCCGATC CCGACGGGCC AATCAAAGCT GTAATTTTAT 2 2 80 

TTCTTTCAAA TTGCATATCA ATCCCCTTAA TGGATTCATT TTTACCATAG TAAACATGGA 2 340 

CATCCTTAGT AGAAAGGGCT ACTTTTTCTT CAGGAAAGGT AAGGATATGC TTCTCATCCC 2 4 00 

AGTTATATGT TGACATGGCT TCTCCTTTAG GCAGCGGTTA ATTTCTTGTG TAGATAGCTT 2 4 60 

CCGAACTTAC GAGCTCCAAA GTTAAAAATC AGGATAAAGA TCAGGAGCAC AGCGGCAGAA 2 5 20 

CCTGCTGATA CAATGGTTCC AT CTGG AAT A GTGCCTTCAC TATTGACTTT CCAGATATGG 2 5 80 

ACAGCCAAGG TTTCTGCTTG ACGGAAGATA GAGATGGGGC TAGTCACACT GAGGATATTC 2 6 40 

CAGTTAGACC AGTCAAGAGC TGGCGCCGAT TGCCCTGCTC TATAGATCAG AGCTGCAGCT 2 7 00 

TCGCCAAAGA TACGACCAGA TGCCAAGACG ACACCCGTTA CAATACCTGG AAGCGCTTCC 2 7 60 

GGAATAACAA CATGAACCAC TGTCTCCCAG CGAGAAATCC CAAGAGCCAG ACCAGCCTCA 282 0 

CGTTGGGTAT GGTGAACGTG TTTCAAACTA TCCTCTACAT TACGCGTCAT CTGACGCAAG 2 88 0 

TTAAAGACTG TCAAGGCCAA GGCACCTGAA ATGATTGAAA ATCCATACTC AAACTGG ACT 294 0 

ACAAAGATCA AGTAACCAAA GAGACCCACC ACCACTGATG GTAAAGAGGA CAAAATTTCA 3000 

ATACAAGTCC GCACAAAGTT GGTAACAGGA CCTTTTTT AG CATATTCAGC CAAGTAAATC 30 60 

CCAGCTCCCA TAGAAAGAGG TACAGAAATA ATCAAGGTAA TG AC C AAT AG GAAAAAGGAA 3120 

TTGTAAAGCT GAATGCCAAT CCCACCACCT GCTTGAAAAG CAGAAGACCT TCCACTCAAG 3180 
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AAAGACCAAG ACATATGGGG CAAGCCCCGA ACCAAGATAT AGAGAATCAA GGAAGCCAAG 3 240 

ATTGTCACAA TGATGCTAGC AATCGTATAG AGGACAGCTG TTGCAACTTT ATCTAATTTC 3 3 00 

TTAGCGCGCA TAATTTTTCT TTCCTCTTTC TTTCGTAATC AATTTAATCA C A CTGTT AAA 33 6 0 

AACTAAGCTC ATCAAGAGCA GTACCAAGGC CAGTGACCAG AGAACATTAT TATTTACAGT 342 0 

TCCCATGACA GTGTTCCCAA TTCCCATAGT TAATATAGAA GTTAAAGTTG CAGCTGGTGT 3 4 80 

GGTCAAGGAA GTTGGGATAA CAGCTGAGTT TCCGACAACC ATCTGGATAG CTAGAGCCTC 3 54 0 

ACCAAAGGCA CGCGCCATCC CAAAGACCAC TGCAGTGAAA ATACCAGAAC GGGCCGCCTT 3 600 

CAAGATCACA CGCCAGATAG TCTCCCAGCG AGTGGCTCCC ATAGCGAAAC TGGCTTCACG 3 660 

ATAATAACGA GGAACCGCAC GCAAGCTATC CGTTGTCATA AAGGTTACGG TCGGCAAAAT 372 0 

CATGACAAAG AGGACGGAAA TCCCTGACAA AATCCCAAAA CCAGTCCCAC CAAAGACACT 3 7 80 

GCGAACAAAG GGAACGACGA CTTGCAAGCC AATAAATCCG T AC ACT ACTG AAGGAATCCC 3 84 0 

AACCAGGAGT TCAATAGCTG GTTGCAAAAT CTTCGCCCCT TTTGGTGATA CTTCGGTCAT 3 900 

AAAAACTGCT GCACCAATAG CAAAGGGTGT TGCGATAAGG GCTGAGAGAA TGGTAACGAT 3 9 60 

AAAGGAACCC AAAATCATAG GAAGGGCACC AAATTCTTTA CTAGAAGGAT TCCAAGTTCC 4 020 

TCCCAAAAGA AAGTCAAAGA TATTCACACC ATTGACAAAG AAGGTCGACA AGCCTTTTTG 4 0 80 

CGCTACGAAA ACCAAAATCA TGGCCACAAG GATGACTATC AAAGAAAGAC AGGCAAAGGT 4140 

CAAACCTTTT CCTAATTTCT CCAGACGAGA ATTCTTTGAT GGAAGCAACA TTTTCTTAGC 4 2 00 

TAATTCTTCT TG ATT C ATT A TTCTCTCCCT TCCAACACTG TCACAGTTCC GGCAGCATCT 4 2 60 

TTTTCAACCT TCATTTCCTT AATCGGAATA TACTTCAATC CTTTGACAAT CCCTTCTTGG 4 320 

GTCTCATCCG AGAGAACAAA ATTGAGAAAT TCTGCAGCCA ACT C ATTGGG CTGCCCCAAT 4 3 80 

GTATACATAT GCTCATAAGA CCACAAGGGC CAATTATTGC TACTTATATT TTCTGGACTT 4 440 

AAGTCATAGC CATTCAACTT CATGCTTTTG ACCGAATCAT CTATATAGGT AAG AG AT AAA 4 500 

TAAGAGATAG CTCCTGGACT TTTTGATACG ATTGATTTTA CCGCTCCATT TGAATCCTGC 4 560 

TCCTGACTTT GCATGGCAGA CTGACCTTCC ATAATGACAG TATCAAAGGT AGCACGAGAG 4 6 20 

CCAGAGCCGG CTGCCCGATT GATAACAGAG ATGGGTAAGT CCTTACCACC AACCTCTTTC 4 680 

CAATTGGTTA CCTCACCTAT GAAGATTTGA CGAAGTTGCT CTGTCGTTAG GTTATCAACA 474 0 

TCAACCTCCT TATTGACAAT CAGAGCCAAG CCAGCTACCG CGACCTTGTG GTCAACAAGA 4 800 

GCAGAAGCAT CAATTCCGTC TTTTTCCTCA CCAAATACAT CTGAGTTTCC TATATCAACT 4 860 

GCCCCAGACT GAACCTGGGA CAAGCCTGTA CCAGAACCTC CCCCTTGGAC ATTGACCGTT 49 2 0 

TTTCCAACAT GGATCGTGCC AAATTCATCT GCCGCTACTT CAACCAAGGG. TTGCAAGGCA 4 980 
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GTTGAGCCAA CAGCCGTTAT GGATTCTCCA CGATCAATCC AGCTAGCACA GCCTACTAAA 5 04 0 

CAAGCCGTCA GCCAAAAAGC GATAAGAGAC AGAGCAAGCT TTTTTCTTTT TTTCACTGTT 5100 

TTTCTCCTCG AAAATAATTA TGAATACTCT GAATTTTTTA AGTAGTTCTT T ATG AGTTG A 5160 

CGCATGAATT CTTACCAAAT TTCTGCGCAA TTGATTATTT ATATAATATA GCCTATATTA 52 2 0 

CTCTTTCCTA ACCTCCTTTT TTCATATGTG GATAAAATCT CTTGTCTATC CCTTCCCCCA 52 80 

TTGTCACCCA TTATAGTCAT TTCGTGTCTC TTTTTCCCCT TTTTAATGCA AGGGAAATTA 5 340 

CTCTCCTTAG ATGATAATCC AAAAGCTAGA AAGGTATCTC AAACCTCTCT ACTCTCCCAC 5400 

ACTAGTTTAC AACTAAAAGG AAAAGATTCT ATTTTATGAG AAATCTAGTT TACAAGCGGT 54 60 

AAGAACGCTA ATAACTAAAC TTCTTGTACT CTTTGAAAAT CTCTTCAAAC CAGTGTTTTG 55 2 0 

AGCTATCTAT GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAG TAAAACTACA 5 5 80 

TGTAATGGCA ATCAAGATAT CAAGAATCAT CCTACTAAAA AAATCCATAC TTTCACTATA 5 64 0 

ACATAGAATA AGATATTTGA CTAGCATTTT CATTTGAATC TGAGGCCTTT TGGAAAATAA 5700 

TTTTTCAAAA CATTTCCAGT AACCTTTGCA AAGCCCAAGC CATTGCCTTT AACCAAAACT 5760 

TGGTACCAAC CATTTGGCAG ACTTTCTGCC AGCTCAACGG TTTCTCCACC CGCATACTTG 582 0 

ACAAACGCTT CTTGGCCAAT TTCAACCGAC TGTTCGACCT CACTCGCTTT CAAGGCTAAA 5880 

CCAAGAGCGA AACTGGGCTC AAAGCGTTTC TTCTTAAAAG TACCCAGATG CACTCCATTG 59 4 0 

CGAGCAATCT TGAGCTTCCA TAAATCTGGC AAAAGTTCTG GCAAGAGATA AACCTGCTCT 6000 

CCAAAAATCT GCAAGATACC CGGTAGATTG ACCTTCAAAT GGTTTTGGGC AAATTCCTCC 6060 

CACAAGGCAA CTTGTTCACG GCTGAGGTTA CTCTTACTTG CCTTAAATTT AGGAGCTGGA 6120 

TTGTTACCCT TAAACTGTAG ATGGGCAACA AACTGACCCT CTCCCTTAAA CTGATGAGGA 6180 

T AC AT C CG AG CCGTTTCTGG CAGCTCAATA CCAGCTACCA TTCCATTGAT ATGCTCTACT 6 2 40 

GGCAACAAGT CAAAATCATA CTCTTCCAGC AACCAATTGA CAATCTCTTC GTTTTCCTCG 6 3 00 

GGTGCCCAGG TACAGGTCGA ATAAACCAGA TGACCACCTT CAGCTAACAT GGTCACTGCA 6 3 60 

TCCTCCAGAA TTTCT CTTTG CAAGCTAGCA CATTGACTCG GATAATCTAA GCTCCAATAG 64 2 0 

TC CAT AG CAT CAGGTTGCTT ACGAAACATT CCTTCACCAG AGCAAGGGGC ATCAAGAACG 6 4 80 

ATTAAGTCAA AATAGCCTTT AAAGACCTTG ACCAAGCGGT CGGCAGATTC ATTCGTCACC 6 54 0 

ACGACATTTG TCGCTCCAAA ACGCTCCATG TTTTCAACCA AAATCTTAGC CCGTTTGCTT 6600 

GAAATTTCAT TGGAAnCAAG TAGCCCCTCC CCTGCTAGAT AGGCTGCCAG TTGAGTTGAT 66 6 0 

TTGCCCCCCG GTGCAGCAGC CAAGTCCAAG ACCTTCATAC CAGGACTGGG TTGGGCTACT 67 2 0 
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TGAGCCACCA TTTGAGCAGC AGGTTCTTGC GAATAAACTA AACCTGTAGC ATGCTCAGGC 67 90 

GATTTCCCTG AAACCTTCCC AT AGTGGC C C CAAGGGGTTT GAGTAATGGC AT C AG AAAAG 684 0 

GAAAGTTCCT CTTCTTTTAA GGGATTGACC CGAAAGGCCG AAACCG CTTC CTCCTCAAAA 6900 

GAGGCAAGAA AATCTCTTGC CTCATCTCCT AGT AT CTCTT TATATTTTTC AACAAATCCT 6960 

TCTGGAAATT GCATTTAAGT TCTTTTCCTT TCGTAAATAT AGGACTGAAT TTCCTCCTGC 7020 

ATCTCAAGAG GC AC CATC AT GACCGGCTGT CTGGTTTGAA AATCAGGAGC TTCACCAAAA 7 0 80 

AGGGTCACAA CCCGATAGCC CAGACTTTCC CCTAAAATAC TAGCTGCGGC ATAATCCCAT 7140 

GGTTGCAGAT AAGTGAGATA GGTCAACAAA CGCCCTGACA AAATCTTGGC AAAACTAATG 7 2 00 

GCCGCACTTC CATAGACACG AACACCAAGA ACCGCTCGGC TCAAATCAGC CAGCCCCCAT 7 2 60 

TCATTGGTTT CCAGCATACC ACTATTCCCT GCAATGAGAA AATCTCCAAG TGGTTTAGTT 7 3 20 

TT AAAAG GAG CTAGGGACCT ATCATTTAGA CAAACTGGAA ATTCCCCACC ACCGTGGTAA 7 3 80 

CAATCCCCTT TGACCACATC ATAAATCAGA CCAAACTGTC CCTGACCATT TTCAAAATAA 74 40 

GCCATCATAA CAGCAAAATC TTCCTGCTGG GCTACAAAAT TATTGGTACC ATCAATGGGA 7 500 

TCAATGACCC AAACCTTCCC CTCTTGAACC GAGGCTCGCA GACAACCTTC TTCAGCACAA 756 0 

ATCTTATCCT CAGGATAACG GGACAAAATC TCACCAACCA AGACTTCCTC AACTTCTTTG 7 620 

TCCAGTCTGG TCACCAAATC TGTTGGAGAG GACTTGGTTT CAACACGCAA GTCTTCCTCC 7 680 

ATATGGTCAA GAATGTACTG ACCTGCTTTC TTAACAACCT CTTTACCAAA TTCAAATTTA 7 740 

CTTTCCAAGA GAAATCTTTC CTTCCCCTTT TTCTTTGGGG 7 7 80 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4820 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double — 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GT AATG AT AT AGGAACACCA GGTGACCTGA TGGGACGTCG TAAGCCTATG AACTACTAGC 60 

TGCTAAAGGC TTTAAAGATG GTATGGTACC AT AT AT CT C A AACCAATACG AAGAAGAAGC 12 0 

CAAACAAAAG GGCAAGACAA TCAATCTCTA CGGTAAAACA AG AGGTTTG G TT AC AG ATG A 18 0 

CTTGGTTTTG GAAAAGGTAT TTAATAACCA ATATCATACT TGGAGTGAGT TTAAGAAAGC 2 40 

TATGTATCAA GAACGACAAG ATCAGTTTGA TAGATTGAAC AAAGTTACTT TT AATG AT AC 3 00 

AACACAGCCT TGCCAAACAT TTGCCAAGAA AACTACAAGC AGTGTAGATG AATTACAGAA 3 60 
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ATTAATGGAC 
CAATCCAGAC 
TGACCAAACA 
AGGAAATAAA 
TGATAATGAA 
AT C AACTTCT 
TCATAATATG 
TTCTATATAC 
ATATAATATA 
GAAAACTCTC 
AAAGGAAAAA 
CGACAGGTGC 
TTCCCTCTAT 
TTGATTTTAA 
TTTGAATCAC 
GTTGTAACAG 
GGGATTCACA 
ACTCTG ATG A 
ACAAAAGGTC 
ATTCTCTGGC 
ATCAAAGACC 
GAACATTATC 
CCTGTGATTA 
TACAGTCTCT 
TTGGAAGACC 
GTTAGAAAGC 
AATCAGCAAG 
CATCCTTTCT 
TGAGGTTGGT 



GTTGCTGTTC GTAAGGATGC 
ATAGATAGTG AAGTCCACAA 
AATGATTTTA GAAGTTCAAT 
GTTTAAAAAG GTGATGAAGA 
GTAAGTTATG ATCTTATTTA 
GAGCTTACAG CTTTAGGCCT 
ACGGGAACAG TTTTTTGCTC 
GGTAAAAGAT AGGGTAAAAG 
AAGTAGATTT CTATCTCATA 
T ATG TTCAAA CAATAGTAAA 
ACTCAATGGC TACT ATT CAA 
AGGAGAATTT AAAATTTGTT 
CTAGTCAAAA TCCTATGTTG 
ACAAGGCCGA CTTGGCTGAT 
AAGGAATCCA GACGCTAGCT 
ATGCGGCCAA GAAGCTCATG 
TTGAAACCTT GCGTACTATG 
ACCCTTTCCC TGGTAAAAAG 
AACAATGGCT TAAAACCAAT 
CTAAGTTTGA GGATGAAACT 
AGTTGCTTCC TATGGATGAG 
C AG AAAAG CT GGCTGAACGC 
TTATGGATAT GACCCGCGCC 
TCGTGAAGGA AGTCCGTGAT 
TCGATGGCAA CGATTAAAGA 
CCT ATTTTTT TAGAGCTTGA 
CGTAAAAGAG CCATTCAAGC 
TATGAAAAAG AACTTTATAA 
CGTGGTCCTC TTGCTGGTCC 
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AG AACAC AA T TACTACCATT 
GCTCAAGAGA GCAATCTTTA 
TTTTGAGAAT AAAAAATAGT 
ACAAACCAAG ATTCAAGCAG 
TCAGCAGGAA ACTCTTCCTG 
ATT AGCTGTT GGTAGTTTAG 
CCTCTGAAAA GTCATCATTT 
GCTATCATCG GACAAAATAA 
AAACAAGAAC TGTTTCGACA 
ATAAAATAGG GGATCTAAAT 
TGGTTTCCTG GTCACATGTC 
GATTTTGTGA CGATTTTAGT 
ACCAAGATTG TTGGTGATAA 
CCAGCAATGA CCAAGGAATG 
ATCAACTCCA AAGAGCAAGT 
GCTGATAAGA TTGCTCGCCA 
ATTATCGGGA TTCCAAACGC 
ATTGCTGTTG TTGGAAACAA 
AAAGACCTGG AAATCTTGGA 
GTTGCACTTA AGTTGGCATT 
GTTACCATTT TTGGTATCAA 
TTCAAACAAA TGAAAATTGA 
CTCGGTTTCC GTGATGACTA 
GGCAAACTCG GTAACTATAC 
AATCAAAGAA TTCCTTGTGA 
AAAGG AT AAT CGCTCAGCAC 
TGAATTAGAT GAAAATTTGC 
GCAAGGATTG ACCTTAATTG 
TGTAGTCGCT GCGGCCGTTA 



GGAATAACTA 4 2 0 

AAGCCTATCT 4 80 

CTCTACTATT 54 0 

GAATTCCTAC 600 
CAACAGGTTC 6 60 

TTCTTTTGGT 720 

GATGGCTTTT 7 80 

AGAAGGCATG 840 

TCATTCATTT 900 

CCTTGCTATG 9 60 

TAAAGCTCGT 102 0 

AGATGCACGC 10 8 0 

ACCAAAACTC 114 0 

GCGTCAGTAT 1200 

GACTGTAAAA 12 60 

GAAAGAACGT 1320 

TGGTAAATCA 1380 

GCCAGGGGTC 14 40 

TACACCGGGG 1500 

GACTGGAGCT 1560 

TTATTTCAAA 162 0 

AGAAGAAGCG 1680 

TGACCGTTTT 1740 

CTTAGATACA 1800 

CAGTCAAGCA 1360 

TTCAAAAGGA 1920 

GCTTGGAATC 19 8 0 

CAGGTATTGA 2040 

TTTTATCTAA 2100 
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AAATTG T AAG ATTAAAGGTC TCAACGACAG CAAGAAAATT CCTAAAAAGA AACATCTGGA 2160 

GATTTTCCAA GCCGTTCAAG ACCAAGCCTT GTCGATTGGA ATTCCTATCA T AG AT AATC A 22 20 

GGTCATCGAC CAAGTCAACA TCTATGAAGC AACCAAACTA GCCATGCAAG AAGCAATCTC 2 2 80 

CCAGCTCAGC CCTCAACCAG AGCACCTTTT GATTGATGCC ATCAAACTGG ACTTGCCCAT 2 340 

TTCACAAACC TCCATTATCA AAGGAGATGC CAACTCCCTC TCTATCGCAG CAGCATCTAT 2 400 

AGTAGCCAAG GTAACACGTG ATGAATTGCT GAAAGAATAC GATCAGCAGT TCCCTGGCTA 2 4 60 

TGATTTCGCT ACTAATGCAG GATATGGCAC AGCTAAACAT CTGGAACGCC TCACAAAACT 2 52 0 

AGGAGTTACC CCAATTCACC GAACCAGCTT TGAACCCGTT AAATC ACTGG TTTTAGGTAA 2580 

AAAAGAAAGT TAATTGAAAG GAAATAACAT GGAGGAACAG TCGGAAATAG TCCGTTCTAA 2 64 0 

GAAAGAATTC GCCTTTGCAT CCAGCACTAT ACTATCCCAA GTTGGTCGAG GAATCATTGT 2 7 00 

CGGCCTCATC GTTGGAATTA TCGTCGGATC CTTTCGTTTC TTAATTGAAA AGGGCTTCCA 2 7 60 

CCTGATACAA GGAGTTTATC AAGATCAAGG GTACTTAGTG CGCAATCTTT TTGTACTGGT 2 82 0 

TTTGTTTTAT ATACTCATCT GTTGGCTCAG TGCCAAACTA ACACCGTCAG AAAAAGATAT 2 8 80 

TAAAGGCTCA GGAATTCCTC AAGTCGAAGC CGAACTGAAA GGCCTCATGT CCCTCAACTG 294 0 

GTGGGGCATT CTTTGGAAAA AATATGTGCT AGGTATTCTT GCTATTGCCA GTGGACTCAT 3000 

GCTGGGTCGA GAGGGACCCA GCATTCAACT TGGAGCAGTT GGTGGTAAAG GAATTGCCAA 30 60 

GTGGCTCAAA TCCAGTCCAG TAGAGGAACG TTCCTTGATT GCCAGTGGAG CTGCAGCAGG 3120 

TTTAGCCGCA GCCTTTAATG CTCCTATTGC AGCACTTCTC TTT GTTGT AG AAGAAGTCTA 3180 

TCACCATTTT TCGCGCTTTT TCTGGCTCTC AACTCTAGCA GCCAGCATCG TAGCAAACTT 3 2 40 

TGTGTCTCTA CTCATGTTCG GTTTGACACC AGTATTGGAT ATGCCAGATA ACATTCCTCC 3 300 

CATGACCCTA GATCAGTATT GGATATATCT CGTCATGGCA ATTTTCCTTG GATTTTCAGG 3 3 60 

TTTTCTCTAT GAGAAAGCTG TATTAAACGT TGGAAGAGTT TATGACTTGA TTGGTCAAAA 3 4 20 

AATCCATTTG GATAGGGCTT ATTATCCCAT CTTGGCTTTT ATCCTTATCA TACCAGTCGG 3 4 80 

AATCTTCTTA CCTCAAATCA TTGGTGGCGG AAATCAGCTT GTCCTTTCTT TAACTGAACA 3 54 0 

AAATTTTAGT TTCCAAGTTT TATTAGCTTA CTTTTTAATC CGCTTTATTT GG AGT ATG AT 3 600 

TAGCTATGGA AGTGG ACTG C CAGGAGGAAT TTTCCTCCCC ATTTTAGCTC TTGGTTCTTT 3 660 

GCTTGGTGCC TTAGTTGGTG TTATCTGTGT CAATCTTGGA CTTGTCAGTC AAGAGCAATT 3 7 20 

CCCTATATTT GTCATTCTAG GAATGAGTGG CTATTTTGGA GCCATATCAA AAGCTCCCTT 3 7 30 

AACCGCTATG ATCCTCGTAA CTGAGATGGT AGG AG AT AT T CGCAACCTTA TGCCACTTGG 3 840 

TCTTGTCACT CTTGTTTCTT ATATTATCAT CGATTTCCTC AAAGGTACGC CAGTCTATGA 3 9 00 
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AGCCATGCTG GAAAAAATGC TTCCAGAAGA AGTATCTAGC GAAGGAGAAG TT ACACTT AT 3960 

CGAAATACCA GTTTCTGATA AAATTGCTGG GAAACAACTT CATGAACTCA ACTTACCACA 4 020 

CAACGTCCTC ATCACAACTC AAGTCCATAA TCCCAAGACC CAAACAGTTA ACGGCTCAAC 4 080 

CAGAATGTAT CTGGGTGATA TGATTCACCT GGTTATTCCA AAAAGTGAAA TTGGAAAACT 4140 

CAAAGATTTG TTGTTGTAGT ATGAGTATTT ACATAATTTA TGTTATCTAA ATGATCAGTT 4 200 

TGATTTATTT AGAAAACCGA TTCTCAGGAA TGAGATCGGT TATTTTTTAC TGATGAGGAA 4 260 

TTTTACATAT AAATAATTGA ACTTTATTAA AAATAAGACT ATAATTAAGT TAGAAATGAT 4 3 20 

AAAGTATAAA GCTAGAAAGG AGTTTACTGT ATCAAATCTG TACAGTAAGA TT AAAATC AT 4 3 80 

GAAAAAGAAA ACAATAGCAA TTATATAGAG AAATGAAATA GAAATAGGAT AAAACAATCA 444 0 

GGACAATCAA ATCAATTTCT AGCAATGTTT TAGAAGTCCA GATGTACTAT TCTAGTTTCA 4 500 

AT C T ATT AT A CAATGTGTTT TGTATCTCAT AGCTCCTTAT ATAGCTCTTC AGTTATGTAG 4 560 

T ATT AAC AG A AGTTTAGTGG GTCAGATTTT TATTATTTTC CTTATTCTGT TTTCTTTGTA 4 62 0 

GGTCTAAGTC TTTTTATCAC TTTGAAAAAC TCCTATAACA TCTTTCCGAA AAACTATAAT 4 680 

TTTCTTGAAA AATATACAAG TCTATGCTAT ACTACTAGTA TACTTACTTA TGGAGAAAAT 47 4 0 

ACATGAAACG TGAGATTTTA CTGGAACCAA TCGACAAACT AAAACAACTC ATGCCCTGGT 4 800 

AAGTTCTGGA AT ACT AC C AA 4 82 0 
(2) INFORMATION FOR SEQ ID NO : 20: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTACGACATC ATGATTAACA GTCATGCGCT ACTACCAACT GAGCTATGGC GGATAAAATA 60 
GTCCGTACGG GATTCGAACC CGTGTTACCG CCGTGAAAAG GCGGTGTCTT AACCCCTTGA 120 
CCAACGGACC TTCTATCTGT AGCAGATATA ACCATTATAT CAATTTCTTG CTAATTGTCA 180 
ATCACTTTTG AGATTTTTTC TCTAAAATAT CTTTTAATTT TCTAATTTTT AATCTTGAAA 2 40 

TAGGACAACG ATGGTCTTCA TAGAAAACAA TTTCTAAGTT TTTTCGATCA ATTTCTCTGA 300 
TATTACCTAT ATTTACCAAA AATGACTTGT GAG G AG AAT A AAATCGCTGA GTATGTTTGT 3 60 

CCTTTTCCTG AAT ATCTGT C ATGGTACCAT AAAACTCTTT TGCAAAATTC TTACCAATAA 420 
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TGCGCAATTT ATGAGATACC CCTGTTGTTT CAATATACAA AATATCATGG TAAGGAATTT 4 80 

TTAAATCATT TCCCTTGTAA TTGTAGTCGA AATAATCTAC AACATCTTCA TTTTCAAGTA 54 0 

ACATACTCTT CGTGTAGAAG ATATTTTGCT CAATTCTCTT CTTAAACATC TCATCATTGA 600 
TATCCTTATC AACAAAATCT AGGGCTGATA CCTGGTATTT ATAGGTTAGA GTCGCAAACT 660 
CTGATCGACT AGTGATAAAG ACGATAATAG CGTAAGGATT GTAATGACGA ATGAGCTGAG 720 
CCACTTCAAA TCCCTTTTTC TCAATTCCAT GAATATCGAT ATCTAGGAAA TAAAGCTGAT 7 80 

TTACTTCATC ATTTTCAATG TATTCTTCAA ATTCACGGAC TTTTCCCGTT GTCTTGTATG 94 0 

ATATTGGAAT ATTCGATTCT TTCGAAATTT CAT CC AAT AT TCTCTCTAGT CTCACTTGAT 900 
GTTCAATAAC ATCTTCTAAA ATTAAAACTT TCATTCAAAT TCCCTCTTAA ATCTAATGAT 960 

TTGTCTAAAT GTACTGCCTT CCATCTCTGT TTCTAAAATA AT ATTG TTGT ACTTATCTAG 1020 

TAGTTCTTTC ACATTATTTA ATCCGACTCC GCGATTTCTT CCCTTAGTGG AGAATCCTAA 1080 

GGCAAATAGA TCTCCTGAAG GAGTCATCGT CATTTTACAT GAATTCTGAA TCACAATAAC 114 0 

TGTTTCAGTT TCCATCTTAA TAACTGCTAC TTCCATCTGC TTTTTATAGC TATCAGCCGA 1200 

TCCTTCG AC A GCATT ATT C A ATAAAACGCT CATGATACGA ACCAAATCCA ATAGTTCAAT 1260 

TGGAAGCTTG G T AATCGT AT CTTTTACTTC CAGTGTAAAC TCT AC AC C AT TATTTCGAGC 1320 

ATAGACAATT GACTGAGCAA CCAAACTTCG TAAAGCTGAG TCTTCTATGT TGTTCAAATC 1380 

AAAGTAACTG TACTTATCTC AACGCAATTT ATGATTTGCT TTGACTAAAA CTTCATTGTA 144 0 

AATTCTGTCA ATTTCCTGTA AATT AC C ACT GTCAATTGCC ATCTGCATGC TGACAAGCAT 1500 

TCCAGCATAA TCATGTCGAA AACCACGGAT TTCATTATAC AGACCAACAA TTTCATCTGT 15 60 

GTAATTCTGT AAATGTTTCT GTTCAAATTT CTTCTGCTTC AAAGCAATCT CTTTCTCCAT 16 20 

TTGAACTTTA TGAGAATTCA TTGCAAAGAA GGTCAAAACG AGAGAGATAA AGACAATAGA 1680 

TGACAAAATA CTTCCAAAAC TATTCAAATG TT T AATCGT A CTTACCAT AT CTGAAACGAA 1740 

AGATACAATA TGT AG C AAT A G T AAAGC AAA AAATACTTTT TTCAAGAAAG GATAAAGGTA 1800 

GTCCTTGTCA AAATAGGCTA GTTCCAAATG GAAATAGTAA ATG ATTTTT A ATGTAACAAA I860 

ATAGGTTAAC ACCGTCACAA CGAAAAAGAA TGGGAAATGA TATTGTAAAA CAAAATTATC 1920 

TCCTGTTATA GAGGAGAAAA TTACGGACAG AAAGTTATGA GTGCTCTCAT ATAAAAGAGA 198 0 

TAGTAGTAAA CTTAGGAATA GTCCTCTATC CCTCTCATAC TGTTTC AT CC ATCGAAAATA 204 0 

GGAATATAAG CCCAAAGGAA ATAAAAATCT TTCAATCCCT ATTTT AT CT A AATATAGAAG 2100 

AT AAAAGG AA AATTCAAGTA CTATTTCAGT TAGTAATGTA TAAGCACCAA AAACGT AT AA 2160 

TTCTTTTCTA TTTATTCGAC CTTTACAAAT TAAACGGTAA CTGTGACTAA TAATTAAAAA 2 220 
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ATGAACAATA ACTGTCCCAA ATCCAAGTAA ATCCATTACT CTTTCTCCTT ATTTCATTAC 2280 

TTTTTTCGTA GGAAAAGAAA ATCAAGGATG ATTCTTGAAA TCCTCATCTC CCCACCTTTA 2 3 40 

ATCTTTTGTA AGTCTTTTTC CTTCAAAGCT ACAAACTGTT CCAATTTAAC TGTCTTTTTC 2400 

ATAATAAAAT CTCCTAAAAT GTTTTTTCTT GTAAGCTAAC TTACAAAAAC CATTATACAA 24 60 

AATGGAATTT CGTTTTAGAT AAAATTCTCT CAACTGTCAT TTTTTTCTCC CAAAGTGTAC 2520 

TTTTTTAAGA AAAAAGCCGG GAAAATTCCC AGCTTTGCTA TTATATTGAT CCCAGCAGGA 2 580 

TTCGAACCTG CGACCGTTCG CTTAGAAGGC GAATGCTCTA TCCAGCTGAG CTATGAGACC 2 64 0 

TAATACAATT ATTCTACCAA AAATTCAATT AAAACTCAAT TTTCTATTTA TGGTAGGGGA 27 00 

ATCCCTGCTG AATCGTAAAA GCGCGATAGA TTTGTTCAAC AAGAACTAGT CTCATTAACT 2 7 60 

GATGGGGTAA GGTTAGGCGA CCAAAACTGA CAGAAAGATT GGCTCTATTT TTTACAGATG 2 82 0 

ATGATAATCC TAAACTTCCC CCAATAATAA AAGTAAGAGT AGAAAATCCT TTTATAGAAC 2 88 0 

TTTCTTCTAA CTGCTTACTA AATTCTTCTG AGAAGAAAGT TTTCCCTTCA ATGGCTAACA 2 94 0 

CAATAACGAA ATCACGGTCA G C AATTTTTG ATAAAATTCT CTGACCTTCT ATTTCTAAAA 30QO 

TCTTTTGATT TTCTGATTCA CTGGCCTTAT CTGGTGTTTT TTCATCTGAT AACTCAATCA 3 060 

TTTCAAACTT AG CAAATCT A GAAATTCCT? TTGAATACTC TGCGATACCA TCTTTTAAAT 3120 

ACTTTTCTTT CAGTTTCCCA ACTGTTACAA CTTTAATTTT CATGACTCTA TTCTAACATA 3180 

TTCTCTATTT TTTCACATCT T ATTC A C AAA ATAAAAAATA GATTTCAATT AAGAAAATCA 3 240 

CAATTTCAAA AGAGTTATCC ACAGTTTGTG TAAAACTTTT GTGTTTAAGT TATAATTAAG 3 300 

CTAGTCAGTT TATACTTTCA GTAATTCAAA CATATGGAGG CAAATATGAA ACATCTAAAA 3 3 60 

ACATTTTACA AAAAATGGTT TCAATTATTA GTCGTTATCG TCATTAGCTT TTTTAGTGGA 3420 

GCCTTCCGTA GTTTTTCAAT AACTCAACTA ACTCAAAAAA CTAGTGTAAA CAACTCTAAC 3 4 30 

AACAATAGTA CTATTACACA AACTGCCTAT AAGAACGAAA ATTCAACAAC ACAGGCTGTT 3540 

AACAAAGTAA AAGATGCTGT TGTTTCTGTT ATTACTTATT CGGCAAACAG AC AAAAT AGC 3600 

GTATTTGGCA ATGATGATAC TG AC AC AG AT TCTCAGCGAA TCTCTAGTGA AGGATCTGGA 3 660 

GTTATTTATA AAAAGAATGA TAAAGAAGCT TACATCGTCA CCAACAATCA CGTTATTAAT 3 7 20 

GGCGCCAgCA AAGTAGATAT TCGATTGTCA GATGGGACTA AAGTACCTGG AGAAATTGTC 37 80 

GGAGCTGACA CTTTCTCTGA TATTGCTGTC GTCAAAATCT CTTCAGAAAA AGTGACAACA 3 840 

GTAGCTGAGT TTGGTGATTC TAGTAAGTTA ACTGT AGG AG AAACTGCTAT TGCCATCGGT 3 900 

AGCCCGTTAG GTTCTGAATA TGCAAATACT GTCACTCAAG GTATCGTATC CAGTCTCAAT 3 960 
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AG AAATG TAT CCTTAAAATC GGAAGATGGA CAAGCTATTT CTACAAAAGC CATCCAAACT 4 020 

GATACTGCTA TTAACCCAGG TAACTCTGGC GGCCCACTGA TCAATATTCA AGGGCAGGTT 4 080 

ATCGGAATTA CCTCAAGTAA AATTGCTACA AATGGAGGAA CATCTGTAGA AGGTCTTGGT 4 140 

TTCGCAATTC CTGCAAATGA TGCTATCAAT ATTATTGAAC AGTTAGAAAA AAACGGAAAA 4 200 

GTGACGCGTC CAGCTTTGGG AATCCAGATG GTTAATTTAT CTAATGTGAG TACAAGCGAC 4 2 60 

ATCAGAAGAC TCAATATTCC AAGT AATGTT ACATCTGGTG TAATTGTTCG TTCGGTACAA 4 3 20 

AGTAATATGC CTGC C AATGG TCACCTTGAA AAATACGATG T AATT AC AAA AGTAGATGAC 4 3 80 

AAAGAGATTG CTTCATCAAC AGACTTACAA AGTGCTCTTT ACAACCATTC T AT CGG AG AC 4 44 0 

ACCATTAAGA TAACCTACTA TCGTAACGGG AAAGAAGAAA CTACCTCTAT CAAACTTAAC 4 500 

AAGAGTTCAG GTG ATTT AG A ATCTTAATTG ACATCTATGT AAACAAAGCT TTACATAAGA 4 5 60 

GAAAAGATGT GTTAGTGTAG AATCATGGAA AAATTTCAAA TGATTTCTAT C AC AG AT AT A 4 62 0 

CAAAAAAATC CCTATCAACC CCGAAAAGAA TTTGATAGAG AAAAACTAGA TGAACTAGCA 4 680 

CAGTCTATCA AAGAAAATGG GGTCATTCAA CCCATTATTG TTCGTCAATC TCCTGTTATT 4 7 40 

GGTTATGAAA TCcTTGCAGG AGAGAGACGC TATCGGCCTT CACTTTTAGC TGGTCTACGG 4 8 00 

TCTATCCCAG CTGTTGTTAA ACAGATTTCA GACCAAGACA TGATGGTCCA GTCCATTATT 4 8 60 

GAAAATTTAC AGAGAGAAAA TTTAAACCCA ATAGAAGAAG CACGCGCCTA TGAATCTCTC 4 920 

GTAGAGAAAG GATTCACCCA TGCTGAAATT GCAGATAAGA TGGCCAAGTC T C G TC C AT AT 4 980 

ATCAGCAACT CCATTCGTTT ACTTTCCTTG CCAGAACAGA TTCTTTCAGA AGT AG AAAAT 504 0 

GGCAAACTAT CACAAGCCCA TGCGCGTTCC CTAGTTGGGT TAAATAACGA ACAACAAGAC 5100 

TATTTCTTTC AACGGATTAT AGAAGAAGAT ATTTCTGTAA GGAAATTAGA AGCTCTTCTG 5160 

ACAGAGAAAA AACAAAAGAA ACAGCAAAAA ACTAATCATT TCATACAAAA TGAAGAAAAA 522 0 

CAGTTAAGAA AACTACTCGG ATTAGATGTA GAAATTAAAC TATCTAAAAA AGACAGTGGA 52 80 

AAAATCATTA TTTCTTTTTC AAATCAAGAA GAATATAGTA GAATTATCAA CAGCCTGAAA 5 3 40 

TAAGGCTGTT CTTTTATTTT TTTATCTCAC AAGGTT ATC C ACTATGTTTT TCGATAAAAA 54 00 

GCTTAATAAA TCAATAATTT CTTCTTTTAT CCCCAACCTG TGGATAAAGT TTGGTAACAT 54 60 

TGTGGATTAT TTTTCACAGC TTGTGGAAAA TTCTTGCTAT CTATGGTAAA AT ATCTCT AG 55 2 0 

TATTAAACTT TTAAATAGTA AAGG AGG AG A AAGGATTGAA AGAAAAACAA TTTTGGAATC 558 0 

GT AT ATT AG A ATTTGCACAA GAAAGACTGA CTCGATCCAT GTATGATTTC TATGCTATTC 5640 

AAGCTGAACT CATCAAGGTA GAGGAAAATG TTGCCACTAT ATTTCTACCT CGCTCTGAAA 57 00 

TGGAAATGGT CTGGGAAAAA CAACTAAAAG ATATTATTGT AGTAGCTGGT TTTGAAATTT , 57 60 
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ATGACGCTGA AATAACTCCC CACTATATTT TCACCAAACC TCAAGATACG ACTAGCTCAC 582 0 

AAGTTGAACA AGCTACAAAT TTAACTCTTT ATAACTATAG TCCAAAGTTA GTATCTATTC 5880 

CTTATTCAGA TACGGGATTA AAAGAAAAGT AT ACCTTTG A T AACTTT ATT CAAGGGGATG 594 0 

GAAATGTTTG GGCTGTATCA GCCGCTTTAG CTGTCTCTGA AGATTTGGCT CTGACCTATA 6000 

ACCCT CTTTT TATCTATGGA GGACCAGGCC TTGGTAAGAC TCACTTATTA AACGCTATTG 6060 

G AAATG AAAT TCTAAAAAAT ATTCCTAATG CGCGTGTTAA ATATATCCCT GCCGAAAGCT 6120 

TTATTAATGA CTTTCTTGAT CACCTAAGAC TTGGGGAAAT GGAAAAGTTT AAAAAGACCT 6180 

ATCGTAGTCT TG AT CTTTTG TTAATCGATG ATATCCAGTC ACTCAGCGGA AAAAAAGTCG 62 4 0 

CAACTCAGGA AGAATTTTTC AATACCTTTA ACGCCCTTCA TGACAACCAA AAACAGATTG 63 00 

TCCTAACGAG TGATCGTAGT CCAAAACATC TAGAAGGGCT CGAGGAGAGG CTTGTCACGC 63 60 

GTTTTAGTTG GGGATTGACA CAAACTATCA CCCCCCCTGA CTTTGAAACA CGTATTGCCA 64 20 

TTTTACAAAG TAAGACGGAA CATTTAGGCT ACAATTTCCA AAGTGATACT CTAGAATACC 64 30 

TAGCTGGGCA ATTTGATTCA AATGTTCGAG ATCTTGAGGG AGCCATCAAC GACATCACTT 6 54 0 

TAATTGCCAG AGTAAAAAAA ATCAAGGATA TCACTATTGA TATTGCTGCA CAAGCCATTA 6 600 

GAGCCCGCAA ACAAGATGTT AGCCAAATGC TCGTCATCCC AATTGATAAA ATCCAAACTG 6 660 

AAGTTGGTAA CTTTT ATGGT GTTAGTATCA AAGAAATGAA CGGAAGTAGA CGCCTTCAAA 6720 

ATATTGTTTT GGCCCGTCAA GTAGCCATGT ATTTATCTAG AGAACTAACA GATAATAGTC 67 80 

TTCCAAAAAT TGCGAAGGAA TTTCGGCGAA AAGATCATAC CACAGTCATT CATGCCCATG 6 8 40 

CCAAAATAAA ATCTTTGATT GATCAAGACG ATAATTTACG TTTAGAAATT GAATCAATCA 6 900 

AAAAGAAAAT CAAATAATTT GTGGATAACT TTTAGTTTTT TATCTTTTTT ATCCACATTT 6 9 60 

TTTAAACAAG CTAAAAAACT TGATATGACT TGTTTAAAGG CTGTTTTCCA CAGATTTCAC 7020 

AGACTCTATT ATTACTATTA TCTTTCTAAT ACTAAAAATA AAT AAAGG AG AATCCATGAT 7 0 80 

TCATTTTTCA ATTAATAAAA ATTT ATTTCT ACAAGCATTA AAT ACT ACTA AGAGAGCTAT 7140 

TAGTTCTAAA AATGCCATTC CTATTTTATC AACACTAAAA ATTGACGTGA CCAATGAAGG 7 2 30 

TATTACTTTA ATTGGTTCAA ATGGT C AAAT TTCAATTGAA AATTTTATTT CTCAAAAAAA 7 2 60 

TGAAGATGCT CCTTTGTTAA TTACTTCTTT AG GTTCG AT C CTTCTTGAAG CTTCTTTCTT 7 3 20 

TATCAATGTA GTATCTAGTT TACCTGATGT AACTCTTGAT TTTAAAGAAA TTGAACAAAA 7 3 80 

TCAAATTGTT TTAACCAGTG G C AAAT C AG A AATTACCCTA AAAGGAAAAG ATAGCGAACA 7 4 40 

ATATCCACGA ATCCAAGAAA TTTCAGCAAG CACTCCTTTA ATACTTGAAA CAAAATTACT 7 5 00 
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CAAGAAAATT ATT AAT G AAA CAGCCTTTCC TGCAAGTACA CAACAGAGTC GTCCGATTTT 7 560 

AACAGGTGTC CACTTCGTAT TGAGTCAACA CAAAGAGTTA AAAACAGTTG C AAC AG ACT C 7 620 

TCATCGCCTA AGCCAGAAAA AATTGACTCT TGAAAAAAAT AGTGATGATT TTGATGTCGT 7 680 

AATTCCTACC CGTTCTCTAC GCGAATTTTC AGCGGTATTT ACAGATGATA TCGAAACTGT 77 4 0 

AG AG ATTTT C TTTGCCAATA ACCAAATCCT CTTTAGAAGC CAAAATATTA GCTTCTATAC 7 800 

TCGTCTCCTA GAAGGAAACT ATCCTGATAC AGATCGCTTG ATTCCAACAG ACTTTAACAC 7 8 60 

TACT ATT ACT TTTAATGTGG TAAACTTACG CCAGTCAATG GAGCGTGCCC GTCTTTTATC 7920 

AAGTGCGACT CAAAATGGTA CTGTGAAACT TGAAATTAAG CATGCGCTTG TTAGCGCCCA 7 93 0 

TGTTCACTCT CCAGAAGTTG GTAAAGTAAA CGAAGAAATC GATACTGATC AGGTTACTGG 8040 

TGAAGATTTG ACCATTAGTT TCAACCCAAC TTACTTGATT GATTCTCTTA AAGCTTTAAA 8100 

TAGCGAAAAG GTGACTATTA GCTTTATCTC AGCTCTTCGT CCATTTACTC TTGTGCCAGC 8160 

AG AT ACT G AC GAAGACTTCA TGCAGCTCAT TACACCAGTT CCTACAAATT AAGTG AAAG A 82 20 

GGTTGAGCCT GCCTCCCCTC TTTTATGATA TAATCGAAAA AGAAAAGGAG AGTAGTATGT 8 2 90 

ATCAAGTTGG AAATTTTGTT GAGATGAAAA AATCACACGC TTGTACAATC AAGTCGACTG 3340 

GTAAAAAGGC TAATCGTTGG GAAATTACAC GTGTAGGAGC AGATATCAAA AT AAAATGT A 84 00 

GTAATTGTGA GCATGTTGTC ATGATGCGGC GATATGATTT TGAGCGAAAA ATGAATAAAA 84 60 

TTATTGACTG AGAACCCTTA GTTAGAGGGT TAGCACTTTA TCCCTTTTTG TGTT AT AAT A 8520 

TTAGGGATTG AAATGAAAAC GGAGAATGAG AAATATGGCT TTGACAGCAG GTATCCTTGG 8 5 80 

TTTGCCAAAC GTTGCTAAAT CAACACTATT TAATGCAATT ACAAAACCAG GAGCAGAGGC 8 640 

AGCAAACTAC CCATTTGCGA CGATTGATCC AAATGTTGGA ATGGTGGAAG TTCCAGATGA 8 7 00 

ACGCCTACAA AAACTAACTG AAATGATAAC TCCTAAAAAG ACAGTTCCCA CAACATTTGA 3 7 60 

AT TT AC AG AT ATTGCAGGGA TTGTAAAAGG AGCTTCAAAA GCAGAGGGGC TAGGGAATAA 8 820 

ATTCTTGGCC AATATTCGTG AAGTAGATGC GATTGTTCAC GTAGTTCGTG CTTTTGATGA 8 880 

TG AAAATGT A ATGCGCGAGC AAGGACGTGA AGACGCCTTT GTAGATCCAC TTGCAGATAT 8 940 

TGATACCATT AATCTGGAAT TGATTCTTGC TGACTTAGAA TCAGTGAACA AACGATATGC 9 000 

GCGTGTAGAA AAGATGGCAC GTACGCAAAA AGATAAAGAA TCAGTAGCAG AATTCAATGT 9 060 

TCTTCAAAAG ATTAAACCAG TCCTAGAAGA CGGGAAATCA CCTCGTACCA TTGAATTTAC 9120 

AGATGAGGAA CAAAAGGTTG TCAAAGGTCT TTTCCTTTTG ACGACTAAAC CAGTTCTTTA 9 130 

TGTAGCTAAT GTGGACGAGG ATGTGGTTTC AG AAC CTG AC TCTATCGACT ATGTCAAACA 92 40 . 

AATTCGTGAA TTTGCAGCGA CAGAAAATGC TGAAGTAGTC GTTATTTCTG CCCGTGCTGA 9 3 00 • 
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GGAAGAAATT TCTGAATTGA ATGATGAAGA TAAAAAAGAG TTTCTTGAAG CCATTGGTTT 9 3 60 

GACAGAATCA GGTGTAGATA AGTTGACGCG TGCAGCTTAC CACTTGCTTG GATTGCGAAC 9*120 

TTACTTCACA GCTGGTGAAA AAGAAGTTCG CGCTTGGACT TTCAAACGTG GTATGAAGGC 9 4 80 

TCCTCAAGCA GCTGGTATTA T CC ACT CAG A CTTTGAAAAA GGCTTTATTC GTGCAGTAAC 9 540 

CAT GT CAT AT GAAGATCTAG TGAAATACGG ATCTGAAAAG GCCGTAAAAG AAGCTGGACG 9 600 

C7TGCGTGAA GAAGGAAAAG AATATATCGT TCAAGATGGC GATATCATGG AATTCCGCTT 9 660 

TAATGTCTAA AAATTAATAA ATGGTGTCAA TTAGGTTGGA AAAAAATTCC AACCCTTTTC 97 20 

GCTTTTGAAA GGAAAAATAA ATGACCAAAT TACTTGTAGG CTTGGG AAAT CCAGGGGATA 97 80 

AATATTTTGA AACAAAACAC AATGTTGGTT TTATGTTGAT TGATCAACTA CCGAACAAAC 98 4 0 

AGAATGTCAC TTTTACACAC GATAAGATAT TTCAAGCTGA CCTAGCATCC TTTTTCCTAA 9900 

ATGGAGAAAA AATTTATCTG GTTAAACCAA CGACCTTTAT GAATGAAAGT GGAAAAGCAG 99 60 

TTCATGCTTT ATTAACTTAC TATGGTTTGG ATATTGACGA TTTACTTATC ATTTACGATG 10020 

ATCTTGACAT GGAAGTTGGG AAAATTCGTT TAACAGCAAA AGGCTCAGCA GGTGGTCATA 10080 

ATGGTATCAA GTCTATTATT CAACATATAG GAACTCAGGT CTTTAACCGT GTTAAGATTC 10140 

GAATTGGAAG ACCTAAAAAT GGTATGTCAG TTCTTCATCA TGTTTTGAGT AAGTTTGACA 102 00 , 

GCGATCATTA TATCGGTATT TTACAGTCTG TTG AC AAAGT TGACGATTCT GTAAACTACT 10260 

ATTTACAAGA GAAAAATTTT GAGAAAACAA TGCAGAGGTA TAACGGATAA ATGGTCACCT 10320 

TATTAGATTT ATTCTCAGAA AATGATCAGA TTAAAAAATG GCATCAAAAT TTAACAGATA 10 3 90 , 

AGAAAAGACA ACTAATACTT GGTTTATCAA CATCTACTAA GGCTCTTGCA ATTGCAAGCA 1C4 40 

GTTTAGAAAA AGAAGATAGG ATTGTGTTAT TGACGTCAAC TTATGGAGAA GCAGAAGGAC 10500 

TTGTTAGTGA TCTT ATTTCT ATCTTGGGTG AGGAACTCGT CTATCCATTT TTGGTAGATG 105 60 

ATGCTCCTAT GGTCGAGTTT TTGATGTCTT CACAGGAAAA AATTATTTCA CGGGTTGAAG 10620 

CCTTGCGTTT TTTGACTGAT TCATCTAAGA AAGGGATTTT AGTTTGTAAT ATCGCAGCAA 10 680 

GTCGATTGAT TTTACCGTCT CCCAATGCAT T C AAAG AT AG TATTGTAAAA ATCTCACTTC 10740 

GTGAAGAATA TGATCAACAC GCGTTTATCC ATCAGTTAAA GGAAAATCGC TATCGAAAAG 10 800 

TTACTCAAGT ACAAACTCAG GGCGAATTTA GTCTTCGAGG AG A T AT TTT A GATATTTTTG 10 860 

AAATATCCCA GTTAGAACCT TGTCGAATTG AGTTTTTTGC TGATGAAATT GATGGTATCA 10 920 

GGTCATTTGA AGTAGAAACA CAATTATCGA AAGAAAATAA GACAGAACTC ACTATCTTTC 1099 0 

CAGCTAGTGA TATGCTTTTG AGAGAAAAGG ATT AT C AAC G AGGACAGTCA GCTTTAGAAA 11040 
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TGATTCCGGA GGCTTCGATT GGATATGTTC ATGGTCGAAT GAGTGAAGTC CAGTTGGAAA 12 900 

ATACTCTATT AGACTTTATT GAGGGACAAT ACGATATCTT GGTGACGACT ACT AT T ATTG 12 960 

AGACAGGGGT GGACATTCCA AATGCTAATA CTTTATTTAT TGAAAATGCG GACCATATGG 13020 

GCTTGTCAAC CTTATATCAG TTAAGAGGAA GAGTCGGTCG TAGTAATCGT ATTGCTTATG 13 080 

CTTATCTCAT GTATCGTCCA GAAAAATCAA TCAGTGAAGT CTCTGAAAAG AGATTAGAAG 13140 

CGATTAAAGG ATTTACAGAA TTCGCCTCTG GCTTTAAGAT TGCAATGCGA GATCTTTCGA 13 200 

TTCGTGGAGC AGGAAATCTT TTAGGAAAAT CCCAGTCTGG TTTCATTGAT TCTGTTGGTT 132 60 

TTGAATTGTA TTCGCACTTA TTAGAGGAAG CTATTGCTAA ACGAAACGGT AATGCTAACG 13 320 

CTAACACAAG AACCAAAGGG AATGCTGAGT TGATTTTGCA AATTGATGCC TATCTTCCTG 13 3 80 

ATACTTATAT TTCTGATCAA CGACATAAGA TTGAAATTTA CAAGAAAATT CGTCAAATTG 13440 

ACAACCGTGT CAATTATGAA GAGTTACAAG AGGAGTTGAT AGACCCTTTT GGAGAATACC 13 500 

CAGATGTAGT AGCCTATCTG TTAGAGATTG CTTTGGTCAA ATC AT ACTTG GACAAGGTCT 135 60 

TTGTTCAACG TGTGCAAAGA AAAGATAATA AAATTACAAT TCAATTTGAA AAAGTCACTC 13620 

AACGACTCTT TTTAGCTCAA G ATT ATTTT A AACCTTTATC CGTAACGAAC TTAAAAGCAG 13 680 

GCATCGCTGA GAATAAGGGA TTAATGGAGC TTGTATTTGA TGTCCAAAAT AAGAAAGATT 137 4 0 

ATGAAATTTT AGAAGGTTTG CTGATTTTTG GAGAAAGTTT ATTAGAGATA AAAGAGTCTA 13800 

AGGAAGAAAA TTCCATTTGA TATTTTTCTT CT AT AAAAT A GATAAAAATG GTACAATAAT 138 60 

AAATTCAGCT AATAAGGATG AG ATT AG AT A AATATTTAAA AGTATCGCGA ATTATCAAGC 13920 

GTCCTACAGT CGCAAAGGAA GTAGCACATA AAGGTAGAAT C AAG G TT AAT CGAATCTTCG 139 30 

CCAAAAGTTC AACGGACTTG AAAGTTAATG ACCAAGTTGA AATTCCCTTT GGCAATAAGT 14 04 0 

TGCTGCTTGT AAAAGTACTA G AG AT G AAAG AT AGT AC AAA AAAAGAAGAT GCAGCAGGAA 14 100 

TGTATGAAAT 7ATCAGTGAA ACACGGGTAG AAGAAAATGT CTAAAAATAT TGTACAAT~G 14160 

AATAATTCTT TTATTCAAAA TGAATACCAA CGTCGTCGCT ACCTGATGAA AGAACGACAA 14 2 20 

AAACGGAATC GTTTTATGGG AGGGGTATTG ATTTTGATTA TGCTATTATT TATCTTGCCA 14 280 

ACTTTTAATT TAGCGCAGAG TTATCAGCAA TTACTCCAAA GACGTCAGCA ATT AG C AG AC 14 340 

TTGCAAACTC AGTATCAAAC TTTGAGTGAT GAAAAGGATA AGGAGACAGC ATTTGCTACC 144 00 

AAG TTG AAAG ATGAAGATTA TGCTGCTAAA TATACACGAG CGAAGTACTA TTATTCTAAG 14 4 60 

TCGAGGGAAA AAGTTTATAC GATTCCTGAC TTGCTTCAAA G GTG AT AAAA TGGAAAATTT 14 520 

ATTAGACGTA ATAGAGCAAT TTTTCAGTTT GTCAGATGAA AAGCTGGAAG AATTGGCTGA 145 80 
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TAAAAATCAA TTATTGCGTT TACAAGAAGA AAAGGAAAGG AAG AA TGCGT AAATTCTTAA 14640 

TTATTTTGTT GCTACCAAGT TTTTTGACCA TTTCAAAAGT CGTTAGCACA GAAAAAGAAG 14 700 

TCGTCTATAC TTCGAAAGAA ATTTATTACC TTTCACAATC TGACTTTGGT ATTTATTTTA 14 7 60 

GAGAAAAATT AAGTTCTCCC ATGGTTTATG GAGAGGTTCC TGTTTATGCC AATGAAGATT 14820 

TAGTAGTGGA ATCTGGGAAA TTGACTCCCA AAACAAGTTT TCAAATAACC GAGTGGCGCT L4 8S0 

TAAATAAACA AGGAATTCCA GTATTTAAGC TATCAAATCA TCAATTTATA GCTGCGGACA 14 94 0 

AACGATTTTT ATATGATCAA TCAGAGGTAA CTCCAACAAT AAAAAAAGTA TGGTTAGAAT 15 COO 

CTGACTTTAA ACTGTACAAT AGTCCTTATG ATTTAAAAGA AGTGAAATCA TCCTTATCAG 15060 

CTTATTCGCA AGTATCAATC GACAAGACCA TCTTTGTACA AGG AAG AG AA TTTCTACATA 15120 

TTG AT CAGGC TGGATGGGTA GCTAAAGAAT C AA CTTCTG A AGAAGATAAT CGGATGAGTA 15180 

AACTTCAACA AATGTTATCT GAAAAATATC AGAAAGATTC TTTCTC T ATT TATCTTAAGC 15240 

AACTGACTAC TGGAAAAGAA GCTGGTATCA ATCAAGATGA AAAGATGTAT GCAGCCAGCG 15300 

TTTTGAAACT CTCTTATCTC TATTATACGC AAGAAAAAAT AAATGAGGGT CTTTATCAGT 15 3 60 

TAGATACGAC TGTAAAATAC GTATCTGCAG TCAATGATTT TCCAGGTTCT TATAAACCAG 15420 

ACGGAAGTGG TAGTCTTCCT AAAAAAGAAG ATAATAAAGA ATATTCTTTA AAGGATTTAA 154 80 

TTACGAAAGT ATCAAAAGAA TCTGATAATG TAGCTCATAA TCTATTGGGA TATTACATTT 15540 

CAAACCAATC TGATGCCACA TTCAAATCCA ACATCTCTGC CATTATGGGA GATGATTGGC 15600 

ATCCAAAAGA AAAATTGATT TCTTCTAAGA TGGCCGGGAA GTTTATGGAA GCTATTTATA 15 6 60 

ATCAAAATGG ATTTGTGCTA GAGTCTTTGA CTAAAACAGA TTTTGATAGT CAGCGAATTG 15 720 

CCAAAGGTGT TTCTGTTAAA GTAGCTCATA AAATTGGAGA TGCGGATGAA TTTAAGCATG 157 90 

ATACGGGTGT TGTCTATGCA GATTCTCCAT TTATTCTTTC TATTTTCACT AAGAATTCTG 15840 

ATTATGATAC GATTTCTAAG ATAGCCAAGG ATGTTTATGA GGTTCTAAAA TGAGGGAACC 15900 

AG ATTTT TT A AATCATTTTC TCAAGAAGGG ATATTTCAAA AAGCATGCTA AGGCGGTTCT 15 960 

AGCTCTTTCT GGTGGATTAG ATTCCATGTT TCTATTTAAG GTATTGTCTA CTTATCAAAA 15020 

AGAGTTAGAG ATTGAATTGA TTCTAGCTCA TGTGAATCAT AAGCAGAGAA TTGAATCAGA 16080 

TTGGGAAGAA AAGGAATTAA GGAAGTTGGC TGCTGAAGCA GAGCTTCCTA TTTATATCAG 1614 0 

CAATTTTTCA GGAGAATTTT CAGAAGCGCG TGCACGAAAT TTTCGTTATG ATTTTTTTCA 16 200 

AGAGGTCATG AAAAAGACAG GTGCGACAGC TTTAGTCACT GCCCACCATG CTGATGATCA 162 60 

GGTGGAAACG ATTTTTATGC GCTTGATTCG AGGAACTCGC TTGCGCTATC TATCAGGAAT 16 3 20 

TAAGGAGAAG CAAGTAGTCG GAGAGATAGA AATCATTCGT CCCTTCTTGC ATTTTCAGAA 16 3 80 
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AAAAGACTTT CCATCAATTT TTCACTTTGA AGATACATCA AATCAGGAGA ATCATTATTT 16440 

TCGAAATCGT ATTCGAAATT CTTACTTACC AGAATTCGAA AAAGAAAATC CTCGATTTAG 16 500 

GGATGCAATC TTAGGCATTG GCAATGAAAT TTTAGATTAT GATTTGGCAA T AG CTGAATT 16 560 

ATCTAACAAT ATTAATGTGG AAGATTTACA GCAGTTATTT TCTTACTCTG AGTCTACACA 16 620 

AAGAGTTTTA CTTCAAACTT ATCTGAATCG TTTTCCAGAT TTGAATCTTA C AAAAGCT C A 16680 

GTTTGCTGAA GTTCAGCAGA TTTTAAAATC TAAAAGCCAG TATCGTCATC CGATTAAAAA 167 JO 

TGGCTATGAA TTGATAAAAG AGTACCAACA GTTTCAGATT TGTAAAATCA GTCCGCAGgC 16800 

TGATGAAAAG GAAGATGAAC TTGTGTTACA CTATCAAAAT CAGGTAGCTT ATCAAGGATA 16860 

TTTATTTTCT TTTGGACTTC CATTAGAAGG TGAATTAATT CAACAAATAC CTGTTTCACG 16920 

TGAAACATCC ATACACATTC GTCATCGAAA AACAGGAGAT GTTTTGATTA AAAATGGGCA 16980 

TAGAAAAAAA CTCAGACGTT TATTTATTGA TTTGAAAATC CCTATGGAAA AGAGAAACTC 17 040 

TGCTCTTATT ATTGAGCAAT TTGGTGAAAT TGTCTCAATT TTGGGAATTG CGACCAATAA 17100 

TTTGAGTAAA AAAACGAAAA ATGATATAAT GAACACTGTA CTTTATATAG AAAAAATAGA 17160 

TAGGTAAAAA ATGTTAGAAA ACGATATTAA AAAAGTCCTC GTTTC ACACG AT G AAATT AC 17 2 20 

AGAAGCAGCT AAAAAACTAG GTGCTCAATT AACTAAAGAC TATGCAGGAA AAAATCCAAT 17 2 80 

CTTAGTTGGG ATTTTAAAAG GATCTATTCC TTTTATGGCT GAATTGGTCA AACATATTGA 17 340 

T AC AC AT ATT GAAATGGACT TCATGATGCT TTCTACCTAC CATGGTGGAA CAGCAAGTAG 174 00 

TCGTGTTATC AATATTAAAC AAGATGTCAC TCAAGATATC AAAGGAAGAC ATGTTCTATT 174 60 

TGTAGAAGAT ATC ATTG AT A CAGGTCAAAC TTTGAAGAAT TTGCGAGATA TGTTTAAAGA 17520 

AAGAGAAGCA GCTTCTCTTA AAATTGC AAC C TTGTTGG AT AAACCAGAAG GACCTGTTGT 17580 

AGAAATTGAG GC AG ACT AT A CTTGCTT T AC TATCCCAAAT GAGTTTGTAG TAGGTTATGG 17 640 

TTTAGACTAC AAAGAAAATT ATCGTAATCT TCCTTATATT GGAGTATTGA AAGAGGAAGT 177 00 

GTATTCAAAT TAGAAAGAAT AATCTTTAAT GAAAAAACAA AATAATGGTT TAATTAAAAA 177 60 

TCCTTTTCTA TGGTTATTAT TTATCTTTTT CCTTGTGACA GGATTCCAGT ATTTCTATTC 17 820 

TGGGAATAAC TCAGGAGGAA GTCAGCAAAT CAACTATACT G AGTTGGT AC AAG AAATT AC 17 880 

CGATGCTAAT GTAAAAGAAT TAACTTACCA ACCAAATGGT AGTGTT ATCG AAGTTTCTGG 17 940 

TGTCTATAAA AATCCTAAAA CAAGTAAAGA AGAAACAGGT ATTCAGTTTT TCACGCCATC 18000 

TGTTACTAAG GTAGAGAAAT TTACCAGCAC TATTCTTCCT GCAGATACTA CCGTATCAGA 18060 

ATTGCAAAAA CTTGCTACTG ACCATAAAGC AGAAGTAACT GTTAAGCATG AAAGTTC AAG 18120 
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TGGTATATGG ATTAATCTAC TCGTATCCAT TGTGCCATTT GGAAKCTAT TCTTCTTCCT IB 180 

ATTCTCTATG ATGGGAAATA TGGGAGGAGG CAATGGCCGT AATCCAATGA GTTTTGGACG 18240 

TAGTAAGGCT AAAGCACCAA ATAAAGAAGA TATTAAAGTA AGATTTTCAG ATGTTGCTGG 18 300 

AGCTGAGGAA GAAAAACAAG AACTAGTTGA AGTTGTTGAG TTCTTAAAAG ATCCAAAACG 18 360 

ATTCACAAAA CTTGGAGCCC GTATTCCAGC ACGTGTTCTT TTGG AG GG AC CTCCGGGGAC 18-520 

AGGTAAAACT TTGCTTGCTA AGGCAGTCGC TGGAGAAGCA GGTCT7CCAT TCTTTAGTAT 18480 

CTCAGGTTCT GACTTTGTAG AAATGTTTGT CGGAGTTGCA GCTAGTCGTG TTCCCTCTCT 13540 

TTTTGAGGAT GCCAAAAAAG CAGCACCAGC TATCATCTTT ATCGATGAAA TTGATGCTGT 13600 

TGGACGTCAA CGTGGAGTCG GTCTCGGCGG AGGTAATGAC GAACGTGAAC AAACCTTGAA 18660 

CCAACTTTTG ATTGAGATCG ATGGTTTTGA GGGAAATGAA GGGATTATCG TCATCGCTGC 18 720 

G AC AAACCGT TCAGATGTAC TTGACCCTGC CCTTTTGCGT CCAGCACCTT TTCATAGAAA 18780 

AGTATTGGTT GGTCGTCCTG ATGTTAAAGG TCGTGAAGCA ATCTTGAAAG TTCACGCTAA 18840 

CAATAAGCCT TTAGCAGAAG ATGTTGATTT GAAATTAGTG GCTCAACAAA CTCCAGGCTT 13900 

TGTTGGTGCT G ATT TAG AG A ATGTCTTGAA TGAAGCAGCT TTAGTTGCTG CTCGTCGCAA 18960 

TAAATCGATA ATTCATCCTT CAGATATTGA TGAAGCAGAA GATAGAGTTA TTGCTGGACC 19020 

TTCTAAGAAA GATAAGACAG TTTCACAAAA AGAACGAGAA TTCGTTCCTT ACCATGAGGC 19080 

AGGACATACC ATTGTTGGTC TAGTCTTGTC GAATGCTCGC GTTGTCCATA AGGTTACAAT 19140 

TGTACCACGC GGCCGTGCAG GCGGATACAT GATTGCACTT CCTAAAGAGG ATCAAATGCT 192 00 

TCTATCTAAA GAAGATATGA AAGAGCAATT GGCTGGCTTA ATGGGTCGAC GTGTAGCTGA 192 60 

ACAAATTATC TTTAATGTCC AAACCACAGG AG CTT CAAAC GACTTTGAAC AAGCGACACA 193 20 

AATGGCACGT GCAATGGTTA CAGAGTACGG TATGAGTGAA AAACTTGGCC CAGTACAATA 193 80 

TGAAGGAAAC CATGCTATGC TTGGTGCACA GAGTCCTCAA AAATCAATTT CAGAACAAAC 194 40 

ACCTTATGAA ATTGATGAAG AGGTTCCTTC ATT ATT AAA T GAGCCACGAA ATAAAGCTGC 19500 

TGAAATTATT C AG T C AAATC GTGAAACTCA CAAGTTAATT GC AG AAGC AT TATTGAAATA 19560 

CGAAACATTG GATAGTACAC AAATTAAAGC TCTTTACGAA ACAGGAAAGA TGCCTGAAGC 19620 

AGTAGAAGAG GAATCTCATG CACTATCCTA TGATGAAGTA AAGTCAAAAA TGAATGACGA 19 630 

AAAATAACCC TGAGAGAGGC TGGAGCCTCT CTTTTTTCTG CAGTTTAGGA GCTAAAGGGA 1974 0 

ACAGAATGGA GAAAATGGAA CAAATGTGTT TTCTAATCTG TTAGACTGTA TCTAGAAAGG 19 800 

GGAAAATTAT GATTAAAGAA TTGTATGAAG AAGTCCAAGG GACTGTGTAT AAGTGTAGAA 19 860 

ATGAATATTA CCTTCATTTA TGGGAATTGT CGGATTGGGA GCAAGAAGGC ATGCTCTGCT 19920 
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TATTCCACGT 


T*T A Ann AAA T 




ATTTCAACAC 


CAAGTTTCGA 


AATCGAATTT 


TAG AC TAT AT 


CCGTAAACAG 


G AAAGT C AG A 


2 0040 


AGCGTAGATA 


CGATAAAGAA 


CCCTATGAAG 


AAGTGGGTGA 


GATCAGTCAT 


CGTATAACTC 




AGGGGGGTCT 


CTGGCTAGAT 


GATTATTATC 


TCTTTCATGA 


AAC ACT A AH A 


GATTATAftAA 




ACAAACAAAG 


TAAAGAGAAA 


CAAGAAGAAC 


TAGAACGCCT 






5 O 0 9 n 


GAGGGCGTCA 


AAGAGTATTA 


AGAGACTTAC 


GCATTGTGTT 






*) "5 a n 


CCCACTAGTA 


AGTCATGCAA 


AAAAAATGAA 


AAAAATTAGA 


AAAACTAGTT 


UAUn/lftO 111 


i UJIU 


GAAAAGGCTG 


TATAATAGTA 


AGAGTTGAAA 


ATAACAACTC 


AGGTCCGTTG 


C'CAAfinftGT 

VJ a \_ VJ V-J VJ 1 




TAAGACACCG 


CCTTTTCACG 


GCGGTAACAC 


GGGTTCGAAT 






OU 


TTGCGTCAGG 


ACCACTTGAT 


GAAAAAAAGT 


TTAAAAAAAC 


TTAAAJUTPT 






GTTGACAAGC 


GAAAGCAGTT 


GTGATATACT 


AATATAGTTG 


TCGCTTGAGA 


GAAGCAAflTP. 


20580 


ACAAAGACCT 


TTGAAAACTG 


AACAAGACGA 


ACCAATGTGC 


AGGGCGCTAC 




\J O H \J 


GTAGTACTGA 


ACAATGAAAA 


AAACAATAAA 






f!T A ana a 




AAACTTTTTA 


ATGAGAGTTT 


GATCCTGGCT 


CAGGACGAAC 


GCTGGCGGCG 


TCCCTAATAC 


2 07 60 


ATGCAAGTAG 


AACGCTGAAG 


GAGGAGCTTG 


CTTCTCTGGA 


TGAGTTGCGA 


ACGCG^GAGT 


C \J o £ U 


AACGCGTAGG 


TAACCTGCCT 


GGTAGCGGGG 


CATAACTATT 


GGAAAfnaTA 




") n Q Q A 


CATAAGAGTA 


GATGTTGCAT 


G AC ATTTG CT 


TAAAACCTHC 






2 0940 


GACCTGCCTT 


GTATTACCTA 


CTTGGTCGGC 


TAACGGCTCA 


CCAACCCCAC 


GATACATAGC 


21000 


CGACCTGAGA 


GGGTGATCGG 


CCACACTGGG 


ACTGAGACAC 


GGCCCAGACT 


CCTACGGGAG 


21060 


GCAGCAGTAG 


GGAATCTTCG 


GCAATGGACG 


GAAGTCTGAC 


CGAGCAACGC 


CGCGTGAGTG 


21120 


AAGAAGGTTT 


TCGGATCGTA 


AAGCTCTGTT 


GTAAGAGAAG 


AACGAGTGTC 


AGAGTGGAAA 


21180 


GTTCACACTG 


TGACGGTATC 


TTACCAGAAA 


GGGACGGCTA 


ACTACGTGCC 


AGCAGCCGCTG 


21240 


GTAATACGTA 


GGTCCCGAGC 


GTTGTCCGGA 


TTTATTGGCC 


GTAAAGCGAG 


CGCAGGCGGT 


21300 


TAGATAAGTC 


TGAAGTTAAA 


GGCTGTGGCT 


TAACCATA 






21338 


(2) INFORMATION FOR SEQ ID NO: 21 











(i> SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 6273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(XL) 


SEQUENCE DESCRIPTION : 


SHQ ID NO: 


21: 






TGTTTTTAAA 


GAGCCGTGTC 


TGGATAGACT 


TTCGGACGCA 


ACGCTCTATT 


AGATAATGAA 


60 


CTGCCTATAC 


ACAAGATTTC 


TAACCTTAGT 


CGACATGAGC 


TGAAACCTCT 


TATTTGTTAA 


120 


GTAGTTCACA 


AAATATTATA 


CACCTATTTT 


ATGAATAGTC 


AACTGTCTTT 


ACAGTAAAAT 


180 


TTTAGAAAAT 


CATGAAAATT 


TTCTCTTTCT 


TTCCATTTTA 


AGTGACATTC 


AGTCATTCTC 


240 


ACATCAAAAA 


AGCCCAGACG 


AAATTGTCTG 


AGCATTCTTT 


TATCTAGTCG 


TTTAAGGAAG 


300 


TTGAGTTCAG 


TATGTTTAAA 


GTCTCTGTCC 


CATCATTTCT 


TCAACAAACC 


TTGTTCTTGG 


360 


AGAAACTCCT 


TGGCTACTTG 


CTTTGCTGAC 


TTGCCTTCAA 


CACCGACTTG 


GTAGTTGAGC 


420 


TGGCTCATCT 


GGCTTTCTGT 


AATCTTACCA 


GCCAATGTAT 


TAAGAACTCT 


TTCCAACTCT 


480 


GGGTGTTTCT 


TGAGAAGAGC 


TTCTTTCATG 


AGTGGAGCCC 


CTTGATAAGG 


TGGGAAGAGT 


540 


TGCTTGTCAT 


CTTCCAAGAC 


CTGTAAATCA 


TAACGCTCCA 


ATTCCGCATC 


AGTCG AAT AG 


600 


GCATCCGTGA 


TTTGAATATC 


CCCTGACTGA 


AT AG C CTG AT 


AGCGAAGGGC 


TGGCTCAATG 


660 


GTCGCTACAT 


TGAGATTGAG 


ACCATACATT 


GATTGCAAGC 


CCTTATTTCC 


ATCTTCACGG 


720 


TCGTTAAACT 


CGAGTGTAAA 


ACCTGCCTTC 


AACTGCC CTT 


CCACTTTTTT 


CAAGTCTGAA 


780 


ATGGTCTTCA 


AGCCATATTC 


TTGAGCAATC 


TTTTTCGGAA 


CAGCT ACAGC 


ATAGCTGTTT 


840 


TGATAAGACA 


TGGGTTTGAG 


ATAGGCTAGA 


TGATCCTGCT 


TAGCAATGCC 


ATCACGCGCC 


900 


ACCTGATAAA 


CCTGTTCTGG 


TTCATGACTC 


ACCTTGGGTG 


ATGGTTGAAG 


C AAA CTTTC A 


960 


GTCACCGTAC 


CAGTAAATTC 


AGGATAGATG 


TCAATATCGC 


CTTTTTTCAC 


ACCTTCATAA 


1020 


AGGAAGCTTG 


TCTTCCCAAA 


AT TCGGTT T A 


ACAGTCGCAG 


TCATGCTGGT 


ATTTTCTTCA 


1080 


ATCAGCAACT 


TATA CAT ATT 


GGCCAAAATT 


TCTGGTTCTG 


CACCTATTTT 


CCCAGCAATA 


1140 


ACCAAGTTTT 


CCTTCTCTTT 


TTGAACCAAA 


AGAGCTGGAC 


TATAAGACAG 


ACCCAGTAAT 


1200 


AAAG CC AC C A 


AGGCAAAACC 


TGAGAAAATC 


GTCCGTAATT 


TTGCTTTTTC 


C AT C ACTTTT 


1260 


ACTAGGAAGT 


TAAAGGCAAT 


GGCTAGCACT 


GCAGAAGAAA 


GTGCCCCAAT 


CAAAATCAAA 


1320 


CTGGCATTAT 


TACGGTCAAT 


TCCCAAAAGA 


ATAAAGGAAC 


CTAGTCCCCC 


TGCACCAATC 


1380 


AAGGCCGCCA 


ACGTTGCCGT 


ACCGATAATC 


AAAACAGCTG 


CCGTCCGAAT 


CCCAGACATG 


1440 


ATAACAGGCA 


TGGCGAGTGG 


AATTTCAAAT 


TTCTTGAGAC 


GTTCCCATCT 


GGTCATCCCA 


1500 


AAGGCAATCC 


CAGCCTCTTG 


CAGGTTCGGA 


TCAATTCCCT 


TCAGCCCAGT 


GATAGTATTT 


1560 


TGCAAAATAG 


GGAAAATCGC 


ATAAATCACT 


AGAGCTGTCA 


AAGCCGGCAA 


GGTCCCAATT 


1620 


CCCATCAAAG 


GG AT AAAG AG 


CCCCAACAAG 


GCCAGAGACG 


GGATGGTCTG 


GAAAATACCT 


1680 


GCAATCTGCA 


AGACCCAGTC 


GGCCAGCTTC 


TCATGATAGC 


GAAGAAAAAC 


AGCCAAGGGA 


1740 
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ATCGCAAGCA AAATAGCTAG TAACAAGGTC AAAAGCGACA ACTGCAAATG TTGAGATAGA 18 00 

GCTGTCAACC AATCACTAAA ACGATCCTGA AAAGTTGCAA TTAAATTAGT CATGAACACT 18 60 

ACCTCCAAAC AAGTCTGCTA CAAAGTCTGT TGCAGGCGCT TTTAAAATTG TCTCGGGATT 19 20 

CGCTACCTGG CGAATTTCTC CATCCTGCAA GACAGCAATA CGGTCCGCCA ACTTCAAGGC 19 80 

TTCATCCGTA TCATCGGTTA CAAAAATCGT TGTCATCCCA AACTCTTTAT GCAATTCTTT 2 040 

TGTCAGAACC TGCAACTGTT TTCTCGAAAT AGCATCCAAG GCCGAAAAGG GTTCATCCAT 2100 

GAGGAAAATC TTGGGCTGAC CAATCATAGC TCGGACAATA CCGACCCGTT GCTGTTCTCC 2160 

ACCAGATAAT TCACTAGGTA AGCGATGCCC ATACTCGGCT ACTGGTAAAC CAACCTTAGC 22 2 0 

CAAAAGCTCT TCTGTTTTCT TCGTAATTTC TTCCTTGCTC CACCCCTTCA TTTCAGGAAT 22 80 

GAGAGCAATA TTTTCCGCAA CTGTTAGATT TGGAAAAAGA GCAATAGCCT GT AAAAC AT A 23 40 

ACCAGTAGAA AGACGAAGTT CACGCTCATC ATAGTCTTTG ATGCGCTTCC CATCCATATA 2 4 00 

AATATTTCCA TCAGTTGGTT CCAAAAGACG GTTAATCATC TTGAGCATGG TCGTCTTACC 2 4 60 

TGACCCAGAA GCCCCTACTA AAACCATAAA TTCCCCATCC TCAATCTGTA AGTTGACATC 2 5 20 

TCTCAAGACA TCCTTTTCTG TGTACCCCAG TGCTACATTT TTGTATTCAA TCATTCTTTC 2 5 80 

TCCTCAATTT AAAAC TTCCC TCGATTGGTC AAGTCTTCTA CCTTAGGCAT AACTTCCTTA 2 640 

TTATCCCAAT GCTCCACAAT TTTCCCGTTC TCTAAACGGA AGATATCGTA CTGGGCATAA 2 700 

GCAACGCCAT CAATCTGAGT CTGACCATAG CTAACCACAT ACTTTCCTTC TCCTAAGAGT 2 7 60 

TGGAAAACAA AGTCAAAAGT G AC ACT AT AT TCACCCACAT AGTTTTTATA AGCAGCACTT 2 8 20 

CCTTGTCCAA TATCATCATT ATGCTGAATC AAATCGTCTG CCACATAATC ACTCCACTGC 2 8 80 

TCTAGCTCCC CATTTTGGAA AATTTCTG" AACAAACC" ".V-.CCACCTT TTTA 7TTTCT 2 9 4 0 

GCTTTCTTAT CCAAATCCTT GATTTCAAAA TCTCCAAAAA TTTGATCTAG TTGGTCATTT 3 000 

TCAGGTGTTC GATAGTAGTC AATGACATCC CAATGCTCAA CAATACAACC ATTCTCATCC 3 060 

TCACGGAAAG TATCCGTCGT CACCCATTGA GCTTCTCCAC CATTCAGATA TTGATGAACA 3120 

TGAACAAAGA CCAGATTGCC ATCCTCAATG GTGCGGACAA TCTTAATCTG ACGCTCTGGA 3180 

TGACGCTCAA AGAAATCTGC AAAGAAGGCT GCAAATCCTT CTTTCCCGTC AGGAACACCT 3 24 0 

GTCGAATGTT GGATATAGGT ATCCCCTACA GACTGGGCTT GAGCCTCAGC AACTCGTCCG 3 3 00 

TCTTGAATGG CATGGATGTA T AGGTTGTG A GCATTTTTCA CTTGTTGTGA CATATTCTAA 3 3 60 

ACCTCATTTC CCTTCTCTTT CAGATTCGCC AAAATTCTTT CTTGAAAACC TTCAAATTGG 3 4 20 

TGAATTTCTT CCTCTGAAAA TCCTTTGTAA AAGATAGTAT CCAATTTCTG ACTGACACGA 3 4 80 
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TGCCCCACTT CTTTCTGGGA CTTGCCTAAC TCCGTTAAAA CTAAATACTT CTTACGCTTG 3 54 0 

TCTTTTCCAC ACGGACTAAC AATTACAAGC TTTTGTTCCT CTAGCTTTTT TATCATAGTC 3 600 

GTCAGCCTAT TATTCGCAAG TCCAGTCGCA AGCGCGATAT CTGTCGCAGT TGCGCAGCCA 3 660 

CTTTC ACT AT TCCATAAAAC CGCTAAAATC TTGCCCTGTT CACCCCTATA AAGAGCCTCA 3 7 20 

GGATCTTGAC TCAGTAACTT TTGAAAAATC CGCCCATTCA ACAAACGAAT ATGATGGGCT 3 7 90 

AGC AAATG AC CATCTTTCAT AACACCTCCA ATTTATTTCG ATATCGAAAT GAATAAAACA 3 34 0 

ATTGTAACAC TCATCGTTCT AACTGTCAAC TATTTCGATT TAGAAATAAT TTTTGATAAT 3 900 

TATCCACACC ACCATACTCC GGCTCAACTA ACTTTTAACG AGAGTTTCTA AACTCCTTCG 3 960 

TCCTCCAGTC TACAAAAGCC TTCCATTCCT ACTATCCTAT ATTTTATGAG GGGACACATT 4C2C 

TTTCCTATCA GACCATTTAT TTTAAAGATA GAAGTAAATC ATAATTGCTT C C AT CTGTT C 4 080 

TTTTATAGTA TATTGAAGTT AG ACT AG AG C ACTGTATCTT CTAAAACATT GATAGAAAGC 414 0 

GATTTGAATT TCCCAATCAA TTTGTTCGTA TTTATAGCAT TTCGAAACTG GAATAGGACA 4 200 

CCATGACTGC TAAAAGATTT CTATAAATTC ATTTAATTTC CTCAATCAAT TTGTTC AT AT 4 260 

CTTATTTCAT TCCGCTATAA TTTCACCTTA CCCTATCTTT TTCGTAGCAC CCTTCAAACA 4 320 

GCCTATCCCC TACCGTTTGA CGATTCCTCA CTTCGCTCCA CTTCCATTAC AGAAGTTTCT 4 380 

TC ACT ACT AT GGGCT CGGCT GACTTCTCAT GATTCCTTGT TACT ACT ATT TGAACGCTCA 4 4 40 

CGAGATAGAT CTTACAAAAA ATGCTTTGAT CCACAATGGA ATCAAAGCAT TTTAAAGAGT 4 500 

TCCTCATACA TAAGCGCAGA AGTCGCAGTT CCTCTCTACT TGGCTTCTTC TCTTTTGACA 4 5 60 

AAGCGAGCCA AGTTGAGCAA CTCAGGTCCT CGATGTTTCC CATTTAGGAC CAATTCACGA 4 62 3 

TTGACCAGGC CTGAGAGACG AACTGCCTGC AATTGCT CAT TTGTAGTAGG CAGTTTTTTA 4 63 0 

GTAGTCTCTA GGAGAGCAGC AACTAAATCT TCACTCAAAT CATGTCGAGC ATGATTGTAA 4 74 0 

AGATCTTTTA TAAGGCTTTC TAGGTTTGGT TCTACCATCC CTACCACCTC CCTTATGGTT 4 8C0 

TAATAATGTT TAATCAAATC AACCGTTGAA CGATCCAATT TCTTCACCAA CGCTTGTAAG 4 860 

AAAGCTTGCG CTTCTAGGAA GTCATCCATT GCATAGAGGG T TTGGTG AG A ATGGATATAA 4 9 20 

CGAGCGCAGA CACCGATAGT TGTTGATGGG ACACCACCAT TTTTCAGATG AGCTGCACCT 4 980 

GCATCTGTTC CGCCTTTACC ACAGTAGTAT TGGTACTTGA TACCAGCTTC TTCAGCCGTT 504 0 

GTCAAAAGGA AATCCTTCAT CCCTGGGAGA AGCAAGTGAC CTCGATCATA GAAACGAATC 5 LOO 

AAGGTTCCAT CTCCAATCTT GCCTTGACCA CCGTAGACAT CACCTGCTGG TGAGCAATCA 515C 

ACTGCGAGGA ACACTTCTGG GTCAAACTTG GTTGTAGAGG TATGAGCGCC ACGCAGACCA 5220 

ACTTCTTCTT GGACGTTAGA ACCCAGATAG AGTTCATTGC CGAGTTTTTG ACCCGATAAA 5280 
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GCTTC AGCT A GCTCGCTTAC CATGAGGACA CCGTAGCGGT TATCCCAAGC TTTTGAGATG 53 4 0 

ATATTTTTTT CATTCGCTGT CAAAATTGCA GAACTATCTC GTACAATGGT ATCACCAGGA 54 00 

CGGATGCCAA AACTTTCTCC CTCAGCCTTG TCCGCAAAAC CACCATCAAA AACGATATCG 54 60 

GCAATGGCTG GCATGGTTGG TCCCCCCTTT CCACGAGTCA AATGCGGAGG AACAGAACCT 5520 

GAAATCACAG GAATTTCATG ACCATCACGA GTCAAGAGTT TGAAACGTTG GCTGCTAACC 5 5 80 

ACCATGGGGT TCCAGCCACC GATTTCTACG ACACGGAAGG TACCATCTGG CTTGATTTCG 564 0 

CTGACCATAA AACCAACTTC GTCCATATGA GAAGCGACCA AGACGCGCGG TGCATCCACA 57 00 

GCTTCTGAAT GTTTGATACC AAAAATACCA CCCAAGCCAT CTGTCACCAC TTCATCCACA 57 60 

TGCGGTGTCA ACTTTTCACG AAGATAAGCA CGGACAGGCG CTTCATGACC TGAGACTGCA 58 2 0 

GCAAGTTCTG TTACTTCTTT AATTTTTGAA AATAATGTTG TCATTTCAGT TCCTTCTTTC 5880 

TTTCATCCAT TTTACCACTT TTTATAGGAG AAGG AT AGTG GGAAGGTGGA TTTCTAAGTT 59 4 0 

AGTATCTTAG TCCTGCTCTA TCTTAGAAAA GGATAGTATT CTCTTGCATC TAGTCCAAAA 6000 

TCTAGTAAAC ATTCCAAAAT TAACTCGAAT ATTTATTTCC AAACAAAAAA ACAATACACC 60 60 

ATCAAAGTTC TTTGGATTTT T CAT G AAATT TACAGAAAAT AGTTCACTTC CCTTTCTTCT 6120 

TTCTTTAAAT ATATAGTTGG TTGAGTTTGG AATAGTACGC TGTAGCTGCT AAAACATTTC 6180 

TAGAAATTAA TTTGACTTTC CTAATAGAGT TGTTCATATC TTATTTCAAT TTACTATAGT 624 0 

ACAAAACTAG AAAAGGAAAA AATCATGACC AGG 627 3 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8171 base pairs 
{ B) TYPE: nucleic acid 

(C) STrtANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ACAACCTTTT TCAAAAACTC ACCTT GGT AC GGAGATGTTT TCCTTTCTGC TATTATTTTC 60 

GGTTATATTC ATATCAATTT TGCTTTAACT CCTCTTGCTT TTTTCATTTA TGCTAGTGGA 12 0 

GGTCTTATTT TAGCTCTATT GTATCGCATG ACT AAAAATC TCTACTATCC AA TACT AG TT 180 

CATATTCTCA TTAATATCAC TGCCTTCTGG GATGTGTGGT TGCTCCTATT TTCAGGAAGT 24 0 

TAGCTTACTA AAATAATGTC GGAACTTTC C GGCATTTTCT TTTTTCACAA ATAGTCAACG 3 00 

TTTTTCTTTT CGATATTGTA GTGGTGTGTA TCCAGTTATT TTTTTGAATT GATTTTGAAA 3 60 
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ATAAGGTTGA 


CTTGACAAAG 


GCAGATAGTG 


AAG A T AGTT A 


AGAAGAATAG 


GATGTTCTTT 


-120 


TTTCCTTTTT 


GGAAAACTTC 


TAAAATATGG 


TATAATGAAA 


AGATAAAGAA 


GTTGGGGGTA 


480 


GAAGATGAAC 


ATTCAACAAT 


TACGCTATGT 


TGTGGCTATT 


GCCAATAGTG 


GTACTTTTCG 


540 


TGAAGCTGCT 


GAAAAGATGT 


ATGTTAGTCA 


GCCGAGTCTG 


TCTATTTCTG 


TTCGTGATTT 


600 


GGAAAAAGAG 


TTGGGCTTTA 


ACATTTTCCC 


TCGGACCAGC 


TCSGGGACTT 


TCTTGACCCG 


660 


TCGTGGGATG 


GAATTTTATG 


AAAAATCGCA 


AGAATTGGTT 


AAAGGATTTG 


ATATTTTTCA 


720 


AAATCAGTAT 


GCCAATCCTG 


AAGAAGAAAA 


AGATGAATTT 


T CTGTTGCT A 


GCCAGCACTA 


780 


TGACTTCTTG 


CCACCAACTA 


TTACGGCCTT 


TTCAGAGCGC 


TATCCTGACT 


ATAAGAACTT 


840 


CCGTATTTTT 


GAATCAACTA 


CTGTTCAAAT 


ATTAGATGAA 


GTGGCCCAAG 


G G CAT AG TG A 


900 


GATTGGGATT 


ATCTACCTCA 


ACAATCAAAA 


TAAAAAGGGG 


ATTATGCAAC 


GCGTTGAAAA 


960 


ATTAGGTCTG 


GACGTCATCG 


AATTG ATTCC 


TTTCCATACC 


CATATTTATC 


TCCGTGAGGG 


1020 


TCATCCTTTA 


GCCCAGAAAG 


AGGAATTAGT 


CAT GG AGG AT 


TTAGCGGATT 


TACCAACGGT 


1080 


TCGTTTCAC7 


CAAGAGAAAG 


ACGAGTACCT 


TTATTATTCA 


GAG AACTTTG 


TCGATACCAG 


1140 


CGCTAGCTCA 


CAGATGTTTA 


ATGTGACAGA 


CCGTGCCACC 


TTGAATGGTA 


TTTTCGAGCC 


1200 


GACGGACGCC 


TATGCGACAG 


GTTCTGGATT 


TTTAGATAGT 


GACAGTGTTA 


ATGGCATTAC 


1260 


AGTT ATTCGT 


CTCAAGGATA 


ACCTAGATAA 


CCGCATGGTC 


TATGTTAAAC 


GTGAAGAAGT 


1J20 


GGAGCTTAGT 


CAAGCTGGGA 


CTCTCTTCGT 


AG AAGTC A TG 


CAACAATATT 


TTGATCAAAA 


1380 


GAGGAAATCA 


TGAAAAAAAG 


AGCAATAGTG 


GCAGTCATTG 


TACTGCTTTT 


GATTGGGCTG 


1440 


GATCAGTTGG 


TCAAATCCTA 


TATCGTCCAG 


CAGATTCCAC 


TGGGTGAAGT 


GCGCTCCTGG 


1500 


ATCCCCAATT 


TCGTTAGCTT 


GACCTACCTG 


CAAAATCGAG 


GTCCAGCCTT 


TTCTATCTTA 


1560 


CAAGATCAGC 


AGCTGTTATT 


CGCTGTCATT 


ACTCTGGTTG 


TCGTG AT AG G 


TGCCATTTGG 


1620 


TATTTACATA 


AACACATGGA 


GGACTCATTC 


TGGATGGTCT 


TGGGTTTGAC 


TCTAATAATC 


1680 


GCGGGTGGTC 


TTGGAAACTT 


TATTGACAGG 


GTCAGTCAGG 


GCTTTGTTGT 


GGATATGTTC 


1740 


CACCTTGACT 


TTATCAACTT 


TGCAATTTTC 


AATGTGGCAG 


ATAGCTATCT 


GACGGTTGGA 


1800 


GTGATTATTT 


TATTGATTGC 


AATGCTAAAA 


GAGGAAATAA 


ATGGAAATTA 


AAATTGAAAC 


1860 


TGGTGGTCTG 


CGTTTGGATA 


AGGCTTTGTC 


AGATTTGTCA 


GAATTATCAC 


GTAGTCTCGC 


1920 


GAATGAACAA 


ATTAAATCAG 


GCCAGGTCTT 


GGTCAATGGT 


C AAGTC AAG A 


AAGCTAAATA 


1980 


CACAGTCCAA 


GAGGGTGATG 


TCGTCACTTA 


CCATGTGCCA 


GAACCAGAGG 


T ATT AG AGT A 


2040 


TGTGGCTCAG 


GATCTTCCGC 


T AGAAAT AGT 


CTACCAAGAT 


GAGGATGTGC 


CTGTCGTTAA 


2100 


CAAACCTCAG 


GGAATGGTTG 


TGCACCCGAC 


TGCTGCTCAT 


ACCAGTGGAA 


CCCTAGTAAA 


2160 



WO 98/18931 



PCT/US97/19588 



275 

TGCCCTCATG TATCATATTA AGGACTTGTC GGGTATCAAT GGGGTTCTGC GTCCAGGGAT 2 2 20 

TGTTCACCGT ATTG AT AAGG ATACGTCAGG TCTTCTCATG ATTGCTAAAA ACGATGATGC 22 80 

GCATCTAGCA CTTGCCCAAG AACTCAAGGA TAAAAAGTCT CTCCGCAAAT ATTGGGCGAT 234 0 

TGTTCATGGA AATCTACCTA ATGATCGTGG TGTAATTGAA GCGCCGATTG GCCGGAGTGA 2 4 00 

AAAAGACCGT AAGAAACAGG CTGTAACTGC TAAAGGGAAG CCTGCAGTGA CGCGTTTTCA 2 4 60 

CGTCTTGGAA CGCTTTGGCG ATTATAGCTT AGTAGAGTTG CAACTGCACA CAGGGCGCAC 2520 

TCATCAAATC CGTGTCCACA TGGCTTATAT CGGCCATCCA GTCGCTGGTG ATG AGGTCT A 2 580 

TGGTCCTCGC AAGACTTTCA AAGG AC AT GG ACAATTTCTT CATGCCAAGA CTTTAGGTTT 2 640 

TACTCATCCG AGAACAGGTA AGACCTTGGA ATTTAAAGCA GATATCCCAG AGATTTTTAA 2 700 

GGAAACCTTG GAGAGATTGA GAAACTAAGA ATGAAAAAGA AATTAACTAG TTTAGCACTT 27 60 

GTAGGCGCTT TTTTAGGTTT GTCATGGTAT GGGAATGTTC AGGCTCAAGA AAGTTCAGGA 2 820 

AATAAAATCC ACTTTATCAA TGTTCAAGAA GGTGGCAGTG ATGCGATTAT TCTTGAAAGC 2880 

AATGGACATT TTGCCATCGT GGATACAGGA GAAGATTATG ATTTCCCAGA TGGAAGTGAT 2 940 

TCTCGCTATC CATGGAGAGA AGGAATTGAA ACGTCTTATA AGCATGTTCT AACAGACCGT 3 000 

GTCTTTCGTC GTTTGAAGGA ATTGCGTGTC CAAAAACTTG ATTTTATTTT GGTGACCCAT 3 060 

ACCCACAGTG ATCATATTGG AAATGTTGAT GAATTACTGT CTACCTATCC AGTTGACCGA 3120 

GTCTATCTTA AGAAATATAG TGATAGTCGT ATT ACT AATT CTGAACGTCT ATGGGATAAT 3180 

CTGTATCGCT ATGATAACCT TTTACAGACT GCTCCAGAAA AACCTCTTTC AGTTATTCAA 3 2 40 

AATATCACAC AAGGGGATGC TCATTTTCAG TTTGGGGACA TGGATATTCA GCTCTATAAT 3 3 00 

TATGAAAATC AAACTSATT" ATCGGGTGAA T T AAA G AAAA TTTCGGATGA CAATTCCAAT 3 3 60 

TCCTTGATTA GCGTGGTGAA AGTCAATGGC AAGAAAATTT ACCTTGGGGG CGATTTAGAT 3 4 20 

AATGTTCATG GAGCAGAAGA CAAGTATGGT CCTCTCATTG GAAAAGTTGA TTTGATGAAG 3 4 80 

TTTAATCATC ACCATGATAC CAACAAATCA AATACCAAGG ATTTCATTAA AAATTTGAGT 3 540 

CCGAGTTTGA TTGTTCAAAC TTCGGATAGT CTACCTTGGA AAAATGGTGT TG AT AG TG AG 3 6 00 

TATGTTAATT GGCTCAAAGA ACGAGGAATT GAGAGAATCA ACGCAGCCAG CAAAGACTAT 3 6 60 

GATGCAACAG TTTTTGATAT TCGAAAAGAC GGTTTTGTCA ATATTTCAAC ATCCTACAAG 3 720 

CCGATTCCAA GTTTTCAAGC TGGTTGGCAT AAGAGTGCAT ATGGGAACTG GTGGTATCAA 3 7 80 

GCGCCTGATT CTACAGGAGA GTATCCTCTC GGTTGGAATG AAATCGAAGG TGAATGGTAT 3 840 

TACTTTAACC AAACGGGTAT CTTGTTACAG AATCAATGGA AAAAATGGAA CAATCATTGG 3 9 00 
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TTCTATTTGA CAGACTCTGG TGCTTCTGCT AAAAATTGGA AGAAAATCGC TGGAATCTGC 3 9 60 

TATTATTT7A ACAAAGAAAA CCAGATGGAA ATTGGTTGG A TTCAAGATAA AGAGCAGTGG 40 2 0 

TATTATTTGG ATGTTGATGG TTCTATGAAG ACAGGATGGC TTCAATATAT GGGGCAATGG 4 080 

TATTACTTTG CTCCATCAGG GGAAATGAAA ATGGGCTGGG TAAAAGATAA AGAAACCTGG 414 0 

TACTATATGG ATTCTACTGG TGTCATGAAG AC AG GTG AG A TAGAAGTTGC TGGTCAACAT 4 200 

TATTATCTCC AAGATTCAGG AGCTATGAAG CAAGGCTGGC ATAAAAAGGC AAATGATTGG 4 2 60 

TATTTCTACA AGACAGACGG TTCACGAGCT GTGGGTTGGA TCAAGGACAA GGATAAATGG 4 3 20 

TACTTCTTGA AAGAAAATGG TCAATTACTT GTGAACGGTA AGACACCAGA AGGTTATACT 4 330 

GTGGATTCAA GTGGTGCCTG GTTAGTGGAT CTTTCGATCC AGAAATCTGC TACAATTAAA 4 44C 

ACTACAAGTC ATTCAGAAAT AAAAGAATCC AAAGAAG7AG TGAAAAAGGA TCTTGAAAAT 4 500 

AAAGAAACGA GTCAACATGA AAGTGTTACA AATTTTTCAA CTAGTCAAGA TTTGACATCC 4 560 

TCAACTTCAC AAAGCTCTGA AACGAGTGTA AACAAATCGG AATCAGAACA G T AGT AG AAA 4 620 

AGAAGGTTTT AGGG CCTTCT 7TTTCCTATC AACTCTTTTC TATTTCCTGT TATTCATCTT 4 6 30 

ATAATGGATA AATATGAATA ATCGGAGTGA GACTATGAAA TACAAACGGA TTGTCTTT AA 4 7 40 

GGTGGGTACT TCTTCTCTGA CAAATGAGGA TGGAAGTTTA TCACGTAGTA AGGTAAAGGA 4 800 

TATTACCCAG CAGTTCGCTA TGCTGCACGA GGCTGGTCAT GAGTTGATTT TGCTGTCTTC 4 860 

AGGTGCCATT GCGGCTGGTT TTCGAGCCTT AGGATTTAAA AAGCGTCCGA CTAAGATTCC 4 92 0 

TGATAAACAG GCTTCAGCAG CGGTAGGGCA GGGGCTTTTG TTGG AAG AAT ATACAACCAA 4 980 

TCTTCTCTTG CGTCAAATCG TTTCTGCACA AATCTTCCTG ACCCAAGATG ACTTTGTGGA 5C4G 

TAAGCGTCGT TATAAAAATG CCCATCAGGC TTTGTCGGTT TTGCTCAACC GTGGGGCAAT 5100 

TCCTATCATC AA7GACAATG ATAGTGTCGT TATTGATGAG CTCAAGGTTG GGGACAATGA 5160 

CACTCTAAGT GCTCAAGTAG CGCCGATGGT CCAAGCAGAC CTTTT AG TTT TCTTGACAGA 5 2 20 

TGTGGACGGT CTCTATACTG GAAATCCTAA TTCAGATCCA AGAGCCAAAC G CTTGG AG AG 5 2 80 

AATCGAGACC ATCAATCGTG AG ATT AT TG A TATGGCTGGT CCAGCTGGTT CGTCAAACGG 5 3 40 

AACTGGGGGT ATGTTAACCA AAATCAAGGC TGCAACTATC GCGACGGAAT CAGGAGTTCC 5 40 0 

TGTTTATATC TGCTCATCCT TGAAATCAGA TTCCATGATT GAGGCGGCAG AGGAGACCGA 5 4 60 

GGATGGTTCT TACTTTCTTG C T C AAG AG AA GGGGCTTCGT AC C C AG AAAC AATGGCTTGC 5 52 0 

CTTCTATGCT CAGAGTCAAG GTTCTATTTG GGTTGATAAA GGGGCTGCGG AAGCTCTCTC 5 5 80 

TCAATATGGA AAGAGTCTTC TCTTATCTGG TATCGTTGAA GCAGAAGGAG TCTTTTCTTA 564 0 

CGGTGATATC GTGACAGTAT TTGACAAGGA AAGTGGAAAA TCACTTCGAA AAGGACGCGT 5 7 00 
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GCAATTTGGA GCATCTCCTT TGGAGGATAT GTTGCGTTCT CAAAAAGCCA ACGGTGTCTT 57 60 

GATTTACCGT GACGACTGGA TTTCCATTAC TCCTGAAATC CAACTACTTT TTACAGAATT 5820 

TTAGAGGTAA ACTATGGTGA G TAG AC AAG A ACAATTTGAA CAGGTACAGG CTGTTAAAAA 5880 

ATCGATTAAC ACAGCTAGTG AAGAAGTGAA AAACCAAGCC TTGCTAGCCA TGGCTGATCA 594 0 

CTTAGTGGCT GCTACTGAGG AAATTTTAGC GGCTAATGCC CTCGATATGG CAGCGGCTAA 6000 

GGGGAAAATC TCAGATGTGA TGTTGGATCG TCTTTATTTG GATGCAGATC GTATAGAAGC 6060 

GATGGCAAGA GGAATTCGTG AAGTGGTTGC CTTACCAGAT CCAATCGGTG AAGTTTTAGA 6120 

AACAAGTCAG CTTGAAAATG GTTTGGTTAT CACAAAAAAA CGTGTACCTA TGGGTGT CAT 6180 

CGGTATTATC TATGAAAGCC GTCCAAATGT GACGTCTGAT GCGGCTGCTT TGACTCTTAA 6240 

GAGTGGAAAT GCGGTTGTTC TTCGTAGTGG TAAGGATGCC TATCAAACAA CCCATGCCAT 6300 

TGTCACAGCC TTGAAGAAGG GCTTGGAGAC GACTACTATT CATCCAAATG TGATTCAACT 63 60 

GGTGGAGCAT ACTAGCCGTG AAAGTAGTTA TGCTATGATG AAGGCCAAGG GCTATCTAGA 64 20 

C CTTCTC ATT CC TCGTGG AG GACCTGGCTT GATCAATGCA GTGGTTGAGA ATGCGATTGT 64 80 

ACCTGTTATC GAGACAGGGA CTGGGATTGT CCATGTCTAT GTGGATAAGG ATGCAGACGA 654 0 

AGACAAGGCG CTGTCTATCA TCAACAATGC TAAAACCAGT CGTCCTTCTG TTTGTAATGC 6600 

CATGGAGGTT CTGCTGGTTC ATGAAAACAA GGCAGCAAGC TTCCTTCCTC GCTTGCAGCA 6660 

AGTG TTGGTT GCAGAGCGTA AGGAAGCTGG ACTGGAACCA ATTCAATTCC GCCTAGATAG 672 0 

CAAAGCAAGC CAGTTTGTTT CAGGTCAAGC AGCTGAGACC CAAGACTTTG ACACCGAGTT 67 80 

TTTAGACTAT GTCCTTGCTG TTAAGGTTGT GAGCAGTTTA GAAGAAGCGG TTCCGCACAT 684 0 

TGAATCCCAC AGCACCCATC ATTCGGATGC TATTGTGACG GAAAATGCTG AAGCTGCAGC 6 90 0 

ATACTTTACA GATCAAGTCG ACTCTGC AG C GGTGTATGTT AATGCCTCAA CTCGTTTCAC 6960 

AGATGGAGGA CAATTTGGTC TTGGTTGTGA AATGGGGATT TCTACTCAGA AATTGCACCC 702 0 

GCGTGGTCCC ATGGG CTTG A AAGAGTTGAC CAGCTACAAG TATCTGGTTG CCGGTGATGG 7 080 

GCAGATAAGG GAGTAAGAGA TGAAGATTGG ATTTATCGGT TTGGGGAATA TGGGTGCTAG 7140 

CTTGGCAAAA TCTGTCTTGC AGACTAGGAC GTCAGATGAG ATTCTCCTTG C C AATCGT AG 7 200 

TCAAGCTAAG GTAGATGCTT TCATTGCAGA CTTTGGTGGT CAGGCTTCCA GCAATGAAGA 7 260 

AATGTTTGCA GAAGCAGATG TGATTTTTCT AGGAGTTAAG CCTGCTCAGT TTTCTGAACT 73 2 0 

GCTTTCTCAA TACCAGACCA TCCTTGAAAA AAGAGAAAGT CTTCTTTTGA TTTCGATGGC 7 3 80 

AGCTGGATTG ACCTTAGAAA AACTAGCAAG TCTTATCCCA AGTCAACACC GAATTATTCG 74 4 0 
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TATGATGCCT AATACCCCTG CTTCTATCGG GCAAGGAGTG ATTAGTTATG CCTTGTCTCC 7 500 

TAATTGCAGG GCTGAGGACA GTGAGCTCTT TTATCAGCTT TT AG C CAAGG CTGGTCTCTT 7 5 60 

GGTTGAACTA GGAGAAAGTT TAATCGATGC AGCGACAGGT CTTGCAGGTT GTGGACCAGC 7 6 20 

CTTTGTCTAT CTTTTTATCG AGGCCTTGGC AGATGCAGGT GTTCAGACAG GATTACCACG 76 9 0 

AGAAATAGCA TTGAAAATGG CAGCACAAAC TGTGGTAGGA GCTGGGCAAT TGCTCCTTGA 7 74 0 

AAGTCAGCAA CATCCTGGAG TATTGAAAGA CCAAGTCTGT AGCCCAGGCG GTTC G ACT AT 7 800 

CGCTGGTGTA GCAAGCCTAG AAGCGCATGC TTTCCGAGGA AC AG T C AT GG ATGCAGTTCA 7 960 

TCAAGCCTAC AAACGAACAC AAGAACTAGG TAAATAAGAG GTAGTTTTGA CTGCCTCTTT 7920 

TATGGTGGCT GAAATGAGAA GACACAAAAA GATTGTCACA AACCCCTATT TTTTTGATAG 7 9 80 

AATAGAAGTA GTAAAAAAGA AATGAGTTAG ACATGTCAAA AGGATTTTTA GTCTCTCTTG 804 0 

AGGGACCAGA GGGAGCAGGC AAGACCAGTG TTTTAGAGGC TCTGCTACCA ATTTTAGAGC 8100 

AAAAAGGAGT AGAGGTGTTG ACGACCCGTG AACCTGGCGG AGTCTTGATT GGGGAGAAGA 8160 

TTCGGGAAGT GATTTTGGAT CCAAGTCATA CTCAGATGGA TGCTAAAACA GAGCTACTTC 82 20 

TCTATATTGC CAGTCGCAGA C AG C ATTTGG TGGAAAAAGT TCTTCCAGCC CTTGAAGCTG 82 80 

GCAAGTTGGT CATCATGGAT CCTTTTATCG ATAGTTCTGT TGCCTATCAG GGATTTGGTC 8 340 

GTGGCTTAGA TATTGAAGCC ATTGACTGGC TCAATCAGTT TGCGACAGAT GGCCTCAAAC 8400 

CCG ATTTG AC ACTCTATTTT GACATCGAGG TCGAAGAAGG GCTGGCTCGT ATTGCTGCTA 84 60 

ATAGTGACCG CGAGGTTAAT CGTTTGGATT TGGAAGGGTT GGACTTGCAT AAAAAAGTTC 8 52 0 

GTCAAGGCTA CCTTTCTCTT CTGGATAAAG AGGGAAATCG CATTCTCAAG ATTGATGCTA 8530 

GTCTCCCTTT GGAGCAAGTT GTGGAAACTA CCAAGGCTGT CTTGTTTCAC GGAATGGGCT 3640 

TGGCCAAATG AAACAAGATC AACTAAAGGC TTGGCAACCA GCTCAGTTTG ACCGTTTTGT 8700 

CCGTATCTTA GAACAAGACC AGCTCAATCA CGCCTATCTC TTTTCAGGTT TCTTTGAAAG 8 7 6C 

CTTGGAAATG GCGCAATTTT TAGCTAAGAG CCTCTTTTGT ACGGATAAAG TTGGCGTCTT 3820 

ACCATGTGAG AAATGCCGAA GTTGCAAGCT GATTGAACAG GGAGAATTTC CCGATGTCAC 88 80 

CTTGATTAAA CCAGTTAATC AGGTCATTAA GACGGAACGC ATTCGAGAAT TGGTGGGTCA 8 94 0 

GTTTTCTCAA GCAGGGATTG AAAGCCAGCA ACAGGTCTTT ATCATCGAGC AAGCGGATAA 9 000 

AATGCATCCC AACGCAGCCA ATTCTCTGCT CAAGGTCATC GAAGAACCCC AGAGTGAAGT 90 60 

TTATATTTTC TTCTTGACTA GCGATGAGGA AAAGATGTTA CCGACAATCC GAAGTCGGAC 9120 

TCAGATCTTC CACTTTAAAA AGCAAGAAGA AAAACTTATC TTACTCTTAG AACAAATGGG 9180 

ACTTGTTAAG AAAAAAGCCA CTCTTTTACC TAAGTTTAGT CAATCGCGAG CTGAAGCAGA 9240 
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AAAGTTGGCT AATCAGCCAA GTTTTTGGAC CTTGGTCGAT GAAACTGAAC GCCTGCTGAC 9 3 00 

TTGGTTAGTA GCTAAGAAAA AAGAAAGTTA TCTACAGGTT GCCAAATTAG CCAACTTGGC 93 60 

AGATGATAAG GAAAAACAGG ATCACCTTTT ACGGATTCTT GAAGTTCTCT GTGGGCAGGA 94 20 

CCTCTTGCAG GTAAGAGTAA GAGTGATTCT ACAAGATTTA CTAGAAGCTA GAAAAATGTG 94 80 

GCAAGCTAAT GTCAGCTTTC AAAATGCCAT GGAATATCTG GTCTTGAAAG AAATATAAAC 954 0 

T C AAAAATG A ATGATAAAGA AAGGAAAGGG CTGTTTTATG GACAAAAAAG AATTATTTGA 9 600 

CGCGCTGGAT GATTTTTCCC AACAATTATT CCTAACCTTA GCCGATGTGG AAGCCATCAA 9 660 

GAAAAATCTC AAGAGCCTGC TAGAGGAAAA TACAGCTCTT CGCTTGGAAA AT AG T AAGTT 97 20 

GCGAGAACGC TTGGGTGAGG TGGAAGCAGA TGCTCCTGTC AAGGCCAAGC ATCTTCGTGA 97 80 

AAGTGTCCGT CGCATTTACC GTGATGGATT TCACGTATGT AATGATTTTT ATGGACAACG 984 0 

TCGAGAGCAG GACGAGGAAT GTATGTTTTG TGACGAGTTG CTATACAGCG AGTAGGCATG 9900 
CAGATTCAAA AAAGTTTTAA GGGGCAGTCT CCCTATGGCA AGCTGTATCT AGTGGCAACG 99 60 

CCGATTGCCA ATCTAGATGA TATGACTTTT CGTGCTATCC AGACCTTCAA AGAAGTGGAC 10020 

TCGATTGCTG CTGAGGATAC GCGCAATACA GGGCTTTTGC TCAAGCATTT TGACATTTCC 10080 

ACCAAGCAGA TCAGTTTTCA TGAGCACAAT GCCAAGGAAA AAATTCCTGA TTTGATTGGT 10140 

TTCTTGAAAG CAGGGCAAAG TATTGCTCAG GTCTCTGATG CCGGTTTGCC TAGCATTTCA 102 00 

GACCCTGGTC ATGATTTAGT TAAGGCAGCT ATTGACGAAG AAATTGCAGT TGTGACAGTT 102 60 

CCAGGTGCCT CTGCAGGAAT TTCTGCCTTG ATTGCCAGTG CTTTAGCGCC ACAGCCACAT 103 20 

ATCTTTTACG GTTTTTTACC GAGAAAATCA GGTCAGCAGA AGCAATTTTT TGGCTTGAAA 103 80 

AAACATTATO CTGAAACACA GATTTTTTAT GAA7CACC7C ATCZ-TCTACC AC AC ACG7T G 10440 

GAAAATATGT TAGAAGTCTA CGGTGACCGC TCCGTTGTCT TGGTCAGGGA ATTGACCAAA 10500 

ATCTATGAAG AATACCAACG AGG TACT AT C TCTGAGTTAT TAGAAAGCAT TGCTGAAACG 105 60 

CCACTCAAGG GCGAATCTCT TCTCATTGTT GAGGGTGCCA GTCACGGTGT GGAGGAAAAG 106 20 

GACGAGGAAG ACTTGTTCGT AGAAATTCAA ACCCGCATCC AGCAAGGTGT GAAGAAAAAC 10630 

CAAGCTATCA AGGAAGTCGC TAAGATTTAC CACTGGAATA AAAGTCAGCT CTACGCTGCC 10740 

TACCACGACT GGGAAGAAAA ACAATAAAGG GAGACAGGAT GTAATAATTC TGTCTGTTTC 10800 

TGTTTAACTT AATTAGTGAT GATAATATAA AG ATGT AT C A CTTGGTATAG AAGCTTTGGT 108 60 

ATTAAGTTTT TTATTAAGCC CATACGGAAT ACCGATGGTT GGAGCAGCAG TTATAGCGTT 10920 

CTTAGAAGGT ATAAATAGAA AAATAAGGTC ATTTTAAATC AAAGGATTGA TAAATCAGAA 10980 



WO 98/18931 



PCT/US97/19588 



230 

AG AAGGTG AT TTTTTGCGAA CATACGAAAA TAAAGAAGAA GWW^'vGCTG AG AT AG AG AA 11040 

AACATTTGAG AAATATATTT TAGAATTTGA TAATATTCCA GAAAATTTAA AAGATAAGAG 11100 

AGCTGATGAA GTTGACAGAA CTCCAGCAGA AAACCTTGCT TATCAGGTTG GTTGGACCAA 11160 

CTTGGTTCTT AAATGGGAAG AAGATGAAAG AAAGGGGCTT CAAGTAAAAA CACCATCGGA 11220 

TAAATTTAAA TGCAATCAAC TTGGTGAATT ATATCAGTGG TTC AC AG AT A CCTACGCTCA 11280 

TTTATCTCTG CAAGAGTTGA AAGCAAAATT AAATGAAAAT ATTAATTCTA TCTCTGCAAT 113 40 

GATTGATTCG TTGAGTGAGG AAGAATTATT TGAACCGCAT ATGAGAAAGT GGGCTGATGA 11400 

AGCGACTAAA ACAGCGACTT GGGAAGTGTA TAAGTTTATT CATGTAAATA CGGTTGCACC 114 60 

TTTTGGAACT TTCAGAACTA AAATC AG AAA ATGGAAGAAG ATAGTATTAT AAA TT AT ATT 11520 

TTTAACTTTA AAAAATTTCA TAAAAATGGT TACCAAAGGC GATAGAAGAA AAA CT AT CGT 11580 

CTTTTTCTTT GCAAATTTTT AAGAAGGGAG GTGATCTTGC ATGGACTTTG AATATTTTTA 11640 

T AAC AG AG AA GCGGAAAGAT TTAACTTCTT AAAAGT AC CG GAG AT ATT AG TTG AT AG AG A 11700 

AGAATTTCGG GCCTTATCAG CAGAAGCAAT TATCCTTTAT TCCATACTTC TTAAACAGAC 117 60 

AGGAATGTCA TTTAAGAATA ACTGGATAGA CAAGGAAGGC AG AG T ATTT A TCTATTTTAC 118 20 

TGTCGAAGAA ATTATGAAAA GAAGAAATAT CTCAAAGCCA ACTGCCATAA AAAC ATT AG A 11880 

TGAGCTTGAT GTAAAAAAGC AATAGGACTG ATCGAAAGAG TAAGGCTTCG ACTTGGTAAG 11940 

CCGAACATCA TTTATGTTAA AGACTTTATG AGTATATTTC AGGTAAAAGA AAATG ACTT A 12 000 

CAGAAGTCAA AAAACTTAAC TTCAGAAGTA AAAGATTTTA ACCTCAGAAG TAAAGAAAAT 12 060 

GAACTTCAAG AGGTTAAGAA CCTTGACTCT AACTATATAG AGAATAATAA GAGTAAGTAT 12120 

AGTAAGAGAG AATATAGTTT TGGTGAAAAC GGACTTGGAA CATTTCAAAA TGTGTTTTTA 12180 

GCTGCTGAAG ATATATCGGA TTTACAAATC ATAATGAACT CACAGCTTGA GAATTACATT 12 240 

AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA ATGCCAGCAA TCAAATTCAT 12 300 

TCGTAATCCG AAG CGTTT AC GATGATTTCG ATAGATTGTT GAAAACATTT T AAAC GTT TT 12 3 60 

TACTTTGGCA AAGATGTTCT CAATCTTGCT TCTCTCCTTG GATAGCGCAT GGTTACAGGC 12420 

TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA CGTGGAGTTT GTACTTGAGG 124 80 

ATATATCTTC ATGAGCCCTT GATAACCACT GTCAGACAAG ATTTTACCAG CTTGTCCGAT 12540 

ATTTCTGCGA CTCATTTTGA AC AACT TC AT ATCACGACAA TAGTTCACAG CGATATCCAA 12 600 

AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC TTCATAGCCT GAAATTTCTT 12660 

TTTACCAGAA TGATTCGCTA ATTCTTTTTT T AGGGCG ATT GATTTTTACT TCCGTCGCAT 1272 0 

CAATCATTAC CGTGTCCTCA GAACTGAGAG GAGTTCTTGA AATCGTAACA CCACTTTGAA 12 780 
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CAAGAGTTAC 
CAGCCGCAAT 
GGTCTTCTAG 
TTTTTAATAC 
ATCGTGC ATC 
TGTTTCGCAG 
TTGTGAGCGT 
AGGAGATCAC 
AATAACAGTC 
TTCTTAACTG 
GCCTTCTTCC 
CAAACAGCCC 
TCGTCTCTCA 
ACCAAATCGC 
CACCCCAGCA 
ATGCTTTGAC 
TCTTGGAATC 
ATGGTT7CTC 
GTGATGAAAT 
7A.*.3"7 "ATGC 
ACGATGCGGC 
AATCACGTTA 
TTATGGATTT 
AAGAAAAGTA 
CTATGCTTTA 
CTCGTGATGG 
CTGGTAGCCT 
GTGTCTTCCA 
ATCGCAATTC 



TTCAACCCAT 
TTGTTCATAA 
GCTTAATTTA 
AGCTAATATC 
AGTTAGTTGT 
GAAGTCTATT 
GGTGCTATTT 
TATGTTGAAT 
CCTATCGATC 
GAATACTCAC 
CTTTTTTTGT 
AGTGGGGTGT 
ATCTTATATC 
AGAGCGTGGT 
TAAAACAGAC 
AAATGCAGTT 
TGCTGAGGCT 
TCAAATGGTG 
TACAGAATTG 

AGCGCCTATT 
TGATAAGGCA 
CCATGAAGCT 
CTTTGAAGGA 
TGGCCCTATG 
AGAATTTAAA 
CTACAATATT 
AATGATTCCG 
TTACATCGAT 



TGGCTCCGAC 
GTTCGATATT 
GGTTTTCGTC 
TCTTCAAAAG 
TTACTTGCTT 
GG AAAGTAAG 
TTTCAGGTAA 
GTTATGCTGT 
CGTTATGTGG 
TATCTCTTTA 
T AT ACT AG T A 
TTTAATATGG 
AATGTTATCG 
ATTCCAGTTA 
AATTTTGCTG 
GGTCTTCTCA 
ACACGTGTTC 
ACCGAAAAAG 
CCGACAGATG 
AAGATTCYTG 
ATCGATGTCA 
GAAGCGGCCT 
TTGGTCAATG 
TGTATCCCTA 
AAGCCAGTCG 
ACACCTTATC 
GTTGGTTTCC 
GGTCTTGAAA 
TCACCAAATC 
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GCATTAACTT 
CTCGCACATA 
CACCTTTTGC 
TCGTGCGCTG 
CATCATTCAT 
AAATATTGAA 
AATAAAATAT 
TTTAGATAAT 
AAAAGATTTT 
CATCAAGAAA 
GAAGAAAAAA 
ACTTAGGTCC 
GTG CTGGTTT 
AACTATATGA 
AGTTGGTTTG 
AGGAAGAAAT 
CTCCAGGTGG 
TTGCCAACCA 
TTATTACCCT 

ACACTATCGA 
ACCTCAATGC 
CAGAAGAAGC 
T CG AAGTC AT 
GTCTTGAGTA 
CGGTTGTGCA 
AGACCCACCT 
ATGCGGAGTT 
TTCTTGAGCA 



GCTTTCCTGA 
TTGAAGAGTG 
GTGTTTAAGT 
AACACCAACA 
AGAACTACTA 
GCTGAGGCTA 
CACGAAGATT 
ACGGTTATAG 
ATAGAGTAGC 
ATGACTAAAC 
TTAGAAAGAT 
CACCCAAAGA 
GGCAGGTTCT 
AATGCGTGGT 
TTCCAATTCT 
GCGTCGCTTG 
TGCCCTTGCA 
CCCCTTCATT 
TATCGCTACT 

TATGAGCAAG 
CCCTATGACC 
ACCGCTTAGT 
GGCCAAACGT 
CCCAGACGAC 
ACTTCCTCAG 
CAAATGGGGA 
TGTCCGTTAT 
GACTTACCGT 



AT AC C AAAAT 
GCCATAAGAA 
TGATAAGCTG 
AGACGCTTAA 
TACCATATTT 
TTAGAAGAAA 
CACAGTTTAA 
CAAGAGATAG 
ATATAATTGA 
AGGGAAGTTT 
TTCTGCCTCT 
GGTATTAGTG 
GAAGCAGCTT 
GTCAAGTCTA 
TTGCGTGGGG 
GGTTCTGTTA 
GTGGACCGTC 
GAAGTGGTTC 
GGTCCTTTCA 
TTTTATTTCT 
GTCTACCTCA 
AAGCAAGAVT 
TCTTTTGAAA 
GGCATTAAAA 
TATACAGGAC 
GATAATGCAG 
GAACAAAAGC 
GGTGTGATGC 
TCTAAGAAAC 



12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13930 
14040 
14100 
14160 
1422C 
14280 
14340 
14400 
14460 
14520 
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AACCAAATCT CTTCTTTGCT GGTCAAATGA CGGGTGTCGA AGGCTATGTT GAGTCGGCGG 14 530 

CTTCAGGCTT AGTTGCGGGA ATTAACGCAG CTCGTCTCTT CAAGGAAGAA AGCGAGGCTA 14 640 

TTTTCCCCGA GACGACAGCG ATTGGAAGCT TAGCTCATTA CATTACCCAT GCCGACAGCA 14700 

AACATTTCCA ACCAATGAAT GTCAATTTTG GGATCATCAA GGAGTTGGAA GGCGAGCGTA 14 7 60 

TCCGTGATAA G AAGGCT CGT TATGAAAAAA TTGCAGAGCG TGCCCTTGCC GACTTAGAGG 14 820 

AATTTTTGAC TGTCTAATTT TTTTGAAAGA ATTGCTCATG AT ACT AT AAA AATCTTAGAA 14 380 

ATTGTGATAA AATAGGTAGG ATGAAAGAAG GAGAGTGAAA ATGGCGAATC CCAAGTATAA 14 940 

ACGTATTTTA ATCAAGTTAT CAGGTGAAGC CCTTGCCGGT GAACGTGGCG TAGGGATTCA 15000 

TATCCAAACA GTTCAAACAA TCGCAAAAGA GATTCAAGAA GTTCATAGCT TAGGTATCGA 15060 

AATTGCCCTT GTTATCGGTG GAGGAAATCT CTGGCGTGGA GAACCTGCAG CAGAAGCAGG 1512 0 

TATGGACCGT GTTCAGGCAG ATTACACAGG AATGCTTGGG ACTGTTATGA ATGCTCTTGT 15180 

GATGGCAGAT TCATTGCAAC AAGTTGGGGT TGATACGCGT GTACAAACAG CTATTGCCAT 15240 

GCAACAAGTG GCAGAGCCTT ATGTCCGTGC ACGTGCCCTT CGTCACCTTG AAAAAGGCCG 15300 

TATCGTTATC TTTGGTGCTG GAATTGGTTC ACCTTACTTC TCGACAGATA CAACAGCGGC 15360 

CCTTCGTGCA GCTGAAATCG AAGCAGATGC CATCCTCATG GCTAAAAATG GTGTCGATGG 154 20 

TGTTTACAAT GCCGATCCTA AGAAAGATAA GACAGCTGTT AAGTTTGAAG AATTGACCCA 154 80 

CCGTGACGTT ATCAATAAAG GTCTTCGTAT CATGGACTCA ACAGCTTCAA CCCTCTCAAT 1554 0 

GGACAACGAC ATTGACTTGC TTGTATTCAA CATGAACCAA CCAGGCAACA TCAAACGTGT 15 600 

CGTATTTGGT GAAAATATCG GAACAACAGT TTCAAATAAT ATCCAAGAAA AGGAATAAGA 15 660 

AAGAATATGG CTAACGCAAT TATTGAAAAA GCTAAAGAGA GAATGACCCA GTCTCACCAA 1572 0 

TCACTTGCTC GTGAATTTGG TGCTATCCGT CCTGGTCGTG CCAATGCAAG CTTGCTTGAC 15790 

CGTGTACATG TAGAATACTA TGGAGTCGAA ACTCCTCTTA ACCAAATCGC TTCAATTACG 15 840 

ATTCCAGAAG CGCGTGTTTT GTTGGTAACA CCATTTGACA AGTCTTCATT GAAAGACATC 15900 

GAACGTGCCT TGAACGCTTC TGATATTGGT ATCACACCGG CTAATGACGG TTCTGTGATT 15960 

CGCTTGGTTA TCCCACCTCT TACAGAAGAA ACTCGTCGTG ACCTTGCTAA AGAAGTGAAG 160 20 

AAGGTCGGCG AAAATGCTAA AGTGGCTGTC CGCAATATCC GTCGCGATCC TATGGACGAA 16080 

GCTAAGAAAC G AG AAAAAG C AAAAGAAATC ACTGAAGACG AATTGAAGAC TCTTGAAAAA 16140 

G AC ATT C AAA AAGTAACAGA CGATGCTCTT AAACACATCG ACGACATGAC TGCTAACAAA 16 200 

GAGAAAGAAC TTTTGGAAGT CTAAAAATAA ACAGAAAAAC TCACTTGGCA TTGCTGG CTG 152 60 

AGTTTTATTC GAAAGAAGGA AATATGAATA CAAATCTTGC AACTTT7ATC GTTGGACTGA 163 20 
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TCATCGATGA AAACGACCGT TTTTACTTTG TGCAAAAGGA TGGTCAAACC TATGCTCTTG 16390 

CTAAGGAAGA AGGCCAACAT ACAGTAGGGG ATACGGTCAA AGGTTTTGCA TACACGGATA I 64 40 

TGAAGCAAAA ACTCCGCCTG ACAACCTTAG AAGTGACTGC CACTCAGGAC CAATTTGGTT 16500 

GGGGACGTGT CACAGAGGTT CGTAAGGACT TGGGTGTCTT TGTGGATACA GGCCTTCCTG 16560 

ACAAGGAAAT CGTTGTGTCA CTCGATATTC TCCCTGAGCT CAAGGAACTC TGGCCTAAGA 16620 

AGGGCGACCA ACTCTACATC CGTCTTGAAG TGGATAAGAA AGACCGTATC TGGGGCCTCT 166 30 

TGGCTTATCA AGAAGACTTC CAACGTCTTG CTCGTCCTGC CTACAACAAC ATGCAGAACC 16 740 

AAAACTGGCC AGCCATTGTT TACCGTCTCA AGCTGTCAGG AACTTTTGTT TACCTACCAG 16800 

AAAATAATAT GCTTGGTTTT ATTCATCCTA GCGAGCGTTA CGCAGAGCCA CCTTTGGCGC 168 60 

AAGT ATT AG A TGCGCGCGTT ATTGGT TTCC GTGAAGTGGA CCGCACTCTG AACCTCTCCC 16920 

TCAAACCACG CTCCTTTGAA ATGTTGGAAA ACGATGCTCA GATGATTTTG ACTTATTTGG 169 80 

AAAGCAATGG CGGTTTCATG AC CTT AAATG ACAAGTCATC TCCAGACGAC ATCAAGGCAA 17 040 

CCTTTGGCAT TTCTAAAGGT CAGTTCAAGA AAGCTTTACC TGGTCTTATG AAGGCTGGTA 17100 

AAATCAAGCA GGACCAGTTT GGGACAGAGT TGATTTAGGG AGGCTTATGA GAAAATCATT 17160 

TTACACTTCG CTCATGACCG AGCGCAATCC TAAAAGTAAC AGTCCCAAAG CAATTTTGGC 17 2 20 

AGACCTCGCT TTTGAAGAGT CAGCCTTTCC AAAACACACA GATGATTTTG ATGAGGTCAG 172 80 

TCGCTTTTTG GAGGAGCATG CCAGTTTCTC TTTTAACCTA GGAGATTTTG ACAGCATTTG 17 3 40 

GCAGGAATAT CTAGAACACT AGCATTTATT CATTGGGTTT GGGCTAGTAA TTTCTCCATC 17 400 

CCTCTG CT AT AATAAAAAGA AATAAAACGA TTAGAGAGGT TCTTTATTTG AAGGAACATT 17 4 60 

CAATAGACAT TCAACTGAGT CATCCAGATC ACCTCTTTCA TCTTTTTCGT TCCAATCAAC 17 5 20 

GCCATCTTCG TTTGATGGAA GAAGAGCTTG ATGT TGTG AT TCATGCTCGT ACGGAGATTG L7 5 80 

T CC AGGTTTT GGGAGAAGAG TCTGCCTGTG AGGAAGCCCG TCAAGTTATT CAGGCTTTGA 17 640 

TGGTCTTGGT AAATCGTGGG ATGACCGTTG GTACGCCAGA TGTAGTCACT GCGATTAGCA 17700 

TGGTCAAAAA TCATGAAATT GACAAGTTTG TCGCCCTTTA CGAAGAAGAA ATTATCAAGG 17760 

ATAATACTGG GAAACCTATC CGTGTCAAAA CCCTAGGGCA AAAGCTTTAT GTGGACAGTG 17 820 

TCAAACAGCA TGATGTGACC TTTGGAATTG GGCCAGCAGG TACAGGGAAG ACCTTCCTTC 17 880 

CAGTGACCTT GGCAGTGACT GCCCTTAAAC GTGGGCAAGT CAAGCGAATT ATCCTAACTC 1794 0 

GTCCAGCGGT GGAAGCGGGA GAGAGTCTTG GATTTCTTCC GCGTCATCTT AAGGAGAAGG 180 00 

TGGATCCTTA CCTTCGTCCT GTTTACGATG CCTTGTATCA AATTCTTGGG AAAGACCAAA 180 6 0 
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CGACTCGTCT 


CATGGAGCGT 


GAAATTATCG 


AAATTGCGCC 


CCTTGCCTAT 


ATCCGTGGCC 


18120 


GGACCTTGGA 


TGATGCCTTT 


GTCATTCTCG 


ATGAGGCGCA 


AAACACGACC 


ATCATGCAGA 


13180 


TGAAGATGTT 


CTT G ACGCGT 


TTAGGTTTTC 


ATTCTAAGAT 


GATTGTCAAT 


GGAGATATTA 


18240 


GTCACATTGA 


CCTGCCACGT 


AATGTCAAGT 


CCGGTTTGAT 


TGATGCTCAA 


GAGAAACTCA 


183C0 


AGAACATCCA 


TCAGATTGAC 


TTTGTTCATT 


TTTCAGCCAA 


GGATGTGGTT 


CGCCATCCTG 


18360 


TTGTCGCTCA 


GATTATCCGA 


GCCTATGAAT 


ATTCTACTGA 


AGTTGCACAC 


GACTGATTTT 


18420 


GAGGAAGTTC 


GCCTGCAAAA 


GAATAGACTT 


GTTCGGTAAC 


TGTAAAAAGT 


GTTATACTAT 


18480 


TTTTATGGAA 


ACAGTATACG 


ACAAAGCACA 


AAAACTTAAC 


TCAAAAAACT 


TCAAACTATT 


18540 


GATTGGTGTC 


AAAAAGGAAA 


CCTTTCAACT 


CATGCTAGAA 


CACCTGAATT 


CAGCCTATCA 


18600 


GATTCAGCAC 


CGAAAAGGTG 


GACGTCCACG 


TAGTCTGCCC 


ATGGAAGACC 


AGCTCATTAT 


18660 


GACCCTCCGT 


TACTTGCGAT 


ATTATCCCAC 


TCAGCGTCTG 


CTCCCCTTTG 


ATTTTGGCGT 


18720 


CGGTGTAGCT 


ACGGTAAATG 


CCATCATCAC 


TTGGGTGGAG 


GATACACTTC 


GTGCGTCAGG 


18780 


TAGCTTTGAT 


TTGG AC C ATT 


TAGAAGCCCC 


GAGTGCTGCT 


GTGGCTATTG 


ACGTCACCGA 


18840 


AAGTCCGATT 


CAGCGTCCAA 


ACAAAACCAA 


AGCAAAAATT 


ATTCTGGTAA 


AAAG AAACG A 


18900 


CACACCTTAA 


AAACTCAAAT 


TATGCTGGAT 


TTGACGACAC 


AT AAAG TCTG 


TCAAATGGCC 


18960 


TTTTCTG AC G 


GACATACGCA 


TGATTTTACT 


CTCTTCAAAG 


AAACTATTGG 


ACAAAGTTTG 


19020 


C CTG AAACG A 


CGCTTGCCTT 


TGTTGACCTA 


GGTTATTTAG 


GCATCTTGAA 


ATTTCATGAG 


19080 


AATACTTTCA 


TTCCTGCTAA 


AAATTCCAAA 


AATCGCCGCC 


TGAGTGAGGA 


TGATAAGCAG 


19140 


TTAAATAAAG 


AGATGTCAGC 


GATACGAATT 


GAAATTGAAC 


ATTTTAACGC 


TAAATTCAAG 


19200 


ACCTTCCAAA 


TCATGT C ACT 


CCCTTATCGT 


AACCGCAGAA 


AACGTTTCGA 


GTTACGGGCG 


19260 


GAATTAATTT 


GTGCCATCAT 


CAATTATGAA 


GTGAACTAGA 


TTCCGAACAA 


GTCTAATATA 


19320 


CTTTTGAGAG 


AGGAAAATCC 


AGTTG TAT AG 


GCTAAAGGTT 


TTATCCAAAG 


G T CT GAG AC A 


19380 


ACGATTAGGC 


ACGATGGAAA 


GAACTTTTAT 


GTGGCTGATG 


ACGATCAGTG 


CATCTTCCTG 


19440 


TGTCATAATC 


ACAGGGCACA 


AGAAAGTAGG 


AATTTGAAAA 


GATGATTGAC 


CAACTATCTA 


19500 


AGTATTACAG 


TTGTAGGATA 


CTAACTGAAA 


AGGATATTCC 


AAGTATTTTA 


TCTTTATATG 


19S60 


AAAGTAATCC 


TCTGTATTTT 


CAGCATTGTC 


CACCAGAGCC 


AAATTTTGCA 


A C T GT AAAAG 


19620 


AGGACATGCT 


TTGTCTACCT 


GAAGGTAAAG 


CTAAGGCTGA 


TAAGTTTTTT 


GTTGGATTTT 


19680 


GGAATGGATC 


TGACCTTGTG 


GCTGTTATGG 


ATTTTCTCTA 


TGCATATCCT 


GATGAGGAGA 


19740 


CTGTTTTTAT 


TGGTTTGTTT 


ATGGTTGATC 


AAGCCTATCA 


GAGAAAAGGG 


ATTGGTAGTC 


19900 


ATATTGTGAC 


AGAAGCACTA 


GCTTATTTTG 


CTAAGAACTT 


TCGAAAGGCA 


CGTTTGGCTT 


19860 
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ATGTTAAGCG AAATCCCCAA TCTCAGCATT TTTGGGAAAA GCAGGGCTTT AAATCAATTG 1992 0 

GATGCGAGGT TAAGCAAGAA CTCTATACGG TTGTTATCGC TGAACAGAGC CTAGAAGATT 19980 

AGAAATGGCA TCAAGTAAGA ACT ATTTGG A ATTTGTTTTG GAACAATTAT CAGGATTAGA 2 0040 

TGATGTGACT TACCGTTCCA TGATGGGGGA GTATATTCTT TACTTCCGCG GCAAGATTAT 20100 

TGGCGGCATT TATGACGATC GCTTTTTAGT TAAACCCGTG CAAGCAGTCT TAGATAAGAT 20160 

TGACCAATCT TCTTTTGAGT TTCCATACAA AGGTGCCAAA GAAATGATTT GAGTGGAAGA 2022 0 

ACTTGATAAT AAGATGTTTC TATAAGACCT AATTTTAGCT ATGTATAACC AACTGCCAAC 2 02 80 

GCCCAAACCT AAAAAGAAAA AGCAAGGGTG AACGAAGTAA AAAAGAAGTC TGCTAAGGCC 203 4 0 

CTGTCTTTGC ACGGGTAAAA TTTTATATAT AAAAAGAAGC TGGGACTAAA GAGCTCACCT 204 00 

TCCTTTGGTT TATATAATTG TCATTACAAG ACGAAGTGGT TGGGCGAAAC TCTGTTGACT 204 6 0 

TTATTCAATT TAGAGTTTCT TATGCACAAT TGAGTCTGGA ACGAAAGTCT CCAGTTCCAA 205 2 0 

AGTATACACT ACAATAAACC AACGATGTAA TACCTGATGA CACAAAGCAC AGTGGGTAGG 205 80 

ACTTGCGAAG TCACCCTTTT CTTTTCAAAA TTTATACTAA ATCATTGATA TCAGTGTACT 2 064 0 

CACGATTAAG TCCTTGAGCA ACTGGTAGGT TAGTCAAGTA ACCTTGATAA GTAGTCACAC 2 0700 

CTTGACGCAA GCCTTCATCT TCAGAGATTG CTTGTGCGAA TCCTTTGCCA GCCAAAGCTT 207 60 

CCATATAAGG AAGAGTGACA TTGGTTAGGG CGATGGTTGA AGTGCGAGCA ACCGCACCAG 2 0820 

GGATATTGGC AACGGCATAG TGGAGAACAC CGTGTTTTTC ATAGACGGCT TCATCGTGCG 2 0880 

TTGTCACACG GTCAGCTGTT TCCATAACGC CACCTTGGTC AACAGCAACG TCAACGATAC 2094 0 

AGAGCCTGGA CGCATTTCTT TGACCATCTC ATCTGTCACC AATTCCGGTG CTTTTGCACC 21000 

AGGGATGAGA ATGGCTCCAA TCACCACATC AGCATCTCTC ACACTTGCTT CAATGTTGAA 210 6 0 

TGAATTAGAC ATAAGAGTTT GAATTTGACT TCCAAAGACT TCTTCTAGAA CTGAGAGACG 21120 

CTTGGAACTA ATATCTAAAA TAGTCACTTG AGCACCAAGA CCAAGGGCGA TCCGCGCAOC 21180 

ATGTGTACCG ACGACACCAC CACCGATGAT AGTTACTTTT CCTTTTGGAA CACCTGGTAC 212 40 

ACCACCAAGT AGAACACCAG AGCCACCAGC TTGCTTAGTA AGGAAGTGAC CTCCGATTTG 213 00 

AACAGCCATA CGACCTGCAA CCTCACTCAT AGGAACGAGG AGCGGTAGTT GTCCTTGATT 213 60 

GTCACGAACA GTTTCAGTTG TTTTTGCTGT TAACATAGCA TCTGCTAATT CTGGAGCAGC 214 20 

GGCCATGTGC AAGTAGGTGA AGAGAAGAAG ATCGTCGCGC AAGTAACCCT ATTCAGAACT 214 80 

TAAAGATTCT TTTACTTTCA CAACCAACTC TGCTGCCCAA GCTTCACCAG CAGTAGCGAC 21540 

AATCTCAGCT CCTTCCTTTT GATAGTCAGC ATCAGTAAAG CCAGAACCGA GACCAGCATT 21600 
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TGTTTCGATA AGGACACGAT GACCACGACT AACTAAGCTA TGAACACCTG CAGGTGTGAG 21560 

GGCGACACGG TTTTCGTTAT TTTTAATTTC TTTTGGGATT CCGATTAACA TTGAGATAAC 217 20 

CTACCTTTCA ATTGACGGTC TTGTTTTGGT TGTCACATTC CAGTTCATAA ATCAAAAATG 21790 

TGACGGTTTC ATTGTATATG AAACCGCTTC AAAAATCAAG AAAAACTTGT CATCCAAATT 21840 

TTTTTATGCT AGACTAGTGA AAATCAAGCT CTAATGGAGG GAAAAGTATG GAATCAATAT 21900 

TTGTGAAATT TGCCCAGTAT CCGTCTATAG AAACGGAGCG TTTATTGCTC AGACCTGTAA 21960 

CTTTGGATGA TGCGGAAcAA TGTTTGACTA TGCCTCGGAC AAGGGTAATA CACGTTACAC 2 2020 

TTTTC C AACC AATCAAAGCT TGGAAGAAAC CAAGAATAAC ATTGCTCAGT TCTACTTGGC 22030 

TAATCCCTTG GGACGTTGGG GAATAGAACT AAAAAGCAAT GGTCAGTTTA TTGGAACCAT 2214 0 

TGACTTGCAC AAGATTGATT CTGTTCTTAA GAAGGCAGCT ATTGGCTACA TTATCAATAA 2 2 200 

AAAGTATTGG AATCAAGGAT TAACGACAGA AGCCAATCGT GCTCTGATTG ACCTACCTTT 222 60 

TGAGAAGATA GGGATGAATA AGTTGACTGC CCTTCACGAT AAGGCTAATC CCGCGTCAGC 22320 

AAAGGTCATG GAGAAATCAG GCATGCGTTT TTCCCATGCA GAACCATATG CTTGT ATGG A 2 2 380 

CCAGCATGAA AAAGGCCGAA TCGTGACAAG ACTTCATTAT GTCTTGACCA AGGAAGACTA 22440 

TTTTCCAAAT AAATAAGCAG TTGAAAAGAA ATTTTTCGAC TGTTTTTTCT TCCTCTTACG 22 500 

AATAATCTAA GAG AG GAG AA AATATGGAAG CAATTATCGA GAAAATCAAA GAGTATAAAA 225 60 

TCATCGTCAT CTGTACTGGT CTGGGCTTGC TTGTAGGACG ATTTTTCCTG CTAAAACCAG 22620 

CTCCACAAAC ACCTGTCAAA GAGACCAATT TGCAGGCTGA AGTTGCAGCT GTTTCCAAGG 22680 

ACTCATCGAC CGAAAAGGAA GTGAAGAAGG AAGAAAAGGA AGAACCCCTT GAACAAGATC 227 40 

TAATCACAGT AGATGTCAAA GGTGCTGTCA AATCCCCAGG GATTTATGAC TTGCCTGTAG 22 900 

GTAGTCGAGT CAATGATGCT GTTCAGAAGG CTCCTGGCTT GACAGAGCAA GCAGACAGCA 228 60 

AGTCGCTCAA TCTAGCTCAG AAAGTT AGTG ATGAGGCTCT GGTTTACGTT CCTACTAACC 22 920 

GAGAAGAAGC AGTTAGTCAA CAGACTGGTT CGGGGACAGC TTCTTCAACA AGCAAGGAAA 22 9 80 

AGAACGTCAA TCTCAACAAG GCCAGTCTGG AAGAACTCAA CCAGGTCAAG GGACTGGGAG 2 3 040 

GAAAACGAGC TCAGGACATT ATT G AC CATC GTGAGGCAAA TGGCAAGTTC AAGTCAGTAG 2 3100 

ACGAGCTCAA GAAGGTCTCT GGCATTGGTG GCAAAACAAT AGAAAAGCTT AAAGACTATG 23160 

TTACAGTGGA TTAAGAATTT CTCTATTCCC CTAATTTACC TGAGTTTTCT ATTACTTTGG 23220 

CTTTATTACG CTATTTTCTC AGCATCTTAT CTTGCTTTGT TGGGCTTTGT TTTTCTGCTA 2 3280 

GTCTGTCTCT TTATCCAATT T C CGTGG AAA TCTCCTGGTA AAGTTCTAAT AATTTGCGGA 2 3 340 

ATCTTTCGAT TTTGGTTTGT TTTTC AAAAT TGGCAACAGA GTCAAGCCAG TCAAAATCTG 2 3 400 
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CCGGATTCTG TTGAAAGGGT ACGGATTTTG CCTGATACTA TTAAGGTTAA TGGTGATAGT 2 34 60 

CTATCCTTTC GTGGCAAGTC TAACGGTCGT GCTTTCCAAG T CT ATT AT AA ACTCCAGTCC 2 3520 

GAGGAGGAGA AAGAAGCCTT TCAAGCTTTA ACTGACCTGC ATGACATACG AC T AG AAGGG 23580 

AACCTTTCGC AGCCAGAAGG GCAGAGAAAT TTTGGTGGCT TTAATTACCA AG C CT ATCTG 23640 

AAGACTCAGG GAATTTACCA GACTCTCAAT ATCAAAACAA TCCAGTCACT TCAAAAGATT 2 3700 

GGCAGTTGGG ATATAGGAGA AAACTTGTCC AGTTTACGTC GAAAGGCTGT GGTTTGGATT 23 760 

AAGACGCACT TTCCAGACCC TATGGGCAAT T AC AT G AC AG GACTCTTGCT GGGACATCTG 2 3 820 

GACACCGACT TTGAGGAGAT GAATGAGCTT TATTCCAGTC TAGGAATTAT CCACCTCTTT 23 8 80 

GCCCT ATCTG GCATGCAGGT AGGTTTTTTC ATGAATGGAT TTAAGAAACT TCTCTTGCGA 2 3 940 

TTGGGCTTGA CCCAAGAAAA GTTGAAATGG CTGACTTATC CCTTTTCCCT TATCTATGCG 2 4 000 

GGACTAACTG GATTTTCAGC ATCGGTTATT CGCAGTCTCT TGCAAAAGCT ACTGGCTCAA 2 40 60 

CATGGGGTTA AGGGCTTGGA TAATTTTGCC TTG ACGGTG C TTGTCCTCTT TATTGTCATG 2 412 0 

CCAAACTTTT TC TTG AC AG C AGGAGGAGTC TTGTCCTGCG CTTATGCTTT TATCCTGACC 2 4180 

ATGACCACCA AAGAAGGGGA GGGGCTCAAG GCTGTTACTA GTGAAAGTCT AGTCATCTCC 24240 

TTGGG CAT AT TGCCCATTCT ATCCTTCTAT TTTGCGGAAT TTCAACCTTG GTCTATCCTT 24 3 00 

TTGACCTTTG TCTTTTCCTT TCTTTTTGAC TTGGTCTTCT TACCGCTCTT GTCTATCTTA 24 3 60 

TTTGTCCTTT CCTTTCTCTA TCCAGTCATT CAGCTGAACT TTATCTTTGA ATGGTTAGAC 24 4 20 

GGCATTATTC GCTTGGTCTC CCAGGTGGCA AGG AG AC C AC TTGTCTTTGG TCAACCCAAC 24480 

GCATGGCTTT TAATCTTATT GTTAATTTCC TTGGCTTTGG TCTATGATTT GAGGAAAAAC 24 540 

ATTAAAGGAT TAACAGTATT GAGTTTATTG ATTACAGGTC TCTTTTTCCT TACCAAGTAT 24 600 

CCACTGGAAA ATGAAATCAC CATGCTGGAT GTQGGGCAAG GAGAAAGTAT TTTCTACGGG 24 660 

ATGTAACTGG GAAAACCATT CTCATAGATG TAGGTGGTAA GGCAGAATCT TATAAGAAAA 2 4720 

TCAAAAAATG GCAAGAAAAG ATGACGACCA GCAATCCCCA GCGAACCTTG ATTCCCTATC 24780 

TCAAAAGTCG AGGAGTAGCT AAGATTGACC AGCTAATTTT GACTAACACG GACAAGGAGC 2 4 840 

ATGTTGGAGA TTTGTCAGAG ATGACCAAGG CTTTCCATGT AGGGGAGATT CTAGTATCAA 2 4 900 

AAGACAGTCT GAAACAGAAC GAATTTGTGG CAGAACTACA GGCGACTCAA ACAAAGGTGC 24 960 

GTAGTATGAT AGTAGGGGAG AACTTGCCCA TTTTTGGAAC TCAGTTAGAA GTTCTATCTC 2 5020 

CAAGGAAAAT GGGAGATGGA GGACACGATG ATACCCTAGT TCTGTATCCG AAATTCTTGG 2 50 80 

ATAAGCAATT TCTCTTCACG GGAAATTTGG AGGAGAAAGG AGAGAACGAC TTGCTGAAGC 25140 
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ACTATCCAGA 


CTTGAAAGTA 


AATGTTTTGA 


AAGCTAGCCA 


ACATGGCAAT 


AAAAAATCAT 


25200 


CAAGTCCAGC 


CTTTCTAGAA 


AAACTCAAAC 


C AG AG CTT A C 


TCTTATCTCA 


GTTGGAAAGA 


25260 


GCAATCGAAT 


GAAACTCCCC 


CATCAGGAAA 


CATTGACACG 


ACTGGAACGT 


ATCAATAGCA 


25320 


AAGTTTATCG 


AACTGACCAG 


CAAGGAGCTA 


TACGTTTTAA 


GGGGTTGGAT 


AGTTGGAAAA 


25380 


TCGAAAGTGT 


TCGATAGGAA 


GGATAAATGT 


TGTAG ATT AG 


TGAAATAAAC 


TAAAAATTTG 


25440 


TTGCATAATA 


ATGATAAAAA 


TGGTATAATG 


AAAACGTATT 


CAATATTGAG 


GATATAAAAT 


25500 


CATTAAAAAT 


CAGCAAAAGT 


TGTTTTATTA 


GTTAGTTTAT 


AATCTATTGG 


TCTTCTTCAG 


25560 


TCCAGTGTAT 


CTGCTGTGAC 


AGTC ACT AAA 


AGTTACAAGT 


ATGATTGGAA 


TACGGTTTGG 


25620 


GAATATAGTA 


CCAACTATCA 


CGACCATCAG 


TAT G CTT GG A 


TTCCGTCATG 


GTCTCGTTAT 


25680 


GACAGCTATT 


CTGAGTATAA 


AGTTGGCGGA 


GGCTGGAACT 


ACGCTCGTTA 


TGAGGTCATA 


25740 


AACTATTACA 


GCGGAGGCTA 


TTAATTCTTA 


AAGAGTGAGA 


AAAAGGAGGG 


CTAGATATGT 


25800 


TGCAGCTTAC 


TCATGTGACC 


TTAAAAACGC 


GACAAGTCAT 


CTTGCAAGAT 


GTGGATTTCA 


25860 


CCTTTAAAAA 


GGGTAGGGTT 


TATGGTCTTC 


TTGCTATCAA 


TGGCTCTGGA 


AAGACGACCC 


25920 


TGTTCCGTGC 


CATTAGCAAT 


TTAATTCCCA 


TAAGTAGTGG 


AAATATCGCA 


GCCCCTCCTT 


25980 


CTTTATTTTA 


TTATGAGAGT 


ATTGAATGGC 


TGGATGGAAA 


CTTAAGTGGG 


ATGGACTACC 


26040 


TTCGTCTTAT 


CAAAAACATC 


TGGAAGTCAG 


GTCTGAACTT 


GAGGGATGAA 


ATCGCCTATT 


26100 


GGGAAATGTC 


TGACTATATC 


AGTCTTCCCA 


TTCGCAAGTA 


TTCCTTAGGC 


ATGAAGCAAC 


26160 


GCTTGGTGAT 


TGCCATGTAT 


TTCCTCAGTC 


AGGCCAAATG 


CTGGCTCATG 


GAT GAG ATT A 


26220 


CAAATGGCTT 


AGATGAGTAT 


TATCGACAGA 


AGTTT TTTG A 


TAGGCTAGCA 


CAAATCGATA 


26280 


GACAAGAACA 


GCTGGTTCTT 


TTAAGTTCCC 


ACTATAAGGA 


AGAGTTGGTT 


GATGTCTGCG 


26340 


ATAGAGTAGT 


AACCATTCAT 


CAGGGGCAGA 


TAGAAGAGGT 


TTAGTTTATG 


AAAGATGTTA 


26400 


GTCTATTTTT 


ATTGAAAAAA 


GTTTTCAAAA 


GCCGCTTAAA 


CTGGATTGTC 


TTAGCTTTAT 


26460 


TTGTATCTGT 


ACTCGGTGTT 


ACCTTTTATT 


TAAATAGTCA 


GACTGCAAAC 


TCACACAGCT 


26520 


TG G AG AG C AG 


GTTGGAAACT 


CGCATTGCAG 


CCAACGAGAG 


GGCTATCAAT 


GAAAATGAAG 


26580 


AGAAACTCTC 


CCAAATGTCT 


GATACCACCT 


CGGAGGAATA 


CCAGTTTGCT 


AAAAATAATT 


26640 


TAGACGTGCA 


AAAAAATCTT 


TTGACGCGAA 


AG AC AG AAAT 


TCTGACTTTA 


TTAAAAGAAG 


26700 


GGCGCTGGAA 


AGAAGCCTAC 


TATTTGCAGT 


GGCAAGATGA 


AGAGAAGAAT 


TATGAATTTG 


26760 


TATCAAATGA 


CCCGACTGCT 


AGCCCTGGCT 


TAAAAATGGG 


CGTTGACCCC 


GAACGGAAGA 


26320 


TTTACCAAGC 


CCTGtATCCC 


TTGAACATAA 


. AAGCACATAC 


TTTGGAGTTT 


CCGACCCACG 


26880 


GGATTCATCA 


. GATTCTCTGG 


ATTTTAGAGG 


; TTATCATCCC 


AAGTTTGTTT 


' GTGGTTGCTA 


26940 
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TTATTTTTAT CCTAACACAA CTATTTGCAG AAAGATATCA AAATCATCTG GACACAGCTC 27000 

ACTTATATCC TGTTTCAAAA GTGACATTTG CAATATCCTC TCTTGGAGTT GGAGTGGGAT 27060 

ATCTAACTGT GCTGTTTATC GGAATCTGTG GCTTTTCTTT TCTAGTGGGA AGTCTGATAA 27120 

GTGGTTTTGG ACAGTTAGAT TATCCCTACC CAATTTATAG CTTAGTGAAT CAAGAAGTAA 27180 

CTATTGGGAA AATACAAGAT GTATTATTTC CTGGCTTGCT CTTAGCTTTC TTAGCCTTTA 27240 

TCGTCATTGT GGAAGTTGTG TACTTGATTG CTTACTTTTT CAAGCAAAAA ATGCCTGTCC 27 300 

TCTTTCTTTC ACTCATTGGG ATTGTTGGCT TATTGTTTGG TATCCAAACC ATTCAGCCTC 27 360 

TTCAAAGGAT TGCACATCTG ATTCCCTTTA CTTACTTGCG TTCAGTGGAG ATTTTATCTG 27420 

GAAGATTACC TAAGCAGATT GATAATGTCG ATCTAAATTG GAGCATGGGA ATGGTCTTAC 27 480 

TTCCTTGCCT GATTATCTTT TTGCTATTGG GAATTCTATT TATTGAAAGA TGGGGAAGTT 2 7 540 

CACAGAAAAA AGAATTTTTT AATAGATTCT AGCTTTCCTA TAGGTAGGGA AAATAAGTAA 27600 

AAACTAACAT ACAGAGCGAA TCAACTTGAT TCTCTCTTTT TGATTCGAAA ACCAAACCAA 2 7 560 

AATACAAACA CAAACTTTTC AAAAAATAAC TTTTTATCTT GACAAGAGCT AGAAAACTTG 2 7720 

GTATCATATA AAAGTTG AG A AAAGCAGAAG TGAGAGCTTC TCGCCTTGTG ACATTAAGTT 277 80 

GCCTGGCCCT ACGGATGAAA AGTTTCGAAG AAACGCTATC ATAACGTGCG GGCTTGTATA 2 7 840 

TTTACAAGTC CGCTATTGTT TTTCTCTAAT AAAACAAAAG AGGTGAAAAC CATAGCAAAG 2 7 900 

CAAGACTTAT TCATCAATGA TGAGATTCCT C7ACCTGAAC TTCGCTTGAT TGGTCTTGAA 27960 

CGAGAACAGC TAGGTATCAA GCCACTCAGT GAAGCGCAAG CTTTGGCTGA TAACGCTAAT 2 8020 

GTTGACCTAG TATTGATTCA ACCCCAAGCC AAACCGCCTG TTGCAAAAAT TATGGACTAC 2 8080 

C GT AAC TT C A AAT7?CAC7\ C T. AG A AG AA Z :AAUA:A^.C AACGTAAAAA \ TA-AA^CCTT 2 3140 

GTT ACT GTG A AAGAAGTTCG TCTAAGTCCG G 2 8171 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 7147 base pairs 
< 8) TYPE : nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23: 
CCGCTCAACT TTTGCAATCA AGGCTAAGTA GACAGCAGCA AATTTCATAT TGTATAATTT 60 
CTGACTCATA CTTCTCTCTT TCTATGTGTA CTAGTATAAA TAAGAAAAAG AAGGCCGTCA 12 0 



WO 98/18931 



PCT/US97/19588 



290 

AGCCTTCTTT TGATTTATTC TTCTG CTT C A TCTTCTGTAA ATTGACTATT GTACAAGTCA L30 

GCGTAGAAGC CACCTTGCGC CATCAGTTCC TCATAGTTGC CTTGCTCGAT GATATTTCCA 240 

TCTTTCATGA CCAAGATCAA GTCTGCATTT CGGATGGTTG ACAAGCGGTG GGCAATGACA 3 00 

AAGGATCTGC GTCCTTCCAT CAAACGGTCC ATGGCTTTTT GGATCAATTC CTCTGTCCGT 3 60 

GTGTCAACAG AAGAAGTCGC CTCATCCAAA ATCAAAAGCG GTGCATCCTT AAGAAGGGCA 4 20 

CGAGCAATAG TCAATAGTTG TTTTTGTCTT ACAGACAAGG TCACGGTGTC ATCCAAGATG 480 

GTATCATAGC CATCTCGCAA GGTCATAATA AAGTGGTGAA TTCCCACAGC CTT ACT AG CT 54 0 

TCCATCATTC GTTCATCACT AATCCCTATT TG ATT AT AG A TGAGATTGTC TCGAATAGTT 600 

CCTTCAAAGA GCCAGGTATC CTGCAAGACC ATTGAAAAGG CATCATGCAC TTCTGAACGC 6 60 

GTCATAGCCT TGGTATCCAC AC CAT C AATG CGAATACTTC CCTTATCAAT CTCATAGAAT "7 20 

TTCATCAAAA GATTGACAAT GGTTG T CTT A CCAGCCCCAG TCGGCCCAAC AATGGCAACC 1BQ 

TTTTGACCAG CATGAGCTGT CGCAGAGAAG TCATAGTCTT GAACATTGAC ACCGTCCACC 84 0 

AGAATTTCTC CTGCTCACAC GTCGTAGAAA CGTGGAATCA GATTGACCAG AGTTGATTTA 900 

CCAGAACCTG TTCACCCAAT AAAGG CC ACT GTTTGACCAG TTTCTGCTTT AAAGCTAACA 9 60 

TGTTCAATAA CTGCCTCCGA ATTTGCCGCA TAGCGgAAGG TCACATCCTT AAACTCGACC 102 0 

TGACCTTTGA AGTTTTCATC AGTCAGCTGC ACTTG AAC AG GGTTTTGGAT AGAAGAATGC 1080 

AAATCTAAAA CTTGATTAAT CCCCTTAGCA GAGACCATAG TTCGGGGAAG AACGATGAAG 114 0 

AGTGCTCCCA TGAGAAGGAA GCCCATGACA ACCTACATGG CATAAGACAT GAAAACAATC 1200 

ATGTCACTAA AGAGAGGCAG ACGCGCTATC GGACCAGCGT CGTTAATCAC ATAGGCCCCA 12 60 

ATCCAGTAAA TCGCCACACT CAAACCACTT GAAATCCCCA TCATCATAGG ATTCAAAATA 132 0 

GCCATAAGAC GGTTG ACAAA CAAATTCAAA CGGGTCAATT CATCATTTAC TGCTGCAAAT 13 90 

TTTTCATTTT GATAATCCTC TGCATTGTAG GCACGAACGA CACGAATACC TGTTAAACTC 144 0 

TCACGAGTGA TACTGTTCAG TTTATCTGTC AGCCCCTGAA TCAAGGACTG TTTTGGAAAG 1500 

GCTAGCGTCA TCAAAACGGT CGTCATCAGG ACGTTGATAA TCACTGCCAC AAGTACGGCC 1560 

CAGAGCCAGT ATTCTGAATG ACCTAAAATC TTCCCAATAG CCCAGATAGC CATAATTGAA 162 0 

CCACGCGTTA CCACTTGCAA GCCCATAGTA AT C AAC AT TT GAACTTGAGT AATGTCATTG 1680 

GTAGTACGCG TCAAGAGGCT AGGAATTGAA AATTTCTTAA TCTCTGTCTG CGAGTAATCC 17 4 0 

AAAACTCGGT T AAAAAT AT C ACTTCTCAGC CTACTAGTAT AAGAAGCCGC C A CTCGG G AT 1900 

GCAAAAAATC CAACTGCAAC TACGGACAAG AAGGCAAGAA AGGACATTCC CATCATCATG 18 60 

CTTGCCGACT GCCACAACTC ATCTAAATTA GTTTCTTGAC TACCTAGCAA ATCCGT AATT . 192 0 



WO 98/18931 



PCT/US97/19588 



291 



TTCGAGATAT 


AGGTCGGCAC 


TTCCAACTCT 


AGATAGACCG 


AAAAGCAAGT 


AAAGAGAATG 


1980 


GCTAGTAAAA 


TCATCCCCCA 


TTCTTTTCTA 


CTAATTCTTT 


TGGCTAATTT 


CTTTATTCTC 


2040 


TCCTCCTATT 


CCCTTGATAT 


TTTGCCTGTA 


GTTGACCGAG 


AACCTTCTCA 


AAAATCAGTA 


2100 


ATTCATCTTC 


AT C AATGTCT 


TCCATCAACT 


GCTTGTCTAT 


GCGTTCAAAA 


AAAGCCTTAA 


2160 


CCTGTTGCAT 


CTGAGAACGT 


GCTTTGTCCG 


TCAGACGAAC 


AAACTTAGCC 


CGCTTATCAA 


2220 


CAGGACTCGC 


CTCCAATTCC 


ACCAAACCAT 


TTTCCACTAT 


ACGCTTAACC 


AG ATT ACT AG 


2280 


CAACAGGCTT 


GGTAATATTG 


ACTTCCTGCT 


CGATATCTTT 


AATCAAGACC 


AAGTCTTGGT 


2340 


TTTTCTCGCG 


ATTATCCAAA 


AAACGCACAA 


CCTGACCTTG 


CGGCCCACCC 


ATAAATTCAA 


2400 


TGCCGCAACG 


TTTGGCTTCC 


TTTTGCACCA 


TCAGGTGAAT 


TTGATGACCA 


AAACGCTTAA 


2460 


AGACTAACAT 


CGGTTTATCC 


ATAATCTCCC 


CCTTCTAAAT 


AAAAATAGTT 


CT CT GG AG AA 


2520 


TAATTAAATT 


TCTATGAGAA 


CTATTTTCTT 


GATTAAAAAA 


ATCCCAAGTG 


ATTTTCTCAC 


2580 


TTACGATCAT 


GTTCTATAGG 


TTAAATTAAA 


ACCCATCTAC 


GTTCGTATAA 


ATCTTTTGGA 


2640 


CGTCTTCGTC 


CTCTTCAAGA 


ACGCTGTAAA 


GTTTTTCAAA 


GGTTTCAAGG 


TCTTCGCCTG 


2700 


ACAATTCCAC 


TTCTGACTGA 


GCAATCATTT 


CCAATTCAGT 


CACTTGGAAT 


TCTTCAATAC 


2760 


CAGACTCACG 


GAGGGCAACG 


ATAGCCTTGT 


GAAGGTCAGT 


TGGCGCTGTG 


TAAACTGTGA 


2820 


TTG7ACCTTC 


TTGTGCTTCT 


ACGTCATCCA 


CATCCACATC 


CGCTTCGAGC 


AATTGCTCAA 


2880 


AGACTCCCTC 


CGCATCTTCA 


CCTCCAAATA 


CAATAACACC 


TTTGTTGTCA 


AAGAGGTAAG 


2940 


AAACAGAACC 


TGAAGCGCCC 


ATGTTTCCGC 


CCTTTTTACC 


AAAGGCTGCA 


CGGACATTGG 


3000 


CTGCTGTACG 


GTTGACGTTA 


CAAGTCAAAG 


TATCCACAAT 


TAGCATAGAG 


CCATTTGGCC 


3060 


CAAAACCTTC 


GTAACGTCCT 


TCTGTAAAGG 


fT»fT>»p f+ G** 1 G 


GTTTCCTTTG 


G CTTT AT C AA 


3120 


TCGCTTTATC 


GATAATGTGT 


TTTGGCACTT 


GGGCTTGTTT 


AGCACGGTCG 


ATAACGAATT 


3180 




OTTTGATTCT 


GGA i L i GGAT 


CACCTTTTTT 


AGCTGCTACA 


TAGATTTCTA 


3240 ■ 


CACCAAATTT 


TGCATATACT 


TTACAGTTAG 


CTCCATCTTT 


AGCCGTTTTC 


TTGGCTACGA 


3300 


TATTGGCCCA 


TTT ACGTCCC 


ATTAGGAATC 


TCCTTTTTTC 


ACATTTTAAT 


CTTT CTT ATT 


3360 


ATAACACAAG 


TTTTTTTGAT 


TTTCACTAGA 


GGAAATGGAT 


TTTATTAGCA 


AATCAAGCTA 


3420 


GG AT AGC ACT 


TTACCTGCTA 


AGATGGTCTT 


GCCTTTCTAT 


CTTTATCAAC 


ACGCACTCAT 


3480 


CCACATTCAA 


AAAACAAACT 


AGACCATTAT 


CTGCAAATAG 


AAACTTTCAG 


CCAAGTTTGA 


3540 


CAAAGTCAGC 


TCAAATTACT 


CTTTGAACTT 


TGTAGATATA 


AGCGACAAAA 


ACAATCATAC 


3600 


TGCACCTTTT 


GTTGACAGTC 


TACTCCAGAC 


ATATCATAGT 


TCAACTAAAT 


ACTTTGAAAT 


3660 
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TCAACAGTTC 


TTATAGGCGC 


TATTGTATTC 


TAAGAAATCA 


ATASA^CAGT 


TTCTAAGCAA 


3720 


ACCTCTAATA 


CTCAATAAAA 


ATCAAAGAGC 


AAACTAGAAA 


GCTAGCCTCA 


GGTTGCTCAA 


3780 


AACACTGTTT 


TGAGGTTCCG 


GATGGGGCTG 


ACATGGTTTG 


AAGAGATTTT 


CGAAGAGTAT 


3840 


AATTTACGTG 


TTCCCAAGAT 


GGAGAACTTA 


GACTAGTACA 


CTGGCACTTC 


TAAAACATTG 


3900 


CTAGCAATTG 


ATTTGTT CAT 


ATTT AATTTC 


ATTTTTTCCA 


TAAATGGGTA 


TTAGATATAA 


3960 


ACAGCAAAAT 


ATTTCCGATA 


CGTGTCGTTC 


TTGAATTTCC 


AATCATCTAA 


AAC AAGT AAA 


4020 


GGATAATCAA 


TCCCCTCTAT 


ATCAAGGAAT 


TGGCTACCCT 


TTTT ACT TTT 


TTACACATTC 


4080 


TGTTTGATAG 


ATTCATTTTA 


ACATCACGAG 


CATACTCCAA 


TGGAAATCGC 


TAGGCAAGAG 


4140 


ATAAACTTTC 


AGATATCCGC 


AGAGAGATCA 


TCCCCTCTTT 


TTGTCGCAAG 


CATTCTCCTC 


4200 


TCCTAGTCAT 


TTTCTACCTT 


ATCTTCTACC 


TGAGGATAGA 


GAGTTGTTCC 


CCAAATAGAA 


4260 


ATCGTCCGCT 


TACGCACTAG 


TGGCAAATCG 


GTTTTTTCAT 


AAACCGTACG 


CCACCATTCC 


432C 


CAGGCAAGCC 


CGGTACACTC 


TCTAATTTTG 


ACAGAGAGAT 


TACGAACATT 


CCCTTTTAAA 


4380 


GGAATACTAG 


TGGTAAAGTG 


AGCC GTT AAA 


TCCTGCCCAT 


TTCTGTCCCA 


AGCCTTAGGA 


4440 


GTCAAGACTT 


CCTTACCTTG 


ATGATCATAG 


GATAATTCAT 


TCCAAGTAAT 


ATAATATTGG 


4500 


GCAACATAGG 


CACCACTATG 


ATCCAGCAGT 


AAATCTCCGT 


TTCTGTAAGC 


TGTAACCTTA 


4560 


GTCTCAACAT 


AGTCTGTACT 


ATTTTGAAAG 


GTCGCAACTA 


CATTGTCACG 


TAAAAAAGAA 


4620 


GTTGTATAGG 


AAATCGGCAA 


GCCTGGATGA 


TCTCCTCTAA 


AGCGACTGCC 


TTCTTGAATC 


4680 


AAGTCCTCTA 


CCATATCCAC 


CTTGCCTGTT 


ACAACTCCGG 


CACCCGAACT 


TGGGTCGCCC 


4740 


CCTAAAATAA 


CCGCCTTCAC 


TTCTGTATTG 


TCCAAAATCT 


GTTTCCACTC 


TGTCTCAGGA 


4800 


GCTACCTTGA 


CTCCTTTTA7 


CAAAGCTTCA 


AAAGC AG CCT 


CTACTTCATC 


ACTCTTACTC 


4860 


GTGGTTTCCA 


ACTTGAGATA 


GACTTCGCGC 


CCATAAGCAA 


CACTCGAAAT 


AT AG AC C AAA 


4920 


GG ACGCTCTG 


CAGAAATTCC 


TCTCTGTTTT 


AAATCCTCTA 


CCGTTACAGT 


ATCTTGAAAC 


4980 


ACATCTCCTG 


GATTTTTAAC 


AGCATCTACG 


CTGACTGTAT 


AATAAATCTG 


CTTAAAATTA 


5040 


ACAATCTGAA 


TCTGCTTTTC 


GCCTGAATGG 


AC AG AGTT AA 


AATCAATATC 


AAGAGAATTC 


5100 


CCTGTCTTTT 


CAAAGTCAGA 


ACCAAACTTG 


ACCTTGAGTT 


GTTCCATGCT 


GTGAGCCGTG 


5160 


ATTTTTTCAT 


ACTGCATTCT 


AGCTGGGACA 


TTATTGACCT 


GACCATAATC 


TTGATGCCAC 


5220 


TTAGCCAACA 


AATCGTTTAC 


CGCTCCGCGA 


ACACTTGAAT 


TGCTGGGGTC 


TTCCACTTGG 


5280 


AGAAAGCTAT 


CGCTACTTGC 


CAAACCAGGC 


AAATCAATAC 


TATAAGTCAT 


CGGAGCACGA 


5340 


TCGACCGCAA 


GAAGAGTGGG 


ATTATTCTCT 


AACAAGGTCT 


CATCCACTAC 


GAGAAGTGCT 


5400 


CCAGGATAGA 


GGCGACTGTC 


GTTGG7AGCT 


GTTACAGAAA 


TATCACTTGT 


ATTTGTCGAC 


5460- 
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AAGCTCCGCT TCTTTCTTTC GATAACAACA AACTCATCGG GTAGCTGATT ACCCTCTTT'G 552 0 

ATGAAACGAT TTTCAATACT TTCTCCCTGA TGGGTCAAGA GTTTCTTTTT ATCGTAATTC 5 580 

ATAGCTAGTA TAAAGTCATT TACTGCTTTA TTTGCCATCT TCTACCTCCT AATAAGTTCC 564 0 

TGGATTGAGT TGCATAAACT CAGACTTGTT CAGCGAAATC AGCCGTGCTT GGACTAAGTA 5*700 

ATCCAAAATT TCCTCGTACA ATTCTTCTGA GACATTGCCT CGCCGTCTGG CTAAATAAGA 57 60 

AGTCGCAATG AC CGT ATT AT CCAACATAAA TACCTTATCT AAGTCAATCA AGGTTGGTCT 5820 

TGT AAAAGG A TTACGAGCTA GATCCGGCTC TTCTATCATA AAGTTCTTGA CCAAACGTCT 5880 

GGTCAAGAGA GCTGGTTTGA AGGTCTGATT TTTAACCAAC TCTTTGTTTT TAGTCATGCT 5 94 0 

GTTGTCAATA CAGATATACA TATGATTCTT C AC AGCC AAA TCGCTACTAA TAGTCGGAAA 6000 

AGGCAAATAA AGAGCTACAA CATCTCCTCT CTTAATCAAG CAAGAGCACC CCCTTTTCTC 6060 

CTAATGTAAC ATAGACAGGA TTG AC CAAGT CTTCTGATTG ACTCAGAATT TCCAAAGTTT 6120 

GAGTTTGGCG CGCTGTCAAT TTAGTAGCAT CTTGTCTCTT CAATACAAAA TGCTTGTCGC 6180 

CAATAACCTT GACAATATAA TCCTTCTCCA AAGCTGACTG GTAAATCCAC ATCAGATGTT 624 0 

GTCTGTCCTG AGAACTCAAG AG AG AAG GAT TTTCAAGCCT CCCGATAGTC TGATAAAAAT 6300 

CAAAAACAGG AGCT AACTC C TGCCAATCTG ATTGGCTAGT TGTCAAGGCT AGAAAAAGGG 63 60 

CTTTGCGAGC TGATACTTCT TGGTTAGCCT TGAGAGTTAC TTTCCCCTCC AACTTTTTTA 6420 

GAAATCGGGA AACTC C AG AA AG C AAATTTT TCTCTAACTG CGAGAAATAA AAACCTTTCC 64 30 

TTCCCAGACA TAAGTCTTTC ATGTCGCTTT CTCTAGCAAA TAAGAGCTCA AACATTTGAT 6 540 

AGTAAAAGAA AAATATCTGG CACTGGGTCG CGCTCATCTT TTCCTTATCG GCTTCTTTTT 6600 

TTAACCAGAG CAAGGGCGAC AGGTAGCTGG ATT GAGA CAT TTCCTCTACC TCCTACTCTT 6660 

TTTTAACTGG AGCATCTGCA CTAGCTGCCA CTTCTTTTGA CTGGATACTT TCCCACTGGT 67 2 0 

TAATCTCCTC TGAGATAAGA CCTTCGCATG TCTTGACAAA TAGGGCAAAA GCCTTGGT r ?T 67 80 

TTCCTGCATA TTTCTCCGTT TGGCATTGAT AGAGGAATTT TTCTTTCTCC AGGAGTTGCG 684 0 

CAGTTTTTTC GTAAGAAATC CAATTTTCCT TTGCATTATA CAAATTGATA ATCCCCTCAC 6 900 

ACAGCAAGCC GAGACTGGAT AAGGCAACCG AAATCAAACG GTAGCGATCA CCTGGCATAG 6960 

GAATAGCACA AAAGAC AG C T ATGAGGAAAC CTGCCACGAT TTCTCTTATT TTTAATACCT 7020 

TATAGCGCCT ACGATGTTGA ACGCTTTTCT TTAAAAAATG AGCTATCTGT ACGTCTAATC 7 080 

GCTCTGTCAG GTACATTTCT TCTGGCGTCA TATTCGTAAC TCCTTTCATT TACTTTGATA 7 14 0 

ATCAGGG 714 7 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CCGCATGGGA TTGGTGTCCT TTTGGGCAAT CTCTTTGACC AAACTGGAAA CATGTTTTAT 60 

GCGCCTGCCT TTACTGCCCT TGTCGGCGGT ACGTCTATAT GATCCTAGTC GCAAAAGTTC 12 0 

CGCGCTTTGG AGCCATTACC ACTATCGGCC TTGTCATTGC CCTCTTTTTC TTGGGAACTA 180 

AACACGGTGC TGGTTCCTTC CTTCCTGGAA TTATCTGTGG CCTCCTAGCA GATGGAGTAG 240 

CTCATTTAGG AAAATACAAG GACAAAACAA AGAACTTCCT TTCTTTCATT ATTTTCGCCT 3 00 

TTAGTACAAC AGGACCAATC TTGCTTATGT CGATTGCGCC CAAAGCCTAT ATGGCTACTC 3 60 

TTCTGGCAAG AGGAAAATCC CAAGAATATA TCGACCGTAT CATGGTCGCT CCAAACCCTG 420 

GAACTGTCCT TCTATTTATC GCAAGTATTG TCATCGGAGC CCTAGTGGGT GCCTTGATTG 4 80 

GACAAGCCTT GAGTAAAAAA TTTGCCCAGA AAATCTGATC AGTTAAAAAG AGCCACGCGG 54 0 

CTCTTTTTTA TTTATGGCTC AATTTCTTAG TCAAGAAATC TCCCAAGAAT TCGATTCCAA 600 

AGATAATCAA AATGATAATA ATGGTTGCCA AGATGGTCAC ATCGTGATTG TAGCGGTTAA 660 

ATCCATAAGC GATGGCTACG TTACCGATAC CACCAGCTCC AACCGCACCG GCCATAGCTG 7 20 

TTtcCCAACA AGGGaAtCAA GGTcACAGTC GTCAC 7 5 5 



{2) INFORMATION FOR SEQ ID NO : 25: 

[il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTCAATTGGT ATCTCAATCA ACGGTCTTCA CATGGTTTCA ACTGGTTTGA CTCTTGAAAA 60 

AGCGAAAGCT GCTGGTTACA ACGCAACTGA AACAGGCTTT AACGATCTTC AAAAACCAGA 12 0 

ATTCATGAAA CATGACAACC ATGAAGTAGC AATTAAGATT GTCTTTGACA AAGATAGCCG 130 

TGAAATTCTT GGTGCCCAAA TGGTTTCACA TGATATTGCA ATTAGCATGG GAATCCACAT 24 0 

GTT CTCACTT GCTATCCAAG AG C ATGTG AC AATTGATAAA TTGGCATTGA CAGACCTCTT 3 0C 
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CTTCTTGCCA CACTTCAACA AACCATACAA CTACATCACA ATGGCTGCCC TTACGGCTGA 3 60 

AAATTAAAAA TGAATGAGCT ATCTGGCCTT AAGTTAAGGT CAGATAGTTT TTAGCTAATT 4 20 

TGTCCCCATA CAATTATAGT TTTTTTATCT TGTGCTTCAT TCTGTTCTGA CTTAAAATGA 48 0 

AAAGGTAGCT ACCAATACAA ATGATGAGGA TAAAACAAAT GACTGAAAAT CGTTATGAAC 54 0 

TAAATAAAAA CTTGGCACAG ATGCTCAAGG GTGGTGTTAT TATGGATGTG CAGAATCCTG 500 

AACAGGCTCG TATCGCAGAA GCTCCTCGTG CGGCAGCTGT GATGGCCTTG GAACGAATTC 6 60 

CCCCTGATAT TCGTGCAGCT GGAGGAGTTT CCCGCATGAG CCACCCAAAG ATGATTAAGG 72 0 

AAATCCAAGA AGCCGTTAGT ATTCCAGTAA TGGCTAAGGT CAGAATCGGG CATTTTGTTG 7 80 

AAGCTCAGAT TTTAGAGGCT ATTGAAATTG ATTATATCGA CGAGAGTGAA GTTCTATCTC 84 0 

CAGCTGATGA CCGTTTCCAT GTGGACAAGA AAGAATTCCA AGTTCCTTTT GTCTGTGGTG 900 

CTAAGGATTT CGGTGAAGCC TTGCGTCGTA TCGCTGAAGG TGCTTCCATG ATTCGTACCA 96 0 

AAGGAGAACC AG GG AC AGGG GATATCGTCC AAGCTGTTCG TCATATGCGT ATGATGAATC 1020 

AGGAAATTCG CCGCATTCAA AACTTACGTG AGGACGAGCT TTATGTTGCT GCCAAGGATT 108 0 

TGCAAGTCCC TGTAGAATTG GTCCAATATG TTCATGAACA TGGAAAATTG CCAGTTGTAA 1140 

ATTTCGCTGC TGGAGGTGTT GCAACGCCAG CAGATGCTGC GTTAATCATG CAATTAGGGG 12 00 

CAGAGGGGGT CTTTGTCGGT TCAGGTATTT TCAAGTCAGG AGATCCTCTT AAACGAGCGA 12 60 

GTCCCATTGT TAAGGCTGTG ACTAACTTCC GTAATCCTCA AATCCTAGCT CAAATCTCTC 132 0 

AAGATTTACG AGAAGCCATG GTTGGTATTA ATGAAAATGA AATCCAAATT CTCATGGCTG 13 80 

AACGAGGAAA ATAGATGAAA ATCGGAATAT TGGCCTTGCA AGGGGCCTTT GCAGAACATG 144 0 

CAAAAGTGCT AGATCAATTA GGTGTCGAGA GTGTAGAACT CAG AAATCT A GATGATTTTC 1500 

AGCAAGATCA GAGTGACTTG TCGGGTTTGA TTTTGCCTGG TGGTCAGTCT ACAACCATGG 1560 

GCAAGCTCTT ACGTGACCAG AACATGCTAC TTCCCATCCG AGAAGCCATT CTATCTGGCT 1620 

TACCAGTGTT TGGGACCTGT GCGGGCTTAA TTTTGCTGGC TAAGGAAATC ACTTCTCAGA 1680 

AAGAGAGTCA TCTAGGAACT ATGGATATGG TGGTCGAGCG TAATGCTTAT GGGCGCCAAT 17 4 0 

TAGGAAGTTT CTACACGGAA GCAGAATGTA AGGGAGTTGG CAAGATTCCA ATGACCTTTA 1800 

TCCGTGGTCC GATTATCAGT AGTGTTGGTG AGGGTGTAGA AATTTTAGCA ACAGTGAACA 18 60 

ATCAAATTGT TGCAGCCCAA GAAAAAAATA TGTTGGTAAG TTCTTTTCAT CCAGAATTGA 1920 

CTGATGATGT GCGCTTGCAC CAGTACTTTA TCAATATGTG TAAAGAAAAA AGTTGAGATT 1930 

■ GAATTTCTCA ACTTTTTTAC ATGTAATAAA CAATAGCGAT GTATTGAAGT GCGGACGCAG 20 4 0 
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CTAGGATAAA GAGATGCCAA ATCATGTGGA AATAAGGTTT TTTCTTGGCA TAAAATCCAG 2100 

CTCCAACTGT ATAACAGAGT CCGCCAGTTA CCATGAGACT CCAGAAAACG GGTGT CGTTT 2160 

GACTGATAAT GGCAGGAATG ATAGCCAGAA CCAACCAGCC CATAATCAGG TAAAGAGCAA 2 220 

GGCTAAATTT CTCATTGACC TTTTTAGCAA AGATTTTATA GAGAATACCA AAGATGGTCG 2280 

TTCCCCATTG GATGACAATA ATC AG AT AG C CAAACCAGTT ATTCATCAAG GTCAAGACAA 2340 

CGGGCGTGTA TGAGCCGGCA ATGGCAACGT AAATCATAGA ATGGTCAATG ATTCGCAAAA 2400 

CATATTTGTG GGTCGAACCA TAGGCCATAG AGTGATAAAT GGTGGATGAT AGGAACATGA 2460 

GAAAGAGACT GATGACGAAA ATGGAAACGC CGATAGAGGA TAAAAATCCG TGTGCTTCAT 252 0 

AACT AT AG AT GGATGAAATA GGCAGCAAGA TAAGCATGAT GACTGCACCC ACAGCATGGG 2 580 

TCACGCTATT AGCAATCTCC TCTCCAAAAC TGAGTTGTTT GCTGAGTTTA AGACTAGTGT 2 64 0 

TCATTGGATT ACCTCCTCTT GAGTATGATC GATTAAGTCT AGAGTTTGAT GATAGAGTTT 27 00 

AACGGTTTGG CAGCTGGTTT GGATAATAGG GTTAGCTGGG TCAATTCCTT CGTTCATGTA 2760 

GTCCACAAAA GCATCGTAGA GTTGGTCTGA ACTTGCTTCA GTTTGTAGAG TATTAAGTGT 282 0 

CTGGGCTATT TCTTGAATAG AAAATACAGA CTTGAGGGTT GTGATAGCAA TCAAACGGGC 2880 

AATCTGTTGG CGTTGGTATT TTTTTTTGTC AGGCTTTGTC AGGTAACCAT TTTTCACATA 294 0 

ATTGTTGACC ATAGATGCTG TTAGCCCCTT GTCTTTATTA GG AG AG AT AG GGGCGCAGAC 3 000 

CTGATTGACA 3010 
(2 } INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15213 base pairs 

( B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CATAAATCGG TGCAAATAAC TTAATAGTGA AGTAGCCATT TCTTTCGTAT TTACCTGAGG 60 

CATATTCCCT AGACGAAAGA AT ATT ATT AT CAATCAAATC ATTGAATGAA CGTAGTCTTT 12 0 

CAACTTCTTC TACTGTTAGA TTTCTGACAA CATTTGTTGC ATAGACCTTA TTTCCATCAG 180 

GATCAGGATG GTACTCATTT GTAACTTTTC TAAGAAGTTG TTGTTTTTGA TTCGTATCCA 240 

ATTTAAGAAT TGAATTTCCT TCGAGATATT CCAACATATA AACAACGTCA AACATGTTGT 3 00 

GGACATATTG CTTCAAATCA TCTGCATTAT TAAATCTTGT AGTTGG AT C A AGTACTTGTA 3 60 

ATCGTCGACT TTCTGTACTA TCAGATTTTG AATGTTTCAA . G ATGGAGTTG ATGGTAATGG 4 2 0 
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TCGCATCATC 


TGGATGGTCT 


GGTGCTTGTA 


ATAATCCTTT 


AGCAAAGAAC 


TCTGGTCCCA 


480 


AGCCACTTCT 


TCGACCATAT 


CCTCCAAGAT 


AAATGTCCTG 


ATCTGAGTCA 


TGTGTCATCT 


540 


C ATGC GT AT A 


AGTAATAGCT 


CCATCCTTAT 


CCAACATTCG 


ATAACCCATA 


TAATAAACTG 


600 




AGCATAAGCA 


CCGTGTTGAT 


TATGCCCAAC 


TTTATTTCCA 


ACAGGTCCAA 


660 


an AAATGTTC 


CATTGCAGGA 


TTTGGATTAT 


CAAAATCTGC 


CACTTCTGTA 


GCTTTCCCTA 


720 




ATCGCCAAAT 


TTATAAGCAT 


CGTAAAGCAA 


AATATTTCTA 


TAAAGTTTTT 


780 




GTCGTCTAAA 


ATACGATACC 


AATAATCGTA 


GTGATCTCGC 


TGACGTTTGG 


840 


CT GTT T C AC G 


CGCATTTTCT 


TCAACAAAAT 


CATTGAGAGC 


CTTCCCCGCT 


TTATGGTCAC 


900 


I /\^w J. VjV_ l«*VJ 1 *• 


GCGATCATAA 


GCTCCAAATC 


CTAGACTAGA 


CATGGTCGAG 


ATGACAAATA 


960 




TGGCAAGG^C 


AGGAGAGGCA 


AGACCATATT 


GCGGTATTTC 


CATGTGGCAC 


1020 




ATCATAAACA 


CCGATAGAAT 


ACTTGGTGCC 


AGCTAACCCT 


TGCTTCGTTT 


1080 




GATAGTGGAT 


TTTTCTTCGA 


CAATGTAAGC 


CTTAGTCTCT 


GATTTAAACC 


1140 




GCTTGTATTT 


GGTAAAAAGA 


CTTTTCGGT A 


ATGTTCCAGC 


GTGCTAAACA 


1200 


AATCTGTCCT 


TCCATGTTGA 


CTGGCAAGAC 


TGATACCATA 


ACTATCGACA 


TTATTCTTAG 


1260 




GTTAAAGCCA 


GATTTACCCA 


ACTCAATCAG 


AGTATCTAAT 


GGTGAAGCAT 


1320 




AAAGAAGTCC 


AAATGGTACA 


GAACTAGGTC 


TTTGACATTC 


ACCTGACCAT 


1380 


AP.fTAAAG'' , T 


ATACCACCGT 


TCCAGATAGG 


TCAAGCCAAG 


TAGCAAGGCT 


TCCTTGTTGC 


1440 


U l 4 Ixjnl 4 I 1 


ATCTACAAGA 


TAACCTTCAG 


TGACGGGGTT 


AGC ACT AG CC 


AGTCCAGCAT 


1500 


C CC* PTn AC AA 


G ACTTTTTT C 


AAACTCTCTT 


CCAGTTGTTG 


TTTTGTTTTG 


GCGAACTGGT 


1560 


CTTCTAGATA 


GAGCTCAGTT 


TGCTTGACGT 


TTGGAGAAAT 


ACCCAGCGTC 


TTTCTGATGG 


1620 


PTTC"^ AA^G 


ATAGTCAACC 


TTTTGTAAGT 


CAGGTAAGAC 


TTGCTTGATG 


ATAGAGGTTT 


1680 


GGTCATACAG 


GAATTGGTTT 


GGCGTATAGA 


G AAG T C C AG T 


A*. * Aljrt. 




1740 


CTAATTTGCC 


GAAATCATTC 


TGGTATTTGA 


GATCCAGCTT 


C TC AG AT AAA 


TCATCCTTGT 


1800 


AGTGAAGCAA 


GAGTTTGTTT 


GCAGTCTGTT 


TGTTAGAAAC 


AATGTCTGTG 


ATGACTTGGT 


1860 


TGTCCTTCAT 


CATGACTGCT 


GACAAGAGTT 


CTTTTTGATA 


TAAAAGACTG 


TTCTCATTGA 


1920 


CCAGGTTTCC 


GTATTTGACG 


ATGGTTGCCT 


TGTTGTAGAA 


AGGTAGCAAT 


TTTTCAATGT 


1980 


TTTTATAAGT 


CAAGTTGCGC 


TTAGCTTGAT 


AATAGGCCAC 


CTTAGAAAAA 


TCACTGTCTT 


2040 


TTTTGCCACT 


TGTTGAAAGT 


GGCTCCACTG 


TTGGTAAAAT 


GAGAGGATTG 


ATTTCTGCTT 


2100 


TTTTGCTTGC 


AATTTGAGAA 


GCATCTAGCA 


TTGTTCCTCT 


TTCTTCAAAG 


GATTCCTTGC 


2160 



WO 98/18931 



PCT/US97/19588 



298 

TGACGACCTC ATCCTTGACC AAGGTGACAT TGTAGACTCT GTTGGC CTTG CTGCTGAATG 22 20 

TGTCCTTTAC CTTCATTTCG TTATAGTGGT AACCAGTGAT GGCATTTCCG TTGGTT AC AT 2 2 80 

TAACATCGCT GAGAACATTG GTCAAACTTC CAGCATGCCT AACATCACCA GAAGTTCGAT 2 3 40 

CCCACAAATT GCCTGCCACT CCAGCGACTC TACCAAAGTG CTTGACATTC TTGATATCAC 24 00 

CTTCAGCATA GCTATCTTGG ATCTGTGCAT CTCGGTCTAC TAGGCCTGCA AGTCCACCCA 2 4 60 

CAGTCTGATC TGAAGTATTT GTGTTAGATG AAATGG CT AC TCTCGCTTTT GACTTAGTAA 252 0 

GTAAAGCCTT GTCACCTGTC AAATGACCGA CCATACCACC GATATTGTAG GCAGCAGTCG 2580 

TTTCATAAGT GTTGATAATT CTTCCCTTGA AACTGCTCTC TGTGATGCTT GATTGCTCAG 2 640 

CCTTAGCCAG CAAACCACCG ATACCACGTT CACCAGCCAG AACACCATCG ACGTGAACTT 2 7 00 

GCTT AATTTT TGTGTTATTC TGAGCTTCAT TTGCCAGTGA AC CG AT AT C A TCTTTCCCTG 2760 

AAATAGCAAC ATTTTTTAGA CTCAGTTTTT CTACTGTAGC ACCACTCAAG TTTTCAAACA 2 820 

GAGGTTTTTT CAAATTATAG ATAGCATAAT TCTTGCCATC TTTTTCACCG ATTAAACGAC 2 880 

CAGTAAAGGT GTCCTTGATA TAGGATCTTT CATCAGGACC AAGCTCCACT TCGTTAGCAT 2 94 0 

TCAGGCTGGC CGCTAAATGA TAGGTTCCAG AGGGATTTTG GTTTATAGCT TTGACCAGAT 3000 

TACTAAAGGA AGTAAAGTTT GTTGTTTCTT CTGTTCCCTT CTTAGCTAGA TAGAAGGTAA 3 060 

AATTATCTTT ATATCTGCTT TCTATCTCCT GCTGAAGCTT CTCTACTTTT GCTGTGATTT 3120 

TAT AAAGG AT TTTATCATTT TTTCTTTCCT CTGATATTGA TGCTACTGGT AGGTATACAT 3180 

CTTTGAATGA AGAAGATTTC ACTTTAACAA AGTAGCTATT TGG ATTG CTT GGAACTTGCT 3240 

CTAACGAAAT GTGTTGTTTA TAAGTACCAT TTGACAAACT GTATAACTCT AGGTCGGAAA 3 300 

CATTTCTTAA TTCAAGTGTT TTCTCTGGTT CTTCTACCTT TTTATCAGGG TCTAGTTCAT 3 3 60 

TTTCTTGTTT AATTTCTTCC TTTCCATTTG AATTGGATGT GTTTGATTCG GTTGAAACAT 34 20 

CCTCAGTTGA ATTTCCGTTT GATGGTTCTG GTTCTGTTTG TCCATTCTCT GATGTTGTAT 3 4 80 

TACCTGAATT TTCTGGTTTT GTTGCAGTTC CGTTTTTTTC TGGTTCATTT GATTCTTCAA 354 0 

CTGGTGGTTT TGAATCACTA GGTTTATTGG AT ACTTC TCC AGTATTTTCG TTAGCTATTT 3 600 

TCCCAGAGTT TGTTTGTGTT TCTTCTGCAG GTTGAACTGG TTTTTCTGTT TCTTGATTTG 3 660 

AGGT AC CTTC TACTGTGCCT TCATTTGGAT TTACTGGAAC TTCTTCTACA GTTTTTTCTG 3720 

AATTTTCATT TTTAGAGTCA TTATGTTCTG GTTTATTTGA TTCTCCAACT GAGGTTGTCG 3 7 80 

AATCACTAGG ATTACTGGAC ACTTCCCCAG TATTTTTGCT AGATGTATCT GGTGATACTT 3 84 0 

TCTCTGAATT CGTTGTTGAT TCTTCTGCAG GTTGAACTGG ATTTTCTGCT TCTTGAATTG 3 900 

AGGTTCCTTC TGTAGTACCT TCATTTGGAT TTACTGGTGT TTCTTCTGTT GGTTTTACTG 3 9 60 
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GAACTTCTTC AGTTTTTTCT GGACCTTGTT CTTTGGTCTT CTCAACCGGA GTTTCAGGTT 4 02 0 

TTACTTGCTC AATATTACCC TT AT ATTCTG GAAGCGGTGC TACCTGCTCT GGTTCACCTT 4080 

TATCACTTAC CACAGTATCT GGCGACTCTG GTTGAACCTC AGTCTCACCT TTGTCGGTCA 414 0 

CAACTGCTTC GGGTAATGTA GGTTGAACTT CTGGTTCGCC TTTGTCACTT AC T AC AG CTT 4 200 

CGGGCAACTC AGGCTGAATT GCGGGTTCAA CAATAGCTCC AGACTGTACG TCCTTATGTT 4 260 

CTACACCAGT CTCAGGTTGT TCCTTTATAA CTTGACTTTT TTTAGTACCT TTTTCGACTA 43 2 0 

TTCTTGGACT AGGCGCAGTC GTTGAAGTTG AAACAATTTC TCGCGAAACT TCTTCCTTGT 4 380 

TTACAGAGAA TATTCTGACG ATTTCAACTT TCTTACCTAA TTTACCTTCT TCTTTTACTC 4 44 0 

TTACAGTTCC TTCAGCTAAA TCAGGATTTT CTTGAATTTC TTCTTGAAAA TCTATTTTTG 4 500 

TCTCCATAGT TTCCTCACGA TATAAGAGTT CAGGTTTGTT CAATTGACCT G AT AAAACTT 4 560 

CATCCTGTGG ATTTAATGTA TTTACCCCAG TCTTTTCTTT TGGAGAAATC TTCTCCTCTT 4 62 0 

TCTTCGTTTC TAGATTCTTA TGTTCGGCTA ATTGTTCTTG AGAATCTGAA GATTGTTTCT 4 680 

CTTCTTTTCT TGGATTGATT AATTCAGTAG AGAAAGGTTT TT C AAC TACT TGAACTTCTG 4 740 

TCGGCTTAGT TGAAGAAACA GGTGTTTGTT CCTGAATAGC TTGT ACTGTT GATGGATGGT 4 800 

CTACAAAATT CGGTGT AAC A TTATAATCCA CCTTTTGTTG TTTTGTAGGA GTGGCAACTG 4 8 60 

AACTCTTTTG ATTACTTACT TCAGACTCAG AAGTCGTTTT TCCCTCTTTG ATATATCCAA 4 92 0 

TATAAGTGTA ACCTGAAATC TCTTTAGGAA GAG GT AATTT TTCTCCAGAG GTCAATTCAT . 4 980 

AGTCCGTATT GT AATTT ACC AAAAGATGAT TTTCTAAAGC ATGGACTGAA ACTAAGACAC 504 0 

t 

CATTTCCTAT CCCTGCAACC AATACTAAAT GTAATACCGT TTTATTCTTA ACCTTTTTCT 5100 

TGGAAACAGC AAAAATTAAA ATTCCCATAC CAGCTAAGCT AGCACCAGCA ACTAGGGCTT 5160 

GCCTCTCATT CTTGCTTCCA GTATTTGGCA ATTCCGCCAG TTGATTTTGA GAATTTAACT 5220 

TATAAACAAG ATAATAAGTT TCATCATCAT TCTCCACGTA TGTCGGAATA TCATAGACAA 52 80 

CCTGCTTCTT TTCTTCTGAT GATAGCTCTG AATCTGCCAC ATATTTATAG TGAACTCCCG 5 340 

CAGTTTCTTG AG CAT CC AC A GATGAACTAG CTAATACAGA CATAAAAAAT AAACTTGAAA 5400 

TCGTTGCAGA TACAAGTCCT ACTGATAATT TTCTAAATGA AAAACGCTCT TGTTTTTCAC 54 60 

CAAAATACTT TTCCATTATT CCTCCTTGAA ATAAAATTTA TATATGTTAC AAAGACCTTT 5 520 

ATTATATTAG TGTATTATCT ATTATCTATA GAAAAGGCAG TATACCTTAA TTATACTCTT 5580 

AATTT ACAAA AAAGTCTTAA AATTGAGATG CGCTTTCATA CTTTGTTTTA T ATT ATTTGG 5 64 0 

AGGTACAATA ACACCTACCA TGAAATTTAC ACGGTAGGTG TTACTCATAT CACTAATCGT 5700 



WO 98/18931 



PCT/US97/19588 



300 



TCTAAAAATG 


GTTTGAGGCA 


GTTGAGGAGA 


ATTCC7TCTA 


TCCAGCTTCC 


TTGTGCTGAT 


5760 


GAGCGATGGT 


CTTCCTGCAG 


GCTTTTTTTT 


AGAAAATCTC 


GGACTTGTTC 


TGGTGCGATT 


5820 


TCAAATTCAA 


AGG CTTTCAT 


TTTATAGAAA 


AAGTCGATGA 


GATGATCTGA 


CAGGTATTCA 


5880 


GTTGAAAAGG 


GTACTTCACC 


ACTTTTTCTA 


TATTCTAATA 


AGAGTCTAGA 


AAATCGAGCT 


5940 


TTTTCTTCAG 


GAAGCTCACG 


AAAATAGGAA 


TTGAGGATCC 


AAGTCTGCTT 


CTGTTTTCTT 


6000 


TCAATTGGAT 


CCTGACTGGC 


AATTC CTTGG 


TCTTTTTCCA 


GCTCTTTTTG 


GTATTGTTTG 


6060 


GCCTTGATAG 


CTCGTTCTGC 


T CT ATTTTT A 


CCAAAAAGAA 


TTTTTTCCCA 


CTTGCGTTCT 


6120 


TCTTGAGTCA 


GGGTCTCTGT 


AAAGCCAAAG 


TAATCTTGAT 


AAGCACGCTC 


TGCGGGTCCC 


6180 


ATGGCTAGAA 


CCAGATTGTC 


TGCATATTGC 


TTGGCGATTT 


TATCCCTCTT 


CTTGCGTTCT 


6240 


TTCTCTGCCT 


GGATACGGAG 


TTCTTGTTCG 


TAGTCAATTT 


TCTCCTTGCC 


TAGCTTGACA 


6300 


AGGTAGAGTT 


GGTCATCCGA 


TTTCCCAAGT 


AAAAAGGGTT 


TGATACACTT 


TTCAAGGACT 


6360 


TCTTCCATCC 


GAGCCTTTTT 


CTTTGGTTCC 


GCCTTGGTCC 


AACTTCCTCC 


CTGAAAGACT 


6420 


TCTAGGAAAA 


CCTGGTAGTC 


TCTCTCAGGC 


GCAAATTGAT 


TGCCACGATT 


GGGTTTGAAA 


6480 


ACACCTTTTT 


CCCAGAGCCA 


TTTTAGAAGT 


CGCTCGTCAA 


AG TT ACT TTT 


ATTGACCTTG 


6540 


ATTTTTTCCT 


TTTTCTGAGC 


TTTTCTGGTT 


AGATTTTCAA 


CCTTTCTGAG 


CAGTTTTTCT 


6600 


TCCTCTTCCA 


ATTGCTGGTC 


AAGGGACAAT 


CGATGAAAAT 


GACGAACACA 


GTCGCTACCA 


6660 


ATTCGAAAGA 


GGCGTTGGCC 


TGTGAC AC CG 


TTAAAGAGTT 


CATAAGCGTA 


TTTGATGGCA 


6720 


TTTCCACAGA 


CACAATTGCT 


ACGGCCGATA 


CCGTTAAAAA 


TAAAGGAAAC 


TTCATTCCAT 


6780 


TCCTTGGTAG 


CTTCTTCCCA AGTATCCGCT TTCGAAG CCT 


GTAAAACTGC 


ATCGTGCAGG 


6840 


GATTTTCTAA 


CTGGAAGTGT 


CATGAGGTCT 


CCTTTCTAAT 


ACTCAATAAA 


AATCAAAGAG 


6900 


C AAACT AG AA 


AGCTAGCCGC 


AATCAGCTCA 


AAACACTGTT 


TTGAGGTTGT 


AGATAGAACT 


6960 


GACGAAGTCA 


GCtCAAAACA 


CTGTTTTGAG 


GTTGTGGATA 


GAACTGACGA 


AGTCAgTAAC 


702C 


CATATATACA 


GCAAGGCGAA 


GCTGACGTGG 


TTTGAAGAGA 


TTTTCAAAGA 


GTATAAGTTA 


7080 


TACTTTTACA 


ACTTGAACCT 


CGTCTTTACC 


GAGTAAAATC 


AAGT ATTTTT 


CAATATTTTC 


7140 


AATCGAATAG 


GCTCGTGATA 


AAGCCTCTTC 


CTATAGAGCT 


AACTGACCAC 


GATAGCGGTC 


7200 


TACGAGTTCA 


CTTGGTTCAT 


CAT AGCGGT C 


TGTCTTGTAG 


TCGAACAGAA 


CAATTTTGTT 


7260 


TTCGTAAAGC 


AGATAGCCAT 


CAAGGATACC 


ACGGACAACA 


AAGTCTTCCT 


GACTCTTTTG 


7320 


GTCTCGTTTG 


AGCATGGAGA 


AAGGTTGCTC 


GCGATAAAGA 


TGGTCGGTAT 


TAG C AAGAAT 


7330 


TTCCTGACCG 


AGTACTGTGT 


CAAAGAAAGC 


AAGAAT TTT A 


TCAAGATTGA 


TCTTGTCTCT 


7440 


G AC AG CTTGG 


CTAGTTTGAA 


CTTGTTTGAG 


TGTTT CTGTT 


AGGCTAGCAA 


GGGTTAGTTG 


7500 
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CTGGCTGAGG 


TCAATTCTCT 


GCATGAGTTC 


GTGACTAGCA 


CTACCAATCT 


CAGCTCCAGT 


7560 


rn A C C TTT 


TTGGTTGAAA 


AATCTGGCAA 


ATCGAAGCTG 


ATTTTCTTGC 


CTACTGACTG 


7520 


ACCTTGACCA 

r\ U V» i L ' — J r\ *\ 


GCAATCTCGA 


CACCTTCCAT 


ATCCATAACT 


GGTTCGTAGA 


ATTTCTTGAT 


7680 


1 1 LjAV_ L 1 UUU 


GTTTGAACAC 


TAGGAAGTTC 


AATAGCTGCG 


CGGTGAAGAG 


T ATT AT AAAC 


7740 




TTCAGCATTT 


CCAGAGCTTC 


TTTGATGGTA 


TCTGACTGAC 


GATTGTCTGC 


7800 




TCTTGGAGAG 


G ACT CTTGGT 


TTCCAACTCT 


CCGATAGCTT 


CTCTGGTCAA 


7860 




CCAATAAAAC 


GATAACTAAA 


GTTGAGCTTG 


TCCTTAGTAA 


ACACTTTACT 


7920 




AGCCAATCTT 


GGAAATTCCG 


TGCTTGCAGT 


CTAGTATTGC 


T ATT T AG TTT 


7980 


LLLA1 i I I 1 vj 


GCTGCTGGGT . ATTCCTTGGA 


TTCCAGCTTT 


TCACGAGAAC 


CCTTGCCGAC 


3040 


AAGATAOALit- 


TTTTTCTCAG 


CCCGCGTCAT 


AGCAACATAC 


AGCAAACGCA 


TCTCCTCAGA 


3100 


Al AOL L ivjt- 1 


AGCTGTAATT 


CCTCTTCGTT 


CTGCCTATAG 


GTCAGACTAG 


GAATGCAGAC 


8160 


tttgatcjU i r 


TTAGGATAGT 


GGTCTTCTAC 


TGCCCCTGTC 


TCCATCTTCG 


CAATATATTT 


3220 


GACACCAAGA 


CCATTCTGAC 


GACTGAGAA T 


GACTTCTGAC 


ATAGAGTCTT 


GCTTGTTGAA 


3230 


AIL 1 i L>A X V.L. 


ATATTGAGGA 


TAAAGACGTA 


AGGAAACTCC 


AGCCCTTTAC 


TCTTGTGGAT 


8340 




TCTACTCCAT 


CTTTTGGCGG 


TGCGACGGCC 


ACGCTTGCCA 


AATCCTCCTG 


8400 


GoCTTCTAAu 


ACTTGGTCAA 


TCATACGAAT 


AAAACGCGAC 


AAAC CTTTG A 


AATTGCTCTT 


3460 


t 1LAAAI ILiA 


TCAGCACGCA 


GTGCTAGGGC 


AT AG AG ATT G 


GCCTGCCT AG 


CAGGACCATT 


8520 


r~~ <~* \ i a rrr* 
LUuLAAAOtL 


CCAACATAGT 


CATAATAAAA 


ACCCTCGTTC 


TAAATCTTCC 


AAATCAAGTC 


as^o 


A*r»A/""A/ , " , Afii™ 


TGGGTTTTGG 


CATACAAGCG 


CCAAGAAGCT 


AGGATATCCA 


TGAATTGCTT 


8640 


L AO I. 4 L I iUn 


GCTAGAGCTG 


TGTGAATCAA 


GCCTTTTTGA 


CTACTTGCCA 


TTTTTTGTGC 


8"?00 


Ai HjnL^Au 1 


TTCTCATAGA 


CATTTTCCTC 






GAAGGGACAA 


8760 


ACCTGCTAGC 


TCATCCTCAT 


CAAAACCAAA 


CATTGGAGAC 


T TCATAAGGvj 




3 3 2 0 


CTAGTCTTCC 


AGGGGATTCT 


GAATGACACG 


AAGAGTGTCT 


- AGCATGACTT 


GCACTTCTAG 


3330 


GGATTGCACA 


TAATTGTTTT 


GCTCTCCGTC 


AGTTTTGACA 


GGAATTCCGT 


ACT CAG ACAG 


8940 


GGCGAGGAGA 


ATCTGGTCAT 


TACGACTGCG 


GCTGGAGGTC 


AGAAGGGCAA 


TTTCCTTAAA 


9000 


GGCAACACCT 


TTTTCTTGAT 


GAAGTTTCAG 


AATCTCCTTG 


ATAACTAAGC 


GCATTTCGCC 


9060 


TGTTAGTTTC 


GTTTCTGTTT 


G ACTCT CTTC 


TTCCTCACCT 


GTATCGTCCT 


TGTCGTAGAG 


9120 


CAGAAATGCT 


CCCTTGTTGT 


CTGGATTGGG 


AGTCAGTTTG 


GTATTGCCAA 


AAACAAGCTG 


9180 


GTCCTTGTTA 


TCATAGTTGA 


TTTCGCCGAC 


CTCTTGGTCC 


ATGAGACGTT 


CAAACACATC 


9240 
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ATTGGTTGCT 


GACAGCACTT 


CTCAACTACT 


ACGGAAATTT 


TCCTTGAGGA 


TAATGAGCCT 


9300 


GCCTTCTTGG 


GGATTTTGCG 


CATAGCGTTG 


GAATTTCTCA 


TTGAAAATCT 


GCGGGTCTCC 


9360 


CTGACGGAAA 


CGATAGATGG 


ATtGCTTGAT 


ATCTCCCACC 


ATAAAGCGAT 


TGTGGCCATT 


9420 


AGACAACAAT 


TCCAGCATCC 


GTTCTTGAAT 


ATGGTTGGTA 


TCCTGATACT 


CATCGACCAT 


9480 


GACTTCATGG 


AAGCGCTCCT 


GATAAGACTC 


ACGAACTTGT 


GGGAAATTCT 


CTAAAATCTC 


9540 


AATGGTGTAA 


TGGCTGATAT 


CAGCGAATTC 


GAAGGCATTT 


TCCTGTCGTT 


TTCTCTG AC G 


9600 


ATAAGCCTCT 


ACAAAATCGC 


TCATGAAAGA 


TTGGAAGGTT 


TTAGCTAGTT 


TCCAAGTGTC 


9660 


TCCATGATAA 


CGTTCTTGAT 


AGTCGAGAAT 


CGCTATCTGG 


TCTGATAATT 


GTCCTAGTTT 


9720 


AGCAAACTGG 


GTCTTTCTCT 


CTTCCTTGTA 


GGCATCAGCC 


AGGGGCTTCA 


AATCAGCCTA 


9730 


CGGCTGGCAT 


TAGTCAGAGC 


TCGACCGTTT 


TTCTCCTTAG 


AGATGGCGAC 


AACACGCGCA 


9840 


AGCACTGCCT 


GATAAGCCTG 


ACTATCGGAC 


TCCTGATTTA 


GGGAGCCAAT 


TTCATCCAGA 


9900 


ATTAACTGAA 


CATTTTCTAA 


ATAGGCAGCC 


TTTGCAAACT 


CCTTGGCATC 


GTTATCCAGA 


9960 


TGGTAACGGA 


AAAAGCTTTC 


CAAATCCCAA 


ACGCCTTCTT 


TGATTTGCTC 


G GTC AGTTTT 


10020 


TCTTTTTCAC 


TGGTAAAATC 


AGCTTTCTCA 


AATCCTTTGA 


GGAAAGATTC 


ACTCAGCCAC 


10080 


TTTTGAGGAT 


TACTGGTGGA 


TTGGAGGAAG 


TCATAGATTT 


TATAGACCTG 


CTGGCGCAGA 


10L40 


CCCCGTTCGT 


CCTTGCCACG 


CCCAGCAAAG 


TTTTTCAGCA 


AATGACTAAA 


GGTCTCTTTC 


10200 


TGTTTACCTT 


GGTAATGCGC 


TTCAAAGACC 


TCATGAAAGA 


CTTC GTTTTC 


GAGAATAAGT 


10260 


TGCTCGCTTT 


GGTTTTGTAA 


AATACGGAAA 


TTAGCTGCAA 


TATCAAGCAG 


ATAACCATGT 


L0320 


TTGCCAACGA 


ATTTTTGTGT 


GAAAGAATCC 


ATGGTTCCAA 


TGGCAGCGTT 


GGGTAGGTCT 


10380 


GCCAACTGGC 


GACCCAAGTG 


TTGTTTGAGC 


TCGACATCAT 


CTGTTTCTTG 


G ATT TTCTTG 


10440 


CTGATTTTTT 


TCTCTAAACG 


TTCTTTAAGT 


TCAGTTGCAG 


CCTTGACGGT 


AAAGGTTGAG 


10500 


ATAAAGAGTT 


GAGAAATTTC 


GACACCACGC 


GCCAATTGGT 


CCAGAATGCG 


CTCTGCCATG 


10560 


ACAAAGGTCT 


TTCCAGAACC 


AGCCGATGCT 


GAGACCAGGA 


TATTCTGGGC 


AGAAGTGTAG 


10620 


ATAGCTTCGA 


TTTGCTCGGC 


AGTTTTCTTC 


TGTTCCTTGC 


TCGAATTTGC 


TTCTGCTTCT 


10680 


TGCAGTTTTT 


GAATCTCCTC 


CTCACTTAAA 


AAGGGAATAA 


GCTTCATCGA 


TTCAACTCCT 


10740 


CTCTTATTTT 


TTCAAGCCAA 


GCTTG CTTG A 


GTTTTTCTCC 


GACCAGACGC 


TTGCCATCAG 


10800 


CTAGGTCCAA 


CTTTTCTAGG 


AAACGGGCTT 


GGCCCAGATG 


GTAATTGGCT 


TCAAAGCCTG 


10860 


TAATAGCCTG 


ATGTTGCTGG 


ACGTATGGGG 


CAATGCTTCT 


GCCATTTTCA 


GTATAAGGAT 


10920 


TGATGGCGAA 


CCGGCCTGCT 


AAAATCTTCT 


CAGCAGC7TT. CTTGTAAAGA 


TAGGCATTGT 


10980 


AGTCCAGTAG 


GAGCTGAAAT 


TCCTCATCTG 


TCAGTTGATT 


AGCCTTCTTT 


TTGTTATAAA 


11040 
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ATTCGCCTAA AT AACTG CTT TCTTTTTCCA AGAAGAGCCC TTGGTATTTC AT AG ATTT GC 11100 

TGGCTTCTAC CACTGCTCCT GCCAGACTTT TTACCGCCAT CAGAGATTGG ACAGGTTCAG 11160 

CCATTTCCAA GTACATGGCG C CG AAAAAGT TCTGCTCCCC TTCTCTTTTT AGCCCAGCAA 11220 

GATAGGTTGG T AACTG AG AA TTGAGCCCAT TAAAGAAATG AGGAAACTGG AACTGAGTCA 11280 

GACTGGATTT GTAGTCTACT ACT C CT AT CG CT C C ATT AG C TTTCAAACGG TCAATCCGGT 11340 

CCACCTTGCC TCGTACAAAG ACACTGCGTC CATTGTCTAA TTGAATAAAG GCTTGGTCTT 11400 

TTCCACCAAA ATTTGCTTCT TCTTTGATGG TTTCGATGGC TGGATTGTGT CGGAGAATAT 114 60 

GTCCACTTGT CCGTGCAACA " TCAAGCAAAA CTTCCTTGGT AAACTGGGCT TCCAAACTTT 11520 

CTTGATAAAT AGCTTCAAAT TCGCGTTCTT GACTGGTTTC TTGAATAGCT TGTTCTAGAC 11580 

GTTGGTCAAA GGAATCTTCA TTAGGCAACT GTAAGGCGCC TTCAAAGATA CGATGCAAGA 11640 

AATTCCCGTG ACT ACGGGC A TCAGGATGCA AACGTAATTC CTCCTGCAAG C CT AAAACGT 11700 

AGCGTAGGAA ATAACTGTAT TCATTGCGAT AAAACTCTGT CAAACCCGAC GTAGACAGGT 11760 

AAAACTCCTG TTTGGCAGGA TAGAGAGCTT GCAAGGTGTC CTTGGCTAAG GTCTTGCTGC 11820 

TTGGACTGGT TGGGATAGCT GGATTTTCCA GACCTTGCTG ATCTAGTTTT TTACCTATGA 11880 

CACGCGACAG AACCTTGACA AAAGT C AAAT CTTGCTCAGT ATCGCTCATC TCACCCTGCT 11940 

GGTGATAGGC AACCAGACTA GACAAAAGAC TGTGATAGGA CCCCATATCC TCCTTAGACA 12 000 

GTCCTTTGTG ATTCATCCTC TTCTCTCTCC CCCTAAATCC AAAATGGATC AACTCTTGAA 12060 

GATAGGCAGA TTCCTTACTT TCACTTTCGT TAAAAAGGCT TGGAGCCGAC AAGAACAACT 12120 

GCTTACCAGC AGAATTGACC AAGCAAAGCA TAGTGTAGCG ATTTTTCTTG AGATTTTCAC 12180 
TGCTGGCAAT CAGTAATTGA ACGCCTTCTT CGCTCGCTTC GTTT ' ZZTT? TGTCTTTCTT 12240 
CATCTGTCAG AAGACTGGTG TTTTGAGAAA TTTTTGGTAA ATTGTCCTGA GTTAGTCCAA 12 300 
TAGCATAGAC AAAGTCAGCA GTCAATGGTG CAATCAAATC GTAACTCTGC ACCAGAACAG 12 3 60 

TGTCCACTGT TGCTGGAATG GTACGGTATT GGCACAAACT CATTCCAGAA TGGAGCAAGG 12420 
CTAGGAAGTC TTCCAGACTA ACCTGTGAAC CAGCAAAAAC AGTCGCAAAT TGTTCTAAAA 12490 
CAT GGC AG AA AGCCTTCCAA ACTTCGGCTT GTCTTTCCTG TTCTACAGCT TCCAAAGTGG 12 540 

TTCTCAAATC TTGTAACTGC TTGGTCACAG CTCCTTCTTT TAGAAAGACA CTCCATTTTT 12 600 

GTAGGAGTTT TTCAGCCTTT TGTTTTCGGC TGGCAAAGAG GGTTTCAAGA GGTGCTAAAA 12660 
TTCTCAGGCG GAGGACATTC AAACGCTCAA G ATT AAAT TT TCCATGGTGG GATTTGGTGA 1272 0 

AGGTTTGCTG AAAGGCTGGC AAGCCATTGA TACCAAGATA GCGGATATAT TGCTCAAAAG 12780 
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CATCAATATC 


AGACTCACTG 


AGGTCAGTAT 


ACAAATCAGT 


TOTAAGAAGA 


TTAATCAAAT 


12840 


CCTCCTGACG 


AAAACGGTAA 


CGTTTTAAAG 


CTAAAATAGA 


CTCG AC AAAC 


TCAGTCAAGG 


12900 


GATGATGAGC 


CATGGCTTCG 


CTTCTACCAA 


GATAAAAAGG 


AATCTGATAC 


TGGTCAAAAA 


12960 


TGGTTTTGAG 


AGATAACTGG 


TAAGAAGCTA 


CATCCCCCAA 


GAGAATACGA 


AAATGCTTGT 


13020 


AGCTCAGGTC 


TGAGTTCTCA 


TGTAATTTCT 


G AC G AAT ACT 


ACGGGCTACT 


AGCTCCAACT 


13080 


CCTCCTTTTG 


CGTCAAACAA 


GACCAGATTT 


GTAAATTTTC 


ACGGTCTTTC 


TCATCGACAT 


13140 


CCAAAGCGAG 


TTCTGAAAAG 


TCATAAGAAG 


ACTCCAACAA 


ACGAGAGGCC 


TTGTCAAAAC 


13200 


TATCCATCTT 


CTCATGAGTT 


TGAGAACAGT 


CCTGAGCAGG 


CGTTTGGTAT 


TTAGAAGCCA 


13260 


GATGATGGAG 


AAATTTTACG 


CTGGCTTGGT 


AGAGATTGCC 


CTCGCTAAAA 


GGACTGGTAT 


13320 


AGGCTTTCTT 


ACTAGCATAA 


GCCCCGATAA 


CAATCTCAAC 


ACCTTTGCCG 


TGAAGTAAGT 


13380 


CCACAACCCG 


CTCTTCCTCA 


GCAGAAAAAC 


GAGTAAAGCC 


GTCAATGACC 


AAGGCGATTT 


13440 


GATTAAAATC 


ACTACTTACC 


TTGTCATTCT 


CAATACCCTC 


AATCAAATGG 


GACAACTGAC 


13500 


TTTCCTGGGC 


TAACTGACCT 


TGATTAAGAT 


AGGCTGTTAC 


TTTCTCAAAA 


ATCAAGAGTA 


13560 


AATCCGCCCT 


CTTATCCTCA 


TCTGTTAAAT 


TCTCCAACTC 


CAAAAAACTC 


ATCTG AG ATT 


13620 


TGGTCATCTC 


ATGGT AAAC C 


TCAATTAACT 


GCTGGATCAA 


TTGAGGATCC 


TGCTTAATAC 


13680 


CGCCATAAAC 


ACGCAAGTCC 


TTGGGATCGA 


GTTCGGCAAG 


CCATTTCTAA 


AAGGCCAACC 


13740 


CAAGACCGAT 


ATCATCAAGA 


GTAGTTTTAG 


CTGGTAAATC 


ATTCAAGACC 


AGATAGCGAG 


13800 


CCATTTGAGC 


AAAGCGCGTG 


ACGGTAATCG 


AAAAAGAAGC 


CTGCTGGGAC 


AAGTATTCCA 


13860 


GCACGGCGCG 


TT C CTTTTC A 


AAAGAAAGAG 


AGTTGGGGGC 


AATGTAGAAG 


ACCCGCTTGC 


13920 


CAGCTGCAAC 


TAGCTCTTCT 


GCCTCTCTTG 


TTAGAATTTC 


TGTCAAAGAA 


GTCCGAATAT 


13980 


CAGTATAAAG 


TAATTTCATC 


TCACCCTCGT 


TGGAATTTTT 


CATCACCCTA 


TATTATACCA 


14040 


TGATTAGCCT 


CGTAAATCTG 


TTAAAATATT 


TAGGCCATCC 


TTTCTTTTCT 


T C AT C ATCTG 


14100 


CTAAATCTTA 


AATACTTAGC 


TTTACTTGTA 


TTAGATAGAA 


TAAGTCTGGC 


TACTCAAAAT 


14160 


CACATAATAA 


AAAAGCCTCC 


CTAACAAGGC 


TTTGAGTTTT 


ATGATTGTTT 


CTTAGGTACC 


L4220 


GAATACACTT 


CAATGTGTTG 


TCCCAGTATC 


TTAATGTCGA 


CTGGTAGATT 


GTCTGATTTA 


14280 


TCGCCATCAA 


CATCGGACTC 


TAATTCGATA 


TCAGAAGAAG 


TTTTAATATT 


ACGTGCCTTT 


14340 


ATATATTCAA 


TATTCTTGAT 


AGAATGATTG 


AACTATAGTA 


AATTGAAACT 


ATAATAGTAC 


14400 


ACCGTGGATG 


CTAAAATATT 


TCTAGAAATT 


AATTTCATTT 


CCCTAATCAA 


GCTATTCGTA 


14460 


TCTTATTTCA 


ATCTACTATA 


ATAAAATGAA 


CCAAAAATAG 


TACACAATGT 


GGTATAATCT 


14520 


TCTTATGGCA 


TATTCAATAG 


ATTTTCGTAA 


AAAAGTTCTC 


TCTTATTGTG 


AGCGAACAGG 


14530 
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TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 14 640 

AAAGCTAAAA GAGAAAACAG GAG AG CT AAA CCACCAAGTA AAAGGAACAA AACCAAGAAA 14 700 

AGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CC AG ATGCTT ATTTGACTGA 14 7 60 

AATACCTTCT GACTTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 14 820 

CTACACTCGA AAAAAAGAAC CACACCTACT ATGAACAAGA CCCAGAAAAA GTAGCCTTAT 14880 

TTCTTAAGAA TTTTAATAGT TTAAAGCACC TAGCACCTGT TTAGATTGAC GAAACAGGAT 14 940 

TCGATACTTA TTTTTATCGA GAATATGGTC G CTC ATT AAA AGGTCAGTTA ATAAGAGGCA 15000 

AAGTATCTGC AAGAAGATAT CAGAGGATTT CTTTGGTTGC AGGTCT AAC A AATGGTG AAT 15060 

TAATCGCTCC AATGACTTAC GAAGAGACGA TGACGAGCGA CTTTTTTGAA GCTTCGTTTC 15120 

AGAAGTTTCT CTTACCAACA TTAACCACAC C AT CGGTT AT TATAGTAAAA TGAAATAAGA 15180 

ATAGGGGGGG GGGGGGAGGG GGGGGGAGGG AGA 15213 
{2} INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6004 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTATTACCTG AAACATTAAA TTTAATTGGA CATCCCGTTA TCAATTTTAT AATATCATCA 50 

AGATTTTTAT TATCTGATTC AGGAATTTTA TCTGATATAA CAACACCATT TTCAAGATAG 12 0 

TTCATTAAAT TATTTGATTC ACTAACATTA GTGTTTTGAT CTCrATCAAC CCAAAAATAA 13 0 

TGGTTATCGG AATCTAAATA CGATGAGTTT AAAAT AT T AT TACAAATTAT TTGATTTGCT 24 0 

CCACCAGGAA TATATCTCAC TACTAAATTC TGTTTAAGAT TCTCACTACC TGAATGAGTG 3 00 

ATAACAAACT CTAGAATATA TTTAGCTAGT CTATCTTCAA CATAAATCAT CTTCCT AG AA 3 60 

TGATACACAT CACCTAATTC AAAAAATGCA TCCTGATAAT CAATATTTTC AATAACATCT 42 C 

ACCTTTTCTC CGTTTTTCAC TAAAAGTTTC ACGGCTTCTC TAGGAAAATC TTTTATAAGT 4 80 

TGTG TAG AAT GTGT AGTG AT AATAATTTGA TGTTTTTTAT TTAAACACTC TTGAAGTAAA 54 0 

AACTCTTTAA ATTTATAGAT TGCACTCGGA TGAAGTGAGA TTTCAGGTTC ATCTATTAAT 600 

ATTAATGAAT TTGATTGCGC AT TT ACT AT A TCATTTACTA AC AAAAT AAT TCTAGCCTCA 660 

CCTGTTCCTG CAAAAGCCT C GGAATATTCT TTTCCAGATT TTTTC AT CCA AAT AG TTTTG 720 
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GAAGCTTTTA TATCATCACC TTTTGAATAC AACTTATGTG TTAAAATTTG AATCTCTGTA 7 30 

TAAGATTCAT CCATTATTTC ACTAATAATT TCACAAACTT TATCATCAAC TTTAACATTA 84 0 

TCTATAACCA TTTCCTTTTT ATAACGCGTA T AG CT ACTTG TATTATTCTT T AAAAT AT C A 900 

GCAACTGGCT TAGATCGTAA TCTTATAAAA TCTT GTTT AC TACGTTGAGT AG AAATTTTT 960 

TTAAAATTAT AGTGATAGAA AAATAAATCA AAAGCAGAAA CATATTCTTT ACAATCACAA 1020 

AAGACAACAT TTTTTTCAAT GCCATCCCAT CTGTCTGTCG AAGAACTTCC AATATATTTA 1080 

TTTTTGGGTA ATCTTTCCAT CTCATATTGT TTTTGAGGAG CATATGGTTC CCAATAATCT 114 0 

AATCCTTTTT TTGTTCCAGA ACGGCCTTTA AG AACTT CT A CATTTCTAGA AGCTTTAATG 1200 

TTATAATATG AATAGATTAA ACATTGTTTC CCATCCACTT CATCTATTTG ATCAACATTT 12 60 

G T ACT AAACC AATATTCAGA CACACTTTTA TTGGCTGGAG AACCATATAA AG C TTGT AAA 13 20 

ATTGAAGTTT TATTTACTCC ATATCTATTA CAGACACCTC AGGATTATTT AACTT AT AAG 13 80 

TTTTAACAGC TACGGAATCA ATTTCAACAG CAACTTGAAC ATCTATGCCT GATTTTTTAA 1440 

GGCCACTTGT AGTGCCACCT GCACCGTTAA ATAAATCAAT AGCAACAATT TTCCCCATAG 1500 

TATTCTCCTA AAGTTTCTCC TTTTTATTAT AACATTATCA AATGTAAAAC CCAACCCGAT 1560 

AGGGTTAGGT TTTTAACATC ATTTCACCAA CTTCTTCATC TCATCAATAC GTGCGACGGT 152 0 

CGCGTCATAT TTACCTTGGT AGTCAGCTTG TTTGTCGCAT TCTTTTTCGA CGACTTCTGG 1680 

TTTGGCGTTG GCTACGAAGC GTTCGTTAGA GAGTTTCTTA CCAACCATGT CCAGTTCTTT 174 0 

TTGCCATTTA CCAAGTTCCT TGTCGAGACG GGCCAGTTCT T CTT CAA CAT TGAGGAGATC 1800 

GGCCAGTGGC AGGTACATTT CTGCTCCTGT GATCACACTT G AC AT AG CCA GTTCAGGTGC 1860 

AGGGATGGTT GATGCGATTT CCAAGTGTTC TGGATTTGTA AAGCGTTTGA TATAGTTGAC 1920 

ATTGCTGTTA AAGAAGGCTT CCAAGTCGCT ATCGCTTGTC TTAACAAGGA TGGTGATAGG 1980 

CTTGCTTGGT GCTACATTTA CTTCCGCACG CGCATTCCGA ACAGCACGAA TCAAGTCTTT 2 04 0 

GAGACTTTCC ACACCAGTGT GAGCCGCAAG GTCTTCAAAG GCTAGATTAA CAGTTGGGTA 2100 

TGCAGCTGTC ACGATAGAAC CTTCTGAGAT TTGTCCAAAG ATTTCCTCTG TCACGAATGG 2160 

CATGATTGGG TGAAGGAGAC GAAGGATCTT C7CCAGCGTA TAGAGGAGAA CAGATCGAGT 2220 

AATGACCTTA TCGTCTTCAT TGTCGCTGTA TAGAACTTCC TTGGTCAACT CAACATACCA 2280 

GTTGGCAAAT TCTTCCCAGA TGAAGTTGTA AAGGATATGA CCAGCCACAC CAAACTCGAA 2 34 0 

CTTATCAAAG TTTTCAGTAA CTTTTGCAAT GGTTTCGTTG AGATTGTGGA GAATCCAGCG 24 00 

GTCCGTCACA TTACCAGCCT CACCTGTTGC AACTTTTGTG ACATTGTCAT GCGCCACATC 24 6 0 

CAGCGTCAAA CCTTCATTGT TCATGAGGAT ATAGCGAGAA ATGTTCCAAA TTTTGTTAAT 252 0 
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AAAGTTCCAT 


GAAGCATCCA 


TTTTCTCGTA 


AGAGAAACGA 


ACGTCTTGAC 


CTCCTGCGGA 


2580 


ACCGTTTGAA 


AGGAACCAAC 


G AAGGG CATC 


AGCACCGTAT 


TTCTCGATGA 


CATCCATTGG 


2640 


Vj i i v_ \— o 


TTACCGAGAG 


ATTTAGACAT 


CTTGCGTCCT 


TGCTCGTCAC 


GGATGAGACC 


2700 


riTrvr at a jvr.r 


APnTT^TGGA 


ATGGCTGACG 


ACCAGTAAAT 


TCCAAGGACT 


GGAAGATCAT 


2760 


wnw — * — 


C AG AAG AAGA 


TGATGTCGTA 


ACCTGTTACC 


AAGGTTGAAG 


TTGGGAAAT A 


2820 


a r^TTT A A nr 




CGACTTCAGG 


CCAGCCCATG 


GTTGAAAATG 


GCCAGAGGGC 


2880 




TAAGTATCCA 


AGACGTCTTC 


GTCCTGAGTC 


CATCCGTCAC 


CTTCTGGAGC 


2940 


1 1 L 1 i V.uCL\j 


AfATACATTT 


CACCATCAGC 


ATTGTACCAG 


GCAGGGATTT 


GGTGACCCCA 


3000 


f" A A A C. r**T*C. A 


r*rt Afi A d AT AA 


CCCAGTCGTG 


GACATTTTCC 


ATCCATTGAA 


GGAAGGTATC 


3060 






ATTCGACCTT 


GTCCTCTGTG 


TCTTGGTTAG 


CAATGGCGTT 


3120 






TGACGAACCA 


TTGAGTAGAC 


AAGCGTGGCT 


CAACTACGAC 


3180 


Al_l_ L ALU I 




CAACACTGTG 


G AC AC GTT TT 


TCGATTTTGA 


CAAGGGCACC 


3210 




AAfTTACCAA 


CG ACTGC CTT 


ACGAGCTTCA 


AAACGATCCA 


TGCCTGAAAA 


3300 




AGCTCATTCA 


TAGTTCCGTC 


GTCGTTCATG 


ACGTTGACTT 


GTGGCAAGTT 


3360 




rPAACCAAGA 


AGTCATTTGG 


ATCGTGGGCA 


GGTGTGATTT 


TCACGACACC 


3420 




Tran^ATfTfi 


CGTGCTCATC 


TCCAACGATT 


GGGATGAGTT 


TATTAGCGAT 


3480 


Trr a a rr a tt: 


A J " ? "1 w I w A ("* 
nuo i i i i * 


CAATCAAGTC 


CTTGTAGCGC 


GGGTCTTCTG 


GATTAACCGC 


3540 


M>\1_ l_ t\n v_ o 


TrrrrAAAfA 


TAGTCTCAGG 


ACGAGTTGTA 


GCAACTTCAA 


GGGCGCGTGA 


3600 






TCATGTGGTA 


GAAGGCACCT 


TCTACATCCT 


TGTGAATCAC 


3660 


CTCAATATCA 


GAAAGGGCTG 


TCCGACCTGC 


TGCG7CCCAC 




ACTCAC7ACG 


3^:0 






AAAGGT TC AC 


AAAGACCTTA 


CGAACAGCTT 


TTGACAAACC 


3780 


TTCATCAAGA 


GTGAAACGCT 


CACGAGAATA 


GTCTACAGAA 


AGCCCCATCT 


ft /"*("*/"* JITTT 

rOLLLLA L 1 L> 




TTCCTTGATG 


GTAGTGGCAT 


ATTCGTCTTT 


CCATTCCCAG 


ACCTTCGTCA 


AGAAAGACTC 


3900 


ACGACCTAGG 


TCATAACCCG 


TAATACCCTC 


ACCACGTAAG 


CGCTCCTCAA 


CCTTAGCCTG 


3960 


AGTCGCAATA 


CCAGCGTGGT 


CCATACCTGG 


AAGCCAAAGG 


GTATCAAAGC 


CTTGCATGCG 


4020 


TTTTTGACGG 


ATGAT GAT AT 


CCTGCAAAGT 


CGTATCCCAA 


GCGTGACCAA 


GGTGAAGTTT 


4080 


CCCAGTTACG 


TTTGGTGGTG 


GAATCACGAT 


TGAATAAGGC 


TTAGCCTTTT 


GATCGCCTGA 


4 140 


AGGCTTGAAA 


ACATCCGCAT 


CAAGCCATTT 


TTGGTAACGA 


CCAGCCTCAA 


CCTCCGCTGG 


4200 


ATTGTATTTA 


GGTGAAAGTT 


CTTTAGACAT 


GTGTGTGTCC 


TTTCTCTATT 


T TG T TT ATTT 


4260 
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TATTTTGAAT TTGCTTAGCA GCTTCTTCTG C AG AC AAATT CGTATTATTT ATTTTAAAGT 4 320 

AGTGGTGCAA CTCATTCCGT TGATGTTGGG AATTTAATTG AAGTGTTTCA GCGGTCTCTA 4 3 80 

AAATTTCTCT TTCAGATACC TCAATATGTC GTTTTAAGGG TTTGTGCTTT AATCGATTCT 44 4 0 

CCGTTCGATT TCGACGTATG CACTCTTCAA GACTTGTTTC CAATTCAACA AACAGAATCT 4 500 

CTTG ATG AAA GTTATCCAAT AAATCCTGAA TTTGCTTTAA ATACATCAGC TGGTACTGAT 4 560 

TTGAAAAATC AATTACGTCT GTTAAAATTA CTGATCGCTG ATTTCTTGCA CTTGCTCCAA 4 620 

GGAAAGAAAA GGTAATTCCA C G AAC AAATT CCCACATCTC CTCGGTATAA TCCTGATAGA 4 680 

TCTCTAGTGC AAAATCAATG GCTTGATGGT T AT AAAAT AG GGTAGCATCC GTCAGTCGAG 4 740 

ATAATTCTTG ACCAATGGTC ATTTTTCCTC ATGCTGGAGC ACCAATGATG AAAAGATGCA 4 8C0 

TCAAATCACC TCCCACTCAC TCCTCAGCAA GCCATATCTC AAATCATCAC AGCAGTTGCC 4 360 

TTGAGCATCT TTGCGGTCTC TTATCCGAGC TTCGAGGGTA AAGCCAAGCT TTTCCGAGAC 4920 

TCGTTGACTT TGAACGTTAT ATCCAAAGCA AGTTAGTTCA ATCTTGTGAA GACCAAGTTC 4 980 

TTTAAAAGCT AGATCAATCA AGGAACACGC TGCTTCTGGA ACATAACCTC CACCCCAATA 5040 

GTCTGGGTGC AAGGTATAGC CAAGCTCTAG CACATCATCC GCATGAAGAT GGTTGAAGTC 5100 

AACAGAACCA ATGACTTTAT CGGTTCCTTT GACGACAATC CCATAGCCAG CTGGGAGATT 5160 

TTCCTTTTGA GTACGCTCCG GAAGAATGTG CTCCAGATAA TAAATCTCAT CTTCCAAGAT 522 0 

CTTGACTGCA GGAAAACCTG CTGGATAGGC GACCTCTGGC AAACTAGCGT AGGTATGGAT 5280 

ATCCTCAGCA TCCACCACTG TGCGGACTCG TAAAACGAGA CGTTCTGTTT CGATTTTATC 5340 

TGGCAGCTCA GTTCTTGCCA TCCTTCTTCC TCGCTTTTTT GATGAAACTG CCCTTCATAT 54 00 

CTACACGCTT GTCCAGATAG CGATAAACGC GCTGATATCC ATCTCCCATG AAATAGGTTG 54 60 

GGGCAAACAG TTG ATTTTT A AAATGTC CCT TTTCATCCAG GAGTTCTGGG GCAACAAGTC 5520 

GCTCAAGAAT CTTGGCAAAG ATGTGGCAAA TACCGTCTTC CTCAACAATC CTATCTACCC 5580 

GACAATCTAA AACAAGTGGA CAGGCGTCTA AAATAGGAGT CTGAGTTCGT TCAGAAAT FT 564 0 

C AT AATGC AC TCCCAAACGT TCCAATTTCT CCTG AT G ACT GATAAAACCA GCCTGCTCCA 5700 

TCGCAAGCAT AGAAGTTTCA TCAGAAAT AT TCACAGTAAA TTTTTGATAC TGTTTGATCT 57 6 0 

GCTCTGCGGC ATTCTCTCTC GCAACGACTC CAATCACAAC CCAATCTCCT AGACTATAAG 5820 

AGGAACTACA GGTCGTGATG TTATAGCCAA AATTCTAATC TTGATATCCT AAAAT AAAAA 5880 

CAGGAAAACC ATAATATAGT TTACTTGTGT TAAAAGATTG CTTCATAACA ACCCCCTTTG 5940 

ACTAAGACGT AAAAGAAAAG CCCTGCCATC TACATGACAG GGACGAATGT GTTTATCCGC 6000 

GGGG 6004 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5857 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 28: 



TGTAGAATTC 


ACGACAATGC 


TTCGTTGATT 






T*TCT GGC AAG 


60 


CGAGTCAATG 


AACCAAAAAT 


AGTACACAAT 




rrrTTTiTnr. 


("ATATTCAAT 


120 


AGATTTTCGT 


AAAAAAGTTC 


vCTCTTATTO 




nn*r Ar; t a T ajv 


CAGAAGCATC 


180 


ACACGTTTTC 


CAAATCTCAC 


G T AAT ACC AT 




-prTi kk > r*/T ft. A 
1 1 AAAoU 1 AY/\ 


A At*l AH AAAAC 


240 


AGGAGAGCTA 


AACCACCAAG 


T AAAAGG AA<_ 




A ft. A nTTfi AT A 


GAGATAGACT 


300 


TAAAAACTAT 


CTTACTGACA 


ATCCAGATGC 


TTATTTGACT 


r- & A AT AflPTT 


CT G ACTTTG G 


360 


CTGTCATCCA 


ACTACCATCC 


ACTATGCGCT 


CAAAUL 1 Alb 


rrrr ft c* a p* 1 **"* 


n AAAAAAGAA 


420 


CCACACCTAC 


TATGAACAAG 


AC CC AG AAAA 


ACj I AbL LI 1ft 


I i 1 V_ 1 1 


ATTTTAATAG 


480 


TTTAAAGCAC 


CTAACACCTG 


TTTAGATTGA 






ATTTTTATCG 


540 


AGAATATGGT 


CGCTCATTAA 


AAGGTCAGTT 


AATAAGAGGC 


AAAGTATCTG 


GAAGAAGATA 


600 


TCAGAGGATT 


TCTTTGGTTG 


CAGGTCTAAC 


AAATGGTGAG 


TTAATCGCTC 


CAATGACTTA 


660 


CGAAGAGACG 


A7GACGAGCC 


ACTTTTTTCA 


AGCTTGGTTT 


CAGAAGTTTC 


TCTTACCAAC 


720 


ATTAACCACA 


CCATCGGTTA 


TTATTATGGA 


TAATGCAAGA 


TTCCATAGAA 


TGGGGAAGCT 


780 


AGAACTCTTG 


TGTGAAGAGT 


TTGGGTATAA 


ACTTTTACCT 


CTTCCTCCCT 


ACTCACCTGA 


840 


GTACAATCCT 


ATTGAGAAAA 


CATGGGCTCA 


TATCAAAAAG 


CACCTCAAAA 


AGGTATTACC 


900 


AAGTTGCAAT 


ACCTTTTATC 


AGGCTTTTTT 


GTCTTGTTCT 


TGTTTCAATT 


GACTATATAA 


960 


ATTGTCTAAG 


CGAAACAACC 


GATAAGAATT 


GGCACAAAAG 


CGACCGTATT 


TTTGTTACCA 


1020 


ATACAGGAAA 


AACAGTTCAT 


AGTTCTATCT 


TGAGCAAGTC 


TCTCCAGCGA 


GCAAACGAAC 


1080 


GCCTTAAAAA 


ACCAATTCCC 


AAACATCTGT 


CCCCTCACAT 


CTTCAGACAC 


ACCACTATTA 


1140 


GCATCTTATC 


AGAAAATAAA 


ATTCCTTTAA 


AAACAATCAC 


GGACAGGGTT 


GGTCATCCCG 


1200 


ACTCTGAAGT 


CACTACTTCC 


ATCTACACCC 


ACGTCACAAA 


GAACATGAAA 


GATGAAGCAA 


1250 


TCAATGTACT 


GGATAAAGTT 


ATGAAAAAGA 


TTTTTTAAAA 


AGTTTTGTCC 


CTTTTTTGCC 


1320 


CTCTAAATAC 


AAAAATAGCC 


CTTCGGATAA 


AATCCGAGGG 


GCTAGAAACG 


TTGTTAAATC 


1380 
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AACGGCCGAA C TTTTG AATT TCATGGTTCG GGATAAAATA CTTCACTGAA CTATTTTATT 144 0 

TTTTAAGGTT ATCATAATAT CAAATAGTTC AATTAAATAC GCTAAATTAC TAATATACTT 1500 

TT T ACCTTTT TCATTCTAAA ATGT AAAGT A CAAACAATTA CAATATACTA GAGGGGGAGT 1560 

AAAAAAGGTA TTAAATCGAT GAGTTCAGCA GGCAAGAAAA TAGCACCTTT ACGGGTGCTA 1620 

TTTT T TAATT AACGCCACGT TAACTTTTGA TTGATGAATT TTATTGTTTG GCACTTCTTT 1680 

CATTTCACGG TAAACATCGA TGAAATTCTT TCCAACATTA TTTTTGGAGT TAACTGCATT 17 40 

TATTTTTGTA TTAATAACTT TTTTAGTATC GAAAGAATGG TTTAAGAAAT CCATAACTAA 1800 

CTCTCCTTTC TCATCCTGTA ATCAAGATTT TTATCAATGT CAAAATAGTA TTTTCTATCA 1860 

ATCCAAATTG GTCCTTCTCC TTTAGAAATA GCAAGTACAT CTACCGGACC TCCTACTGTT 1920 

TCAAGAGTGT TGACAATTTT TCTCTTAAAT GAAGTTAATT CAATAAATGT TTTAGCTGTA 1980 

CTCGCCATTT CATTAAGTGC TTGCATTCCA ATAAGGTCTA TTATAGGATT TATATAATAT 204 0 

TTTTGCTGTA TAGATGATAT ATTTTCAAAT ATATTCTCAA TTTCATCACC CAATCCATTT 2100 

TTCTCCATAA CTGATGATAC TTGCTCTGCG ATATATACAT TTAAGTTAGG ATCTATACCA 2160 

TTCATAATCG TCTCAACCAT CTCTGACTGT GCAAAAGGGA TTATATGACA AGTTTTATGA 22 20 

TGATTTATCA CACTTTCATT AATAACTTTC CAAATTAATC GTTTAGAAAA AATTCCATAT 22 8 0 

AATTCAATTT GTCTT AT AG A TGGAAAT ATC TCGTCTGTAC CATAACCTGC TATAACTAAT 2 340 

CCAGTTATGT TTGTTGAGTC ATATCCAATG AAAATCGCTT TATATAAAGA TTTAGCAATA 24 00 

ACTTCAACCT CATCATCAGT ATGAGGAAAG GATTTAAAAA CATCGTCTAC AATGCTTTTT 24 60 

ATTAACTCTA ACTCAGCTTC AAAAAATTCA AAATTACTTT CAGCTTCTAC TTTTG AAATT 2 520 

TCTAAACTAA AATT AG TT AT AGCATTTAAT AAAATTTTAT TAAAATCATC TAGAGTGATG 2 580 

GTTTCACCAT TAGAAACTCT TAAATCAGCT GTTTCTTGCG CTTCATAGGC AATGCTGTCC 2640 

AAAATACTTC TTGTACTTCT GACAATATAA TT TCTTAAT A AATCCTCAAC TTGTAGATGT 27 00 

TTAAAGGAAA TTAAAAATTC TATTAGCTTT TCAACGTATT GGGCAGTATT ATCTAATAAA 2 7 60 

TCTGTGCCAA TAGCCTGCTT AAACTCATTT AAAATTACCT CCCACGGAAT TTCCATAAAC 2 82 0 

GAAGCGTTCC CATATATCAT GATCCCCACG GAATGTTCTT TTG AT AAAGT GAATAATTTT 28 80 

CGGGCGCTAT TAAAAACTTT TGAATTTTTC CCGTCTGATA AGGTTACAGC GCTATCAGAA 2 94 0 

GCCAATACAA CACCATTTTT ATTTAATATT CCAATTTCTG CTGTCAAAAT ATCACCTAAA 3 000 

CTTTCTAAAC CTGCT C ATGC TCTAATGGTA CAACAGCTAA GGTCTTACCA AGACTTGCCA 3 0 60 

AC ACTTTT AA TACTGTATCA AGTTGTGGGC TTGTCTTTCC TGTTTCCATT CTAGCGATAA 3120 

CTGGCTGACT AACACCGCTC ATCTCCTCTA GTTTCTTCTG ACTAATACCC TTTTCATTTC 3180 
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TAGCCTCCAT AAGCTCACTC ATGATAGCCA CGCGCATATC ACTTTCCAAA ATTTCCTCTT 3 24 0 

TGCTGAATAA TTCACCTCTT ACATCTTTCC AGTTACTACC AATAGCATTA TTTTTCATTG 3 3 00 

TCTAAACCTC TTTCTTTTAA ATCTGCAAGT TCACGTTTAG CTTGCTCAAT CTCTCTTTTG 3 360 

CGTGTTTTCT GTGTCCTTTT CAT AAAATG A TGCAGTAAAA CAAAACTACC ATCCATCCAA 3 4 20 

GCAACAAATA AAATTCT ATC TCTAAGTGGT CTCAGCTCCC AAATTTCAGC ATCTAAATGC 34 80 

TTAATATATG GTTCGCCTGC GCGTGTTCCA TGTTGGCTTA ACAACTCAAT ATAATCATTA 3 5 40 

ATTTTATTAA GCTTAATTCT GCTATCTTTC CCTTTTTTAC TGGTAAGCTC TCGCATATAA 3 600 

T C AAAAAC AG GCTCATTGCC GTTTTTATCC TTGTAAAAAT AGATATTATG CACTATTAAC 3 6 60 

ACCTCTTCCT AATAACAATT ATAACCTAAA AGTTATTGTT TGTAAATACT TTTAAGTTAT 3720 

TAAAATAAAA AGCACCTAGT TTCCTAGATG CTAGCACAAT GACACGGATT CGCACCGTGG 3 7 80 

CTACCTCTAT CAAGGTGTAC TCCTTCTATA CTATCCCTTG TGCTTTAGAA TATTATACCA 3 84 0 

CACAATCAAC TAGATACCTA CCATCTCATG ATATACCCCC ATTTTGGGCA AGGGTACAAC 3 900 

GCTAAAATAC AAATCAGAAT AGATATTAAA CCACTTATTT AACTTATCAT AAGCTGGTGA 3 9 60 

TTGACTGATA AATAATATCC GCTGACAAGC TCCGATAACA TTCATGTGAT TGTACACATA 4 020 

AACCTCTTTT ACAGCCTCTA AAATGTCAGC CTCACTTGTT TGTACCCTAA TATCTGTTAT 4080 

CTGCTTGATA GTTGCGTATT TTTGATAAGC TAGCATATCT TGATTTTTAG CAG CATC AAA 4140 

CATTTTACGC TCAAGGACAC TATACTTAGG TTGTTCTTTA TCTCGCATGA AATACCACTT 4 2 00 

GAGCCATAAA ATCTTTTCTC GGTGTATTAC AGAAATACGC TCAATTTTCT TCTTTGTCAT 4 2 60 

TGCTACCTCC T AAAT CATC A ATTTAACAAT TCTAACCACT CACTTTTAGA AATAGTTGCA 4 3 20 
T AG ATCTTG T TCGATGTATG ATACAAAGGT TCTAAATCTT TTTCCACCCT AAT AT AGTTC . 4 3 80 

ATCTTATCCT CATGAGTAGG AAAGTATAGT ATTTCCCTTT CATCCTCGTT TACGATACGA 4 44 0 

TTGCACCAAT CATCAATAAT AACTGGCACT TCCCACTCAC GCCATTTTTT AAGCTTTTCT 4 500 

AAAAGTTCAT TATCACTAAA TAGCTCGCCA TCTATTTGGA AAAATTCCCC TAAGTCATTG 4 560 

T1TCCTTCAA C AAT AAT AAA CTCTGGCATA TTTCTATTAC TTAATAACTC CTTGAGTTCT 4 62 0 

TGTAACTCTT TGATTTCCTT TAG AT ACTTC CTCAATTTCC AACCTCAATT CTTCAATCTG 4 680 

C CTT ACT ACT CCAAAAATTT CATGGGTCTT ATAAGATTGT TCAAGTATAG CCTTTGCTGC 4740 

TTGAGTTCTT ATAAACGGGT TGACCTTACT GTCCATCATA ATATCATTGA GTACAGAAAC 4 800 

AGCGTTAGAT GATGCTAAAT AAAGCATTTG AGTTGTTTTA TCCATCATCT CATCTTGCTT 4 860 

TATCCTCAAT GTCTTTTTAA CCGCTGCAAC TTTTAGATAC TTATGACCTG TTGCGCGTGA .4920 
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TACCCCTGCT TTTTGACATG CTTTGTCTAT CGTTGGCTCG GTAAGCATGG CATCTATGAA 4980 

TTTAATTTGC TTGGACCTAA GGTTATCATT TTCATTTCCT GCCATCTATT ACCTCCTCAT 504 0 

TATCAAAATA AAGGGTTGCC CCTTTATTTC CCTATGCTAG ATAATTCTGC AATTCTGCAT 5100 

CCATTCCCTC TGAATTGCCC TCAACAATCA TTTCATGCTG TACTAAATCA ATCTTATCTC 5160 

CGTTAATAAG TAAACCACCG TGGAAATAAT CAATTTTTCT ATCAAGGAAA TGTACTAGCT 5220 

TTTCAAGGCG TTGCTGTTGG CTGAATTGCT CCATGTCAAT TTCGATATAA GCAAGGGTAG 5280 

TATCATTATC CATAATATCT TCTAATTTTC T AAG AG C TAG AGGTTTATTT TTATATTTTT 5 340 

CTAGGTATTC TCTCATTTCT GCCACTGTTA ATTTGATACT AG AT AAT AAA CTTAGTTCAG 5400 

CTGCATCATC TGCTGTAATA GGCTCTTCTT TTGATTCATG GTTTGCTAGT TCAGCATTTT 54 60 

TCTCTTTTTC TAGTTGCTGA TACAATAGCT GAGCAGTATT TTGGGAATAG TTTTCGCCCT 5520 

CTTTTTTATA TTTTAAAAGT TCTTGCTCTG CATACACTTT CCCGATAATC ACTTCCTTAT 5 580 

AAACTAATTG CCCATCTTGA GCTTTTAGCT TAATACTCCC ATGCTCTGGA ATTT CAAT AT 5 64 0 

ACTTAATTAT ACCATTTTTT GAGTATAAAA CAAAGCCTTT CTCCATCATT TTTAATAATT 57 00 

TATCATCCTT GTTTTCAGTC ATGCTTTTCT CCTTTATTTC ATTTTATTAT AATCTG AAT A 57 60 

CCCCTAGTCT ATTTATTTCA CTAGCTTTTT AGGGTTCGTA TGCTAAAATA CTACCCTTTT 5 820 

TCTCTACCTT ATGGCTGACT TTTCAAATTG GTTAGTT 5 857 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10254 base pairs 
(3) TYPE : nucleic acid 

(C) 5TRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION : SEQ ID MO: 29: 

AAAATGATAG CAGGAGAGTT TTCCCGTCCA TCACACCCAC AACTGAGAGC CTTAGCTCAG 60 

GCTTCTCGCC AAAAACAGGC CGCCTTT AAC AAGGAAGAGA ACCCCTTGAA GGGAGCCGAA 120 

ATCATCAAGA CTTGGTTTCC CTCAACCCGG AAAAATCTTT ACATCAACAC TCGCTTGATG 130 

GTGGACTACG GTCTCAACAT CCATCTAGGG GAAAATTTTT ATTCT AATTG GAACTTGACC 2 40 

ATGCTGGATA TCTGTCCCAT TCGTATCGGG GACAATGCTA TGATTGGTCC TAATTGTCAG 300 

TTTTTG AC AC CCCTCCATCC ACTAGATCCA CAGGAACGCA ATTCAGGTAT CGAGTACGGA 3 60 

AAGCCTATCA C AATCGG AG A TAATTTCTGG ACTGGTGGTG GCGTCATTGT CCTTCCTGGA 4 20 

GTGACACTGG GAAATAATGT CGTTGCACCA GCAGGGGCAG TAATTACCAA ATCTTTTGGC 4 80 
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GACAACGTTG TCCTAGCTGG CAATCCTGCG CGCGTGATTA AGGAAATACC TGTTAAATAG 54 0 

AAGTAAAAAG GAACAGCTGC GGTTGTTTCT TTTTTGTAGG TTTCATCATT TTTTACCCAG 600 

TTCACATTTA CCTACTCTAT CTCTTAGCAA GTCTGTTTCA TTAAGCAAGT TCAAAGCATC 660 

TCGTAAGTGG GATGTTTTTC TCCTCAGTTC ATCAGCTTCC TCCTTGACAC TCGGTCAGAT 720 

TTTGATACAA TAGTACAAAA TTAGAGGAGG CAGCCTATGA TTCAGAAACA TGCGATTCCT 7 80 

ATTTTAGAGT TTGATGACAA TCCTCAGGCG GTTATCATGC CCAATCACGA GGGGCTGGAC 84 0 

TTGCAGTTGC CAAAGAAGTG TGTTTATGCA TTTTTAGGTG AGGAGATTGA CCGCTATGCG 900 

AGGGAAGTAG GGGCGAACTG TGTTGGCGAA TTTGTTTCTG CCACCAAGAC CTATCCAGTT 9 60 

TATGTCGTGA ACTACAAGCA CGAGGAGGTC TGTCTGGCTC AGGCTCCTGT TGGCTCCGCT 1020 

CCAGCAGCCC AGTTTATGGA TTGGTTGATT GGCTATGGTG TGGAGCAGAT T ATCT CT ACT 1080 

GGGACCTGTG GTGTCCTAGC TG AT AT AG AG GAAAATGCCT TTCTAGTCCC TGTTCGCGCT 1140 

CTGCGAGATG AAGGAGCCAG TT ACC ACT AT GTGGCACCTT GTCGTTATAT GGAAATGCAG 1200 

CCAGAGGCTA TTGCTGCTAT TGAGGAAGTT TTGGAAGACA GAGGGATTCC TTATGAAGAA 12 60 

GTCATGACCT GGACGACAGA CGGTTTTTAC CGAGAAACGG CTGAAAAGGT GGCTTATCGT 13 2 0 

AAGGAAGAAG GCTGTGCTGT TGTGGAGATG GAGTGTTCTG CTCTTGCGGC AGTAGCTCAA 13 80 

TTGCGTGGGG TTCTCTGGGG TGAATTGTTG TTCACAGCAG ATTCTCTAGC GGACTTGGAC 14 40 

CAGTACGACA GTCGTGACTG GGGCTCGGAA GCTTTTAATA AGGCGCTAGA ACTCAGTTTA 1500 

CCAACTG7TC ACCACCTTTA GTTGTACTGG CAAAGGATTT CTTTTATCAT AAAATGTCTA 156 0 

GCTCATACTT TTCAAAAATA TGTTTAAACG ACGTCACCTT CCTCTTGTCC TAGGCATGTT 162 0 

GAGGTTGGGA AAAATCTTTA AAATCAGAAA AACGTATCAT ATCAGGTGAT GAAAACTTTG 1680 

ACACTATGCG TTTTATGTCG ATAAGATTTA GAGTGAGATG AAATGATACT CTTCGAAAAT 1740. 

CTCTTCAAAC CAGGTCACCT TCACCTTGCC GTAGGTATAT GTTACTGACT TCGTCAGTCT 1900 

TATCCGGCAA CCTCAAAACG GTGTTTTGAG CTGACTTCGT CAGTTCTATT TCCAACCTCA 1860 

AAACAGTGTT TTGAGCAACC TGTGACTAGC TTTCTAATCG ATGCCTTGGT TTTCATTGCC 19 20 

T AT AAT C AAA AAGAGAAATT TTCTCCTGAA AAGCATATAG AGTAGCTGGC GTTAAAAGCT 1980 

CCTGTCTTGC TTTTTTGACC TATAGTCACA TCTATCAAGT ATTGTTCTTG CCTAAGCTAT 204 0 

CAATAAAAAG GTGGCATTTT TTAGGCTTGG TGTTAGTAGA TTTTGCCTTA TCCTATCTAA 2100 

GTCATTTCGA ACTTTTTATG GTACAATGGA AACATGTTAT TCAAATTATC TAAGGAAAAA 2160 

ATAGAGCTAG GCTTATCTCC TTTATCGCCA GCCCGTCGTA TTTTTTTGAG TTTTGCCTTG 2 2 20 
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GTCATTTTAC TAGGCTCTCT TCTTTTGAGC TTGCCCTTTG TCCAAGTTGA AAGCTCACGA 22 80 

GCGACTTATT TTGATCATCT TTTCACTGCT GTCTCTGCAG TCTGTGTGAC GGGTCTCTCA 2340 

ACCCTTCCAG TAGCTCACAC CTATAATATC TGGGGTCAAA TAATCTCTTT GCTCTTGATT 2 400 

CAGATCGGTG GTCTAGGGCT CATGACCTTT ATTGGGGTTT TCTATATCCA GAGCAAGCAA 2460 

AAGCTTAGTC TTCGTAGCCG TGCAACTATT CAGGATAGTT TTAGTTATGG AG AAAC TCG A 2 520 

TCTTTGACAA AGTTTGTCTA TTCTATTTTT CTCACGACCT TTTTGGTTGA GAGCTTGGGA 2 580 

GCTATTTTGC TTAGTTTTCG CCTTATTCCT CAACTTGGCT GGGGACGTGG TCTTTTTAGT 2 64 0 

TCCATTTTTC TAGCGATCTC AGCCTTCTGT AATGCCGGTT TTGATAATTT AGGGAGCACC 2 700 

AGTTTATTTG CTTTTCAGAC CGATTTACTG GTCAATCTGG TGATTGCAGG CTTGATTATT 2 7 60 

ACAGGCGGCC TTGGTTTTAT GGTCTGGTTT GATTTGGCTG GTCATGTAGG AAGAAAGAAA 2 820 

AAAGCACGTC TGCACTTTCA TACGAAGCTT GT ACT ATT AT TGACTATAGG TTTGTTGTTA 2 880 

TTTGGAACAG CAACTACTCT CTTTCTTGAG TGGAACAATG CTGGAACGAT TGGCAATCTC 2 940 

CCTGTTGCCG ATAAGGTTTT AGTTAGCTTT TTTCAAAC AG TGACGATGCG AACAGCTGGC 3 000 

TTTTCTACGA TAG ATT AT AC TCAGGCTCAT CCTCTCACTC TTTTGATTTA TATCTTACAG 3 060 

ATGTTTCTAG GTGGGGCACC TGGAGGAACA GCTGGGGGAC TCAAGATTAC CACATTTTTT 3120 

GTCCTCTTGG TCTTTGCACG AAGTGAGCTT CTAGGCTTGC CTCATGCCAA TGTTGCGAGA 3180 

CGAACGATCG CGCCGCGAAC GGTTCAAAAA TCCTTTAGTG TCTTTATTAT CTTTTTGATG 3 2 40 

AGCTTCTTGA TAGGATTGAT TCTGCTACGG ATAACAGCCA AAGGCAATCC TCCCTTTATC 3 300 

CACCTCGTAT TTGAAACCAT TTCAGCTCTT AGTACAGTTG GTGTAACGGC AAATCTGACT 3 3 60 

CCTGACCTTG GGAAATTGGC TCTCAGTGTT ATCATCCCAC TTATGTTTAT GGGACGAATT 3 4 20 

GGTCCCTTGA CCTTGTTTGT TAGCTTGGCA GATTACCATC CAGAAAAGAA AGATATGATT 3 4 80 

CACTATATGA AAGCAGATAT TAGTATTGGT TAAGAAAGGA AAGAGCATGT CAGATCGTAC 3 54 0 

GATTGGAATT TTGGGCTTGG GAATTTTTGG GAGCAGTGTC CTAGCTGCCC TAGCCAAGCA 3 600 

CGATATCAAT ATTATCGCTA TTGATGACCA CGCAGAGCGC ATCAATCAGT TTGAGCCAGT 3 6 60 

TTTGGCGCGT GGAGTGATTG GTG AC ATC AC AGATGAAGAA TTATTGAGAT CAGCAGGGAT 3 720 

TGATACCTGC GATACCGTTG TAGTCGCGAC ACGTGAAAAT CTGGAGTCGA GTCTGCTTGC 3 780 

GGTTATGCAC TGTAAG AGTT TGGGGGT AC C GACTGTTATT GCTAAGGTCA AAAGTCAGAC 3 840 

CGCTAAGAAA GTGCTAGAAA AGATTGCAGC TGACTCGGTT ATCTCGCCAG AGTATGAAAT 3 900 

GGGGCAGTCT CTAGCACAGA CCATTCTTTT CCATAATAGT GTTGATGTCT TTCAGTTGGA 3 9 60 

TAAAAATGTG TCTATCGTGG AGATGAAAAT TCCTCAGTCT TGGGCAGGTC AAAGTCTGAG 4 020 
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TAAATTAGAC 


CTCCGTGGCA 


AATACAATCT 


GAATATTTTG 


GGTTTCCGAG 


AGC AGG AAAA 


4080 


TTCCCCATTG 


GATGTTGAAT 


TTGGACCAGA 


TGACCTCTTG 


AAAGCAGATA 


CCTATATTTT 


4 14 0 


GGCAGTCATC 


AACAACCAGT 


ATTTGGATAC 


CCTAGTAGCA 


TTGAATTCGT 


AAAGAGGGAT 


-4200 


GACCCCTCTT 


TTTTGATGCC 


TAAGATGGCA 


AATAGAGACA 


GAAGCCCCTT 


GTCTTCTAGT 

W 4 4 4 w 4 r\\J 4 


4 2 60 


AAAAGTT CTT 


CAAAGGCTGG 


ACTTTATGGT 


AAAATAGAAA 


GAAGTGACAA 


GAGAGAGTAA 


4 3 2 0 


T ACT C AATG A 


AAATCAAAGA 


TCAAACTAGG 


AAACTAGCTA 


CGGGCTGCTC 


AAAACACTGT 


4 380 


TTTGAGCTTG 


CAGATAGAAC 


TGACGAAGTC 


ACT AAC ATCT 


ATACGGCAAG 


GCGACGTTGA 


4 4 4 0 


CGCGGTTTGA 


ACAGATTTTC 


GAAGAGTATA 


AGAAAAAATC 


AGTCCCCTAA 


AGGAGTAGAT 


4 50 0 


TATGAAGTTA 


TTGTCTATCG 


CAATTTCTAG 


CTATAATGCA 


GCAGCCTATC 


TTCATTACTC 


4 5 60 


TGTGGAGTCG 


CTACTGATTG 


GTGGTGAGCA 


AGTTGGGATT 


TTGATTATCA 


ATGACGGGTC 


4620 


TCAGGATCAG 


ACTCACGAAA 


TCGCTGAGTG 


TTT AG CT AGC 


AAGTATCCTA 


ATATCGTTAG 


4680 


AGCCATCTAT 


CAGGAAAATA 


AATGCCATGG 


CGGTGCCGTC 


AATCGTGGCT 


TGCTAGAGCr 


4 7 4 0 


TTCTGGGCGC 


TATTTTAAAG 


TAG TTG AC AG 


TGATGACTGG 


GTGGATCCTC 


GTGCC^APTT 


4 800 


GAAAATTCTT 


GAAACCTTGC 


AGGAACTTGA 


GAGCAAAGGT 


CAAGAGGTGG 


ATGT CTTT GT 


4 860 


GACCAATTTT 


GTCTATGAAA 


AGGAAGGGCA 


GTCTCGTAAG 


AAGAGTATGA 


GTTACGATTC 


4 9 20 


AGTCTTGCCT 


GTTCGGCAGA 


TTTTTGGCTG 


GGACCAGGTC 


GGAAATTTCT 


CCAAAGGCCA 


4 9 80 


GTATACCATG 


ATGCACTCGC 


TGATTT AT CG 


GACAGATTTG 


TTGCGTGCTA 


GCCAG^TCTA 


504 0 


ACTGCCTGAA 


CATACTTTTT 


ATGTCG AT AA 


TCTCTTTGTC 


TTTACGCCCC 




5 100 


CAAGACCATG 


T ACT ATCT G C 


CTGTCGATTT 


CTATCGTTAT 


TTGATTGGGC 


GTG AGG AC C A 


5160 


GTCTGTCAAT 


GAGCAAGTGA 


TGATTAAGTG 


CATTGACCAG 


CAACTCAAGG 


TCAATCGACT 


5220 


CTTGATAGAC 


CAACTTGATT 


TGTCCCAAGT 


GAGTCATCCC 


AAAATGCGAG 


AATATCTGCT 


5 2 80 


GAATCATATT 


GAACTCACGA 


CGGTGATTTC 


CAGTACCCTG 


CTCAACCGAT 


CTGGAACAGC 


5340 


GGAGCATCTG 


GCAAAAAAAC 


GCCAATTGTG 


GACCTATATT 


CAGCAGAAAA 


ATCCAGAACT 


54C0 


CTTTCAGGCT 


ATTCGTAAGA 


CCATGTTGAG 


CCGTTTGACC 


AAACATTCTG 


TCTTGCCAGA 


5460 


TCGCAAACTG 


TCCAATGTCG 


TCTATCAAAT 


CACCAAATCT 


GTTTATGGAT 


TTAATTAATA 


5520 


TAAGTGTTTT 


ATAAGAGGGA 


TTTAAGAAAA 


ATTTTAACTT 


TTTCTTAGTC 


CTTTTTAATT 


5580 


TCAGGAGATT 


ATACTAGAGT 


CATCAAATAA 


AGAAAGACTC 


TAAGGAGAAT 


CCTATGAAAT 


5640 


TCAATCCAAA 


TCAAAGATAT 


ACTCGTTGGT 


CTATTCGCCG 


TCTCAGTCTC 


GGTGTTGCCT 


5700 


CAGTTGTTGT 


GGCTAGTGGC 


TTCTTTGTCC 


TAGTTGGTCA 


GCCAAGTTCT 


GTACGTGCCG 


5760 
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ATGGGCTCAA TCCAACCCCA GGTCAAGTCT TACCTGAAGA GACATCGGGA ACGAAAGAGG 5820 

GTCACTTATC AG AAAAAC C A GGAGACACCG TTCTCACTCA AGCGAAACCT GAGGGCGTTA S880 

CTGGAAATAC GAATTCACTT CCGACACCTA CAGAAAGAAC TGAAGTGAGC GAGGAAACAA 594 0 

GCCCTTCTAG TCTGGATACA CTTTTTGAAA AAGATGAAGA AGCTCAAAAA AATCCAGAGC 6000 

TAACAGATGT CTTAAAAGAA ACTGTAGATA CAGCTGATGT GGATGGCACA CAAGCAAGTC 60 60 

CAGCAGAAAC TACTCCTGAA CAAGTAAAAG GTGGAGTGAA AGAAAATACA AAAGACAGCA 612 0 

TCGATGTTCC TGCTGCTTAT CTTGAAAAAG CTGAAGGGAA AGGTCCTTTC ACTGCCGGTG 6180 

TAAACCAAGT AATTCCTTAT GAACTATTCG CTGGTGATGG TATGTTAACT CGTCTATTAC 624 0 

TAAAAGCTTC GGATAATCCT CCTTGGTCTG ACAATGGTAC TGCTAAAAAT CCTGCTTTAC 6 3 00 

CTCCTCTTGA AGGATTAACA AAAGGGAAAT ACTTCTATGA AGTAGACTTA AATGGCAATA 63 60 

CTCTTGGTAA ACAAGGTCAA GCTTTAATTG ATCAACTTCG CGCTAATGGT ACTCAAACTT 64 2 0 

ATAAAGCTAC TGTTAAAGTT TACCGAAATA AAGACGGTAA AGCTGACTTG ACTAATCTAG 64 80 

TTGCTACTAA AAATGTAGAC ATCAACATCA ATGGATTAGT TGCTAAAGAA ACAGTTCAAA 654 0 

AAGCCGTTGC AGACAACCTT AAAGACAGTA TCGATGTTCC AGCAGCCTAC CTAGAAAAAC 6600 

CCAAGGGTGA AGGTCCATTC ACAGCAGGTG TCAACCATGT GATTCCATAC GAACTCTTCG 66 60 

CAGGTGATGG CATGTTGACT CGTCTCTTGC TCAAGGCATC TGACAAGGCA CCATGGTCAG 672 0 

ATAACGGCGA CGCT AAAAAC CCAGCCCTAT CTCCACTAGG CGAAAACGTG AAGACCAAAG 67 80 

GTCAATACTT CTATCAACTA GCCTTGGACG GAAATGTAGC TGGCAAAGAA AAACAAGCGC 6840 

TCATTGACCA GTTCCGAGCA AAyCGTACTC AAACTTACAG CGCTACAGTC AATGTCTATG 6900 

GTAACAAAGA CGGTAAACCA GACTTGGACA ACATCGTAGC AACTAAAAAA GTCACTATTA 6 9 60 

AC AT AAACGG TTTAATTTCT AAAGAAACAG TTCAAAAAGC CGTTGCAGAC AACGTTAAAG 7020 

ACAGTATCGA TGTTCCAGCA GCCTACCTAG AAAAAGCCAA GGGTGAAGGT CCATTCACAG 7080 

CAGGTGTCAA CCATGTGATT CCATACGAAC TCTTCGCAGG TGATGGTATG TTGACTCGTC 7140 

TCTTGCTCAA GGCATCTGAC AAGGCACCAT GGTCAGATAA CGGTGACGCT AAAAACCCAG 72 0 0 

CCCTATCTCC ACTAGGTGAA AACGTGAAGA CCAAAGGTCA ATACTTCTAT CAATTAGCCT 7 2 60 

TGGACGGAAA TGTAGCTGGC AAAGAAAAAC AAGCG CT CAT TGACCACTTC CGAGCAAACG 73 2 0 

GTACTCAAAC TTACAGCGCT ACAGTCAATG TCTATGGTAA CAAAGACGGT AAACCAGACT 7 3 80 

TGGACAACAT CGTAGCAACT AAAAAAGTCA CTATTAACAT AAACGCTTTA ATTTCT AAAG 74 4 0 

AAACAGTTCA AAAAGCCGTT GCAGACAACG TTAAGGACAG TATCGATGTT CCAGCAGCCT 7 5 00 

ACCTAGAAAA GGCCAAGGGT GAAGGTCCAT TCACAGCAGG TGTCAACCAT GTGATTCCAT 7 56 0 
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A^GAAl. 1L1 1 


r~cLC ar.r.Tr. i^t 


GGC ATGTTG A 


CTCGTCTCTT 


GCTCAAGGCA 


TCTCACAAGG 


7620 


LALLA 1 bt> 1 l~ 


a n a t & & c 




ACCCAGCTCT 


ATCTCCACTA 


GGTGAAAACG 


7680 


Tva AA(j A(- 1_ AA 


nuu 1 AA I 


TTfTATCAAG 


TAGCCTTGGA 


CGGAAATGTA 


GCTGGCAAAG 


7740 




Lilw 1 L~A 1 1 unC 




CAAACGGTAC 


TCAAACTTAC 


AGCGCTACAG 


7800 


TC_AATGTt.TA 






CAGACTTGGA 


CAACATCGTA 


GCAACTAAAA 


7860 


AAGTCACTAT 


1 AAG A 1 AAA 1 


sji 1 Annu/uU\ 


CAT C AG AC AC 


AGCAAATGGT 


TCATTATCAC 


7920 


CriLiAACTC 


TGGTTCTGGt- 


Vj L Vj/\\_ 1 t_ LWn 


T G AATC AC AA 


TCATGCTACA 


GGTACTACAG 


7980 


AT AGC ATG C C 


TGC. I tjALALL 


A 1 bALAAo I * 


CTACCAACAC 


GATGGCAGGT 


GAAAACATGG 


8040 


CTGCTTCTGC 


TMCAAuA I L» 


1 1_ I IjA 1 Ak, Ijrt 


TGATGTCAGA 


GGATAAAGCT 


ATGCTACCAA 


8100 


ATACTGGTGA 


GaCTlaaala 


1 LAAioVjLn/V 


GT ATTGGTTT 


CCTTGGGCTT 


G CGCTT G C AG 


8160 


GTTTACTCGG 


TGGTCTAGG I 


i l^AAAAALA 


AAAAACAAGA 


AAACT AATC A 


GCTAAGGAAA 


3220 


TAAATGATGG 


ATAGTGCjIjL I 


rap"?aAriTT 
L? Av_ I AnuA 1 I 


ACTTTAACAA 


CTC AATC AGC 


AATCAGGACT 


8290 


TTCTTTCAAT 


AGCAGATTAA 


AA 1 l_A ILwIA 


AAAfAATAAA 


AAT AGT GTT A 


TACTTAAAGC 


8340 


AGTATAGCAC 


TGTTT L TATL 


AAAuuAOAvj A 


r*AGATGGGAA 


AGACAATTTT 


ACTCGTTGAC 


8400 


GACGAGGTAG 


AAATC AC. AG A 


I A i I L. A 1 V- AVJ 


AG AT ACTT AA 


TTCAGGCAGG 


TTATCAGGTC 


8460 


TTGGTAGCLC 


A IajA 1 obAt i 


op a agpgct a 


GAGCTGTTCA 


AGAAAAAACC 


GATTGATTTG 


8520 


ATTATCACAG 


ATGTCA i GA I 


<vjV_ k_ I L»Vj(jA 1 *~» 


\Jt\ I KJ\J l » AIM 


ATTTAATCAG 


TGACGTTCAA 


8580 


TACTTATCAC 


CAGAGCAULW 


1 1 I I A I l l 


ATTACTGCTA 


AGACCAGTGA 


ACAGGACAAG 


8640 


ATTT ACGGCC 


TGAGCTTGGG 


AbLAon 1 VjA i 


*™TTATTGCTA 


AGCCTTTTAG 


CCCACGTGAG 


8700 


CTGGTTTTGC 


L> 1 L> I v^L. A\_>Vn. 




CGCCTTCATC 


GTGGGGGCGA 


AACAGAGCTC 


8760 


ATTTCCCTTG 


GC AA i v- I AAA 


a A TP. A A TC Jd 1 ^ 


AGTAGTCATG 


AAGTTCAAAT 


AGGAGAAGAA 


8820 


ATGCTCGATT 


TAACTCTTAA 


ATCATTTGAA 


TTGCTGTGGA 


TTTTAGCTAG 


T AAT C C AG AG 


8880 


CGAGTTTTCT 


CCAAGACAGA 


CCTCTATGAA 


AAGATCTGGA 


AAGAAGACTA 


CGTGGATGAC 


8940 


ACCAATACCT 


TGAATGTGCA 


TATCCATGCT 


CTTCGACAGG 


AGCTGGCAAA 


ATATAGTAGT 


9000 


GACCAAACTC 


CCACTATTAA 


GACAGTTTGG 


GGGTTGGGAT 


AT AAG AT AG A 


GAAACCGAGA 


9060 


GGACAAACAT 


GAAACTAAAA 


AGTTATATTT 


TGGTTGGATA 


T ATT ATTT C A 


ACCCTCTTAA 


9120 


CCATTTTGGT 


TGTTTTTTGG 


GCTGTTCAAA 


AAATG CTG AT 


TGCGAAAGGC 


GAGATTTACT 


9180 


TTTTGCTTGG 


GATGACCATC 


GTTGCCAGCC 


TTGTCGGTGC 


TGGGATTAGT 


CTCTTTCTCC 


9240 


TATTGCCAGT 


CTTT ACGTCG 


TTGGGCAAAC 


TCAAGGAGCA 


TGCCAACCGG 


GTAGCGGCCA 


9300 
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AGGATTTTCC 


TTCAAATTTG 


GAGGTTCAAG 


GTCCTGTAGA 




TTAGGGCAAA 


9360 


CTTTTAATCA 


GATGTCCCAT 


GATTTGCAGG 


TAAGCTTTGA 


TTCCTTGGAA 


GAAAGCGAAC 


9420 


GAGAAAAGGG 


CTTGATGATT 


GCCCAGTTGT 


CGCATGATAT 


TAAGACTCCT 


ATCACTTCGA 


9480 


TCCAAGCGAC 


GGTAGAAGGG 


ATTTTGGATG 


GGATTATCAA 


GGAGTCGGAG 


CAAGCTCATT 


9540 


ATCTAGCAAC 


CATTGGACGC 


CAGACGGAGA 


GGCTCAATAA 


ACTGGTTGAG 


GAGTTGAATT 


9600 


TTTTGACCCT 


AAACACAGCT 


AGAAATCAGG 


TGGAAACTAC 


CAGTAAAGAC 


AGTATTTTTC 


9660 


TGGACAAGCT 


CTT AATTG AG 


TGCATGAGTG 


AATTTCAGTT 


TTTGATTGAG 


CAGGAGAGAA 


9720 


GAGATGTCCA 


CTTGCAGGTA 


AT C CC AG AGT 


CTGCCCGGAT 


TGAGGGAGAT 


TATGCTAAGC 


9780 


TTTCTCGTAT 


CTTGGTGAAT 


CTGGTCGATA 


ACGCTTTTAA 


ATATTCTGCT 


CCAGGAACCA 


984Q 


AGCTGGAAGT 


GGTGGCTAAG 


CTGGAGAAGG 


ACCAGCTTTC 


AATCAGTGTC 


ACCGATGAAG 


9900 


GGCAGGGTAT 


TGCCCCAGAG 


GATTTGGAAA 


ATATTTTCAA 


ACGCCTTTAT 


CGTGTCGAAA 


9960 


CTTCGCGTAA 


CATGAAGACA 


GGTGGTCATG 


GATTAGGACT 


TGCGATTGCG 


CGTGAATTGG 


10020 


CCCATCAATT 


GGGTCGGCAA 


ATCACAGTCA 


GCAGCCAGTA 


CGGTCTAGGA 


AGTACCTTTA 


10080 


CCCTCGTTCT 


CAACCTCTCT 


GGTAGTGAAA 


ATAAAGCCTA 


AAACCCCTTT 


ACAAATCCAG 


10140 


CTATTCATGG 


TAGAATAGAT 


TTTGTGTGAA 


AT AT C AG C AG 


GAAAGCATGA 


AGCTCGTCAA 


10200 


CACGTGTCTT 


ATGACAACTA 


ACCTTGGCTG 


TTTAGGCGAA 


GGGCATCTGC 


ACGG 


10254 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 9769 base pairs 
(B> TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CCGGCGACTA TCGATAACAC TTGACTTGGT AGCCCCACAT TTTGGACAAC GCATCCTTTC 60 

CCTCCTTATC GTTTTCTTTT CATTATACCA TTTTTTAAGC GATTCCCAAA ACAATTCTTC 120 

TTTTTGCTTG ACAAGTTTTT TGTTTTGTTG TATTATTTAA TTAAGACAAC AAGGTAAAAG 180 

AAAGGAGACT AAGATGTCCT GGACATTTGA CAACAAAAAA CCCATCTATT TACAGATTAT 2 40 

GGAGAAAATC AAGCTTCAGA TTGTTTCCCA TACACTGGAA CCCAATCAAC AACTTCCAAC 3 00 

CGTGAGGAGC TAGCTAGCGA GGCTGGTGTC AATCCCAATA CCATCCAAAG AGCCTTATCA 3 60 

GACCTTGAAC GAGAAGGATT TGTCTACAGC AAGCGAACAA CTGGACGATT TGTGACTAAG 4 20 

GATAAGGAGC TAATCGCCCA GTCACGCAAA CAATTATCAG AAGAAGAATT GGAACACTTC 4 80 
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GTTTCCTCCA 


TGACCCATTT 


TGGCTATGAA 


AAAGAAGAAC 


TACCAGGCGT 


AGTCAGTGAT 


540 


TATATTAAAG 


GAGTTTAAGC 


CTATGTCATT 


ACTAGTATTT 


GAAAATGTAT 


CCAAATCATA 


600 


TGGAGCAACA 


CCAGCCCTTG 


AAAATGTTTC 


TCTTGACATT 


CCAGCTGGAA 


AAATTGTCGG 


660 


CCTTCTTGGG 


CCAAACGGCT 


CAGGAAAAAC 


AACCCTGATT 


AAACTAATTA 


ATGGCCTCTT 


720 


ACAACCAGAT 


CAAGGACGTG 


TCCTCATCAA 


CGACATGGAC 


CCAAGCCCAG 


CAACCAAGGC 


780 


CGTTGT AG CT 


TATTTGCCTG 


ATACGACCTA 


TCTCAATGAG 


CAAATGAAGG 


TCAAAGAAGC 


840 




TTCAAGACCT 


TCTATAAAGA 


TTGTCAGATC 


TTGAACGCGC 


CCATCATCTA 


900 


CTTGCAGACC 


TGGGCATTGA 


TGAAAATAGT 


CGTCTCAAGA 


AACT AT C AAA 


AGGAAACAAA 


960 


GAAAAGGTTC 


AACTGATTTT 


GGTTATGAGC 


CGTGATGCTC 


GTCTCTATGT 


TTTGGACGAA 


1020 


CCCATTGGTG 


GGGTGGATCC 


AGCAGCCCGT 


GCTTATATCC 


TCAATACCAT 


TATCAACAAC 


ioao 


T ACTC AC C AA 


CTTCTACCGT 


TTTGATTTCT 


ACCCACTTGA 


TTTCTGATAT 


CGAGCCAATC 


1140 


TTGGATGAAA 


TTGTC TTCCT 


AAAAGACGGA 


AAAGTCGTCC 


GTCAAGGAAA 


TGTAGATGAT 


1200 




AGT C AGGTG A 


ATCCATTGAC 


CAACTCTTCC 


GTCAGaATTT 


AAGGCCTAAG 


1260 


CAAAGGAGAT 


TATTTATGTT 


TTGGAATTTA 


GTTCGCTACG 


AATTTAAAAA 


TGTTAACAAG 


1320 




CCCTCTACGC 


AGCCGTGCTA 


GTCCTTTCTG 


CCCTCATCGG 


AATACAGACA 


1380 


CAAGGCTTTA 


AAAATCT AC C 


TTACCAAGAA 


AGTCAGGCTA 


CTATGCTACT 


TTTTCTAGCT 


1440 


ACAGTCTTTG 


GTGGCTTGAT 


GCTTACACTT 


GGGATTTCAA 


CCATTTTCTT 


GATTATTAAA 


1500 


CGCTTCAAAG 


GTAGTGTCTA 


CGACCGACAA 


GGCTATCTGA 


CTTTGACCTT 


GCCACTTTCT 


1560 


GAACACCATA 


TCATCACAGC 


CAAACTAATC 


GGTGCCTTTA 


TCTGGTCATT 


GATTAGCACC 


1620 


GCTGTATTGG 


CTCTAACTGC 


T GTT ATT AT T 


CT1CCTTTAA 


CACCTCCAGA 


ATCCATTCCT 


1580 


CTTTCTTATG 


TGATTACATT 


TGTAGAAACA 


CATCTCCCTC 


AGATCTTTCT 


TACAGGTATA 


1740 


TCCTTCCTAC 


T AAAT ACT AT 


TTCAGGAATC 


CTCTGCATCT 


ACCTGGCTAT 




1 8 00 


C AGCTTTTC A 


ATGAATACCG 


TACAGCACTC 


GCTGTTGCAG 


TCTACATTCG 


TATCCAAATC 


1860 


GTCATTGGAT 


TTATTGAACT 


TTTCTTCAAT 


CTTAGTTCTA 


ATTTCTATGT 


CAATTCACTG 


1920 


GTAGGACTCA 


ATGACCATTT 


CTATATGGGA 


GCAGGTATAG 


CCATTGTTGA 


AG AA CT CAT A 


1980 


TTCATAGCTA 


TCTTTTATCT 


CGGAACCTAC 


TACATCTTGA 


GAAATAAGGT 


TAATTTGCTT 


2040 


TAAATAATTT 


TTACCTAGAT 


ATGTAACATA 


CTCATAGAAC 


AAAAGAGACC 


AGGCAAAAAG 


2100 


TCTTTAAAAT 


T AG AAAACG C 


ATAGTATCAG 


GTGTTGAATA 


TGTACTGCcC 


CCCAAAAGTT 


2160 


AGATTTTTTC 


TGTCTAACTT 


TTGGGGGCAG 


TTCATAAGAA 


CCTTGGTAAT 


ATGCGTTTTT 


2220 
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TGTGAGCTGA 


CTTATTTCCT 


TTCACTATAT 


CGCAAAATGA 


AATAAGAACG 


GAACGATGGG 


2280 


ATTTTGGAAT 


TCAAATCAAT 


TTATAAGAAT 


GTTTTAGAAG 


TAATATTATC 


CT ATTC CAG A 


2340 


TTCAGTTCAC 


T AT AC AATT G 


AGTTTTCAAG 


CAACCTGTTT 


ACATAATGTG 


TACATAATTA 


2400 


GGTTCGTGAT 


TCCACCCTTT 


TCACCTTTAA 


AAACCTCGCT 


TTCGCAAGGC 


TCTTCTATTT 


2460 


ATAAGATAAG 


GCACGTTTAA 


AGGTTTTCCA 


AATCCCTAAA 


TCATCCGTTT 


GAAGAACGAG 


2520 


ACTAGCATAC 


ATGCGTCCGA 


TAAATCCTGT 


TGCTACCACC 


GCAAAAATCA 


CTGTAATAGC 


2530 


AAGTGAAATC 


CATGCTTCTG 


CTCCCCCCGC 


ATAGTCATTA 


ATCGTTCGAA 


ACGGCATAAA 


2640 


GAAGGTCGAA 


AT AAAGGG AA 


TATAAGAACC 


AATCTTCAAG 


AGGAGATTGT 


CACCAGCTGC 


2700 


ACCTAGAGCT 


GTCACTCCAA 


AAAAACCACC 


CATAATCAAA 


ATCATCAAAG 


GCGACAAGGC 


2760 


TTTCCCTGAG 


TCCTCAGGAC 


GAGAAACCAT 


AGATCCTAGG 


AAGGCTGCCA 


AGACTACGTA 


2820 


CATGAAAAGA 


CTGATCAAAA 


TAAAGAGCAA 


CCTATTCACT 


GAGATAGCAT 


CTCCCAAGTG 


2890 


ATCCAAAATA 


CCAGACTGAG 


CCAAGAATGG 


CAAATCTTTA 


AAGAGCAAAA 


CGGC AGCC AG 


2940 


ACCACCTACA 


ACATAGATCC 


CAATATGCGT 


TAAAATCACT 


AGAAACAGAG 


CCATCATCCG 


3000 


CGCATAGAAA 


TAGTGACTTG 


CCCTTATGCT 


AGAAAAAACG 


ACTTCCATAA 


TTTTGGTGCC 


3060 


TTTTTCACTG 


GCAACTTCCT 


GAGCTGTTAC 


ACCCGCATAG 


GTAATCAGAA 


TCATATAAAG 


3120 


AAAGAATCCT 


AAGGCACCTG 


CTGCAATTGT 


TTGAATAAAC 


TTTTTATTTT 


CCTTGGCTTC 


3180 


ATCAATCTTT 


TCTGTGAATT 


GAATTGTCTG 


CGCTAAGCGT 


TTTTCCTGCT 


CTTGAGACAA 


3240 


GGAAGCAGTT 


GAACGATTAA 


GCTGATTTTG 


CAGTTCATTG 


AGTGTACCTG 


TAACCTCAAA 


3300 


TTTAATTCCA 


TTTTCAAGCG 


ATGTTTCGCC 


ATGATAAACT 


GCCTTTAGAA 


CACTATCTTC 


3360 


TTGATCAATG 


GTCAAATAAC 


CTTTTAATTT 


TTCTTCTTTA 


ATTGCTTCTT 


TGGCACTTGC 


3420 


TTCGTCTTTA 


TAGTCGAAGT 


TAACACCATT 


TACATTCTTC 


AGTCCTTCTG 


CTACAGATGG 


3480 


CACTGTTGTC 


AC T ACTGCC A 


CTTTATTATT 


TTTAGCCATA 


GAAGAACCTT 


GGAGATGCCC 


3540 


AATTCCTACA 


GAGATTCCTA 


AAAAGAGGAA 


CGGCGAAATC 


ACCATAAAGA 


AGAAACTCCA 


3600 


TGACTCGACA 


TGTCGAAGAT 


AGGTTTCCTT 


GATTACAACC 


CACATATTTC 


TCATACTTCC 


3660 


ACTCCTGATT 


CTAGTTTAAA 


GATTTCATCG 


ATAGTTGGCG 


CTTGTTGGTC 


AAATGTTGCG 


3720 


ATATATTGAC 


CTTGAGTCAA 


GATTGAGAAG 


AGTTCCCTTC 


CAGCGCTCTC 


ATCCTCCAAA 


3730 


ATCAATTTCC 


AACTGCCTTG 


TTTGGTCAAG 


CTCACCTGTT 


TGACATGAGG 


AAGATTTTCC 


3840 


AATTCTT CCT 


TGCTTCGTTC 


ACTTGAAACA 


AAGAGACGCG 


TTTTCCCGTA 


TTGATTGCGG 


3900 


ACATCCTGAA 


CTGGTCCGTG 


CAAGACCACA 


CGGCCATCTC 


GGATCATCAG 


AATATCGTCA 


3960 


CAAAGTTCCT 


CAACATTGGT 


CATGACATGG 


TCAGAAAAGA 


TAATGGTTGT 


CCGCGCTCTT 


4C20 
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TTTCCTGAAA AATGACTTGT TTGAGCAATT CTCTATTAAC TGGGTCCAAT CCACTAAAAG 4 080 

GCTCATCCAA GATAATCAGG TCTGGTTCAT GAATCAGAGT AATAATGAGC TGAATCTTCT 414 0 

GCTGATTTCC TTTTGACAGA CTCTTGATTT TATCTGTCAG CTTTCCTTTC ACTTCCAACC 4 200 

TCTTCATCCA TTGAGGGAGT TTTT CTTTG A CTTCTTTGGC ATCCATGCCT TTTAGAGTCG 42 60 

CCAAGTAGCG AACTTGTTCA AGAACTGTCA ATTTAGGCAT GAGATGCGTT CTTCAGGCAG 432 0 

ATAACCAATC CGAGCATAGG TCTCCTGACG AATATCCTGA CCATCCAGAC CGATTTCTCC 4 38 0 

CTGATATTCT AGG AATTTC A AAATACTATG GAAAATCGTT GTTTTTC C AG CACCATTTTT 44 4 0 

TCCGACTAGT CCCAAAATAC GACCTGGTCG CGCTTGAAAG TCAATACCAA ACAAAACTTG 4 50 0 

CTTGGATCCA AAACTTTTCT CTAGACTTCT TACTTCTAGC ATCTTTCACC TCCGAAATTT 4 560 

CTTGCACTCA TTATACTCCT TTTTGATACC CTTTACAATG TTTTTTGTCC AT TTTT AG AA 4 62 0 

GACTATTCCT GTGTAAAATA TGGCCTGGAG CACTTTTATA CTCAATGAAA ATCAAAGAGC 468 0 

AAACTAGGAA GCTAGCCGTA GACTGCTCAA AGTACAGCTT TG AG GTTGC A GATAAAACTG 47 4 0 

ACGAAGTCgA CTCAAAACAC TGTTTTGAGG TTGTGGATAG AACTGACGAA kCrTAaCTAT 4 80 0 

ATCTACGGCA AGGCGAAcTG ACGTGGTTTG AAGAGATTTT CGAAGAGTAT TAGTGATAAA 4 86 0 

TCCATTATAC AGCAGCAAAC TTAATTTATA CCTTCCGCTC CTCAACTCTC TATTTTTAAT 49 2 0 

CCTGAATTGT TATTTGAGTA ACTCCTTTTT CCTCGTAAAG TTTTCTTCCT CTAAAACTTC 4 9 80 

TGGAAAAAGG CTAATACTTT CAGACAACAT TTTT AT AAG A AACAAGTTCA TCTGTCATTT 504 0 

CAAGAAGGAG TAATCCTTTA TCTACTAATG GACGGAACAG AATTCAACCG CTTGTCCGAT 5100 

ATGTTTTCTA AGGATTATAT AGTAAAATGA AATAAGAACA GGACAAATTG AT C AG G AC AG SI 60 

TCAAATTGAT TTCTAACAAT G TTTT AG AAG T AG ATGT AT A CTATTCTACT TTCAATCTGC 52 2 0 

TATATCTATT ATGCACACCC CTATAGGATC TAATGAAAAT CACAACAGGC TCATTCATAG 52 80 

ATGGTTACCT AAGCCTAAGG GAACTAAGAA AACG ACT AC C AAGGAAGTCG CATTCATCGA 534 0 

AAAGTAGATT AACAACTATC CTAAAAAATG CTTG AACT A C AAGTCCCCCA GAGAAGACTT 5400 

CTGGATGACT AA CTTG AACT TGAAATTTAG CAATAATTAA TTCACTATCT AACTATATTT 5 4 60 

AGTAATTATT TCAGAACTGA TTAATATTAA AATTAACTAA CAATTCAAAG GATTCATACT 5520 

AGCCATAAAT TACGTCCATC AGAGAGAGAC TCTTACTACT TTTAGATTTT AGTCTTTCTA 55 80 

GCTTCAGAAT ACATCTAAAC TTTAGGGAAA ATGACTATTC GAAAGCGCGA ATGCCTCAAA 5640 

ATT AT CTC AG ATAAGCTATT CGAAACTTAG AATGCTTTTA AATTTATGGA AT TGCG ATT A 57 00 

TTCGAAACCT AGAATGCATA TAACCTTTAG TTGACAGACC TATTCTAAGT CTCGAAGGGC 57 60 
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TATTTACTTT CTATTCCTTA TCAAAAAAGA CTCATTCCCC CTTTCTCCTC CAAAATATGG 5 820 

TATAGTAGAA ATATACTATC TATGAGGAGT T T AC ATGTC A CAGGATAAAC AAATGAAAGC 5880 

TGTTTCTCCC CTTCTCCAGC GAGTTATCAA TATCTCATCG ATTGTCGGTG GGGTTGGGAG 5940 

TTTGATTTTC TGT AT TTGGG CTTATCAGCC TGGGATTTT A CAATCCAAGG AAACCCTCTC 6000 

TGCCTTTATC CAGCAGGCAG GCATCTGGGG TCCACCTCTC TTTATCTTTT T AC AG ATT TT 6060 

ACAGACTGTC GTCCCTATCA TTCCAGGGGC CTTGACCTCG GTGGCTGGGG TCTTTATCTA 6120 

CGGGCACATC ATCGGGACTA TCTACAACTA TATCGGCATC GTGATTCCCT GTGCCATTAT 6180 

CTTTTATCTA GTGCGCCTAT ACGGAGCTGC CTTTGTCCAG TCTGTCGTCA GCAAGCGCAC 62 4 0 

CTACGACAAG TACATCGACT GGCTAGATAA GGGCAATCGT TTTGACCGCT TCTTTATTTT 6300 

TATGATGATT TCCCCCATTA GCCCAGCTGA CTTTCTCTGT ATGCTGGCTG CCCTGACCAA 6 3 60 

CATCAGCTTC AAGCGCTACA TGACCATCAT CATTCTGACC AAACCCTTTA CCCTCCTGGT 64 20 

TTATACCTAC GGTCTGACCT ATATTATTGA CTTTTT CTGG CAAATGCTTT GACACGTAAA 64 80 

AAATCCGTTT GGTTTCCCAA GTGGATTTTT AAAGCGTAGA TTAACTATAG CTTGATACTA 6 54 0 

AATATACTTT GGTATGGAAA TCATGCATAT TTTTCGATAG TGACGCGAGG ACTTACCTAG 6600 

CCTTTCCGCC GTGATAGAAA CACCTGAAAT CTAATGGTTT CAGGTATTCG GAAACTTTGA 66 60 

GCCTAGTGTC TCAAAGTTTA GGTATGGAAT TTTGAAGAAA GTCGCTACCG TCCGTAATCA 67 2 0 

CTTAAGGAAA GGCTCAAAAA T ATTGTTTT C AACCACAAAA TCCGTTTGGT TTCCCAAGCG 67 8 0 

GATTTTGTGC TTTATTTTGA AACTTCTTTT GCAAGAACAA AGTTCCCAAG TGT GGC AG AA 6 84 0 

CCATTTCCTG CGACTGCTGG CGTCACGATA TAGTCACGCA CATCTGGTAC TGGTAGGTAA 6 90 0 

CCATTAAGAA GAGATGTAAA TTTCTCACGG ACACGGTCCA GCATATGTTG TTGAGCCATG 6 960 

ACCCCTCCAC CAAAGACAAT CACGTCTGGG CGGAAAGTCA CTGTCGCATT AACCGCAGCT 7 020 
TGAGCGATAT AGTAGGCTTG AACATCCCAA ACAGGGTTGT TGAGTTCAAT AGTTTCCCCA 7 080 

CGTACACCTG TACGAGCTTC CAAACTTGGA CCACCTCCA7 AAC CTTCT AG ACATCCCTTA 714 0 

TGGAAAGGAC AAAC AC CCTT AAACTCTTTT TCAATATCCA TTGGGTGTCT AGCAACATAA 7200 
TAATGACCCA TTTCAGGGTG ACCCACACCA CCGATAAACT C AC C ACGTTG GATGACGCCT 72 60 

GCACCGATAC CTGTACCGAT TGTGTAGTAA AC C AAGTT TT CGATACGACC ACCAGCATTG 7 3 20 

TTACGGGCAA CCATTTCACC G T AAGC AG AG CTGTTTACGT CTGTTGTGAA CTACATTGGC 7 380 

ACGTTTAGGG CGCGACGAAG GGCACCAACC AAGTCTACAT TTGCCCAGTT TGGTTTTGGA 7 4 40 

GTCGT CGTG A TAAAGCCATA AGTTTTTGAG TTTTTGTCAA TATCAATCGG CCCAAATGAA 7 500 

CCAACTGCAA GACCAGCAAG GTTATCGAAT TTTGAGAAGA ACTCAATGGT T TT AT CG ATT 7 560 
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GTTTCGATTG GAGTTGTTGT TGGAAATTGT GTTTTTTCTA CAACGTTAAA GTTTTCATCA 7 620 

CCGACAGCAC AGACAAACTT TGTACCGCCC GCTTCCAAGC TTCCATATAA TTTTCTCATC 7 680 

ATAAACCTCT TGTTTTTATT TTCTTTATTA TAGCATACTT CGAAAGTCTA AATGTCTCTA 7 740 

TTTTTTAGAT TTTCCTCTGT AAATCTTACT ATCTAATAAA AACGAACAAA CATGTCATTT 7800 

GTTCGTTTTC ACATTAGAGA GGATTGATTA CATTTTCACT TCGATCACAG CATCCCCCTT 7 8 60 

AGCAACTGAA CCTGTTGCGA CTGGAGCTAC TGAAGCGTAG TCACCTGTAT TTGTAACGAT 79 20 

AACCATTGTT GTATCATCAA GTCCAGCTGC AGCGATTTTG T TTGAGTCAA ATGTTCCAAG 7980 

AACATCGCCA GCTTTCACCT TATTACCTTG AGCAACTTTT GTTTCAAAAC CGTCACCGTT 804 0 

CATAGATACA GTATCAATAC CAACATGAAT CAAAACTTCA GCACCATTTC TTGTTTTCAA 8100 

ACCAAAAGCG TGCCCTGTTG GAAAGGCAAT TGAAACTTCA GCATCAGCTG CTGCATAGAC 8160 

CACGCCTTGG CTTGGTTTCA CAACGATACC TTGTCCCATA GCTC C ACTTG AGAAGACTGG 8220 

GTCATTGACA TCAGCAAGAG CGACAACATC ACCGACGATA GGAGTTACAA GTGTTTCATT 8 2 80 

TTGAAGAGCT GCTGGCGCAA CTTCTTCTTT TTCTTCAGCC ACTTCAGCTC GTTTTGCACC 8 34 0 

TGCAGTTGCG TCTACTTCAT CTTCGTAACC AAACATGTAA GTAACAGCAA AACCAAGGGC 8 4 00 

AAATGATACA GCTACCATAA CAACGTATTG TGGAAGTTGT CCGTTACCAA CATAAAGCAT 84 60 

TGTACCACGC ATGATGGTGA TACCATTACC AGTACCAGCA AGTCCAAGGA TAGAAGCCAA 85 20 

TCCACCACCG ATTGCACCAG CAATCAATGA AAGGAAGAAT CGTTTACGCA AGCGCAAGTT 8 5 80 

CACCCCGAAG ATACCAGCCT CTCTAATACC TAGGAAGCCA GAAACAGCAG CCGGGAAAGC 3 640 

AAGTGTTTTC AGTTTTGGAT TTTTTGTTTT AACACCAACC GCAACAGTAG CAGCACCTTG 8 7 00 

AGCTGTCATA GCACCTGTGA TGATAGCGTT GAATGGGTTA GCATGGTCAG CAGCAAGTAA 87 60 

TTGCACTTCA AGCAAGTTGA AGATGTGGTG CACACCTGAC ACGACGATCA ATTGCTGAAC 88 20 

CCCACCAATC AAGAAACCAC CAAGACCAAA TGGCATGCTA AGAATCGCTT TTGTAGCAAT 8880 

AAGGATGTAG TTTTCAACAA CGTGGAAAAC TGGTCCAATG ACAAAGAGTC CAACGATAGA 8 94 0 

CATGACCAAA AGTGTCACGA AT GGTGTT AC CAAGAGGTCA ATGACATCTG GAACAACTTG 900C 

CGGACAGCTT TTTCAAATTT AGCTCCGACA ACCCCGATGA TGAAGGCTGG AAGAACGGAA 9060 

CCTTGCAAAC CAACAACAGG GATGAAACCA AAGAAGTTCA TCCCTGTTAC TTCACCACCT 9120 

TGAGCAACTG CCCAAGCGTT TGGAAGTGAG CCAGAGACAA GCATCATACC AAGAACGATA 9180 

CCAACGGCAG GATTTCCACC AAATACACGG AAGGTTGACC ACACAACCAA ACCTGGCAAG 92 40 

ATGATGAAGG CTGTATCTGT CAAGATTTGT GTGTAAGTTG CAAAGTGACC TGGAAGTGGC 9 300 
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ATTTCAAGAG CGTTGAAAAG ACCACGCACA CCCATGAAGA GACCTGTCGC TACGATAACT 9 3 60 

GGGATGATTG GAACGAAAAC ATCACCAAAA GTACGGATAG CACGTTGGAA CCAGTTCCCT 94 20 

TGTTTAGCAA CTTCTGCTTT CATGTCATCC TTAGATCATG TTGGTAATCC AAGTACAACA 94 30 

ACTTCATCGT ACATTTTGTT AACTGTACCT GTACCAAAGA TAATTTGGTA TTGCCCTGAG 9 540 

TTAAAGAAAG CACCTTGAAC TTTTTCCAAG TTCTCAATCA CTTCTTTATT GATTTTCTCT 9600 

TCATCTTTGA CCATGACACG TAGACGAGTC GCACAGTGGG CAACACTATT GACATTTTCA 9 660 

CGTCCGCCCA AGGCATCGAT GACTTTTTTT GCAATTTCCT G ATTGTT CAT TTGCAAAAAT 9720 

CTCCTTATAT AACATTTTGT TCTTGTTTGA AAGCGATTTT ATTCGCCGG 976 9 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3149 base pairs 
(3) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCCTTGAGTG CTAATTCATA GTTCTATTCT ATCACTTGGT CAGAAATAAT CAAGAAAAAA 60 

G TCTG ACTTT CTCAAGATAA AAAGC CTG AG AC CAACTC AG ACTTTTTAAT TCTTAAAATG 120 

GCAATTCTTC CTCTTCCAAG ACCAAATCTC CCAAATCTTG GCCTGCATTA TTTTCACGCA 190 

TAGCACGTTG CGCACGACTT TCCAAGAGTT GGAATCCTGT GACAAGTACT TCGGTCACGT 2 40 

ACTTC ATTT G GCCATTTTTC TCAAAGCGAC GGGTACGCAA TTCTCCATCA ACGC-AAATGA 300 

GACTACCTTT GGTTGCGTAC TTGCCAAAGT TTCTGCTAGT CTGCCCCATA GGACCATATT 3 60 

GACAAAATCA GCTTCACGTT CACCGTTTTG CTCTTTGTAA CGACGGTTCA CACCGATACT 4 20 

TGCTCGCGCT ACCGACTTGT CATTGTTGGT TTTGTGCAAT TCTGGTGTAG ACGTTAAACG 4 80 

TCCAATCAAG ATAACTTTAT TATACATATT TTCTTCCTCC TACTTATCTA TTCGTAGGAA 540 

ATCAAAAAAA GTTACAGAAA TTTGTAACTT TTCGAGAAAA TTTTTTATTT TTTATGAACC 600 

ATGAAACCTG TCGCCTGTTG ATTGGCCATA ATGGTCATAT CTGTAATCTG AACACGACGA 6 60 

GGTTGACTAG TCACATAGAC TACTGTATCT GCAATATCCT GAGCTTGCAA AGCTTCTATT 720 

CCTTGGT AAA CGGACGCAGC TCGTTCTTTA TCACCATGAA AACGCACTGT AGAAAAATCT 7 80 

GTTTCGACAA TTCCAGGCTG AATGGTCGTC ACCTTGATAT CCGTTGCGAT GGTATCAATT 84 0 

CGCAGTCCAT CTGAAAAGGT CTTAACTGCC GCCTTGGTCG CTGAGTAAAC AGCTGCACCA 900 

GCATAGGCAT AAATTCCTGC GGTTGACCCC ATATTGATAA TATGACCTTG ATTGGCTTTT 960 
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ACCATTGCTG GCAAGAAACA GCGAGTGACT GCCATCAAAC CTTTGACATT GGTATCCAAC 1020 

ATGGTCAGCA TATCCAACTC TTCATAGTCT TGATAGGGAG CTAAGCCAAG AGCCAGTCCT 1080 

GCGTTATTGA CCAGGATGTC AATCTGACCT ATCCTTTCTA AAAT ATC AG A GCAGACAGTC 1140 

TTTACCATTG TCATATCCGT GACATCTAGG AGAAAAGTCC AAACTGTTTG ATTTGGAAAA 1200 

GTTTCTGCAA ACTCCGCCTT AAGAGCTTCT AGTCTGTCTA TCCCTCCTCC TGTTAGAACG 12 60 

ACATCCTCAC CCTGCTCCAG ATAAGCACGC GCAATCGCTT CACCGATTCC TGATGTCGCT 13 20 

CCTGTAATCA CAACATTTTT TGCCATCTTA TTTCCTTCTA GCTGGTCTAT CAGATATTAA 13 80 

CAACTTCTTA GGCAGTCCAG TGTTTCGCTG GGTCGAACGG TGTTCCGACA ACTTGGTCTT 14 40 

CTGATAATTC AAGCACCCCA CGTTTTTGTG GAGCATTTGG CAGATGCAAT TCACGAGGAC 1500 

T G C AC ATC AT ACCAAAACTC TTTTCACCAC GAAGTTCACC TGGGAAAATG AGATTCCCTT 1560 

TTGGCATCAT AGCTCCAGGA AGCGCGACAA TGGTTTTCAA CCCCACACGC GCATTGGGAG 16 20 

CTCCTGCAAC GATTTGTACA GTCTTATCAC TTGCGACTGC AACTTGGCAG AT GTTG AGGT 16 80 

GGTCACTATC TGCATGGGCT ACCATCTCAA CAATTTCACC TACAACAAAC TTAGGTTCCT 174 0 

T ATC ATT AAC AATTTCTTCT GTAAAACCTT CCGCCTGCAA CTCTTGGTTC AAACGAGCGA 1800 

CTTGCTCATC TGTCAAAAAG ACTTGACCGC GCTCTGCAAT TTC AAAT AAA CTTGAAACTT 18 60 

CCAAAATATT CCAAGCCACT GTTTCCCCAT TATCTTTGAG AAAAACACGG GCTACCTTGC 19 20 

CTTTGCGCTC CACATCCAGT TTGGCATCTC CGCTATTTTT CACGATGACC ATAAGGACAT 19 3 0 

CACCCACATC TTCTTTATTA TATGTAAAAA TCATTGTTTC CTTTTTCTCC TATTTCAGTC 2 04 0 

CTGCTAAAAA GTCATTGATT TGTTG CTTGC TTTTACGGTC GCGATTGACA AAACGACCGA 2100 

TTTCCTTGTC CTTTTCTAGA ACAACAAGGC TAGGAATTCC GTAAACATCC CAGAGTTTGG 2160 

CCAAATCCAT AT ACT G ATCT CGGTCCATTC GAATAAACGT GAACTCTGGA TTGGTCTCCT 222 0 

CAATCTCTGG TAAGGCAGGA T AAAT AT AAC GACAATCGCT ACACCAGTCT GCCACAAAAA 2 280 

TGAAGACCTT CTTGCCCGCT TTTTCCACTA AAGATGCTAA TTCTTCTAAA CTTGCTGGCT 2 34 0 

GTATCATAAG ACTTCCTCCT CATAGACTAG GTCTTCATTT TC AT AG AC AA AGGTATAATG 2 4 00 

ACGCCCATCC TCAAAAATGA CGCCACCAAC CAAGCTCTCC ACACTGCTTT CGTAAACTTG 2 4 60 

AACATAAAGG GTCGCAATTT CCCCCATGTC GGAAAAATGG TCTCGCACAA TCTCTGTCAA 2 5 20 

CTCTTCCTGA GTCTTCATGA GCTTACGGTC ATCTGCAACT TTTTTCGTAG CAAGAGCAAG 2 5 80 

GCTTCCGATA CCTAGCAGAG CCAAGCCTGC CATCCACATT TTTTTAGCTT TCATACCATT 2 64 0 

CATTTTAACA CAAAAAAGGC TTCAGGACAA ATGAGGAAGC AGCAGAAAAG CAAGTAAAAA 2 7 00 
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GCCTCTTCCT TTAAGGAAAA GGACTTCTTA TACTCAATGA AAATCAAAGA CCAAACTAGG 2*760 

AAGCTAGCCG CAGGCTGCTC AAAGCACTGC TTTGAGGTTG TAGATAGAAC TGACGAgTCa 2820 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AGCTGACGTG GTTTGAAGAG ATTTTCGAAG 2880 

AGTATTATTC TTATTGCCAG GCACCTAAGT TGCCAACGTA GTAACTATCA GGTGTGT AGG 2 94 0 

TATTGCGAGC ATCTTACCTG ATGAAGCCAG ATAATACTAC TTGCCATTGT CTTTGACCCA 3000 

ATCATTCGCA ATCATGGAAC CAGAAGAACT TACATAATAC CATTCTCCCT TGTCATAAAC 3 0 60 

CCAAGTACTG ACTTTCATGG TTCCTGAGCA ATTAAAGGCA AAAAAACTGT CCAATAACAT 3120 

TCGTTTTTTA AAAGCATTTG AC ACT A CAT 3149 
(2) INFORMATION FOR SEQ ID NC : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CCAAAAATTC AACCTTTAAG GGGAGTCCAG AGAGACTCAC AAGGTGTCAG AT AAAAG AAT 60 

GGTGCAATTT TCTAGAGGAG ACTTTTTGAG TCTGCTCTCT TGTGTTGT AC GATTTTAACT 12 0 

GAGGCCTTGC ACTAGCAAGG TCTTTTCTTT ATCTGGTCCC C TT AAAATTT AAGGAGGAAA 180 

AGTTATGAAT CCCACATGTA AGAAGCGTTT GGGTGTCATT CGGTTGGAAA CCATGAAGGT 2 40 

CGTTGCACAA GAGGAAATCG CGCCACAATC TT7CAATTAC TCCTAGAAGG AGAAATGGTT 3 00 

GAAGCCATCC GAGCAGGCCA ATTTCTTCAT CTGCGTGTAC CGGACGATGC CCATCTCTTA 3 60 

CGTCGTCCTA TTTCAATTTC GTCTATTGAC AAGGCAAACA AGCACTGTCA CCTCATTTAT 4 20 

CGGATTGACG GAGCTGGGAC TGCAATTTTT TCAACCTTAA GTCAGGGAGA CACTCTTCAT 4 80 

GTGATGGCGC CTCAGGGAAA TCGTTTTGAC TTGTCTG AC C TTGATGAGCA GAATCAGGTT S4 0 

CTCCTTGTTG G T GGTGGG AT TGGTGTTCCA CCCTTGCTTG AGGTGGCCAA GGAATTGCAT 600 

GAACGTGGAG TGAAAGTAGT GACACTCCTC GGTTTTGCTA ATAAGGATGC TGTTATTTTG 6 60 

AAAACGGAAT TGGCTCAGTA TGGTCAGGTC TTTGTAACCA CAGATGATGG TTCTTATGGC 720 

ATCAAGGGAA ATGTTTCCGT TGTTATCAAT GATTTAGACA GTCAGTTTGA TGCTGTTTAC 780 

TCGTGTGGGG CTCCAGGAAT GATGAAGTAT ATCAATCAAA CCTTTGATGA TCACCCAAGA 840 

GCCTATTTAT CTCTGGAATC TCGTATGGCT TGTGGGATGG GAGCTTGCTA TGCCTGTGTT 9 00 

CTAAAAGTAC CAGAAAACGA GACGGTCAGC CAACGCGTCT GTGAAGATGG TCCTGTTTTC 960 
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CGCACAGGAA CAGTTGTATT ATAAGGAGAA AATT AT G ACT ACAAATCGAT TACAAGTTTC 102 0 

TCTACCTGGT TTGGATTTGA AAAATCCGAT TATTCCAGCA TCAGGCTGTT TTGGCTTTGG 1080 

ACAAGAGTAT GCCAAGTACT ATGATTTAGA CCTTTTAGGT TCTATTATGA TCAAGGCGAC 1140 

AACCCTTGAA CCACGTTTTG GGAATCCAAC TCCAAGAGTG GCAGAGACGC CTGCTGGTAT 12 00 

GCTCAATGCA ATTGGCTTGC AAAATCCTGG TTTAGAGGTT GTTTTGGCTG AAAAGCTACC 1260 

TTGCCTGGAA AGAGAATATC CAAATCTTCC TATTATTCCC AATGTACCTC GTTTTTCAAA 13 20 

ACAAGAGTAT GCAGCTGTTT CTCATGGGAT TTCCAAGGCA ACTAATGTAA AAGCTATCGA 13 80 

CCTCAATATT TCTTGTCCCA ATGTTGACCA CTGTAATCAT GGACTTTTGA TTGGTCAAGA 1440 

TCCAGATTTG GCTTATCATC TGGTGAAAGC AGCTGTGGAA GCCTCAGAAG TGCCAGTTTA I 500 

TGTCAAATTA ACCCCGAGTG TGACCGATAT CGTTACTGTC GCAAAAGCTG CACAAGATGC 1560 

GGGAGCAAGT GGCTTGACCA TGATCAATAC TCTCCTTGGA ATCCCCTTTG ACCTCAAAAC 1620 

TAGAAAACCA ATCTTGGCCA ATGGAACAGG TGGAATGTCT GGTCCAGCAG TCTTTCCAGT 1680 

AGCCCTCAAA CTCATCCGCC AAGTTGCCCA AACAACAGAC CTGCCTATCA TTGGAATGGG 17 40 

AGGAGTGGAT TCGGCTGAAG CTGCCCTAGA AATGTATCTC GCTGGGGCAT CTGCTATCGG 1900 

AGTTGG AAC A GCTAACTTTA CCAATCCTTA TGCCTGCCCT GACATCATCG AAAATTTACC 1860 

AAAAGTCATG GATAAATACG GTATTAGCAG TCTGGAAGAA CTCCGTCAGG AAGTAAAAGA 19 20 

GTCTCTGAGG TAAACTGCAA TCAATCTGTT CTTGATTTTT TATTAGTTTG TAATATGAAT 19 80 

TTAGGAGAAT TTTGGTACAA TAAAATAAAT AAGAACAGAG GAAGAAGGTT AATGAAGAAA 204 0 

CTAACATTTA TTTTTTTAGC TCTGCTATTT TTCTTAGCTA GTCCAGAGGG TGCAATGGCT 2100 

AGTGATGGTA CTTGGCAAGG AAAACAGTAT CTGAAAGAAG ATGGCACTCA AGCAGCAAAT 2160 

GAGTGGGTTT TTGATACTCA TTATCAATCT TGGTTCTATA TAAAAGCAGA TGCTAACTAT 2 22 0 

GCTGAAAATG AATGGCTAAA GCAAGGTGAC GACTATTTTT ACCTCAAATC TGGTGGCTAT 2 2 80 

ATGGCCAAAT CAGAATGGGT AGAAGACAAG GGAGCCTTTT ATTATCTTGA CCAAGATGGA 2 34 0 

AAGATGAAAA GAAATCCTTC GGTAGGAACT TCCTATGTTG GTGCAACAGG TGCCAAAGTA 2 4 00 

ATAGAAGACT GGGTCTATGA TTCTCAATAC CATGCTTCGT TTTATATCAA AGCAGATGGA 2 4 60 

CAGCACGCAG AGAAAGAATG GCTCCAAATT AAAGGGAAGG ACTATTATTT CAAATCCGGT 2 52 0 

GGTTATCTAC TGACAAGTCA GTGGATTAAT CAAGCTTATG TGAATGCTAG TGGTGCCAAA 2 580 

GTACAGCAAG GTTGGCTTTT TGACAAACAA TACCAATCTT GGTTTTACAT CAAAGAAAAT 2 64 0 

GGAAACTATG CTGATAAAGA AT GG ATTTTC GACAATGGTC ACTATTATTA TCTAAAATCC 2 7 00 
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GGTG GyT AC A TGGCAGCCAA TGAATGGATT TGGCATAAGG AATCTTGGTT TTATCTCAAA 27 60 

TyTGATGGGA AAATrGCTGA AAAAGAATGG CTCTACGATT CTCATAGTCA AGCTTGGTAC 2 82 0 

TACTTCAAAT CCGGTGGTTA CATGACAGCC AATGAATGGA TTTGGGATAA GGAATCTTGG 2 880 

TTTTACCTCA AATCTGATGG GAAAATAGCT GAAAAAGAAT GGGTCTACGA TTCTCATAGT 294 0 

CAAGCTTGGT ACTACTTCAA ATCTGGTGGC TACATGGCGA AAAATGAGAC AGTAGATGGT 3000 

TATCAGCTTG GAAGCGATGG TAAATGCCTT GGAGGAAAAA CTACAAATGA AAATGCTG CT 3 060 

TACTATCAAG TAGTGCCTGT TACAGCCAAT GTTTATGATT C AG ATGGTG A AAAGCTTTCC 312 0 

TATATATCGC AAGGTAGTGT CGTATGGCTA GAT AAGG AT A GAAAAAGTGA TGACAAGCGC 3130 

TTGGCTATTA CTATTTCTCC TTTGTCAGGC TATATGAAAA CAGAAGATTT ACAAGCGCTA 3 2 40 

GATGCTAGTA AGGACTTTAT CCCTTATTAT GAGAGTGATG GCCACCGTTT TTATCACTAT 3 300 

GTGGCTCAGA ATGCTAGTAT C CC AG TAG CT TCTC AT CTTT CTGATATGGA AGTAGGCAAG 3 3 60 

AAATATTATT CGGCAGATGG CCTGCATTTT GATGGTTTTA AGCTTGAGAA TCCCTTCCTT 342 0 

TTCAAAGATT T AAC AG AG G C TACAAACTAC AGTGCTGAAG AATTGGATAA GGTATTTAGT 3 4 80 

TTGCTAAACA TTAACAATAG CCTTTTGGAG AACAAGGGCG CTACTTTTAA GGAAGCCGAA 3 54 0 

G AAC AT T AC C ATATCAATGC TCTTTATCTC CTTGCCCATA GTGCCCTAGA AAGTAACTGG 3 600 

GGAAGAAGTA AAATTGCCAA AGATAAGAAT AATTTCTTTG GCATTACAGC CTATGATACG 3 660 

ACCCCTTACC TTTCTGCTAA GACATTTCAT GATGTGGATA AGGGAATTTT AGGTGCAACC 3 720 

AAGTGGATTA AGG AAAATT A TATCGATAGG GGAAGAACTT TC CTTGG AAA CAAGGCTTCT 3 7 80 

GGTATGAATC TGGAATATGC TTCAGACCCT TATTGGGGCG AAAAAATTGC TAGTGTGATG 3 84 0 
ATGAAAATCA ATG AG AAGCT AGGTGGCAAA GATTAGTACT AT AAGTG AAT ATCATTTGAC 3 900 

TGAATAGTAA GTTAAAAATC CTGATTTCAA GTAAAATCAG GATTTTTTCA TGGATGCAAT 3960 
TTTTTTGGAG TCTGGTGTGA CGCGG AG GGT CTTTTGTCCT GTGTAAGTGA CAAAGCCGGG 4 02 0 

TTTTCCACCA GTTGGTTTAT TGAGTTTTTT GACTTCAATC ATATCTACCT GCACCAGATT 4080 
CGACAGGCGC CCTTGAGAGA ACT AGG C AG C TAACTCTGCT GCGT CTGTCT TGACTGCATC 414 0 

AGATGGGTCA AG ATTTC CTG AGATGACAAC ATGGCTTCCA GGAATGTCCT TAGCATGGAA 4 200 

CCAAAGTTCC TCCTTGCGGG CCATTTTAAA GGTCAATTCC TCATTTTGAA GATTGTTTCG 42 60 

TCCGACATAG ATGATGGTTT TGCCATCGCT TGCT AG AT AT TGTTCTAGTT TTTT GCGTTT 4 3 20 

CTGGATTTTC TCCCGTTGTC TTCTGCGGAT AAAACCTGTT TGAATCAATT CTTCACGGAT 4 3 80 

TTC AG C G ATT TCTTCCAGTC CAGCTTGGTT GACCACCCTT TCTACACTTT CC AG AT AG AG 4 4 40 

AATAGTGGCT TTGGTTTCTT CAATCAAATC AGTCAAGTAT TTGACAGCTT CTTTGAGTTT 4 500 
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CTGATACCGT TTAAAATAGC GTTGGGCATT CTGGTTGGGA GTCAGAGCCT TATCAAGCGC 4 560 

AATCATGATA GGTTGGTTGG TATAG7AGTT GTCTAGGATA ACCTGGTCTT G GTCGTT AG G 4 62 0 

CACTTGGTGG AGGAAGGTTG TCAGCAATTC TCCTTTTTGA CGAAATTCTT CAGCGTTGTC 4 630 

TGTCGCCAGT AACTCTTTTT CCTGTTTTTT GAGTTTGTGT CGGTTTTTCT GAAGTTCATT 47 4 0 

TTCAACACGA CGAATCAGTT CACTGGCCTG CTGTTTCACG CGGTCGCGCT CAGCCTTATC 4 800 

CTTATAGTAG GTGTCCAACA AATCAGAAAG ATTTGCAAAA GGCTCTCCCA CCTGATTTGC 4 860 

AAAAGGAACT GGACTGAAGG AACTCTCAGT CAAGCATGGC TTGGTTTCTT GATTGAAAAA 4 9 20 

ATTTCGGAAA GCGCAAAGTT TTTC ACT AAC CAGTATCCTT TCCAATTCAT TTGCCGTATC 4 930 

GCGTCCCAGA CCTTGAAAGA GGCTTTGAAG ATTTTTTGCT GTTAGTTCTT GGGTTTGCAG 504 0 

GATTTCAAAG AGCTTTTCAT CCTTGATAGT AAAAGGATTG AG AG ATTTTG TACTTGGCGG 5100 

AGCGATATAG GTCGATCCTG GAAGTAAGGT GCGGTAGCTA TTTTGTGAAA AGCCGACGTG 5160 

TTTGATAACT TCGAGGATTT TATGACTGCT TTTATCGACC AG T AG AAT AT TACTGTGTTT 52 2 0 

CCCCATAATT TCGATAATCA AGGTAGCCTG GATATGGTCT CCAATCTCGT TTTTATTGGA 52 80 

AACTGTAATT TCCACAATAC GGTCATTTTC CACTTGCTCA ATCGACTCAA TCAGGGCCCC 53 4 0 

CTGCAAATAC TTTCTCAAAA C C ATG AT AAA GGTAGAAGGT TGAGCTGGAT TTTCAAAACT 5400 

CGTTTGGGTC AGCTGAATGC CTCCAAAAAC TGGATGGGCA GAAAGGAGCA GCCGATGGCT 54 60 

TTGGCGATTG CTGCGGATTT GCAAGACCAA CTCTTGTTCA AAAGGCTGAT TGATTTTCTG 55 20 

CATCCGACCA TTCACTAATT CGCTTCGCAA TTCCTCAACT ATGTGGTGTA AAAAAAATCC 55 80 

GTCAAATGAC ATCGTTCTCT CCTTGTGATT GTATTCCATA GTATTATATC AAAAAGGTAG 564 0 

AATAAAATCA TGGAAATGTG GT AT AAT AAA GCCAAGTAAA GAGAAACGAG AAGCACATGT 57 00 

ATATTGAAAT GGTAGATGAA ACTGGTCAAG TTTCAAAAGA AATGTTGCAA CAAACCCAAG 57 60 

AAATTTTGGA ATTTGCACCC CAAAAATTAG GAAAAGAAGA CAAGGAGATG GCAGTCACTT 5 82 0 

TTGTGACCAA TGAGCGTAGT CATGAACTTA ATCTGGAGTA CCGTAACACC GACCGTCCGA 5 38 0 

C AG ATGTC AT CAGCCTTGAG TATAAACCAG AATTGGAAAT TGCCTTTGAC GAAGAGGATT 594 0 

TGCTTGAAAA TTCAGAATTG GCAGAGATGA TGTCTGAGTT TGATGCCTAT ATTGGGGAAT 6 0C0 

TGTTCATCTC TATCGATAAG GCTCATGAGC AGGCCGAAGA ATATGGTCAC AGCTTTGAGC 6060 

GTGAGATGGG CTTCTTGGCA GTACACGGCT TTTTACATAT TAACGGCTAT GATCACTACA 6120 

CTCCGGAAGA AGAAGCGGAG ATGT TCGGTT TACAAGAAGA AATTTTGACA GCCTATGGAC 6180 
TCACAAGACA ATAAACGAAA ATGGAAAAAT CGTGACTTGA TATCCAGTTT AGAATTTGCT . 6 2 4 0 
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TTGACAGGTA TTTTTACTGC TATCAAGGAA GAACGCAATA TG6SA^^CA CGCAGTGACG 6 3C0 

GCTCTAGTGG TCATCCTTGC AGGTTTTGTT TTTCACGTGT CACGAATCGA ATGGCTCTTT 63 60 

CTCCTATTGA GTATTTTCTT GGTAGTAGCC TTTGAGATTA TCAACTCTGC TATTGAAAAT 6420 
GTGGTGGATT TGGCCAGTCA CTATCACTTT TCCATGCTGG CTAAAAATGC CAAGGATATG 64 30 

GCGGCCGGCG CGGTATTAGT GGTTTCTCTT TTCGCAGCCT TAACAGGCGC ATTGATTTTT 6 540 

CTCCCACGAA TCTGGGATTT ATTATTTTAA ACAGT AAG AG GAAATTATGA CTTTTAAA7C 6 600 

AGGCTTTGTA GCCATTTTAG GACGTCCCAA TGTTGGGAAG TCAACCTTTT TAAATCACGT 56 60 

TATGGGGCAA AAGATTGCCA TCATGAGTGA CAAGGCGCAG ACAACGCGCA ATAAAATCAT 67 20 

GGGAATTTAC ACGACTGATA AGGAGCAAAT TGTCTTTATC GACACACCAG GGATTCACAA 67 90 

GCCTAAAACA GCTCTCGGAG ATTTCATGGT TGAGTCTGCC TACAGTACCC TTCGCGAAGT 684 0 

GGACACTGTT CTTTTCATGG TGCCTGCTGA TGAAGCGCGT GGTAAGGGGG ACGATATGAT 6 900 

TATCGAGCGT CTCAAGGCTG CCAACGTTCC TGTCATTTTG GTCGTGAATA AAATCGATAA 69 60 

GGTCCATCCA GACCAGCTCT TGT CTC AG AT TGATGACTTC CGTAATCAAA TGCACTTTAA 70 20 

CGAAATTGTT CCAATCTCAG CCCTTCAGGG AAATAACGTG TCTCGTCTAG TGGATATTTT 7080 

GAGTGAAAAT CTGGATGAAG GTTTCCAATA TTTCCCGTCT GATCAAATCA CACACCATCC 7140 

AG AACGTTT C TTGGTTTCAG AAATGG TTCG CGAGAAAGTC TTCCACCTAA CTCGTGAACA 7 200 

GATTCCGCAT TCTGTAGCAG TAGTTGTTGA CTCTATGAAA CGAGACGAAG AGACAGACAA 72 60 

GGTTCACATC CGTGCAACCA TCATGGTCGA GCCCGATAGC CAAAAAGGGA TTATCATCCG 7 3 20 

TAAAGGTGGC GCTATGCTTA AGAAAATCGG TAGCATGGCC CGTCGTCATA TCGAACTCAT 73 80 

GCTAGGAGAC AAGGTCTTCC TAGAAACCTG GGTCAAGGTC AAGAAAAACT GGCCCGATAA 7 440 

AAAGCTAGAT TTGGCTGACT TTGGCTATAA TGAAAGAGAA TACTAAGTAG AGGTAGGCTC 7500 

ATGCCTGCTT CTTGTTTTTA CAGAAGGAGG ACTTATGCCT GAATTACCTG AGGTTGAAAC 7 5 60 

CGTTTGTCGT GGCTTACAAA AATTCATTAT AGCAAAGAAG ATTTCGAGTA TAGAAATTCG 7 620 

CTACCCCAAG ATGATTAAGA CGGATTTGGA AGAGTTTCAA AGGGAATTGC CTAGTCAGAT 7 6 80 

TATCGAGTCA ATGGGACGTC GTGGAAAATA TTTGCTTTTT TATCTGACAG ACAACCTCTT 77 4 0 

GATTTCCCAT TTGCGGATGG AGGGCAAGTA TTTTTACTAT CCAGACCAAG GACCTGAACG 7300 

CAAGCATGCC CATGTTTTCT TT C ATTTTG A AGATGGTGGC ACGCTTGTTT ATGAGGATGT 7860 

TCGCAAGTTT GGAACCATGG AACTCTTGGT GCCTGACCTT TTAGACGTCT ACTTTATTTC 7 9 20 

TAAAAAATTA GGTCCTGAAC CAAGCGAACA AGACTTTGAT TTACAGGTCT TTCAATCTGC 7 980 

CCTTGCCAAG TCCAAAAACC CTATCAAATC CCATCTCCTA GACCAGACCT TGGTAGCTGG 804 0 
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ACTTGGCAAT ATCTATGTGG ATGAGGTTCT CTGGCGAGCT CAGGTTCATC C AG CT AG AC C 3100 

TTC CC AG ACT TTGACAGCAG AAGAAGCGAC TGCCATTCAT GACCAGACCA TTGCTGTTTT 8160 

GGGCCAGGCT GTTGAAAAAG GTGGCTCCAC CATTCCGACT TATACCAATG CCTTTGGGGA 82 20 

AGATGGAAGC ATGCAGGACT TTCATCAGGT CTATGATAAG ACTGGTCAAG AATGTGTACG 82 80 

CTGTGGTACC AT C ATTG AG A AAATTCAACT AGGCGGACGT GGAACCCACT TTTGTCCAAA 3 340 

CTGTCAAAGG AGGGACTGAT GGGAAAAATC ATCGGAATCA CTGGGGGAAT TGCCTCTGGT 84 00 

AAGTCAACTG TGACAAATTT TCTAAGACAG CAAGGCTTTC AAGTAGTGGA TGCCGACGCA 84 60 

GTCGTCCACC AA CT AC AG AA ACCTGGTGGT CGTCTGTTTG AGGCTCTAGT AC AG C AC TTT 852 0 

GGGCAAGAAA TCATTCTTGA AAACGGAGAA CTCAATCGCC CTCTCCTAGC TAGTCTCATC 8530 

TTTTCAAATC CTGATGAACG AGAATGGTCT AAGCAAATTC AAGGGGAGAT TATCCGTGAG 864 0 

GAACTGGCTA CTTTGAGAGA ACAGTTGGCT CAGACAGAAG AGATTTTCTT CATGGATATT 87 00 

CCCCTACTTT TTGAGCAGGA CTACAGCGAT TGGTTTGCTG AGACTTGGTT GGTCTATGTG 37 60 

GACCGAGATG CCCAAGTGGA ACGCTTAATG AAAAGGGACC AGTTGTCCAA AGATGAAGCT 38 20 

GAGTCTCGTC TGGCAGCCCA GTGGCCTTTA GAAAAAAAGA AAGATTTGGC CAGCCAGGTT 3880 

CTTGATAATA ATGGCAATCA GAACCAGCTT CTTAATCAAG TGCATATCCT TCTTGAGGGA 8940 

GGTAGGCAAG ATGACAGAGA TTAACTGGAA GG AT AATCTG CGCATTGCCT GGTTTGGTAA 9000 

TTTTCTGACA GGAGCCAGTA TTTCTTTGGT TGTACCTTTT ATGCCCATCT TCGTGGAAAA 9060 

TCTAGGTGTA GGGAGTCAGC AAGTCGCTTT TTATGCAGGC TTAGCAATTT CTCTCTCTCC 9120 

TATTTCCGCG GCGCTCTTTT CTCCTATTTG GCGTATTCTT GCTGACAAAT ACGGCCGAAA 9180 

ACCCATGATG ATTCGGGCAG GTCTTCCTAT GACTATCACT ATGGGAGGCT TGGCCTTTGT 92 4 0 

CCCAAATATC TATTGGTTAA TCTTTCTTCG TTTACTAAAC GGTGTATTTG CAGGTTTTGT 9 3 00 

TCCTAATCCA ACGGCACTGA TAGCCAGTCA CGTTCCAAAG GAGAAATCAG GCTCTGCCTT 9 3 60 

AGGTACTTTG T CT AC AGGCG TAGTTGCAGG TACTCTAACT GGTCCCTTTA TTGGTGGCTT 94 2 0 

TATCGCAGAA TTATTTGGCA TTCGTACAGT TTTCTTACTG GTTGGTAGTT TTCTATTTTT 94 80 

AGCTGCTATT TTGACTATTT GCTTTATCAA GGAAGATTTT CAACCAGTAG CCAAGGAAAA 95 4 0 

GGCTATTCCA ACAAAGGAAT TATTTACCTC GGTTAAATAT C C CT AT CTTT TGCTCAATCT 9 600 

C TTTTT AACC AGTTTTGTCA T C CAATTTT C AGCTCAATCG ATTGGCCCTA TTTTGGCTCT 9660 

TTATGTACGC GACTTAGGGC AG AC AG AG AA TCTTCTTTTT GTCTCTGGTT TGATTGTGTC 97 20 

CAGTATGGGC TTTTCCAGCA TGATGAGTGC AGGAGTCATG GGCAAGCTAG GTGACAAGGT 97 80 
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GGGCAATCAT CGTCTCTTGG TTGTCGCCCA GTTTTATTCA GTCATCATCT ATCTCCTCTG 9840 

TGCCAATGCC TCTAGCCCCC TTCAACTAGG ACTCTATCGT TTCCTCTTTG GATTGGGAAC 9 900 

CGGTGCCTTG ATTCCCGGGG TTAATGCCCT ACTCAGCAAA ATGACTCCCA AAGCCGGCAT 9 960 

TTCGAGGGTC TTTGCCTTCA ATCAGGTATT CTTTTATCTG GGAGGTGTTG TTGGTCCCAT 10020 

GGCAGGTTCT GCAGTAGCAG GTCAATTTCG CTACCATGCT GTCTTTTATG CGACAAGCCT 10080 

TTGTGTTGCC TTTAGTTGTC TCTTTAACCT GATTCAATTT CGAACATTAT TAAAAGTAAA 1014 0 

GGAAATCTAG TGCGAGTAAA AATCAATCTC AAATGCTCCT CTTGTGGCAG TATCAATTAC 10 2 00 

CTAACCAGTA AAAATTCAAA AACCCATCCA GACAgATTGA 102 4 0 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(Ai LENGTH: 13 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(Dl TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CGCTTTATCG 


TGGACGTGGT 


CAAGCCGAGA 


ATTTCATCAA 


GGAGATGAAG 


GAGGGATTTT 


60 


TTGGCGATAA 


AACGGATAGT 


TCAACCTTAA 


TCAAAAACGA 


AGTTCGTATG 


ATGATGAGCT 


120 


GTATCGCCTA 


CAATCTCTAT 


CTTTTTCTCA 


AACATCTAGC 


TGGAGGTGAC 


TTCCAAACTT 


130 


TAACAATCAA 


ACGCTTCCGC 


CATCTTTTTC 


TTCACGTGGT 


GGGAAAATGT 


GTTCGAACAG 


240 


GACGCAAGCA 


GCTCCTCAAA 


TTGTCTAGTC 


TCTATGCCTA 


TTCCGAATTG 


TTTTCAGCAC 


300 


TTTATTCTAG 


GATTAGAAAA 


GTCAACCTGA 


ATCTTCCTGT 


TCCTTATGAA 


CCACCTAGAA 


360 


GAAAAGCGTC 


GTTAATGATG 


CATTAAAGAA 


CAGTCGAGAT 


GAAAAAATCG 


TGTGACGCAC 


420 


CAAGGGAGGA 


GTCTGCCCTT 


TTGAGGAAAT 


CTAGCGAGGA 


AAAACGATAC 


TGGAACAGCA 


480 


GAAAGTAAAA 


CTGACCTCAT 


GAGGAGGAAG 


AAAGTGGCTC 


ATGAGGTCAG 


GGGTTTTG7A 


540 


AGTTACATCT 


AGTTGAGAGA 


GGTATGAATG 


ATTTGGGATT 


AATCATTTCT 


TGTTTTAAAT 


600 


CAGGAGAATA 


GTAACGATTT 


TTTCCTTTTT 


TGACGAACTC 


TATTCCCTAA 


CGATCAATCA 


660 


ATTTAATCAT 


GTACCTAATA 


TTAGAATTGT 


TTATCCCAAA 


TTTATTTGAA 


AGCTTCTCTA 


720 


AGCTATATCC 


TTGTTTTCTA 


AG TT C AT AG A 


TCTGAACTTT 


ATCATCATAA 


GTTAGTTTCA 


780 


TAATAAAAAC 


ACCCCAAAAG 


TTAGATTTTT 


TCTGTCTAAC 


TTTTGGGGGG 


CAGTTCATTC 


340 


AACACCTGAT 


ACTATGCGTT 


TTTCTTATTT 


GAAATACTTT 


TTACTCAACC 


TCTTTATACT 


900 


CAATGAAAAT 


CAAAGTGCAA 


ACTAGAAAGC 


TAGCCTCAGG 


CTGCTCAAAA 


CAGTGTTTTG 


960 
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Abu 1 i UUAoA 


tcc a arr^r a. 




AGAGATTTTC 


GAAGAGTATT 


ACTTAATCTT 


L020 


C TTG AT ACTT 


•Pi" IPTMril 


TAAATCCTAC 


AATCATCCCT 


AC C AT ATTTT 


GCATAAAATT 


ioao 


CGGTAGAAT I 


I CTuGGAGOVj 




GCCATTCATC 


AAAGCAGAAC 


CCAAGGCGTA 


1140 


GCCTL.LTALL 


A 1 L><*t<_ An I A*j 


TTGCTAAAAT 


AAGGCCTAAC 


CACTGACTTT 


TTCCTTTAAA 


1200 


TCCTGCGAAA 


AATCCCTGLA 




GACCAAGCTA 


AAGAACATCC 


ACTGAGGGTA 


1260 


GCCTGATAAG 


ri-AA i L A 


AnAAAtTTGC 


TAGTCCTCCG 


ACTACCGCTC 


CTTCACGACT 


1320 


ACCAAAGTAA 




a n A AG AO AC C 


AGCATCTAAA 


AGAGTTAGAA 


TTCCTGTAGG 


1380 


TGTTGGGATT 


nwrwn iv a iv iv a t* 
i I 1 AAGAAA I 


a a.<T"PAftA AP 


C A C AG AAA GG 


GCGGTT AATA 


GGGATACAAG 


1440 


GGCGATTTTA 


GTTGTTTTTG 


'T w T f T*r*T**T w Pf^ A^P 
111 yt 1 i. LA i 


ATTGTC^TAC 


TCCATACTGA 


TCTGCTTGTG 


1500 


CAATAGCACG 


ATAAACGAAA 




TTTCTACTGC 


TGGCAAAAGT 


TTATCACCTT 


1560 


TAACCAGGTG 


ACTGGCAA 1 U 


V» 1 AVj nua^ A#tv 


AGGTACAAC s 


TGCACCAGCA 


TTTTGGCCTT 


1620 


GGATAACTGtj 


A 1 I I I V. 1 Atjvj 


ATA^TA AAfin 


TCTGTCCATC 


ATAAAAGACA 


TCCACAGCCT 


1680 


TGTCCTGACT 


AAGACGA 1 i tj 




T AATG ACTG t 


GGCGCTCCTA 


AATCATGCAA 


1740 


T TTCTGCGCT 


GCAGTT^l I LA 


t U I 1 A 1. l_V_.S\n 


GGTTTT AA TT 


TCCTGACCGG 


ATAATAATTC 


1800 


TGCTTCTGGG 


AGATT AGt» t- Li 


1 AA 1 LnLnL 1 


GACATAAGGG 


AAAAAGCGAA 


TCAACTCTTG 


I860 


GCAGAGCTCA 


/""I**"* » ik rp jv 
(_ 1 tj At- AL»L 1 A 


LA 1 LA l ycu * 


TTCCTTGCAG 


ACCAAGACAG 


GATCCAACAC 


1920 


CACAGGTACT 


CCTGGGC^l * 




GTCCAAGGlC 


TTCTCAGCCA 


CCCTGACAGT 


1980 


AGGGAGAAGA 


LLAA rt- I I AA 




TTCCACATCA 


CGCAAGCTAT 


CTAATTCATC 


2040 


TTGAAAAATG 


GTATn- An (- Avj 




TTCAAATCCT 


TTTTCTGT C A 


AGGCTGT CAA 


2100 


, A.-voT_ACT 






GTTC.\.\GGT*. 




,V\TCAGCTGA 


2160 


CAGTlCACCA 


ff^Tir^T a a a a a 


T ATATTTCC 


AGAAAGTGCT 


AAAATACGAT 


TATTCTTCAT 


2220 


AACGAATCTC 


CTTTAAATAC 


AAACCATTTG 


GTGCTGCACT 


GGGACCTGCA 


AOTTbLL iU L 


9 9 a n 

i. £. a u 


CCTTCTTCTC 


CAAGATGAGA 


TCAATCTGCT 


CTACTGGCAT 


GCGGTTGTTA 


CCGATTTTGA 


2340 


GAAGAGTCCC 


CACCATATTG 


CGAATCTGTT 


TATACAAGAA 


ACCATTTCCT 


GAAAAGGTAA 


2400 


AGGTCAAAAA 


TTGTCCTGTC 


TCATCGACTA 


TTAAACTAGC 


TTCTGTGATG 


GTGCGAACCT 


2460 


TATCCTCTAC 


ACTAGTCCCA 


GAGGCTGTAA 


AACCGGTAAA 


ATCATGGGTT 


CCCTCTAGCT 


2520 


TTTTGATTGC 


AATCTGCATT 


CGTTCCACAT 


CGAGTGGGTA 


GGGAAAGTGG 


GTGGCATAGT 


2580 


GACGGCGCAT 


CGGATTTTTG 


GGACGTCCTC 


TATCCACAG7 


AAACT CAT AG 


GTCTTG CT AT 


2640 


CCTTGGCATA 


ACCGCAATGA 


AAATCATCTG 


CCACAAGCTC 


AATCGAAATC 


ACATCAATAT 


2700 
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CTTCAGGAGA CTGGGTATCC AAGGCAAAAC GGAGTTTCTC CTCATCCATC TGATAAGGCA 2 7 60 

GGTCAAAATG AATCACCTGT CCCAGGGCAT GAACCCCACT ATCTGTCCTA CCAGCACCGT 2 820 

GAACAGTAAT GGCTTGCCCT TTATTTAATC TGGTCAAGGT TTTTTCAATT TCTTCCTGAA 2880 

CGCTACGCGC ATGAGGCTGG CGCTGAAAGC CAGCAAAGGC ATAACCATCA TAGGAAATAG 2940 

TTGCTTT AT A TCTCGTCATA GCCTCTATTT TATCAAGAAA TTAGTCTGTA AACAAGGACC 3 000 

TAAAACAAAT ATTGTATGGG TATAAAAATC T C AT ACT CTT CGAAAATCTC TTCAAACCAC 3 0 60 

GTCAGTTTCC ATCTGCAACC TCAACACACT ATTTTGAGCA ACCTGCGGCT AGCTTTCTAT 312 0 

AGTAGATTGA AATAAGATAT GAACAACTCT ATTAGGAAAG TCAAATTAAT TTCTAGAAAT 3180 

ATTTTAGCAG CTACAGCGTA CTATTCCAAA CTCAATCAAC T AT AGTTTG C TCTTTGATTT 32 40 

TCATTGAGTA TCAAAAGAAA AACTTAGGAA TCAATCCTAA GCTCTCTTCT GAAGTAGGTA 3 3 00 

CATGACAAAG ATAGAGATTA CAATCAACCA ACCTCCTAAG ATACTAAAGA CCAACATCCC 33 60 

ATTGTGAGTT AGTAAGCCAA TTGCACCTAG AACG AATGGG GTCGTAAAGG CTCCGAAACT 3 4 20 

ACAGCCTAAT ACAGCAAATG AAGTTGCTTG ATTGAGGAGT TTAGCTGGAA TTCGTTCAGA 34 8 0 

GACAAGTTGA AAGACCGTCG TCAAGACTAC ACTATAGGCA AATCCAGCCA GAACACTTCC 3 54 0 

TGCTACTACC ACCCACAAGG ATCAAGACAA GGCAATCACG ATTTGCCCCA AGCCAAAGGT 3 600 

AATACCAGAC CAGAGGAGCA GTTTCTCTTT AAAGATAGAA ATCAAGAAAG AAAAACTCAC 3 660 

CCCAGCCACA ATCCCGATCA ACTGCATGAT ACTAAGAACA AAACT AG AT A ACTGGGCATC 3 72 0 

CCCCAATCCT CTTTCCACCA TCAAACTTGG AATACGGATG GTAATAGCTG TATTGGTACA 37 80 

AACTACAACT GCCGCTTCGA TACCTAAGGT AAAAATCAAG CCTTTCATTT CTCGAGTTAA 3 8 40 

ACGACTTGCT TCCTTCGCTC TTTTCT T GAC TTCTTTCTTT GATTTTCCAT AAGGGACAAA 3 9CO 

G AG C AG AT AA AGGGGCAGCA CCAAAAA7CC AGCACTATAG CCTAGAAAGA T AGCTGTC C A 3 9 60 

ACCAAAGGCC AACAACTGAC CGACGGCCAA GGTAATGAGA GAAGCTCCAA CGACCTCTGC 4 020 

AGAAGCGCGT AGCCCTAACA TCTGAATTCG CCTTTTTCCT TGGTAGCGTT CACTGATAAT 408 0 

AGAAATGGCC TTGGCATTGA TCATCCCAAG ACCCAAACCA AAGAGAAGCC GTGTTCCAAA 414 0 

GACAAAGGGA TAGGCTTGGT ACCAGAAGGG AGCTGTACCG CTCAATGATA AAATCAGCAA 4 200 

GCCCAAACTA ATCTGTAAGC GCTCAGGAAA TATTTTTTCT AAGAAACCAT TTAGCAGTAA 4 2 60 

CATCATCATG ATTCCAAAGG AAGGCAAGCT CACCAAGAGC TCAATTTGTT CCTTAGAATA 4 320 

ACCCTGATAA TAGTCAAACA TGGCTGGTAG GG C ACT C G AA ATGGAAAAGG AGGTAATCAA 4 3 80 

AACGAGGGAG AG AGC CAAAA TGCTGGCCCG TTCTAAAAAT TGTTTCATGA AATCT CTTTC 44 4 0 

TATATTTCTC TTAATCTTCT ACTTTTTTGA TAGTTATCAA ATAAGCAAGA AAAGAAGAAG 4500 
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CCTCATTGGT 


TTGTAGACTC 


C* TTCTTAAAT 




ATrrrTTrTA 

/\ 1 \w ^ >_ . 


TPTT ft 1* ft /"^n/™ 1 

It IT, ATACTC 


4 5 60 


AATGAAAATC 


AAAGAGCAAA 


CTAGGAAGCT 




TGTTPAA A AP 


nu lui I I I Li A 


4 62 0 


GGTTGCAGAT 


GGAAACTGAC 


GTGGTTTGAA 


VJ/ftVji"\ 1111 ^.Ij 


/VrtUAlj 1 A i 1 A 


OOATGACTTT 


-1 630 


CTCTTGATTT 


G CT TG AT AAA 


GTAGAAAATA 


AATfrTnrT a 


V- <_ A 1 A 1 AVjvjV- 


ft a /"* A ft ft ft rp ft 

AAL.AAACjA I A 


4 7 4 0 


ATCAGACACC 


ACTTAAACAC 


AACATTCCAA 


^» v— 1 1 \j 1 i \« A 


riTTr a a & a & 

<~ A i l LAAAaA 


f*a a/*"Tft a/^ 
Lt AALi 1 AA<jLiO 


4 800 


AAAGGATTAT 




TGG AAT AT^n 


A\j 1 i . L A\j AA 


t_t-A,AC»Q-v_A I 1 


l\ ft ft ft ft n rr* ft 

aaaaagagca 


4 860 


AACATCATAT 


ACAGAAAGGG 




\— ac A*- I \j\w 1 w 


ft rr* ^ ^ ft fti 1 

OA I Q-CCAAAT 


CTTGTATTGA 


4 920 


CCCTGTTTGT 








T/^/^f* ft ^ » rt - * 


ATAGTGGCAA 


4980 


AGGAAATTTT 


CTAGGGTATA 






& A/^ s, rv a a 

LLAAOrtOVjAA 


a Tr*r"f ft ft ft t*/^ 
a rGGTAAATC 


50 40 


AC AC AGGT AA 


TCATCATACP 




^~ Av_ ^_ I 1 1 1A 


AbUuLAAljAlj 


ACTTGGCCTT 


5100 


TGCCAATTTT 


CACCTACACG 


GCTC A t* A A PC 


ill au/v\ua 1 


AAAvjUvj 1 AAA 


AAT AGTT AC C 


5 L 60 


AAGAGGTTGG 


AC AG AAC CGT 


GTAATAGAf.A 


"OUA I \_ l_ AA 


AALLAV.\.A 1 L» 




52 2 0 


TCAAGATAAA 


CTCCCGTAAA 




iAPiiriaCA 
AAl_ AA\j nAkjA 


I ALboL i A i A 


AAATACAAGT 


52 30 


TTATAGTGTT 




TAAATPTTrf 


t»p a cb. a a r~Tr^ 


1 VjA ill AAL» 1 


TT C ATGGC A C 


5 3 4 0 


CAAAACCATC 


AAT CTT AC AG 


TCCATATTGT 


GGTrrrrTTr" 

1 Vj*— v. l LL 




ft «T1 ft ifHllMHIMliP ft 

A * ATTTTTCA 


5400 


CGCGCCTCCC 




1U1 1 t 1 UvjLVj 


L.AL.U I 1 I I At_ 


TTTCAAGTCC 


TTGATGAGAG 


5460 


TTACTGTATC 


ACCA^CAGPC 

V— ^— J\ * V- AVJV- V— 


fVrt lit ./t. 1 1 i v_ 




IjA I At_it_L»At_ A 


AOACCvTCTT 


5520 


CTACTTCTGC 


AACTTCAGCA 


GGATTCCACT 




rTPTrrr a a 

L i L l VjUOLAA 


ALk.Avj i Atjvju 


5580 


CACCGTCTTC 


GT AC AC AT AC 


TPTGACTTAr 


Alii* uvjnLA 


A i « 1 v_jL» 1 AAA 


L lUlTt-ATOCl 


564 0 


TTTCTC CTTA 


TCATCATTCA 


CTATTCTTTC 






a/~f— a a c'v &tt 


2 7 0 0 


ATACCCTAAA 


ATCAGCATTT 


TGACAAATTT 


AGAAAAAAAC 


rGiTATrai'? 

v—OA I A 1 Laa i 


PT ITTrr /"'TT 
L. * A i t_L)L*L. I i 


57 6 0 


TTCTACATTT 


ACATTCTTTT 


TTCAGCTTCT 


GCTTTGATTT 


TTTCAACTAC 


TTCTTGAATG 


3320 


TTCAAACCAG 


TTGTATCAAG 


GTAGACAGCA 


TCCTCTGCTT 


GTTTGACAGG 


AGAAGTCTCA 


5880 


CGATGACTAT 


CCTTGTAGTC 


ACGCGCAGCA 


ATTTCCTTTT 


TTAGGGTTTC 


AAGGTCTGTT 


5940 


TCAATTCCCT 


TGCCAATATT 


TTCCTTGTAA 


CGACGCTCTG 


CTCTCTCATC 


AACAGAAGCT 


6C00 


ACTAGGAAAA 


TTTTCAATTC 


TGCTTGTGGC 


AAT AC AAC AG 


TTCCAATATC 


GCGACCATCC 


6C60 


ATGACAATCC 


CGCCTTGCTG 


GGCAATTTCT 


TGTTGGAGAG 


AAACCAGTTT 


CTCACGCACT 


6120 


TGAGGAATTG 


CTGCAATAGC 


AGAAACATGA 


TTGGTCACTT 


CATTTTCACG 


GATAGGATGG 


6130 


GTAA'f ATCCA 


CATCTCCTAC 


AAAAACAAGC 


TGGTCTCCAG 


TTTCTGAACG 


TCCAAAGCTG 


6240 
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ATTGGATGCT GGTCCAACAA GGCTAGAAGG GCTTCGACTT CTTCAACTCC TAATTGGTTC 6 3 00 

TTAAGAGCCA TATAGGTCGC TGCACGATAC ATAGCTCCTC TATCAACGTA GGTGAATCCA 6 3 60 

AAATCCTTAG CAATAATCTT TGCGACCGTA CTCTTACCGC TGGAAGCAGG ACCATCAATA 64 20 

GCAATTTGAA TTGTTTTCAT ATCGGCTCCT ATTTTATTTT TATAACATCA CCTGGATTAG 64 8 0 

CAAACCAAGA TCCTGTAGCC ATGTGCCCAG GATTCAAGGC CTCTAACTGA GCAATGGAGA 654 0 

TTCCTGCACG AG CGGC AAT A CCTCCTTCCC CTTCTCCTGC GAGAACTTTA ATCGTTCCTT 6 600 

CAGGATTAGC AGCTTCTTCT GAACTACTAG AAGT AG ATTC TGGCTCTGAA CTCTGCTCAG 6 660 

GCTGAGAACT AC TTG AAG AT GAGATTTGTA CTACACTGGC ATCAGAATCA TGAAAGCCTT 67 2 0 

TTAAGGCTGC TGTGCGATTA CTCCCCCCCG ATGATAGATA GATGAGAACG ATCACCATCA 67 3 0 

CCACCACAAT TACAAAGAAA ATACTAGCTA GGATCGTCAA AATACGATTA CCCATCCTAT 684 0 

CAGCCCCTCC G7GGTTTCGA TGCCGACGCT CTGCTCTTGA TT CTTCTTG A T CAT AG AT AT 69 00 

CTTCTTGCCA CGGTTCTTTT GCCATACCTT ACTCCTTGTT TTTTTTTACT TTTCTTATTA 6960 

CAATATAAAT ATGAACATGA AAATCACACT TATACCTGAA CGATGTATCG CCTCTGGCCT 702 0 

TTGCCAAACT TATTCTGATT TATTTGATTA CCACGATAAT GGAATCCTGC CTTTTTACCA 7080 

TGACCCTGAC CAACTGGAAA AAGAAATTTC TCCTACTCAG GAT AT CTT AG AGGCTGTTAA 7140 

AAATTGCCCA ACTCGCGCCC TG ATT GG AAA CCAGGAAGCC TAAATCAATG GCGATAATCC 72 00 

ACTCCCTCTA GTTTAGCACA TTTCCATGTA AAATT AT AG T CTTTTCACTT TATTTTTTTC 72 6 0 

TGTAAAATCA GGAAGGTCAC TTTTTTCTTT GATAAGATAA AGTGGTCTTT TTTTACTCTC 73 2 0 

T AAAT AAAT C TTACTGATAT ACTTGCCGAG AATCCCAATG GTCAAGAGTT GAATCCCTCC 7 3 80 

AAGAAAGAGA ATAACAGCCA TCAGAGAGGT CCAACCAGAT GTCCCATTGC CCAAAATGAG 74 40 

GGTCCGAACC ACAACAAAAA AGGTCATCAG C AG AG AAAG A AAA C AAG ATA GG AG A C C AG C 7 500 

TACAAAGGCT ATAATCAAGG GAAAATCTGA AAAATT AAT A ATCCCTTCAA TGGAGTAGAA 7 560 

AAAGAGTTGC CTAAAACTCC AACTTGTCTT GCCAGCCTGC CTTTCGACAT TTGGATAGTC 7 6 20 

CAAATAGTAG GTTTTGAAAC CCACCCAGGC GAAGAGCCCC TTTGAAAAAC GATTGGACTC 7 6 80 

GGTCAAGCTT AAAATGGC AT CGACTACAGA CCTTCTCATC ATACGAAAAT CACGGACACC 77 4 0 

CGACGGCAGA GCTACTCGGC TGATTTTTTG CATGAGGCGA T AAAAG AG AA CAGCACAGAA 7 800 

ACTGCGAAAG AAGGGTTCTC CCTCCCGACT AGTTCTCCGT CTCCCAACGC AG TC C AAG T C 7 860 

TACATTTTTG TCTAATACAT TTT7CATCTC AAACAACATA CTAGGAGGAT CTTGGAGGTC 7 9 20 

TGCATCCATC ACCACCACCA AATCTCCTGT CGCATATTGC AAGCCTGCAT AAAGGGCTGC 7 9 80 

TTCTTTGCCA AAATTTCGAG AGAAAGAAAT AT AAT GG ACT GCCGGATTTT GCTCCCGATA 8 04 0 
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CGCCTTTAAG AGTTCCAAGC TCCCATCACT TGATCCATCA TCGACAAAGA CATACTCGAT 8100 

TTCTCTTTCC AAATCTGGAA GTAAAGCTTC CAGAGCCTGA TAAAAAAGAG GAAGTACTTC 3150 

CTCTTCGTTT AAACAAGGGA CGATGATTGA AATCATCATC TTAGTCTTCA AATCCATTTG 8 2 20 

CATGCTTGCT TTGCCAACGC CATGCGTCTT CACACATTTG GGTGATGTCG AGTT CTGCTT 8 230 

CCCAACCGAG TTCTGCTTTA GCTTTTGCCG GGTCTGAGTA GCAGGCAGCG ATATCACCTG 8 3 40 

GGCGACGTTC TACGATGCGG TAAGGAATAG GACGGCCCAC CGCTTTTTCC ATGTTTTGGA 84 00 

TAATTTCAAC AACTGAGTAA CCTTTACCAG TTCCAAGGTT ATAAACGTTT AGTCCTGAAC 84 60 

CTTTTTGGAT TTTTTTCAAA GCTGCAACGT GACCCTTAGC CAAAf CGACA ACGTGGATAT 8 52 0 

AGTCACGAAC ACCTGTTCCA TCTTCCGTAT CGTAATCGTC TCCAAACACT TGCACTTGCT 8 58 0 

CTAATTTTCC AACGGCT AC T TGAGTCACAT ATGGCAAGAG ATTGTTTGGA ATACCGTTTG 8 64 0 

GATTTTCTCC CAAATCACCA CTCTCATGGG CTCCGATTGG GTTAAAGTAA CGAAGCAAGA 87 00 

CAACATTCCA TTCTGAGTCT GCTTTGTAAA TATCAGTCAA AATTTCCTCT AGCATGAGCT 87 6 0 

TAGTACGACC GTATGGGTTG GTCACTGAAA GTGGGAAATC TTCCAAGATG GGCACTGTGT 3 8 20 

GCGGATCCCC GTAAACTGTC GCAGAAGAAC TGAACATGAT GTTTT T AC AG TTGTTTTCTT 8 8 80 

CCATGGCTTT CAAAAGGCTG ACACTTCCAG CGATATTGTT GTCATAGTAG GCAAGAGGGA 894 0 

TACGTCTTGA TTCGCCAACA GCCTTCAAAC CAGCAAAGTG AATCACACCA GTCGGTTCTT 9000 

CCTGCTTGAA AATATCTCTG ACCGTATCTC TGTCACGAAT ATCTGCCTCA TAGAAAGGAA 90 6 0 

TCTCAACTCC TGTGATTCCT TCAACAACTT CTAAACTCTT ACGATTGCTA TTGACAAGAT 9120 

TATCCACCAC AACAACTTGA TGACCTGCTT GGATCAATTC AATAACAGTG TGGGTTCCAA 9180 

TAAAACCGCC ACCACCAGTT ACCAAAATCT TTTCTTGCAT -TTTTTTCCT CCYTTCTCAG 9 2-50 

ATTATTTTTT CTTATTTTAC CATTTTTGAC AGGGAATGTC ATTTGCCATC CTAAACTACC 9 300 

TG AT AAAATT TCAGTAAAAT GCTTATACTC TTCGAAAATC CAATTCAAAC TACGTCAACG 9 360 

TCGCCTTGCC ATGGGTATGG TTACTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT 94 2 0 

GTTTTGAGCT GACTTCGTCA GTTCTATCCA CAACCTCAAA GCAGTGCTTT GAGTAACCCG 9 4 80 

CGCCTACTTT CCTAGTTTGT TCTTTGATTT TTATTGAGTA TTATTCGCTT TTTACTCGTT 9 540 

TGACATAGTT TTCAATTGGG TAATTTAGAG GGTCCAAGGT CAACTCCTTG TCTTGGATCA 9600 

GTTGGGCTAG ATGGTAACCA ATGATAGGAC CAGTTGTGAG GCCTGATGAA CCTAGTCCAC 9660 

TGGCTGCATA G AC AC C AGTT AAGTCAGGCA CCTGCCCAAA GAAAGGAGAG AAATCACTGG 9720 

TGTAGGCACG GATTCCAACA CGCTCAGATT TTGAAGTAGC TTCAGCCAAA ATCAGATAGT 97 30 
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GAGTCAAGCT 


GGCCTCCTCC 


ATTTCTTCGA 


GCAAGGTTTC 


ATCTACCGTC 


AAATCAAATC 


9840 


CCATGTCATT 


TTCGTGGGT A 


GCGCCTAAGG 


ATAATTTCCC 


ACCTGCAAAG 


GGAATCAAAT 


99C0 


CCCACTCCCC 


TTCTGGCATG 


ACAACAGGGT 


AATCTTCCAT 


GTCTTGGGCA 


AG CTG AT AAT 


9960 


CTCG T AGTTG 


TCCTTTTTGA 


GGACGGACAT 


CCACTTCATA 


ACCTAAAGGC 


TCTAACATGT 


10020 


CCCCCAACCA 


AGCTCCCGTC 


GCCAAAATAA 


CCTGCTCAAA 


CTCCTCTTCA 


CCAATCTGGT 


10080 


AGCCTGATGC 


TAACGGTGTC 


AGAGTCACTT 


TTTCTTTGAC 


CAGCTTGACA 


TGACTGACTT 


1014C 


CCAGCAAACG 


AGTCACTAAA 


AGTTGGCCAT 


CTACTCTCGC 


TCCACCAGAA 


GCATAGAGCA 


10200 


GGCGGTCAAA 


TCCCTGCAAA 


CCAGGGAATA 


ATTCATTAGC 


TGAGGCTTGG 


TTCAGAATGG 


10260 


CTAATTGCCC 


TATCAAGGGA 


GATTCTTCTC 


TGCCCTGGAG 


GGCCAGTTGA 


TAAAGTTCTT 


10320 


CCAAATTGGA 


TTCATCCTTT 


TTCAAGAGAA 


AGACTCCCGA 


ACGCTGGTAA 


AAGTC G ATTT 


10380 


CTTGTCCTGA 


TTTCTCTAAA 


TCAGCTAATA 


AATC C AC AT A 


AAAATCAGCC 


CCCAAGCGCG 


10440 


CCATCTTGTA 


CCAGGCTTTA 


TTACGGCGTT 


TGGAAAACCA 


AGGACTGATA 


ATTCCTCCTC 


10500 


CGGCCTTGCT 


GGCTTGACCT 


TGCTCATGCT 


CAAAAACGGT 


CACCTCTACG 


TCACTTTCTC 


10560 


TCGAGAGGTA 


GTAGGCAGCT 


GTTGCTCCCA 


CAATTCCTGC 


TCCAATAATG 


GCAACTTTTT 


10620 


TCATTGTCTT 


CACTTTCTAA 


CTAGATATGA 


TGGAAAGGAT 


TGGTTGATGC 


CTGACTAGGC 


10680 


AAGATATCAA 


TAGACCACCC 


CTTATCTTCC 


TTCCATTGAC 


TAAGAAGTGC 


TCCCATTTTT 


10740 


TCTACAAAAA 


TCACTTCGAT 


ATAGTGACCT 


GGGTCCAATG 


CAAGCAACCC 


ATCAGATAGC 


10800 


AT AT CCTG AG 


CACTATGGTA 


GTAGATATCA 


CCAGTGATAT 


AGACATCTGC 


CCCCTTTGCC 


10860 


AAAGCATCCT 


TATAGAAAGA 


CTGCCCGCTT 


CCACCACAAA 


TTGCTACTCT 


TGAAATAGGC 


10920 


TTCTGCAAAT 


CATCCTCTTG 


ATAATGCACC 


ATTCGAAGGC 


TATCTAGGTC 


AAAGACTTGC 


10980 


TTGACCTGTT 


GGGCCAATTC 


CCAAAATGTC 


TGAGGCTGAA 


TATTCCCAAT 


ACGTCCAATT 


11040 


CCACGTTCTG 


GACCTGTTTC 


CTGCAGATAA 


GTCCTCTCCT 


CGATTCCTAC 


CATCTGACAA 


11100 


AACCAGTCAT 


TGAGCCCATT 


TTCAACGATA 


TCAATATTGG 


TATGGCTGAC 


ATAAACTGCG 


11160 


ATATCATGCT 


TAATCAGGTC 


GATGTAAATC 


TGATTTTGCG 


GACGGCTGGC 


AAGCAAGTCC 


11220 


TTGATAGGAC 


GAAAGATAGG 


CGCGTGCTTG 


ACGATAATCA 


AGTCCACACC 


CTTTTCAATG 


11280 


GCCTCTGCCA 


CTGTCTCTTC 


ACGAATATCG 


AGGGCAACCA 


TGACCCTTTG 


CATACCCTTG 


11340 


TCTAAAGTGC 


C AATTTG C AG 


ACCACGGCTG 


TCTCCCTCCA 


TAGAAAATTC 


CTGAGGGCAA 


11400 


AAGGCTTCAT 


AAGCTTGGAT 


CACTTCACTT 


GCTAACATGG 


ACCACCTCCT 


TGATAGCTTG 


114 60 


AATCTTATCT 


ACTAGAACTT 


GACGTTCTTC 


CAGATTTTTT 


TCTGGGATTT 


GTCCGAGGGC 


11520 


GAACTCTAGC 


TTCTCACCTT 


CTTTTTGCCA 


TTTTTGGACA 


AATACTGGAC 


TGACTTCTTT 


11580 
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GGACAAGAAG 




P. A ArATCACT 


GGCTGATAGC 


TTCATTTGTC 


CTGCTTCCAC 


1164C 


CACCAAAATv- 


1 <_A A AAAHL * 


Tfrr AG PTTC 


TTCTAAGATG 


CTTTCTCCTA 


CAATCTGGAA 


11700 


TCCATGATCC 


rp/->rr> ft J\ A 

1 G I A(o^_v_ 


Tirr.PAAf.Tr 


GTCTTCACGA 


TTATTGGGCT 


GG AGG AT C AA 


11760 


ACGCTCTACA 


TTAGCTAAL. I 




TTCTTCTAAA 


ATCCTAGCAA 


TCAAACGACC 


11820 


ACCCATGCCA 




TP.APAGACAC 


TTGGTCAGTC 


TCTTCAAAAG 


CTGCCAAGCC 


11880 


ATTGGCTAAA 


At. 1 I Uun 


TTTTP^PPTT 


TAGGCCGTGA 


GCCTCAACAT 


TTTTAACCGC 


11940 


AGACTGATAG 


GGACCTTCCA 




TGCAATAGCG 


CTTTTGATTT 


GGCCTCTCTC 


12000 


AACCAACTCG 


AT AGGC- AG A 1 


A AP.P ATGGTC 


ACTTCCCACA 


TCTAGTAAAA 


TAGCCCCCTG 


12060 


TGACACAAAG 


GAAGCTAU.UA 




CTTTGAAATC 


ATCTTCTCTC 


ACTTTCCAAA 


12120 


ACTCTATTAC 


CTCTTATTA I 




AATCTTCAAC 


TTCCCAGTAA 


TATAAGCACC 


12180 


TCT GG CG AAA 


G AAG I I 1L AA 


TP.T P A A AG 


TAATAAGTGA 


ATCCAATTGA 


AAGATTTTAA 


12240 


ACAATTTCCA 


AAAATGTCAA 


AAAATAAAAA 


AT AAAC AG TT 


TATTCAGAAA 


ATTCTTGACA 


12300 


TATAAAAACA 


CATGGTAGAA 


'PITA. ITTir.A 
1 A 1 AA i i /\^J/\ 


AAGTTAGAAA 


AAATAAAAGT 


TTCACTAAAA 


12360 


TTTGTATTTG 


AAGGTGGTGT 


1 LAUA 1 AAwA 


AATTTA.GTCA 


GACGAACCAC 


GAATTTGCTC 


12420 


TATGCTTTCT 


f^* Ik. Jk mil Hit k f* 

GGAATTTA Tt- 


iTA AP AP.TIAG 


GATACAGTCA 


TGGAACAAAC 


ATTGTTTGAA 


12480 


TTAGAACTAC 


TTCCAGAGGA 


SflTS TC A TT 
AL» A lAlLnl i. 


GTCACAGGTC 


TCCCTAAGTA 


TTGTTCTTTT 


12540 


ACTTGTTTAA 


TTACAGGT. v_La 




TTTATATAAA 


ATAAGTAGCT 


TTACTTACGC 


12600 


AATAGGCTAG 


rr*/~> ✓"-'T* /''/T^ f"*^* 

TGv_TGTG i w * 




TTAATAATTA 


GGAGTTTGTT 


ATGGATTTAT 


12660 


TAGAGAAAGA 


ATGTTTAAAA 


TTTP.ATAAAA 
1 Aj 1 OA 1 


ATTTCCAACA 


GGGTGATATT 


TGGAATTACT 


12720 


ATT ATT TAT C 


AGATAAGATG 




GGTGGAAAAT 


ACACATAAGC 


TCCCAAATAA 


12780 


AAGACGCTGT 


AAATATTTTT 


AAGATTGTGT 


ATAAACTATC 


y— a a/""T > A A AT 
LLAAL I AAA J 


AATTfiTACC^ 

V 4 L \j 1 *VVJ * — * 


12840 


TTAAAGTTGT 


TAAAAATTTA 


GAGGAATTAA 


AAAAAATTAA 


TTCCCCTAGG 


GAAATGAGCC 


12900 


CTACTGCTAA 


CAAATTTATA 


ACTCTATATC 


CTAAGTCAGA 


ATCTGAAGCT 


AAGAGTATGA 


1296C 


TTTGTAATCT 


7ACGAATAGA 


CTGTCAGAAT 


TTAAGGCTCC 


AAAAATACTA 


TCTGACTATC 


13020 


AATGTGGAAT 


GCATTCTCCA 


GTTCATTATA 


GATATGGGGC 


TTTTTTAAAA 


AAACAAGCTT 


13080 


ATGATGAAAA 


AAATAAAAAA 


GTCATCTATT 


TATTGCTAGA 


TGAAAAAAGG 


AAGAACTATG 


13140 


TAGAAGATAA 


, GAGACAAAAT 


TTCCCTAGTC 


TTCCTAGCTG 


GAAAATGGAT 


TTATTTTCAG 


13200 


AAGAAG 












13206 
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<i> SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 13104 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CCGGATCCAG 


C G AAAAAT AT 


GCTCTTTGAT 


GCTGTAAGTG 


GTCAAAAAGA 


TGCTAAAACA 


60 


GCTGCTAACG 


ATGCTGTAAC 


ATTG AT C AAA 


GAAACAATCA 


AACAAAAATT 


TGGTGAATAA 


120 


AAAATTTGTT 


CAAGGGGGGT 


GGAAATCAAA 


TCCCCCTTTG 


AATTTATCAA 


TAG AG AC AC A 


iao 


AATAATTTAG 


C TTTCTT AT A 


AAAAAGTAGT 


ATCCTATGAA 


AGGAGTTAAT 


ATGGAAAAGC 


240 


AACAACCTAG 


TAAAGCAGCC 


CTGCTGTCTA 


TCATTCCTGG 


GTTACGACAG 


ATTTACAATA 


300 


AACAAAAAGC 


CAAAGGTTTT 


ATCTTCCTTG 


GTGTAACCAT 


CGTATTTGTC 


CTTTACTTCC 


360 


TAGCACTTGC 


AACCCCTGAA 


TTGAGCAACC 


TCATCACTCT 


TGG TG AC AAA 


CCAGGTCGTG 


420 


ATAATTCCCT 


CTTTATGCTG 


ATTCGTGGTG 


CCTTCCATCT 


AATCTTTGTA 


ATCGTTTATG 


430 


TACTCTTTTA 


TTTCTCAAAT 


ATCAAAGATG 


C AC AT AC GAT 


TGCAAAACGC 


ATTAACAATG 


540 


GAATTCCAGT 


TCCACGCACA 


CTCAAAGACA 


TGATCAAAGG 


GATTTATGAA 


AATGG CTTCC 


600 


CTTACCTCTT 


GATCATTCCA 


TCTTATGTTG 


CCATGACCTT 


CCCGATTATC 


TTCCCAGTTA 


660 


TCGTAACCTT 


GATGATCGCC 


TTTACCAACT 


ACGACTTCCA 


ACACTTGCCA 


CCAAACAAGT 


720 


TGTTGGACTG 


GGTTGGTTTG 


ACC AACTTT A 


CAAACATTTG 


GAGCTTGAGT 


ACCTTCCGTT 


780 


CTGCCTTTGG 


TTCTGTTCTT 


TCTTGGACTA 


TCATTTGGGC 


TTTGGCAGCT 


TCTACTTTAC 


840 


AAATCGTAAT 


TGCTATCTTC 


ACAGCTATCA 


TTGCCAACCA 


ACCATTTATC 


AAAGGAAAAC 


900 


GTATCTTTGG 


TCTTATTTTC 


CTTCTTCCTT 


GGGCTGTCCC 


AGCCTT CATC 


ACTATCTTGA 


960 


CATTCTCAAA 


CATGTTTAAC 


GATAGTGTCG 


GTGCTATCAA 


CACTCAAGTA 


TTGCCAATCT 


1020 


TGGCTAAATT 


CCTTCCTTTC 


CTTGATGGAG 


CTCTTATTCC 


TTGGAAAACA 


GACCCAACTT 


1080 


GGACTAAGAT 


TGCCTTGATT 


ATGATGCAAG 


GTTGGCTCGG 


ATTCCCATAC 


ATCTACGTTC 


1140 


TGACCTTGGG 


TATCTTCCAA 


TCTATTCCTA 


ACGACCTTTA 


CGAAGCACCT 


TATATTGACG 


1200 


GTGCCAACGC 


TTGGCAAAAA 


TTCCGCAACA 


TCACTTTCCC 


AATGATTTTG 


GCTGTTGCGG 


1260 


CACCTACTTT 


GATTAGCCAA 


TACACCTTCA 


ACTTTAACAA 


CTTCTCTATC 


ATGTACCTCT 


1320 


TCAATGGTGG 


AGGACCTGGT 


AGTGTCGGAG 


GTGGAGCTGG 


TTCAACCGAT 


ATCTTGATCT 


1380 


CATGGATCTA 


CCGTTTGACA 


ACAGGTACAT 


CTCCTCAATA 


CTCAATGGCG 


CCAGCTGTTA 


1440 


CCTTGATTAT 


CTCTATCATT 


GTCATCTCAA 


TCTCTATGAT 


CGCATTCAAG 


AAACTACACG 


1500 
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CATTTGATAT GGAGGACGTC TAAGATGAAT AACTCAATTA AACTCAAACG TAGACTGACT 15 60 

CAAAGCCTTA CTTACCTTTA CCTGATTGG? CTATCAATTG TAATTATCTA TCCACTGTTG 1620 

ATTACCATTA TGTCAGCCTT TAAAGCAGGT AACGTCTCAG CCTTTAAACT AGATACTAAT 16 80 

ATCGACCTCA ATTTTGATAA CTTTAAAGGC CTCTTCACTG AAACCTTGTA CGGTACTTGG 1740 

TACCTCAACA CTTTGATTAT CGCCTTAATT ACCATGGCTG TTCAAACAAG TATCATCGTA 1800 

CTTGCTGGTT ATGCTTACAG CCGTTACAAC TTCTTGGCTC GTAAACAAAG TTTGGTCTTC 1860 

TTCTTGATCA TCCAAATGGT GCCAACTATG G CCGCTTTG A CACCCTTCTT CGTTATGGCG 19 20 

CTTATGTTGA ACGCCCTTAA CCACAACTGG TTCCTCATCT TCCTCTACGT TGGTGGTGGT 1980 

ATCCCGATGA ATGCTTGGCT CATGAAAGGC TACTTCGATA CAGTGCCAAT GTCTTTAGAC 204 0 

GAATCTGCAA AACTAGACGG TGCAGGACAC TTCCGCCGCT TCTGGCAAAT TGTTCTACCA 2100 

CTTGTTCGCC CAATGGTTGC CGTACAAGCT CTCTGGGCCT TCATGGGACC TTTCGGGGAC 2160 

TACATCCTCT CTAGTTTCTT GCTTCGTGAG AAAGAATACT TTACTGTTGC CGTAGGTCTC 2 22 0 

CAAACCTTCG TTAACAATGC GAAAAACTTG AAGATTGCCT ACTTCTCAGC AGGTGCTATC 2280 

CTCATCGCCC TTCCAATCTG TATTCTCTTC TTCTTCCTAC AAAAGAACTT TGTTTCAGGA 2 34 0 

CTTACAAGTG GTGGCCACAA CGGATAATTT ATCCCCGCCA CCCTTTTTCA TTTTATACTC 2 400 

TTCGAAAATC TCTTCAAACC ACGTCAGCTT TATCTCCAAC CTCAAAGTTG TGCTTTGAGC 24 60 

AACCTGTGGC TACTTTGCAC TTTCATTTTC ATTGATTATT ACCAATTGTC ACTGTAAATA 2 5 20 

ATATCCTTGT AGCAAGCAAT TTTTCTCCTA GACTTGAAAT AAAGCGCATT TCTCTATATA 25 80 

ATAATACTCA TATAGAAAAC ACCTTTTAGA AAGATACCTA TCCTTCCATA TCCATTTTCC 2 640 

TATTTTTCAA GTATTTCGGG GGTTCGTAAG CCCCTC7CCA AACGTTTCGA CCTCAACTCG 2 7 00 

TTTCAACTTC TCTTTACCAG TATCTTCCTT ATC AG CTTGT CTATGGTACC CATTGCTATC 27 60 

CAAAACAGCT CCC AGGAGAC CTATCCGCTA GAAACTTTTA TCGATAATGT CTATGAACCT 2 82 0 

CTG A C AG AT A AGGTTGTCCA GGATCTCTCT GAACATGCTA CAATTGTCCA TGGCACATTA 2 380 

ACTTATACTG GAACAGCTAG TCAAGCCCCT TCTGTTGTGA TTGGTCCAAG TCAAATCAAG 2 94 0 

GAATTACCTA AGGACTTGCA ACTGCATTTC GATACAAATG AGCTAGTCAT CAGCAAGGAA 3 000 

AGCAAGGAAC TGACCCGCAT CTCTTACCCA GCCATTCACA CTCACAGTTT CAAAAGCAAA 3 0 60 

GACAGCTTGA CCCAAGCAAT TTCTAAAGAC TGGTACCAAC AAAATCGTGT CTATATCAGC 3L2 0 

CTCTTCCTAG TTCTCGGTGC GAGCTTCCTC TTTGGTTTCA ATTTCTTTAT CGTCTCTCTT 3 130 

GGAGCTAGCT TTCTCCTTT A TATCACCAAA AGATCACGCC TCTTTTCATT TAATACCTTT 3 2 40 
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AAAGAGTGCT 


ACCATTTTAT 


CTTGAACTGT 


TTAGG ATTGC 


CGAeTCC ".AT 


TACACTTATT 


3300 


TTGGG ATT AT 


TTGGCCAAAA 


TATGACAACC 


CTGATTACTG 


T AC AAAAT AT 


TCTTTTTGTT 


3360 


CTGTATCTGG 


TC ACT ATCT T 


TTATAAAACA 


CATTTCCGTG 


ATCCAAATTA 


C CAT AAAT AG 


3420 


G AG ATTTTT A 


TGCCCGTTAC 


GATTAAAGAC 


GTGGCCAAGG 


CTGCTGGTGT 


TTCGCCTTCA 


3480 


ACCGTAACCC 


GTGTTATTCA 


AAATAAATCA 


ACCATTAGCG 


ACGAAACAAA 


AAAACGTGTT 


3540 


CGCAAAGCTA 


TGAAGGAACT 


CAACTACCAC 


CCAAACCTCA 


ACGCTCGTAG 


CTTGGT AAG C 


3600 


AG CT AT ACT C 


AGGTTATCGG 


ATTAGTTCTT 


CCTGATGACT 


CAGACGCCTT 


CTACCAGAAT 


3660 


CCTTTCTTTC 


CATCGGTTCT 


ACGTGGCATC 


TCTCAAGTCG 


CATCTGAAAA 


CCACTATGCC 


3720 


ATTCAGATAG 


CAACAGGGAA 


AGATGAGAAG 


GAG CGTCTC A 


ACGCTATTTC 


ACAAATGGTC 


3780 


TACGGCAAGC 


GTGTAGATGG 


GCTAATTTTT 


CTCTATGCCC 


AAGAAGAAGA 


CCCTCTCGTA 


3840 


AAACTCGTCG 


CAGAAGAACA 


GTTCCCCTTC 


CTTATCTTAG 


GTAAATCTCT 


ATCTCCTTTC 


3900 


ATCCCACTTG 


TCGACAACGA 


CAATGTTCAA 


GCTGGTTTTG 


ATGCGACTGA 


AT ATTTC ATC 


3960 


AAAAAAGGCT 


GCAAACGCAT 


TGCCTTTATC 


GGAGGAAGTA 


AAAAGCTCTT 


CGTGACCAAA 


4020 


GACCGTTTAA 


CAGGCTATGA 


ACAGGCGCTT 


AAACATTACA 


AACTTACCAC 


TGACAACAAT 


4080 


CGCATCTACT 


TTGCCGACGA 


GTTTCTGGAA 


GAAAAGGGCT 


AT AAA ITT AG 


CAAGCGATTA 


4140 


TTCAAGCACG 


ATCCACAAAT 


TGATGCTATC 


ATCACAACCG 


ATACCCTCCT 


AGCTGAAGGT 


4200 


GTTTGTAACT 


AT ATTGC C AA 


ACACCAGCTG 


GATGTCCCTG 


TTCTCAGCTT 


TGACTCGGTT 


4260 


AATCCCAAGC 


TCAACTTGGC 


AGCCTATGTC 


GATATCAATA 


CTTTAGACCT 


TGGTCGTGTT 


4320 


TCCCTTGAAA 


CTATTCTCCA 


GATTATTAAT 


GATAATAAAA 


ACAATAAACA 


AATTTGTTAC 


4380 


CGT C AATTG A 


TCCCCCACAA 


AATTATCGAA 


AAAT AAG AG A 


CTGGGCAAAA 


AGTC GTT AAA 


4440 


AG C AAAAACG 


CATACTATCA 


GGTATTGAAA 


AAACTTGATA 


CTATGCGTTT 


T ATTGTGGG A 


4500 


AGATTTACTT 


CCTTTTCTAC 


TG AAATTG AG 


TCTTTTCCCA 


AGATCTTTTT 


ATACTCAATG 


4560 


AAAATCAAAG 


TGCAAACTAG 


GAAGCTAGCC 


GCAGGTTGCT 


CAAAACACTG 


TTTTG AGGTT 


4620 


GTAGATGAAA 


CTGACGAAGT 


CAGTAACCAT 


ACCTACGGCA 


AGGTGAAGCT 


G AC GTGGTTT 


4630 


GAAGAGATTT 


TCGAAGAGTA 


TTAATCACTA 


ATTATCTATC 


TCAACAAATC 


TTCCTAGAAT 


4740 


ATGAACATTT 


TCCG AG AC AG 


AGACAAAGGA 


GCTTGGATCC 


ACTTGTGTCA 


TAATCTGTTT 


4800 


AAATT C ATT A 


AACTCTGCAC 


GTGTAATGAC 


AG TG ATT AAA 


ACTGCCTTTC 


TCTCGTGATT 


4360 


ATAGGTTCCT 


TCTGCATCGT 


GGATCATGCT 


TCCTCCGCGG 


TGCAATTTTT 


TATGGATTTT 


4920 


TTCAATTACC 


TTCTCTGGAT 


GATTTGTCAC 


AATCATGGCC 


TGCATACGCT 


TTTGCTTAGT 


4980 


AAAGACTGCG 


TCTGTCACAC 


GGCTAGAGAC 


AAAG ATCGT A 


ATCATAGAAT 


' AAAGAGCGTA 


5040 
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TTTCCAACCA AAGGTCAAAC CTGCTATCAG CATGATAGTT CCATTTACCA AGAAAGAAAT 5100 

ACTACCGACA TTCTTACCCG TTTTCTTACG AATAGTCAGG CTGACCATAT CCGTCCCACC 5160 

ACTGGAGATA TTGTTTCGAA GAGCAAAACC AATCCCCAAA CCCATAACAA CACCCCCAAA 5 2 20 

AAGGGAATTG ATAATGGGAT C CT CTGTCAA GGTTGCCACA GGGACAAACT GGATAAAGAA 5 2 80 

GGAACTCATA GATACCGTGA TAAAGGTAAA GACGGTGAAC TTATCGCCAA TCTGATACCA- 5 3 40 

AGCTAAGACC ATCAAAGGGA AGTTAATGGC GTAGAAGCTT AGCGAAATCG GAATATGAAA 5 4 00 

ACCAAACCAG TGATTACTCA AGGCAGAGAT AATCTGTGCC AGACCTCTTC CACCACTCGA 54 60 

ATACACATGC CCTGGTTGGA AAAAGAAATT AACTCCTACT GCTGATAAAA AACCATAGAC 55 20 

CAGAGAGGCC GAAATCTTCT CATCATACTT TTCTCCACAG ATACTTTGTA AGACACGTAA 55 30 

AATTTTTATC TCATAACCAA AGCGGCGCAG ATAATAGCGC CACCGCTTAA TTCGTTTTGT 5 64 0 

TTCTTTCATC TTCTTCTACT T G T AAGCTG A GTTCCTCTAG TTGTTTGAGA GCGACTGTTG 5700 

ATGGAGCTTG TGTCATTGGG TCAGTTGCCT TGTTGTTCTT AGGAAAGGCA ATGACTTCAC 5 7 60 

GGATATTTTC TTCTCCAGCA AGCAACATGA CAAAACGGTC AAGCCCGATA GCCAAACCAC 582 0 

CGTGTGGTGG GAAACCATAG TCCATGGCTT CAAGAAGGAA ACCAAACTGG TCATTGGCTT 5880 

CTTCAGTTCA GAAACCAAGA GCCTTGAACA TGCGTTCTTG AAGGTCTTTT TGGTTGATAC 59 4 0 

GAAGGCTACC ACCACCAAGC TCATAACCGT TCAAGACGAT ATCGTAAGCA ATGGCACGAA 6000 

CCTTAGCCAA ATCACCTTCT AATTCATGAG CAGTCTCTTC CTGTGGAAGT GTGAAAGGAT 60 50 

GGTGGGCGCT CATGTAGCGG CCTTCTTCTT CAGACCATTC AAACATCGGC CAGTCAACCA 6120 

CCCAAAGGAA GTTGAACTTA TCATTATCAA TCAAGCCAAG CTCTTTAGCA ATACGTCCAC 6130 

GAAGGGCACC CAGTGTTGCA TTAGCCACTT CAAGCGTATC CGCCACAAAG AGAACCAAGT 624 0 

CCTTATCTTC AAGAACAAGC CCTGTTGTCA ATTCTTCTTG GATACCAGTC AAGAACTTGG 6 300 

CAACTGGTCC GTTTAATTCT CCATCAACCA CCTTGACCCA AGCAAGACCT TTGGCACCAT 63 60 

ACTGTTTGGC TACTTCCGTC ATCTTGTCGA TGTCTTTACC TGAATAGTTG TCCGCAGCTC 64 2 0 

CTGTGACCAC AATCGCTTTT ACAGCAGGTG CTTCTGAAAA GACTTTAAAG TCTACACCTC 64 8 0 

GGACCACTTC TGTCAAGTCC TGAAGCAACA TGTCAAAACG AGTATCTGGC TTGTCAGAAC 65 4 0 

CGTAAAGAGC CATAGCATCA TCGTATTTCA TACGAGGGAA TGGTAGCGTT ACTTCGATGC 6600 

CTTTTGTTTC CTTCATCACG CGCGCGATCA AGCTTTCTGT AATATCTTGG ATTTCTTGCT 66 6 0 

CAGTAAGGAA GGACGTTTCC AAGTCGACCT GAGTAAATTC AGGCTGGCGG TCTCCACGCA 67 20 

ACTCCTCGTC ACGGAAACAT TTAACGATTT GGTAGTAACG GTCAAAACCA GCATTCATCA 67 80 
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AGAGCTGTTT CGTGATTTGT GGACTTTGAG GAAGAGCGTA AAAATGCCCC TTATTAACAC 68 4 0 

GAGACGGCAC TAAATAATCA CGCGCCCCTT CAGGCGTTGA CTT AG AAAGG AATGGTGTCT 6 9 00 

CCACGTCGAT AAACTCCAAC TCATCCAAGT AGTTGCGGAT ACAGTGGGTC ACCTTGGCAC 6 9 60 

GAAGTTTAAG ATTTTCCAAC ATTTCTGGAC GACGAAGGTC AAGGT AACGG TAACGCAAAC 70 2 0 

GTGTATCGTC ATTTGCCTCA ATGCCATCCT TAATCTCAAA TGGTGTTGTC TTAGCTGTGT 708 C 

TAAGCACAAT AAGAGCTGTC ACGTTTAACT CAACCGCACC ACTTCGCAAC TTATCATTGG 714 0 

CTTGTCACGC GCAGCGACCT GACCAGTCAC CTCAATAACA AATTCGCTAC GAAGGcTTTC 7200 

AGCTGTTGCC ATAACCTCTG CAGATACTTT TTCAGGGTTG ATAACCAACT GCATGATTCC 7 2 60 

TTCACGGTCA CGAAGATCGA TAAAGATCAA ACCACCAAGG TCACGACGAC GGCCAACCCA 73 2 0 

TCCTTTCAAG GTTATTTCTT CTCCCATCTC TTCCTCACGA ACACGACCAG CATACATACT 7 3 80 

ACGTTTCATT ATTTCTCTCC T C TITTATTC TGTTACTATT TTACCATAAA AGCGCAGCTC 74 4 0 

TTCATGAAAA TCATCAGAAA AGTTTGCCAG TCTTTAAAAG TCAGGTGAAA GCCCTAAAAA 7 500 

TTAGCGCTAA TACTCTTCGA AAATCTCTTC AAACCACCTC AGCGTCGCCT TACCGTATGT 75 6 0 

ATGGTTACTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG AGCTGACTTC 7 62 0 

GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTTGAGCAA CCTGCGGCTA GCTTCCTAGT 7 68 0 

TTGCTCTTTG ATTTTCATTG AGTATAATAC AAAAATCCGA TGAACTTCAC CGGACTCTTT 77 4 0 

TATTTTGAAT TTTTGCCTGC TTTACGCTTT TCAGCGATTT CGGCTGCCTT TCGAGGCAAG 7800 

ACAATTTCCG TTATGTAACC CGTCCCAAAA CCCAGTACAC CTGCAATAGC AGCAAAGACA 78 60 

ACTGCTAGAT AGTTATAGAA GAAATCGCCT TTGAAGGCAT AACCTAGCGC TCCAATGATG 79 2 0 

AAAAATAGAA CGACTGCCTG AATCACTGCT AATAAAATTA CTCGTTTCAT GTGACCTCCT 79 8 0 

GACTCTArTA TAGCATGAGA ATCATCAAAA AGCCGACTAA ATTATTCAAA GCGTGAAGAG 8 04 0 

AAATACTGTA G AC C AG AC C T TTTCTGCTAA TGTAAGCCAA ACCCAAACTA AAACCAAGCC 8100 

TAAAATAGAC AAAAAATTGT TGCACATCAC CTGGAAAATG AATC AAGGC A AATAGAAGAC 8160 

T AG AT AC C AG AAGAAAAATC AGGGTTCGTT TACTATTGTC CTCCTTAGCA AAGAGATAGC 3 2 20 

GTGCTAACAT CCCTCTAAAA AC AATCTCT T CCGTCAAAGG AGCAAAAATA ACCACAGCAA 82 90 

AGAATGAGAA AAGTGGTTGA GACAAGGTCA AGTCTGTCGC TATTTGCTGA TTTACTGAAG 8 340 

GAT CAT CTGG CAAGAAGAAT TGAACGACCA GAGATAAGAA CCAAACCAAG ACAGGAAGCC 8400 

AAATAAATCG ATTAAAGCCG CTCTTCTCAA TATGAACAGG AGCCTTCTGA TACCATTTGT 84 6 0 

AAATGCCGTA C AC AT AT ACT CCAGCCAAGG CCACATAGAG TAGAGTAACA GCATAGGGTG 8 52 0 

AAGCGCCTAA AGCAAGCGAC GCAGTCGCGA GCCCCTGAAT AAAGCCATAG ATAAATAAAA 8 5 80 
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AGGATAGAAG GGCTAGAAGA ATCCAGCCAA GGTTTTTAAG T AATTTC AT A GATAACTCCT 8 64 0 

TTATT TG AAA TAACGTTTTA CCATAGGTAA CTGCATCACA TTGATATAAA CATGGATGGC 870 0 

TCCTACAAGC AAGAAAGCTA GTAACTGAAT CTCTCCTGTC AAGAAAGAAA TGATAATAAG 87 60 

AAAAATATAT AAGGCTGGTA AGACATATTG GTG T AATTGG AATAAAATTC GAAAACTCTG 882 0 

TTCCAAATTA GCCTGACGCT CCCCTTCATC ATAAGAATTT ATATAGTTCA AGACATCCTT 888 0 

TGGTGTAGCG AAAAATTCCA AATCAAACTG ACGAACAATC GCAATGGTTT TAAAAAGAGA 8940 

TTTTTGAGCG ACTAAGAATA CCACAAAGAG TAAGAAAGAA AGGAAAAATG TTTGAGGGTT 9000 

TGTATGCAAT ATAATCACCT CACTTAATGA AATAAAAATA GCCAATGGAA TCGCTACACC 9060 

TGTAATATTA AAAGCAATGG TTCCAAACTC AAGATTCCGA TACATTTGCA CATAATAGGT 9120 

TTCATTCAGA TCGTCATCCA TTTCCTCTTC ATACAAAGAA TGAAATTTTC TCCTTTTCTT 9180 

TAAGAAATTG AAAGTCAAAA ACATACTAAT GAAACCTATC AGTAAACAAA TAGCTGATAT 92 4 0 

CCATGGCATC AAGGCTTTTA CATCTAAAAT AATTTCGTGG GATTCGACAC GTGCCTT AAA 930 0 

CATCCCTACA AACATGCCCA AGAACCCCCC AAGACAATAG ACATCAAAAA TAACAATCTA 9 3 60 

CGTTTCTTTT TCATATTCAT TCTCCTTTTT C ACTTGC TAG ATTTTTGGAT TTCTTTTCAA 94 20 

TCCATTCAAT TACTGGGATG AGAGCAAAGT AGACCCAAAC AAATTGGTCG CTTTGATAGG 9480 

GATTAAACCA GCTTAGGTCC ATCCCAATCA GT AG AAAT AC GCTGACTAAT AAAGCT ATG A 954 0 

CCACTACATA AT AAAT C ACT TTATACTTCT TCATCACTCG TCCTCCTCCA AACGAAATAC 9 600 

CGATTCGACT GTTTCGTTGA AAATTTGAGA TATTTTCAGG GCAATGATAA TGGATGGGGT 9 660 

GTACTCATCC CGTTCTAGTA GGCTAATGGT CTCTCTCGAA ACCCCTGCCA GTTTGG CT AG 97 2 0 

GTCCCTTTCA TTG AG AC CAT CGCGAGCTCG AAGCTCTTTT AGACGATTTT TTAGTTGCAT 9 780 

GTTACACACC TACTCTCCGT CAAATTCAAC GGTTTGGATA TCCTCAATAC GTTGCAACTT 9 84 0 

GAATTTTTCT TTTCCCGTAT TATCTACACG TCGTAGCTTT ACCCATTCCT CATCAACATC 9900 

CACAACTTCC CAGTTATCTG GCCCAATATA CACTCCCGTT ATAATTGGTT CCTTTCCAAT 9960 

CATTTCTTGT AATAATCTCG ACATTTCTGC GTTTCCTTTC TCTTTTCGCT CAAGTCTTTT 10020 

GATTTTATTC TCTAGTTTCT TGATTTTTTT AG AATT ATT A GAATAAAAGA AAAT CAT AAA 10080 

TAGTATAAAT CCTAGTACCC ACATTATAAC TCCTTTCTGC TTCCTATTTC TTAACTTGAA 10140 

TTCATTGTAA CATATCTTTT TCTTTTTGAC AAGTATAGTT GTCAAAAAAA TTATGATTTT 10200 

TGTCATTTTG CAAAAGAAAA AGGTCAGGAG TAGGTTCCTG ACCACTTTAT C TAT C ATT AA 10260 

TACTCTTCTA AAATCTCTTC AAACCACGTC AGCTTCACCT TGCCGTAGGT ATGGTTACTG 103 20 
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ACTTCGTCAG TTTCATCTAC AACCTCAAAA CCATGTTTTG AGCTGACTTC GTCAGTTCTA 103 SO 

TCCACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTCTATCCAC AACCTCAAAA 10440 

CCATGTTTTG AGCTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTTGAGCAA 10500 

CCTGCGGCTA GCTTCCTAGT TTGCTCTTTG ATTTTTATTG ACT AT AAAAT CCTACTTTTT 10560 

CAAAGATTTC TGAGAAGTTT TGGCTGATTG TCTCAAGTGA CACTTGCACT TCTTCTCGGG 10620 

TTTGGTT GTT CTTGACCGTC ACTTGTCCGC TTTCGACTTC GCTCTCTCCT AGGGTGATGA 10680 

GGGTCTTAGC CGCAAAGACA TCGGCTGACT TGAACTGAGC TTTT AGTTT A CGGTTGAGGT 10740 

AATCACGCTC TGCTTTGAAA CCTTGTTGGC GAAGAGCCTG TACCAATTCC AAGGCCTTGA 10800 

TATTTGCCCC TTCGCCCAAG ACTGCGATAT AGACATCTAG GGCGTTTTCG ATAGGGAGGG 108 60 

TCACACCTTG CTTTTCAAGG ATGAGAAGCA GGCGCTCTAC ACCAAGTCCA AAACCAAATC 10920 

CAGCAGTTTC AGGGCCTCCA AAGTAAGCAA CCAAACCATC GTAGCGACCA CCCGCACAGA 10980 

CGGTCAGGTC ATTGCCCTCA ATCTCTGTGA TAAACTCGAA AATGGTGTGG TTGTAGTAGT 11040 

CCAGACCACG CACCATATTG GTATCGATGA TGTAATCTAC TCCAAGATTT TCCAACATCT lliOO 

GACGCACAGC ATC AAAATG A GCTTGGCTTT CTTCATCAAG AAAGTCCAAG ATAGACGGCG 11160 

CATTCTCTAC TGCCACCTTG TCTTCTTTTT CCTTAGAGTC CAAGACACGA AGAGG ATTTT 11220 

CCTCCAAGCG ACGTTGGCTA TCCTTAGACA ACGTCTCCTT GAGCGGTGTC AAATAGTCAA 11280 

TCAAGGCTTG GCGGTAGGCT GCACGGCTCT CAGGATTTCC AAGAGTGTTG AGGTGCAATT 11340 

TGACACCTTG AATACCGATT TCCTTCAAAA AATGGGCTGC CATACCGATT GTTTCCACAT 11400 

CGGTAGCTGG ATTGCTAGAG CCAAAACACT CAACACCAAT CTGGTGGAAT TGGCGCAAGC 114 6 0 

GCCCTGCCTG TGGACGCTCA TAACGGAACA TAGCTCCCAT GTAGTAGAAC TTCCTTGGCT 115 20 

TTTGCACTTC TGGGGCGAAA AGTTT ATTTT CCACATAGGA ACGGACAACG GGTGCAGTTC 115 80 

CTTCTGGACG GAGGGTAATA TGACGGTCAC CCTTGTCATA AAAATCGTAC ATTTCCTTGG 11640 

TTACGATATC CGTTGTATCT CCGACAGAGC GACTGATAAC CTCGTAATGC TCAAAAATAG 11700 

GCGTGCGCAC TTCTG CAT AG TTGTAGCGTT TGAAAATCTC ACGGGCAAAG CCCTCAACGT 117 60 

ACTGCCACTT AGCAGACTCA GCAGGTAAAA TATCCTGCGT TCCTTTTGGT TTTTGT AATT 11820 

TCATAGGGAA TCCTCTTTAA ACTTAATAGT CTTATTTTAC CATAAATAGA GGGATTAAAA 11880 

CAGTAAGAAA AAAATTAGGA TTTAGATATC ATTTTTGACA TTAAGAATTG TCAAAAAAAT 11940 

AGCTAGCAAG GAAAGACCAA CAAATAGCAT CCAAGTCAAC TGTATATTCC ATACGGCTAC 12000 

TAGTGAAAAA CAAGCTGTTC CCACAGGTAT GG AT AAGGT A AACAATAGAC CTAAAAAATT 12060 

AC T ACT AC G A GCTAGAACCT CTGGAGCTAG ATTTTTCATG AGCATGGCAC TAATCTTTGG 12120 
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TTGAACTTTA CCAGACACAT ACAGAGTAAA GAAGAGAAAT AGCAAACCAA GCACGACTTG 12X90 

ATTGAATAAA TTAGCCAAAC CAACTAGACT AAGTCCTACG CTCTCCCACA TCATCAATCT 12 2 40 

AGGCAAGGAC TGCTTCCCAA AATAATCATT GCCCGTAAGG CTACTGATGA TGACTGATAC 12 300 

TAAAACACAG AATTGATTGA TAAATAGTGC CTCTGTATAA G AAAAATT C A AGAGAGAATG 123 60 

GCTCAAAAAG AAGATATTAT AAATTCCACC CAAAGCGCCA CCCAAGGAAT TAATAACCAA 124 20 

GACAGCAAAG AGCATAAAAC CAAAGTTTTT CTGTCCACTT TTAAGAAAAA CGAGACGTAA 12 4 80 

ATTTCGGTAA ATTGTTAGGA ACTGGTCTTT GATAGAAAGC TTCTCATTTT TTAAGTTTTC 12 540 

ACCATCAGCA GATGACATTG ACAGGCTCAA TTTGCTTTTT CCTAAAAAGA GGATAGTGGC 12 600 

TG AT ACT AGG AAAAAGCAGG CATTGATTCC CGCAACGAGA GAAAAATTGT TGACCGATAG 126 60 

AGCTAAGAGC CAGACTCCGA AAGCTTGACC ACCAATAGCT GAAATATAGG TGATGAACTG 12720 

TGAAAAAGAA TAAGCCTCCA TC AG AT CATC TTCAGCTACT TTTTCCTTAA TAAGAGGCAT 127 80 

ACGCAGGCCA CCTGCAAAAT CACTGATGAT ATCACTAATG ACATTGATCA AACACAGGCT 12 840 

AG AAAAGG CA AAGAGACTAG CTTCCTCAAC AACTAGGGCT GCTACAAAAA ATAGAACCGC 12 9 00 

C TG AAAC AAA CCGCTATAGA C CATC C ATTT GACCTTGTCC CTCGTGTAAT CTGCCCGAAT 12 9 60 

CCCTGCAAAA ACTGTAAAGA GGGTCGGAAG AATCATGACA ATATTCGCCA TAGCAACAGC 13 020 

AAAAGATGCT TGTGACAAGG TCGATGCATA GACGATAAAG ACCAGGTTGA AAATCGAAAC 13 0 80 

ACCAAAAGCA TTGAAGAAGC GTGG 13 104 
(2) INFORMATION FOR SEQ ID NO: 35: 

(x) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19250 base pairs 
(31 TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 35: 

CCGGGCAAAT AGTTTTGAAC T TTTC AT CAT TTTCTCCTTT AAAACTTTCT CTCCATTATA 60 

GACTCTTTTC AGAAAGTTGT CAACAGAATT TTCAGAATTT TTG AAAATT A TTTTTCAAAC 120 

AACATCTTTG CAAAAAATAT GAATATCGTA AGCGCGTCAT AACAAGGTAT CTATCATTCA 18C 

TGGAGCTCCT CCTGTATACT ATT ACT AAAG TAAATATTGG AGGATATTTT AATGCCACAA 2 40 

CCTATTGTTC CTGTAGAGAT TCCACAATCT CGTCGTTTTG ATTCTAAAAA GAGAAATGAT 300 

ATTCTrCTTA AAATTCGTAT TGGCAAGCTT GAAGTAAGTT TTTTTCAATC TCTCAATCTC 3 60 
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GAAATGATAG 


AACAGCTTTT 


GGATAAGGTG 


TTGCTCTATG 


AC AATTC AT C 


TATCTAGCCT 


420 


AGGCCAGGTC 


TATCTCGTGT 


GTGGGAAAAC 


TGATATGAGA 


CAAGGAATCG 


ArrCACTGGC 


480 


TTATCTCGTT 


AAAACCCACT 


TTGAATTGGA 


TCCTTTCTCC 


GGTCAAATCT 


TTCTCTTTTG 


540 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGTCCT 


TT A CTGGG AT 


GGTCAAGGAT 


TTTGGCTACT 


600 


ATATAAACGC 


TTTGAGAACG 


GCAGACTGAC 


TTGGCCCAGT 


ACAGAAAAGG 


ATGTCAAAGC 


660 


TCTCGCACCT 


GAACAAGTAG 


ATTGGCTGAT 


GAAAGGCTTT 


TCTATCACTC 


CAAAAATATA 


720 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCTG 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTTA 


780 


CTGTCCTGAT 


CGATTTGTCC 


TGTT ATT ATT 


TCATTTTACT 


ATAAATCCAT 


CAGAAAGTCG 


340 


TGATTTCTAT 


TGAAATGAGG 


ACTTTCTTTT 


TATACTCATC 


TGCTTTCAAA 


AAGCACTCTA 


900 


GTCCATCTCC 


GATTAACGAT 


GGACTTTATC 


ACCTCCTTCT 


CCAGTCCTTG 


TATAACATCT 


960 


TGAAGTTGAT 


TCATGACATC 


TTCCAAAGTT 


CGAAAGGCTT 


TATTCTTAAA 


TCCACGTTTA 


1020 


CGAATCTCTT 


TCCACACTTG 


TTCAATGGGG 


TTCATCTCTG 


GTGTGTATGG 


AGGAATAAAT 


1080 


GCAAAGCCAA 


TATTAGTCGG 


AATCTT T AAG 


GTACTTGATT 


TATGCCATAT 


AGCATTGTCC 


1140 


ATAACGAGTA 


AAAGATAATC 


ATCTGGATAA 


GCTTGTGAAA 


GCTCCTATTC 


CTAAAGCCCC 


1200 


TTTATAACCT 


CTTGCGAGAG 


ACACTATTGA 


CTCAGCCCTT 


ACTTCATGCG 


GATGAAACCT 


1260 


CCTATCGGGT 


TCTAGAGAGT 


GATAGCCATC 


TGACCTACTA 


TTGGACTTTT 


TTGTCAGGTA 


1320 


AAGCAGAGAA 


ACAAGCGATT 


ACGCTTTACC 


ACCATGATCA 


GTGTCGAAGT 


GGTTCAGTAG 


1380 


TACAAGAATT 


CCTAGGAGAT 


TATTCTGGCT 


ATGTTC ATTG 


TGATATGTTG 


CCGCAGTAAC 


1440 


TTACGACTTT 


AGTCCTCTAG 


TTCTGCCTAT 


GCGATAGCAG 


TCCAAGGTTT 


AGGAGTAAGG 


1500 


CGACGCTAAG 


CTTGGTAAAC 


TGCGAACAGC 


TAGAAGCTTA 


TCGTCAACTG 


GAAGAAGCTG 


1560 


CACTTGTTGG 


ATGTTGGGCG 


CATGTGAGAA 


GGAAGTTTTT 


TGAAGTGCCC 


CCCAAGCAAG 


1520 


CAGATAAATC 


ATCCTTAGGA 


GCTAAAGGTT 


TAGCCTATTG 


TGATCAGTTA 


TTTTC CTTGG 


1680 


AAAGAGACTG 


GGAGGCTTTG 


CCAGCTGATG 


AACGGCTACA 


GAAACGTCAA 


GAACATCTCC 


1740 


AACCCCTACT 


GGAAGACTTC 


TTTGCTTGGT 


GCCGTCGTCA 


GTC AGTTTT A 


TCGGGTTCAA 


1800 


AACTAGGAAG 


GGCAATTGAA 


TACAGCCTCA 


AGTATGAAGA 


AACCTTTAAG 


ACCATTTTAA 


1860 


AAGACGGACA 


TCTGGTCCTT 


TCCAATAATC 


TAGCTGAACG 


CG C C ATT AAA 


TCATTGGTTA 


1920 


TGGGACGGAG 


TAAAAGAGTC 


CAGTGGACTC 


TTTTAGCCTA 


AGCTCAGTTT 


AAAAAAACGA 


1980 


GGGTCGTTAT 


TTTTAAAAAA 


GCGAGGGTGG 


TTATTTTCTC 


AAAGTTTTGA 


AGG AG CT AAA 


2040 


GCAACAGCTA 


TT ATT AT GAG 


TTTGTTGGAA 


ACAGCTAAAC 


GTCATCAATT 


ATAGTGCGTT 


2100 


GAATCTATAA 


CAGTACGCAT 


CGACTGCTAA 


AATATTTCTA 


TAAATCAATT 


TTCCTTTCCT 


2160 
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AATCG ATT T G 


TTCATATCTT 


ATTACAATCC 


ATTATAAATA 


GCCAGAAATA 


TCTATCCTAT 


2220 


CT TCT AG AAT 


GTCTTCCAAA 


CGAGGAAACT 


CTCGTAAACA 


AAGAGGTTTT 


AGAGGCCTAT 


2280 


TTACCGTGGA 


CTAAAGTTGT 


ACAAGAAAAG 


TGCAAATAAG 


AAATCTCCAC 


ATTAGGAACT 


2340 


AT AT AT GAG T 


TCTCTAGTCT 


GG AG ATTTTT 


CAATAGACTT 


CGTTATTGGG 


CGGTTACTTT 


2400 


C G AAACTTTG 


AAAACTTCAA 


AAAACGGATT 


TTTATCGCTC 


TGAACATCAA 


AAAAGAAAGG 


2460 




4 V* V* * A V- * ^— «^ 


AG CT T AG CTT 


TT CTTC AAC C 


CACTACAGTT 


GACAAAGAGC 


2520 


* 1 * A * ^p- 1 


ATCAAACATG 


AAGCGCAAAA 


AC AAGC C AAA 


AATCCGATAG 


AATGGCTATC 


2580 




CAAGTAAGAC 


ATTTCCATCA 


AATACGTTCA 


ATTTTACTCT 


TGTTCTACTA 


2640 


AdAATTAATC 


ATrTCGTTT ^ 

i \ i^l & 4 4 * 


GATTTATTAA 


AAATATACAA 


TTCAGCTTTT 


CCTCCAAACT 


2700 


ATTTTATCCA 


CTATCCCTGT 


ATAGCTCTGT 


ATTATCTTAA 


CAACTTTAGT 


AG AG AC AT TT 


2760 


TPfTfAACAT 


AATCCG G AAC 


CGGTAATCCA 


AAATCCTCAT 


CTTGTGCCAA 


GCTAACAGCA 


2820 


fiTT^T A A fTP. 

\J 1 L i. 1 


P TTP. AAG AAG 


AGAATTTTCA 


TCAATGCCTG 


C CAAAAT AAA 


TCCTGCCTTA 


2880 


tpt AJvnn A PT 


CAGGACGTTC 


TGTACTTGTA 


CG AAT AC AT A 


CAGCGGGAAA 


AGGATAACCT 


2940 


THAPTAC^AA 


AG AAACT AC T 


TT CTTC CGGT 


AAAGTTCCCG 


AATCAGATAC 


TACAACAAAT 


3000 


CP ATT CATTT 


GTAAACAATT 


ATAGTCATGG 


AATCCTAGTG 


GCTC ATG CTG 


AATCAC ACGT 


3060 


' T *TATCTAGTT 


TAAAACCGCT 


CTCTTGTAGC 


CTTTTCTTTG 


ATCTAGGATG 


GCAAGAATAT 


3120 


AAGA* r TGGCA 


TATTATACTT 


TT C AG CT AAT 


TGATTAATTG 


CTGTAAAGAG 


AGAAATAAAA 


3180 




TATCAATATT 


TTCCTCACGG 


TGAGCTGAAA 


GTAAGATATA 


ACCTCCTTTT 


3240 




AACGTTCATG 


GATATCTGAA 


GACTCAATAG 


CAGATAAATT 


TTTATGTAAC 


3300 


ACTTCTGCCA 


TAGGAGAACC 


AGTTACATAT 


GTGCGCTCTT 


TAGGTAAACC 


ACACTCATGT 


3360 


AAATACTTAC 


GTGCATGTTC 


AGAGTATGCT 


AAGTT AAC AT 


CTGAAATAAC 


ATCAACAATC 


3420 


CGACGATTAG 


TCTCTTCCGG 


TAGGCACTCA 


TCTTTACAGC 


GAriGCCAtjL 




3480 


AAAATTGGAA 


TATGTAAACG 


CTTGGCAGCA 


ATAGCTGATA 


AACAAGAATT 


TGTATCCCCT 


3540 


AAAATCAATA 


AAGCATCTGG 


TTTAATTTGA 


TTCATCAATT 


TGTATGAAGT 


ATT AAT AAT A 


3600 


TTCCCTACAG 


TAG C AC C AAG 


ATCATCTCCA 


ACAGCATCCA 


TGTATACGTC 


CGGAGTGTCT 


3660 


AACCCTAAAT 


TATCAAAGAA 


AATACCATTT 


AAATTGTAAT 


CATAGTTTTG 


TCCAGTATGT 


3720 


GCCAAAATAA 


CAT CAAAAT A 


CTTTCGACAT 


TTAGTGATAA 


CACTACTTAG 


ACGTATAATC 


3780 


TCTGGACGTG 


TTCCCACAAT 


AATCAATAAC 


TT AAGTT TG C 


CATTATCTTT 


AAAGTGAATA 


3840 


TC ACT AT AAT 


CTGTCTTAAT 


TTTCATTTAT 


TTC7CCACTT 


GTTCAAAAAA 


AGTATCTGGA 


3900 
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TGTCTAGGAT CAAATGACTC ATTAGCCCAC ATGACAGTAA TT AG ATTTT C TGTATCAGAA 3960 

AGATTAATAA TATTATGTGC ATAGCCCGGT ATCATATGTA TTGCTTCAAT CTTATCGCCC 4020 

GACACTTCAA AGTTCAGAAT AGGATACTCT TGACCGTTTT CATCCAGCCC TATCCTACGC 4080 

TCTTGTATTA AAGCACGACC AGAAACAACC ATGAAAAATT CCCACTTAGA ATGATGCCAA 4140 

TGTTGCCCTT TGGTAATGCC AGGTTTAGAA AT ATT AAC AG AAAATTGACC CGTATTTTCT 42 00 

GTTTTTAATA ATTCCGTAAA ACTACCTCGT TCATCTATAT TCATTTTTAG AGGAAACTTA 42 60 

AACTTATCTA CTGGTAAATA AGATAGGTAG GTAGAATACA ATTTCTTTTT AAACGATCCC 4 3 20 

TGAGGAATTT CAGGCATAAC TAAACTATCA GGCTGTTTTT T AAATGTTTC TAATAGAGAG 4 3 80 

ACAATCTCTC CTAAGGTTGC ACGATGAGTC GTTGGTACGT AGCAGTAGTT TCCTCATGGG 44 4 0 

CTAGGTAAGA TTTGTAATCC ATCTAGATTA CAACGATGAG GATTTCCTTC CAATGCAGTT 4500 

AGACACTCTT GTATCAAA7C ATCAATATAC AGCAACTCCA ATTCTACACT TGGATCATTT 4 560 

ACTTGAATAG GTAAATCGTG AG CT AG ATT A TAACAGAAAG TTGCTACAGC AGAATTGTAG 4620 

TTAGGACGGC ACCACTTCCC ATAAAGATTC GGGAAACGGT AAACTAAGAC AGGTGCTCCC 4 680 

GTTTTCTTTC CATATTCAAA GAAGAGTTCT TCCCCTGCTA GCTTAGATTG TCCATATATA 4 74 0 

GAGTTTGAAA ATCGGCCTTC TAAACTAGCT TGAGTAGAAC TTGAGAGTAG AACAGGACAA 4 800 

GTGTTTTCAT ACTTTTCTAA AATCTCCAAT AATCTACTTG AAAAACCGTA ATTTCCCTCC 4 860 

ATG AATT CAT CAGGATTCTC TGGACGATTG ACACCAGCTA AATGGAATAC GAAATCGGCC 4920 

TTCTTACAAT ATTCATCTAA TAAAATCGGA TCTGTATCAC GATCATACTG AAAAATCTCT 4 9 80 

CCAATCTCTA AATTAGGACG AGTCCTATCT CGTCCATCTT TCAAAGCTTC CAGAGTACAG 5040 

AT AAG ATTTT TTCCTACAAA TCCTTTCGCT CCTGTGATTA AAATATTTTT AATCATGCCC 5100 

CCTCCTTATT TTATATGCTC TTTTAATAGT TAACTCTCTC GACAATACAT GATACATTAT 5160 

ATATCCTTGA TAATTTTAAT GTATCTTAAA AGATTTTACA TCTCTTCGTC TGCTACCATA 5220 

TCACGAATTG CTGTCTGTAT TTCATCTAAT TCTAGCAACT TTCTTTTAAC TTGCTCTACA 5280 

TCCATCAAAT C GG T ATT ATT ACTATTGAAT TCTGTCAACA AATTTCTATT CGTACTACCA 5 34 0 

TCTTTGAAAT ACTTATCATA GTTAAGATTA CGATTATCAC TAGGAACTCT ATAAAAATCA 54 00 

CCCAAATCAA TTGCATTTGC GCACTCTTCG TTAGTTAATA GTCTTTCATA CCTTTTTTCT 54 60 

CCCTGTCTAA TACCTATAAT CTTAATATCT TCTTCTGAGC CAAAAATTTC TGATACAGCC 5520 

TTAGCCAACA CTTCAATCGT ACATGCTGGT GCTTTCTGAA CTAGTATATC TCCAGATTTC 5580 

CCTTCTTCAA ATGCAAATAA AACCAAGTCT ACTGCTTCTT CCAATGTCAT CACAAAACGT 564 0 

GTCATGCTAG GTTCAGTAAT TGTAAGAGCA TTTCCTTGCT TAATTTGCTC AATCCAAAGA 57 00 
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GGAACGACAG ATCCACGGCT ACACAGAACA TTCCCATAGC GAGTCACACA TATCTTTGTA 5760 

TCCTCAGGAT TTACCGTCCT GGACTTAGCA ACAGCAATCT TTTCCATCAT AGCC7TGGAT 5820 

GTTCCCATAG CATTGACAGG ATAAGCCCCC TTATCTGTAC AAAGACAGAT AACTTGCTTT 5880 

ACACCAGCTT CGATAGCCGC AGTGAGGACA TTCTCCGTTC CCAAAATGTT AGTTTT T AC C 594 0 

GCTTCTACAG GGAAAAATTC ACAAGAAGGT ACTTGTTTAA GAGCAGCAGC GTGAAAAACA 6000 

TAATCCACAC CATGCATAGC ATTTTTT AC C GAAGCTAAGT CACGCACATC TCCAAGGTAA 6060 

AAACGGATTT TCCCAGCCAC TTCTGGTACT TTTACCTGAA ACTCATGACG CATATCATCT 6120 

TGTTTCTTTT CATCTCGCGA AAATATACGA ATCTCTGAGA CATCTGTTTC TAAAAAACGC 6180 

TTGAGAACCG CATTCCCAAA TGAACCTGTC CCTCCTGTAA TTAGGAGAGT TTTTCCTGTA 62-30 

AATTGTGACA TATATTACAC TTCTCCTTCT AGTATGTCTG CAATTTTCTT AC AAGC CGTT 6 300 

CCATCTCCAT ATGGATTTGA AGCTTGACTC ATTGCTTGAT AAACTGAATC ATTTTCTAAT 63 60 

AATTCTTTAA AATGCCTATA AATATTATTT TCATCAGCAC CTACAAGTTT CAAAGTCCCT 64 20 

GCTTCAATTC CCTCTGGACG TTCAGTTGTA TCTCTCATAA CCAAAACAGG TTTT CCT AAA 64 80 

CTTGGAGCCT CTTCCTGAAT ACCACCACTA TCTGTTAAAA TTAAATAACT TCTTGATAAA 6540 

AAATTGTGAA AATCTAATAC TTCTAAAGGT TCGATCATCT TGATACGTTC ACAGCCACTT 6600 

AGTTCTTCCT CAGCAATTTG GCGAACACGA GG ATT C AT AT GGATAGGATA AATACCCTTC 6660 

ACATCTGAAT ATTCTTCAAT AA7CCTTCTA ATTGCTCTAA AC AT ATGTCT CATCGGTTCA 67 20 

CCAACATTTT CACGACGATG AGCTGTAATT AGAATAAACC TGCTTTCTCC TATCCATTCT 6*780 

AACTCAGGAT GCGTATAGTC CTCTTGAATT GTAGTTTGTA AAGCATCAAT CGCCGTATTA 684 0 

CCTGTCACAA ATATGCTCTC TGGAGTTTTT crTTCTCTT* AAACA77ATC 7T77CAAACT 5900 

TGTGTTGGTG T AAAATG AT A CTGAGCCAAA ACCCCAACTG CTTGACCATT AAACTCTTCA 69 60 

GGATATGGTG AAT AG AT AT C GTAAGTGCGC AAACCAGCTT CAACATGACC AATTGGAATC 70 2 0 

TGTAAATAAA AGGCCGCCAG TGAACTAGCG AAGGTCGTAC TTGTATCCCC ATGAACTAAC 7 080 

ACCAAATCAG GTTTTTCTGA CTCTAAAATA GCCTTCATTC CTTCCAAAAT GCCAATGGTC 7140 

ACATCAAATA AAGTTTGTTT ATCTTTCATA ATAGACAAAT CAAAATCGGG AATAATCCCA 7 200 

AATGTGTCCA AGACCTGATC CAACATTTGA CGGTGTTGGC CCGTAACGCA AACTAATGTT 72 60 

TCAATATTCT TACGTGTTCT TAACTCTTTG ACCAAAGGAC ACATCTTGAT G GCTTCTGG A 7 3 20 

CGAGTTCCAA AT ACT AC AAC TACTTTTTTC ATATATTTAC TTACTCCTAA CAAATAATGA 7 3 80 

ACGGTTCTTA AAAT AAATT A GATAACGGCT AATCCATAAC ACCACCTCAG ACATACTTGA 7 4 40 
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ACAAATAGCT AATCTTACTA AACTAAAATT AT C AG AC AAG ATAAATATTC CTAATCCCAA 7 500 

AGTTTGGACA AT CG AAGCT A ATATAGTTGT CATTGTAGTT TCTTTCACTT TATCAATAGC 7 560 

TCCTAAGACA GGCCATCCGT AAATCATAGA ATAAAAACTA GCAACAAAAG CGGGTAATAA 7 6 20 

GTACTTAAGA AAATCTG CTG AAACGGTATA TTTTTCACCA CCAATTATAG AAAGAATTTG 7 680 

ATTTGAAAAG AATAAAACTA TCAAAACTCC AAAGATAATA GGAATAAACA TAATCCGATT 7740 

AATACTCTTA AC CG ATTGT A TATCTTTAGT ACGTATCATA TGCGGATATA AACTATTCGC 7800 

TATAGGATTA TACAATGATT TTGCTGCTGA AAGCAGTTCC ATTGCTATCC CCCAAAAGGC 7 860 

TATCTCTTGA CTTTGTAAAT AAAAACCCGA AATGACTGTC GTAAAGACGC CAAAAATAGT 7 9 20 

AGTTGCAAAA TTGGATAAAA AATAAATAGA GGATTCCTTT AAATCTTTAA CCCAAACAGA 7 98 0 

CAGATAAGAA AATG AT AATT TAATTCCATA ATAATGAAGG AATCT AT AAG AAACT ACT GC 804 0 

AGCAACTAAA TTCCCAATTC CTTCCAATAT AGGAATCCAT AAAAT AG AAG AATC AT CTTT 8100 

TACTACAATA AATGTCAAAA TTGTAATGAT AGTTTTAGAA ATAATATAAG GAATTGCAAC 8160 

TGCATGCATC TTTTCAATTC CACGAAATAA AAAGTCAAAG ATAAAAATAT TGGTCACTGT 82 2 0 

AGCTAACAAA TAAAAAACTG AAAAAAGAAT ATTCTCTCTC ATTATTGGGA TTTGCCACAT 82 80 

CAATATGGTG TAAATTAGAA TCGAAATGAT AG AT AAAAAT ATTTTTTCAA CTAGAGTATC 83 40 

TCCAACTATC CTTCCAATCT TTGAGGGAGT ACT AC AAGC A TTTACAATAT TTTTTGTAGC 84 00 

TGATATCATG AAACCAAAAT CAATCACCAG TTGAACATAA GCTATTAACG CTTTAACATA 8 4 60 

AATAACCATT CCATACGCGT CTAGCGAAAG CACCCTTGTC AAATACGGGA GTGTTAATAA 3520 

AGGAAATAGT AATTTAACAA TATTCAGAAT AT AG AG AG AA CTTGTATTTT TTATAAATGA 8580 

AATTCTATCA ACTTTCACCA ACTAGTCCTT CCAAAAAAAG ATCTAAATAC TCC AAACT AC 8 640 

TTCTCGCTTT CAACACCAAT TCTGAAGGTA TTGTTATCGG TTTT AG ATG A AAAGTTTCAA 87 00 

GTTTCTTTAC AATACTATTA ACACTTGAAT CAAATAAAGA TTCACAACGT TGTAACTCTC 87 6 0 

CAATTGCTCC ATAATAACGT GCTGTTTTTT CTGGATGGCA TGCAATGGCA ATCACAGAT V 882 0 

TATTAAAACA TGTTGCCACT ACCCCAACAT GTAATTTACA AGTTAAAACC ACATCTACCA 'i860 

TTTTCAACAA TGATGTCATT TCTGCAGGAG AATGATACTT GAATTGAAAA CAATCCTCAG 894 0 

TTCTAACTAA TTTTCTAAAT TCCTGATAAT AAGCATCTTC ATAAGGTAGA ATGGAATCCG 9000 

AAGTTACTAC AACATAATAG TTAGGATTGT TTTCTAGAAA AAGACTAATT GATTCCGCAA 90 60 

ATTTTTC AAG AGCTTTTTTG GAATGATTAT AGTGAACAAG AATTATCTTC TT ATCTTT AG 912 0 

CTTCTCTTTT CAATTGACAC AGCTGCTCTG TTTTTTCTTC TCTTAATTTA CTTGAAATAA 9180 

TTAAATCAAA GGTTTCATGC ACTGGAGCCG AAGGCGACAA ATGCTTCAAA GAATCAAATG 9 2 40 
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ATTCTCGATC 
TCATCAAAGA 
AATATGAAGC 
AACGTGCATT 
GTTTTTTAGT 
ATGG ATT AT A 
CGTAAAGATA 
TTTTCACCAC 
AGGACGATAA 
AAAATAAATA 
ACTAAATATT 
TAATTCTTCC 
CGCTGATGCT 
AGTCCGTAGA 
AGAAGCTACT 
TACAAAATCT 
ACGGTGCCCC 
AACTACTCTT 
CAAGATTGAA 
AAACAACATT 
CAAGGAGAAC 
TGCCTCTGAA 
AGTTATT A C C 
TTTCTCTTCA 
ATAAAAAAAT 
ATGAAATAAT 
CCCCATATAC 
TACAGAAAAA 
TAGAAAGCAA 



ACGAACTGTA 
ATCGTTATTA 
CCAAATTCCC 
ATGCCCTTCC 
AAAACTTTGT 
AAAAGAAACT 
ATCACCGTAA 
TATTTTTTCA 
ACATCTATTG 
ACTTTTGAGA 
CCCAAAACAA 
ACAAAAGAAG 
TTATCAAAAA 
AATGTCATCC 
GTTAGGCTGG 
AATGAAGAAA 
CTAATAACAA 
TTCTTCGAAA 
AACACCTGGA 
ACCCAAAAAA 
ACACCAGGAA 
TATGCTGAAT 
CTAGAAATAA 
TTTTCCTGAT 
AAATAGCCTA 
TCTTCATTAT 
TCATTGAAAA 
TCATATATAC 
ACCCATATAA 



ATAAATTGAG 
GGCCCTGCAC 
AAAGGTAAAA 
CCAAAATATC 
TTTTGGCGAT 
TCATATCCTT 
TTACTTGAAC 
ACCTCCTAAA 
AACTACTTCT 
TTTTACTTGT 
AACTCCAAAA 
AGCCTACAGG 
AATCACCAAC 
CAT ATT CAT A 
TCAGTCCCGA 
AGGAATCAAA 
CTAATACATT 
AAGTAATCCC 
TTTTACGACT 
TAGTACGCTT 
GCATAAGTAC 
AGCTATTCGC 
AGCCCACTCC 
AATTTTGTAC 
CAGAATAACA 
TATAGAAGTT 
TTAATCCAAA 
TAACTTTTTG 
ATATAGACGG 



353 
CATGATTAAT 
CAATACCTAA 
AT CG TTT AAA 
CTCCCGGGA? 
ATTCTTTCAA 
TAGATTCTAA 
CATAATCCGT 
AATAAATATC 
CACTAAAAGC 
TTGAAAAGCT 
AACACCACCA 
TAACCCCAAA 
TAACCATCCA 
TGGAATGCTA 
CTCTGAAAAT 
TAAGTATATA 
TAATAGAAAT 
TAAAGATTGT 
TCCTGTTAGG 
TATAACTCGG 
TCCTAAATCA 
CGCTCTAACT 
TGTTAAAATC 
TTCTGAATGA 
AAACAAAATC 
ACTAGGCCTC 
CATAAAAAAA 
TAAAATAAAA 
AACATAATTA 



AATT CTCTTT 
TACTCCTATA 
TTGGATTAAA 
ATACAAAATA 
GTACATTTGA 
TAAATCATAG 
TGCACCATGT 
ATAATCAAAC 
AATAGTTGAG 
CTGAAATTTA 
TAGTAACCAA 
AATTTATTAA 
ATAGGAAAAA 
CTAGGCACAA 
ACTTCCCCTA 
CCTATAGTAT 
ACAGTTACTA 
GTGTATACTA 
ATCATTATCA 
GACAGCTTAT 
TCTATTATTC 
GCTAGTACTG 
CTACCCGCAT 
TAATGTACCT 
CAAATTATAA 
CACAGCAGAG 
TAAGATAAAA 
CCAGTAATTT 
GATATAAGAA 



ATACCATAAT 
GGCTTTTTAA 
TTATCACGAA 
GCATCTGCTT 
AAGAAATCTG 
ACAATCTCAC 
AACATAATTT 
TATACATAAT 
AAATT AC CG A 
ATCGCCATCC 
ACTTCCAAAA 
TAACAACCGT 
TTGATAGGAT 
CAGTTACAGC 
CTATATTCTT 
TCAAGTCGAA 
TTAAAAATAC 
AAACCAACGC 
AAA TT AGG T A 
CTGAATAAAA 
CTGAACTAGC 
TTTTAGAATC 
TGTACAAAAT 
TTCCATCACT 
AAATATATGA 
TTGTTTGAAA 
TCAGATACCA 
GAAAAATAAT 
AACCATTATT 



9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
1C920 
10980 
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CCAATTATCG AGAGTCCAGA ACAAGTAACA GAAAGCAAAT A?A^OTA ATGTCAC TAG 11040 

TGTCACTCTA CAAATATACT TTGTCTGCAT CTATATCTCC TTTATTACAC ACATTTCTTG 11100 

ATAACGATTC AATAATTTAC T AGCTTG AT A ACAAATATCA TAGAGTCCAT C TGTC AT ACT 11160 

GTTATTTATT TCAAAACGAT TGCATTCCTC AGATGTTAAA GACAGTACTT TATCTTTCCA 112 20 

TAGCAACACA GACTCTTCGT TGATAGGTAA GTAACTAATG TTTTTGGTCA CAT CT ACTT C 11280 

TTGCGT C AC T GTATCTGACG ATAAAATTTG TAATCCCGAT GCCTGAGCCT CTACTAGAGA 11340 

AACAGGCAAC CCCTCATATT TAGACGGAAG CAAAAAAACA TCCATCGCAG ATAATAAATC 114 00 

AG AAAT AT C A GTCCTTCTCC CTAAAAATAG CACATATGGG GTCAGATTTA GTTCTAAAGC 114 60 

TTTCTGTTTT AATTTCTGCT CATCCTCACC ATTACCAACT AGGAGTAAAA TAACATTTGG 11520 

TTTGATTAAA ATGAGTTCTT TTAAAACGTT AAAT AAAT AA CTTTGGTTTT TTTGATCTGA 11580 

T AGG C GAG CT ATATTTCCTA ATACGAACTT ATTTGACACA TCTAATTCTC TACGACATTT 11640 

TTCTCTAACA TCTGACAAAA ATTGATACTT TTTCAAATCA ATTGCATTAA AAATAATTTC 11700 

AATTTTTCCG TCTTTATACG CTTTCTCTCC ATATAACCAC TTAGCCGAAT CTTCCCCACA 117 60 

TGCAAACCAA TGAGTTGCTA AGATTTTTAC CAAAATTGTT ACTAATTTAC GCAATACTTT 11820 

TTGAAAACTG TTTTCTGTTA CATAAGCCAT ATGACTATGA ATAATTCTAA TTTTACAACC 11B80 

AATTATTTTA GATAAGATCA G AC C AATTGC AG ATTT AT AG CCATGGCAAT GAACTATATC 11940 

ATAATCTCCT TTCTTTATTA TTCTAGCAAG AGAGAGAAAC TGATGTAGAG GCTTTTTCCT 12000 

TAATAGACGC ACATGATAAA CCTTTGCACC CAATTCTTTC ATTTTATCCT CTAAAAATCC 12060 

TTGTTCTTTT CCAGGCACAA TAAAATCAAA TTGAATTTTT TTTCT ATC AA TGTGAGAATA 12120 

ATAGTTGAAT AGAAAACTTT CTACTCCACC ACTATCTAGT GTTGTAAATA GATGTAATAC 12180 

TTT AATCA V? CTTCTTCCTT AAGCTTAAGA TTCGCTTCTC TAATTCTATT TCTGTTTTTT 12240 

CTTTTTCTAA ACTAATTCTG TCCATGAAGT TATCACAATT CTTAATTAGC TGTTTCCTGT 12 300 

CAAGGTTTTG AATATACAAA GCCAAACAAT CTTTTTCCGA TTCATCCTTC ATAGGTAAAA 12 3 60 

CGAAACCAAA AC C ATTCTCT ATTG AC ACTT TTTCCATATA AGTATCTTCA CAAACTAAAA 12 4 20 

TAGGTTTATA CAACAATGCA GCAAAGTAGA GTTTATTAGA CAAAGCATAG TCTAGTAAGG 12480 

GAGTGTGATT CCCGTATAAA TTCAAAACAA CATCTGTATT CTTATAAAAA GACATGGTAT 125 40 

CTTTAGGCTG GAATGTGTCC ACCAAGTTAA CATTGCTGAT ATTTTTTTCT TGACAAAATT 12600 

CCCTTAATTC TCCTGCATTA GTACCTATAA AATTCAACTG AAATCGACTG TCATTTGCAA 1266 0 

AAAAATCGAT TATTTTTTTA TTTTGTTCTT GAAAACGAAT TAAACCAATG TAGGAAAGTT 12720 

GAATTGGAAA CGTACTATTA TTTTTT AACT GCTTTACCTC GTTT AATTCT ATCATATTGG 127 80 
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GTAGGTTATG GGTAGTAAAA TACTCTCCCA TTGGTAAAAA AAATTTATAG CCGTCTGAAG 12 840 

AAACGATATT CATTAAAGAA TTTTTCACCA ATTGTTTCTG AACCAAACGA TAAACCAAAA 12 900 

ATTTTTCATA ACTGTAATCA CGAATATCAT AAATATATCT ATTTTTAAAT GAAAAGAGAA 12 9 60 

G AAAATCT AC TAAAATGAAA GACACAATAC TATGTAACGG CAATATCATA TC AT AAT CAT 13 020 

TTTCTTTTAG CTTCTTTTTA ATTTCTTTTC TGAATTTTAC ATAACCTAAT ATCTTACTTA 13 080 

ATTTTCCTTT AC C AG AAAAA GAAATACGAT AG TAG TTTTG TTTTGTAATA ATCTCGTTAA 13 140 

TATTCTTATC CCAATATATA ACATCGTAAC TAATAGACAG TTTCTTCAAT AATTCTTTAT 13 200 

AAAAATTGAA GTAAGGAGTT AG AT AT AT AT TATCAGATAG TATAAACAGT ACTCTCATTA 13 260 

AATTATTCTT TCTTACTTTC CCTCTCTAAA CATGTCTCCA GTTCGAGCAT AAACTGCTCT 13 320 

TTTGAAAAGT GATTTTCATA GTAACAACGA GCTTTCTTTC CTAACTCTCT TTGTCTCTTA 13 380 

ATAGATAACA TACTAAATTT ACAAATATTT TTTGCCAATT GTTTTACATC TCGTTCGGGA 13440 

CTAACATATC C AC AATTTG C TTCTTCTACA ATTATTTTAG CATCTCCTGA AATTGCACCT 13 500 

ATAATTGGTT TGCCTGCCGC CATATAAGAk TGTACCTTCC C AGGT AT ACT ACGAGAAACT 13 560 

ATCGAGTCTC CTATTAAAGA AACTAACATA GCATCTGATT TTTTATACAA GGATGGCATT 13 6 20 

TCCTCCAAAG AACGTCTTCC AT AG AAGG AA ATATTCTTTA ACTCCAATTC ATGAGCTAAT 13 680 

CCTTTCATGC TTAACAATTC CGTACCATCT CCAACAAAAT GAAAATGAAT TTTCTTGCGT 13 740 

AAATTGGTAT TCTTCTCTAT CAAACTGGCA GCTTTCAAAA TACTTTCCAA ATTTTGTGCT . 13 300 

TTGCCAATAT TACCAGCAAA AGTTAGGTCA ACACTTTCTT TATTAACTAT AGATTCATCA 13 860 

GGGAT AAAAA GATCTTCTGC ATATTGTGGC AAATATGTAA TCTTTTCTTC GGATATGTCA 13 9 20 

AATTGCTTCA CAAAATAATT TTTAAATGAT CGACTACTGA C AAAT A T AT A ATCACTAGCT 139 SO 

CGGTAAACTT TTTTTG AG AT AAATTTAAAC AGCTTGAAAA TCAAGCCATC TTGTTTCACT 14 040 

CCACCTACGG TTAAACTATC TGGCCAAACA TCCATACAAT ATAGAAACAT CGGTTTCTTA 14 100 

TATTTTTTTT TATAAGCCAT ACCAGCCCAT GCCATCATAA CTGGAGACAA TTGCTTAACG 14160 

AATACACAGT CAAAATTCGA TCCATCTTTC GTTTTATACC TCCCCAATAA AACTCCTAAA 14 2 20 

GTAGAACTAA TTGCAAAGCT AAAATAATTC AACAATCGAA ATACAACACT TTTTTTTCTA 14 2 80 

GGGATTGTAT AAGAACG AT A TATCGTAACA CCTTCTATAA TCTCACGTCT TTTTTTATTA 14 340 

TGACGATAAT CTGCATATAT CTTCCCTTCA GGGTAATTAG GAATCCCAGC CAAAACAGAG 144 00 

ACTTCATGCC CTTTTCGAAC TAAATCTTCA C AAAT ATCTG ACAACCTGAA 7GGTTCTGGC 14 4 60 

TTATAATGTT GGCAAACAAA TAGTATTTTC ATTGTCCAAT TTAACTTTCT TTCTTACCAC 14 520 
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TACCCTCTAC AATACCTTTT CGTTTCAGTA CGTAAGGTAT TGTCTTAACT ATACATCTAA 14 530 

TATCCATTAT CAAAGACAGA TGTTTAACAT AGTAGCCATC TAACTCCGTC TTCATCTCAA 14640 

C AG AC AAAGT ATCACGCCCG TTAATTTGTG CCCATCCAGT TAACCCTGGC AAGATATCAT 14 7C0 

TTGCTCCATA CTTATCTCTC TCTGCAATCA AATCTAGTTC ATTTATACCC GCTGGTCTAG 14 7 60 

GACCTACAAT ACT CAT ATT A CCAACAAGAA TATTAAACAA TTGTGG T ACT TCATCCAAAG 148 2 0 

ATGTTTTTCG CAAGAAAGCC CCTACTTTTG TAATCyATTG CTCTGGATTA TATAAGTTTC 14 880 

GAG GCGCC AC ATTTTTACGT GCATCTATTT TCATAGACCT AAATTTCAAA ATATAGAAGT 14 94 0 

ATTCTTTATG AATACCAAAG CGTTTTTGCT TAAATATAAC CGGACCTTCT GAATCAAGTT 15000 

TAATCGCAAT TGCAATTATC AT AAAAACC G G AC AC AAT AT TATTATCCCT ATTAAAGATA 15060 

ATAATATATC ACCTAATCGT TTTATTATAC CGTACATAAA CAACCTCCAA CTATAAATTC 15120 

TATTTCCATT TTTCATTCTA TTTCCATTTG ACAAATTAAA TCAGGCAGTA CATGCAACTA 15180 

CAGAAACTCA ATATATATTT GGTCACTCAA TGATTTTCAG AAATATAATT CTTTTATCCT 15 24 0 

CTACGTCAGA TAAAACTTTT CTCCATCTAA ACAAAATTTA TTTGTTTCAG TAATATATGA 15 3 00 

GTTCTCAATA ATG AATT AG A AGGTCCAGTT CAATTATTCT TCCAAATAGA CCGAATATTA 15 3 60 

TTTGAAGACA TATCGGTTTC TGAAATTGCA ATCAGTACAT AAGCTAATAA ACTGATAAGT 15420 

ATG CTCTGT A AGAATGCCAG AGTTATATTG TAGTCCCCTT C CAT ACT AT A TTCATTTTAT 154 80 

TTTTTACCAT AATTT CC AT A GGAACCGTAA ACTCCATACT TATTAACCGA GATATCCAAT 15 540 

TTATTTAAAA CAACTCCTAG GAACAGTTTC CCTGTTTGTT TTAATTGTTG TTTCGCTTTT 15 600 

TGGATATCAC GTTT ATTCGC CTCACCTGTT GCTGTTACCA AGATGGACGC ATCACACTTT 15 6 60 

TGAGTGATAA TTGCC G CAT C AATAACAATT CCAATAGGCG GTGTATCAAT AATGATATAA 15720 

TCAAAATATT TACGCAATCT TTCAATCATA TCATTAAAAT TTTTACTTTG TAACAAGGCT 15 7 80 

GTAGGGTTTG GTGATACAGA TCCCCATTGA ACTACAAATA AATTTTCAAT ATTTGTATCA 15 8 40 

CATAAACCGT GAGATAAATC AGCTGTCCCA GATAAAAATT CTGTTAGCCC TGTAATT1 T*T 15900 

TCACGAGATT TAAAAACTCC TAACATAACT GAATTTCGAG TATCGCCATC GATCAAAAGA .5960 

GTTTTATAGC CTGCACGCGC AAACGACCAT GCTATATTTA TGGAAGTAGT T GTTTTTC CT 16020 

TCCCCAGGGT TAACAGAAGT AACGGAAATT ACTTTT AGTT TATCTCCGCT CAACTGTATA 16080 

TTTGT AC AC A AGGCATTGTA ATATTCTTCT GCCTTCTTAA TGAACTCCAG TTTTTTTTGT 16140 

GCTATTTCTA ATGTCGGCAT CCTTCTCTCC TATTTCAACT TACCCAAGTT TGGCACAACT 16 200 

CCCAAAAGTG TCATCTGCAA TGTATTTTCG ATATCTTCCG GACGTTTCAC ACGAGTATCC 162 60 

AAAAGTTCAA GATGAAGAAC TATAACACTA GTTCCAATCA CCCCTGCCAA AAAACCAATT 16 3 20 
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AGTCTATTGC GTTTAATATT TGGCGAAGAC GGGGATATCG CCGGCCTTGC CTCCTCCAGT 16380 
GTTGTCACGT CAGAAACACG AGTAATACTG ATAATTTTTT GAGCAGCTAC TTCTCTCAAA 
GAGTTAGCGA TACGGCTTGC CTCTTCAGGA ACTCGATCAT TAACTGAAAT AGAGACAATA 

CGGGTATCAA CTGGTACTGT CACTTTAATT TTATTAGCCA AACCTTTTGG CGTCAAATCT 16560 

AGTTTCAAAT CAGAAACAAC TTCCTCCAAA ACATCCTCCG AAAGG AT AAT CTCACGGTAG 16 620 

TCTTTTACCA GATAAGTTCC TGCCTGCAAA TCCTGATTTG TCAACCCCGG CTTGTCTCCT 16680 

TGATTGCGAT TCACTACGTA AATTCGCGTG GTACTCCTAT ATTCTGGCTT AACAATAAAA 167 40 

GTGCTATATG CAAAAGCCCC CGCACCTGTC ACAAGTGCCA CTATTAAAAT CATTAGCTTG 16800 

CGTTTCCACA AGCTTTTAAC TAATTGAAAT ACATCGATTT CTATCGTATT TTGTTCTTTC 16860 

ATCATTTCTC CTAAATTAGT TGATCCATTA CAATTTTTCG AGGATTGTCT ATAAAAAGTT 16920 

CCTGAGCCTT CCCTTCTCCG T ATTTTTGGG TAACAAGGTC ATATGCTTCT GCCATATGAG 16980 

GAGGTCTACC GTCT AG ATTG TGCATATCAC TTGC AAT G AC ATGAACCAAA TCCTGCTCTA 17040 

AAAAAT ACT G AGCTCTTTTT TTCATGAATT TATAACGTTC GCCAAAAAGT TTGGGTTTGA 17100 

GGACATGTGA ACTATTTACT TGCGTGTAAC AGCCCATATC GATCAGTTCT CGAACGCGTT 17 160 

TTTCATTATT TTCAAGAGCA TCATAGCGCT CAATGTGGGC AATGACTGGA GTAATTCCCA 17 220 

ACATCAAGAT CTTGCTCAAG GCGCTATGAA TATCGCGATA AGGAGTGTTC ATACTAAACT 172 80 

CTATCAAGGC AT AACG ACT A TCATTGAGGG TCGGAATCCG CTTTTTTTCC ACCTTATCCA 17 340 

GAACATCTGG TGTGTAATAA ATTTCACCCC CGTAAGCAAT GACCAAGTCA CTCGCCACTT 17 4 00 

CCTTAGCTAT TTCCCGAACC TGAACAAACT TTTCTGCTAT CTTCTCTTCC GGAGTTTCAA 174 60 

ACATGCCCTT GCGACGGTGA GAGGTAGAAA CAATGGTTCG CACCCCCTGT CTGTAGGATT 17 520 

CTGCCAAGAG AGCCTTCCTT TCCTCTCTTG ACTTGGG AC C GTCATCTACA TCAAAAACCA 17 5 80 

TATGCGAATG GATGTCTATC ATTTCATCTA CCCTCCATCA CATCCTGTAT AGCTGCTTTA 17640 
ACTACAGCTA AACTACTATC ATCTATTTCC ATCACATAGA GGTTACTGTC TGGC ATTGC A 17700 
TAAGAAGGAA GATCCATCCG ACCTGTCCCT TTTAAATCTT GAGAATTTAC TTTATAATTC 17760 
CCTCCACTTT CTAACTGAGC ATTGACCAAA TTTATCATGG TCTCAAGTGG CAT ATTTGTT 17 820 

TGGATAGAAT CTTGCAAGCT ATTAATGATC GT ACT AT AAT TTTTCAGCAC TTCGGTTGAC 17880 
GTTAATTTTT GAAGGATAGC CACAATCACC TTTTGTTGAT GGCGCCCGCG GTCACGATCG 17 94 0 

CCATCTGCTA GGGAGTAGCG CTCACGAACA AAACCGACAC CCTGTTCTGA ATCAAGATGA 18000 
ACATTGCCTG CAGGGTAATA CTTTCCATTC GTATGGGCAG TAAATTCTTG ATCATTATAA 180 60 
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ACATCAATTC 


CACCCAACAA 


ATCAATCAAT 


TTCAAAAACG 


AAGTGAAGTT 


CAATCGCACA 


13120 


TAGTAATTGA 


TATCCACTCC 


ATAGAGATTT 


TCTAAGGTGT 


GAATGGACGA 


ATCAACTCCA 


18180 


TAAATGCCCG 


CATGAGTCAA 


TTTATCTTTT 


TGATTATTTC 


CACCATCTGC 


GATTGGTACA 


13240 


TACGCATCAC 


GTGGCGTTGT 


GGTCAAGAGG 


ATTTTCTTGG 


TATCTCGATT 


GACAGTCATC 


18300 


AGGATGTTGA 


CATCTGATCG 


CGACACCGAA 


CTAATAGGAC 


CATAGGTGTC 


AATTCCACTA 


18360 


AC AT AG AT AT 


TGAAAGACTG 


ACTCTTAGAC 


GTCTTAGGAG 


CTTCTACTTT 


TTTAGTGAAT 


18420 


CCCTTAGTAT 


AAATCTTTTT 


TATCTTCGAT 


GCGTAGTCTG 


GATACTCTGA 


CTCGATGATG 


18480 


TTTTCAAAGA 


C ACT ATTT AG 


GACAATGGCC 


TTAGTCTCCC 


CTGCAATCAA 


ACTCTTGTAA 


13540 


GCTGCCAAGT 


AAGACGAACT 


CTGCTTGACC 


GTCAAATCGG 


TATTCTGACT 


TGACTTGATA 


18600 


TCAGCTAGTA 


ATTTCTGAAT 


ATTTTCATTA 


TTAGTCCCAG 


TCGGTGCTGT 


CACACTCGTC 


18660 


AGTTGCGTAA 


CATTTTCGAT 


CTCACTATCT 


GCTAAAACAG 


CGACACTGAT 


TGAATATTCT 


18720 


GAGTAATTAG 


AAGTCGCATT 


TAAACGATTG 


GTCAGTCCAA 


CAAACTGCTG 


TACTGCAAAG 


18780 


AGCGACACAG 


AGCTGACAAG 


GATAGAGAAC 


ACCAACAGAA 


AAATAGTAAA 


CTTTTCAGCT 


18840 


TTTTTATAGA 


TAATCAAGAG 


TAGCCCTACC 


AAGGC AACT A 


GT AGG ACT AA 


CGCAGTTACC 


18900 


ACT AG ATT AA 


GAT AT CT AAA 


AGCAAGGATA 


TTGTACTTAA 


AG ATT AAG AA 


CAATAAAAAA 


13960 


CAAACTAACA 


ATAAATAAAT 


AGTC AG C AAA 


ACTATATTAA 


CACTTCCCTT 


CACTTTCTCT 


19020 


GAACGTGATT 


TTTTAAAACG 


TCTACTCATG 


ATTAATACCT 


ATACATTGAA 


CATTATACGA 


19080 


TT AT AT C ACT 


TTTTTACGGT 


AATGTCTACA 


CCTTTATTTT 


TACTATCTGC 


ATCTTTAAGT 


19140 


ATCTT ACT AG 


ACTTCCCGCG 


AAACAAAAAT 


ATAGTAAAAT 


GAAATAAGAA 


CAGAACAAAT 


19200 


CGTTCAGGAC 


AGTCAAATCG 


ATTTCTAACA 


ATGTTTTAGA 


AGCAGAGGTG 




1925C 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21706 base pairs 

( B) TYPE: nucleic acid 
<C) STRAND EDN ES S : double 
( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AAAGTTGAAA GACTGCTAGC TGTTTTTGAT AC C AATCGTT TCCAACTACA G AG C AAAC AG 6C 

TATACAAAGT TTCTTTTTGG ATGTAAGCTT CTTGATGGAC AATTCCAAGA AAATCAAGAA 120 

ATTGCTGACC TTCAATTTTT TGCCATTGAC CAACTGCCGA ACTTATCTGA AAAACGCATT 180 

ACCAAGGAGC AAAT AG AG CT TCTTTGGCAG GTTT AT C AAG GTCATAGGGG GCAAT ATCTT . 2 40 
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GACTAAGAAG ATGATTATCG TATTTCTAAA TCCATTTTTA ACAACTAGCA TGGTATAATA 300 

ATATGCAGGA AAATTTTGAA TTATGAGGAA G ACT AG AT G A ATTTATGGGA TATTTTCTTT 3 60 

ACGACTCAGG CAACCGAGCC GCCCAAATTT GACCTTTTTT GGTATGTTAG CCTATTTACG 4 20 

CTCTTAGCCT TAACCTTTTA TACAGCCCAT CGCTATCGTG AAAAGAAGGT TTACCAACGA 480 

TTTTTCCAAA TCTTGCAGAC TGTTCAGTTA ATCCTTCTTT ATGGTTGGTA CTGGGTCAAT 54 0 

CATATGCCAC TGTCAGAAAG CCTACCCTTT TACCATTGCC GTATGGCTAT GTTTGTGGTA 600 

CTCTTGCTTC CTGGTCAATC C AAAT AT AAA CAATACTTTG CATTATTGGG AACATTTGGG 660 

AC AT T AGC AG CCTTTGTTTA TCCAGTGCCA GATGCTTACC CTTTTCCACA TATCACCATT 72 0 

CTATCCTTTA TCTTTGGTCA TTTAGCACTC TTGGGGAACT CTCTAGTTTA TCTATTGAGA 7 80 

CAGTATAATG CGCCATTGCT GGATGTGAAG GG AATTTTTC TCATGACCTT TGCCCTAAAT 84 0 

GCCTTGATTT TTGTGGTCAA TTTGGTGACA GGTGGCGATT ACGGATTTTT GACAAAACCG 900 

CCATTGGTTG GGGATCACGG TCTAGTAGCT AATTATTTAC TTGTTTCAAT TGTGCTGGTA 960 

GCTACTATCA GTTTGACTAA GAAAATCTTA GAATTCTTTT TAGCTCAAGA AGCAGAAAAA 102 0 

ATGATTGCAA AGGAAGCTTA ACACAGAGCT TTCTTTTTTG CTCTTAGAGA GTTTTTACAA 1080 

GCAGCTTATA AAATAAGAAT TTCTGAATAG ACAAACTCAA AAAATGGCTG GGAAATTTAG 1140 

GAAAAAAGCA AGCACGATTA AATTTTTTGT GTTATAATAT TTTGTG AAT A GCTATGCCTA 12 00 
TGTTTACCTA TGGAATAATA CGAAGTGCGA AACTTGGAAG AT AG AG AGG A AGCGATGTAA * 12 6 0 

TGGCTAGAGA AGGCTTTTTT AC AGGTCT AG ATATTGGAAC AAGCTCTGTC AAGGTGCTTG 13 2 0 

TGGCCGAGCA GAGAAATGCT GAATTAAATC TAATTGGCGT GAGTAATGCC AAAAGTAAAG 13 8 0 

GTGTAAAGGA TGGAATTATT GTTGATATTG ATGCAGCAGC AACTGCTATC AAGTCAGCCA 14 4 0 

TTTCCCAAGC GGAAGAAAAG GCAGGCATTT CGATTAAATC AGTGAATGTC GGCTTGCCTG 1500 

GTAATCTTTT GCAGGTAGAA CCAACTCAGG GGATGATTCC ACTAACATCT CATACTAAGG 15 60 

AAATTACGGA TCAAGATGTT GAAAATGTTG TCAAATCAGC TTTGACAAAG AGTATGACAC 162 0 

CTGACCGTGA AGTCATTACC TTTATTCCTG AAGAATTTAT TGTGGATGGT TTCCAACGGA 16 80 

TTCGTGACCC ACGTGGCATG ATGGGGGTTC GCCTTGAAAT GCGTGGTTTG CTTTATACAG 17 4 0 

GACCTCGTAC TATCTTGCAC AATTTGCGTA AG ACGG TTG A GCGTGCAGGT GTTCAGGTTG 1800 

AAAATGTTAT CATTTCACCA CTAGCAATGG TTCAGTCTGT TTTGAACGAA GGGGAACGTG 18 60 

AATTTGGTGC TACAGTGATT GATATGGGGG CAGGTCAAAC GACTGTCGCT ACAATCCGTA 19 2 0 

ATCAAGAACT CCAGTTCACA CATATTCTCC AAGAAGGTGG AGATTATGTA ACT AAAG ATA 19 BO 
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TCTCCAAGGT TTTGAAAACC TCTCGCAAAT TACCCGAAGG CTTGAAACTG AATTACGGGG 204 0 

AAGCCTATCC GCCTCTTGCA AGCAAAGAAA CCTTCCAAGT AGAGGTTATT GGAGAAGTAG 2100 

AAGCAGTCGA AGTGACGGAA GCCTACTTCT CAGAAATTAT TTCTGCACGA ATCAAGCACA 2160 

TCCTTG AAC A AATCAAGCAA GAATTAGATA GAAGGCGTCT ATTGGACCTC CCTGGTGGTA 22 20 

TTGTCTTAAT CCGTGGGAAT GCCATTTTAC CAGGTATGGT TGAGCTTGCT CAGGAAGTCT 22 80 

TTGGCGTCCG TGTCAAGCTT TATGTTCCAA AT CAAGTTGG TATCCGTAAT CCAGCCTTTG 2 3 40 

CGCATGTGAT TAGTTTATCA GAATTTGCCG GTCAATT AAC AG AAGTT AAT CTTTTGGCTC 2 4 00 

AGGGAGCGAT AAAAGGTGAG AATGACTTAA GTCATCAGCC AATTAGTTTT GGTGGGATGC 2 4 60 

TGCAAAAAAC AGCTCAGTTT GT AC AAT C AA CGCCTGTTCA ACCAGCTCCT GCTCCAGAAG 2 52 0 

TAGAGCCGGT GGCGCCTACA GAACCAATGG CGGATTTCCA ACAAGCTTCA CAAAATAAAC 2 53 0 

CGAAATTAGC AGATCGTTTC CGTGGATTGA TCGGAAGCAT GTTTGACGAA TAAAGAGGAA 2 64 0 

AAATAAATTA TGACATTTTC ATTTGATACA GCTGCTGCTC AAGGGGCAGT GATTAAAGTA 2700 

ATTGGTGTCG GTGGAGGTGG TGGCAATGCC ATCAACCGTA TGGTCGACGA AGGTGTTACA 2 7 60 

GGCGTAGAAT TTATCGCAGC AAACACAGAT GT AC AAGC AT TGAGTAGTAC AAAAGCTGAG 2 32 0 

ACTGTTATTC AGTTGGGACC T AAATT G ACT CGTGGTTTGG GTGCAGGAGG TCAACCTGAG 2880 

CTTGGTCGTA AAGCCGCTGA AGAAAGCGAA GAAACACTGA CGG AAGC TAT TAGTGGTGCC 2 94 0 

GATATGGTCT TCATCACTGC TGGTATGGGA GCACGCTCTG CAACTCGACC TGCTCCTGTT 3000 

ATTG CTCGT A TCGCCAAAGA TTTAGGTCCG CTTACAGTTG GTGTTGTAAC ACGTCCCTTT 3060 

GGTTTTGAAG GAAGTAAGCG TGG AC AATTT GCTGTAGAAG GAATCAATCA ACTTCGTGAG 3120 

CATGTAGACA CTCTATTGAT TATCTCAAAC AACAATTTGC TTGAAATTGT TGATAAGAAA 313 0 

ACACCGCTTT TGGAGGCTCT TAGCGAAGCG GATAACGTTC TTCGTCAACG TGTTCAAGGG 32 4 0 

ATTACCGATT TGATTACCAA TCCAGGATTG ATTAACCTTG ACTTTGCCGA TGTGAAAACG 3 3 0C 

GTAATGGCAA ACAAAGGGAA TGCTCTTATG GGTATTGGTA TCGGTAGTGG AGAAGAACGT 3 3 60 

GTGGTAGAAG CGGCACGTAA GGCAATCTAT TCACCACTTC TTGAAACAAC TATTGACGGT 3 4 20 

GCTGAGGATG TTATCGTCAA CGTTACTGGT GGTCTTGACT TAACCTTGAT TGAGGCAGAA 34 30 

GAGGCTTCAC AAATTGTGAA CCAGGCAGCA GGTCAAGGAG TGAACATCTG GCTCGGTACT 3 54 0 

TCAATTGATG AAAGTATGCG TGATGAAATT CGTGTAACAG TTGTTGCAAC GGGTGTTCGT 3 600 

CAAGACCCCG TAGAAAAGGT TGTGGCTCCA CAAGCTAGAT CTGCTACTAA CTACCGTGAG 3 660 

ACAGTGAAAC CAGCTCATTC ACATGGCTTT GATCGTCATT TTGATATGGC AGAAACAGTT 3 7 20 

GAATTGCCAA AACAAAATCC ACGTCGTTTG G AACC AACTC AGGCATCTGC TTTTGGTGAT 37 30 
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TGGGATCTTC GCCGTGAATC GATTGTTCGT ACAACAGATT CAGTCGTTTC TCCAGTCGAG 3 840 

CGCTTTGAAG CCCCAATTTC ACAAGATGAA GATGAATTGG ATACACCTCC ATTTTTCAAA 3900 

AATCGTTAAG TAAATGAATG TAAAAGAAAA TACAGAACTT GTTTTTCGAG AAGTTGCAGA 3 9 60 

GGCTAGTCTG AGTGCTCATC GAGAGAGTGG TTCGGTCTCT GTCATTGCAG TTACCAAGTA 4020 

TGTAGATGTA CCGACAGCGG AAGCCTTGCT TCCGCTAGGT GTCCATCATA TCGGTGAAAA 4 080 

TCGTGTAGA? AAGTTTCTGG AAAAATATGA AGCTTTAAAA GATCGAGATG TGACTTGGCA 4140 

TTTGATTGGT ACCTT GC AAA GACGTAAGGT GAAAGATGTC ATTCAATACG TTGATTATTT 4 200 

CCATGCATTG GACTCAGTAA AGCTAGCAGG GGAAATTCAA AAAAGAAGTG ACCGAGTCAT 4 2 60 

CAAGTCTTTC CTTCAAGTAA ATATTTCTAA AGAAGAAAGC AAACACGGTT TTTCGAGAGA 4320 

GGAACTGCTG GAAATCTTGC CAGAGTTACC CAGACTAGAT AAGATTGAAT ATGTTGGTTT 4 3 80 

AATGACGATG GCACCTTTTG AGGCTAGCAG TGAGCAGTTG AAAGAGATTT TCAAGGCGGC 44 40 

CCAAGATTTA CAAAGAGAAA TTCAAGAGAA ACAAATTCCA AATATGCCTA TGACCGAGTT 4 500 

AAGTATGGGA ATGAGTCGTG ATTATAAAGA AGCGATTCAA TTCGGTTCCA CTTTTGTTCG 4 5 60 

TATAGGTACA TCATTTTTTA AGTAGGAGAG AACCATGTCT TTAAAAGATA GATTCGATAG 4 62 0 

ATTTATAGAT TATTTTACGG AGGATGAGGA TTCAAGTCTC CCTTATGAAA AAAGAGATGA 4 680 

GCCTGTGTTT ACTTCAGTAA ATTCTTCACA GGAACCGGCT CTCCCAATGA ATCAACCTTC 4740 

ACAGTCGGCT GGCACAAAAG AGAACAATAT C AC C AG ACT T CATGCAAGAC AACAGGAATT 4 800 

GGCAAATCAG AGTCAGCGTG CAACGGATAA CGTCATTATA GATGTTCGTT ATCCTAGAAA 4 8 60 

ATATGAGGAT GC AAC AG AAA TTGTTGATTT ATTGGCAGGA AACGAAAGTA TCTTGATTGA 4 920 

TTTTCAGTAT AT C AC AG AC C TCCAGGCTCG TCCTTCTTTC GACTATTTGG ATGGAGCTTG 4 9 80 

TCATGTTTTA GCTGGAAATT TGAAAAAGGT AGCTTCTACC ATGTATTTGT TG AC AC C AG T 5 04 0 

GAACGTTATT GTAAATGTTG AAGATATCCG TTTACCAGAT GAAGATCAAC AGGGTGAGTT 5100 

CGGTTTTGAT ATGAAGCGAA ATAGAGTACG ATAATGATTT TTTTAATTCG TATGATTTAT 5160 

AATGCAGTGG ATATTTACTC CCTGATTTTG GTAGCCTTCG CTGTCATGTC TTGGTTTCCA 5 22 0 

GGTGCCTACG AATCCAGTTT AGGTCGTTGG ATTGTAGCGT TGGTGAAACC AGTGCTTGCT 5 2 80 

CCCTTGCAAC GCCTGCCTTT ACAGATAGCG GGTCTTGATT TATCTGTTTG GGTTGCGATT 5 340 

GTTTTGGTTC GATTTTTAGG AGAAAACCTA GTGCGTTTTC TGGCGATGAT AGGATGAATA 5400 

AACCGATTTA TCAGCATTTC TCCATAGAAG ATCGTCCATT TCTTGACAAG GGAATGGAAT 54 60 

GGATAAAGAA GGTAGAAGAT AGCTATGCTC CTTTTTTAAC TCCTTTTATC AATCCTCATC 552 0 
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AGGAGAAGCT 


ATTAAAGATT 


TTGGCCAAAA 


CCTATGGTCT 


TGCTTGTAGC 


AGTAGTGGGG 


5580 


AATTCGTCTC 


GAGTGAGTAT 


GTTCGAGTTT 


TATTATACCC 


AG ATT AT TT C 


CAACCAGAGT 


5640 


TTTCAGATTT 


TGAAATATCT 


CTCCAGGAAA 


TTGTGTATTC 


CAATAAATTT 


GAACATTTAA 


5700 


CGCATGCTAA 


GATTTTAGGG 


ACAGTCATCA 


ATCAATTAGG 


GATTGAACGG 


AAACTTTTTG 


5*760 


GAGATATCCT 


AGTAGATGAA 


GAACGGGCGC 


AGATTATGAT 


TASTCAGCAG 


TTTCTTCTTC 


5820 


TCTTTCAAGA 


TGGACTAAAG 


AAAATTGGTC 


GTATACCTGT 


TTCGCTGGAG 


GAACGTCCTT 


5880 


TCACCGAGAA 


AATAGATAAG 


CTAGAACACT 


ATCGAGAACT 


GGATTTATCT 


GTGTCTAGTT 


5940 


TTCGATTAGA 


TGTTCTTTTA 


TCAAATGTTT 


TGAAACTATC 


TAGGAATCAA 


GCAAACCAGT 


6000 


TGATTGAAAA 


GAAACTTCTC 


CAAGTAAATT 


ATCATGTGGT 


AGACAAATCA 


GATTACACTG 


6060 


TTCAAGTTGG 


AGACTTGATT 


AGTGTGAGAA 


AATTTGGTCG 


CTTGAGATTA 


CTTCAAGATA 


6120 


AGGGACAAAC 


GAAAAAAGAG 


AAGAAAAAAA 


TAACCGTCCA 


CTTATTATTA 


AGTAAGTGAG 


6180 


GAATAGAATG 


CCAATTACAT 


CATTAGAAAT 


AAAGGACAAG 


ACTTTTGGAA 


CTCGATTCAG 


6240 


AGGTTTTGAT 


CCAGAAGAAG 


TCCATGAATT 


TTTAGATATT 


GTGGTTCGTG 


ATTACGAAGA 


6300 


TCTTGTGCGT 


GCGAATCATG 


ATAAAAATTT 


GCGTATTAAG 


AGTTTAGAAG 


AGCGTTTGTC 


6360 


TTACTTTGAT 


GAAATAAAAG 


ATTCATTGAG 


C C AGTCTGT A 


TTGATTGCTC 


AGGATACAGC 


6420 


TGAGAGAGTG 


AAACAGGCGG 


CGCATGAACG 


TTCAAACAAT 


ATC ATT CATC 


AAGCAGAGCA 


6480 


AGATGCGCAA 


CGCTTGTTGG 


AAGAAGCTAA 


ATATAAGGCA 


AACGACATTC 


TTCGTCAAGC 


6540 


AACTGATAAT 


GCTAAGAAAG 


TCGCTGTTGA 


AACAGAAGAA 


TTGAAGAACA 


AGAGCCGTGT 


6600 


CTTCCACCAA 


CGTCTCAAAT 


CTACAATTGA 


GAGTCAGTTG 


CCTATTGTTG 


AATCTT C AG A 


6660 


TTGGGAAGAT 


ATTCTCCGTC 


CAACAGCTAC 


TTATCTTCAA 


ACCAGTGATG 


AAGCCTTTAA 


6720 


AGAAGTGGTT 


AGCGAAGTAC 


TTGGAGAACC 


GATTCCAGCT 


CCAATTGAAG 


AAGAACCAAT 


6780 


TGATATGACA 


CGTCAGTTCT 


CTCAAGCAGA 


AATGGCAGAA 


TTACAAGCTC 


GTATTGAGGT 


6840 


AGCCGATAAA 


GAATTGTCTG 


AATTTGAAGC 


TCAGATTAAA 


C AG G AAGTGG 


AAGCTCCAAC 


6900 


TCCTGTAGTG 


AGTCCTCAAG 


TTGAAGAAGA 


GCCTCTGCTC 


ATCCAGTTGG 


CCCAATGTAT 


6960 


GAAGAACCAG 


AAGTAGCTCC 


AATGCATCCG 


ATAGGTCCAA 


CACCAGCTAC 


AGAAACTGTT 


7020 


GATTCAATAC 


CGGGATTTGA 


AGCACCGCAA 


GAATCTGTTA 


CAATTTTATA 


AGAAATATTC 


7080 


TGAGAACAAT 


ATCTTATCCT 


TATATTTCCA 


GCGAGCAGGA 


GATGGTGTGA 


GTCCTGTAAT 


7140 


CCCTATTGAT 


AAGATTATCC 


TCTCAAAAAC 


TCAAGTCTGA 


AGCTAGTAAG 


ATTTGACGTT 


7200 


TCCCACGTTA 


CGGGATAAGA 


GGGAGAAAGA 


CTAAATCTTT 


TTCCGAATAA 


AGGTGGTACC 


7260 


ACGATTTTCG 


TCCTTTTTGG 


AAGTCGTGGT 


TTTTAATTTG 


TTATTATTTA 


TAAAGGAGAT 


7320 
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ACCATGAAAC TCAAAGACAC CCTTAATCTT GGGAAAACTG AATTCCCAAT GCGTGCAGGC 7 3 80 

CTTCCTACCA AAGAGCCAGT TTGGCAAAAG GAATGGGAAG ATGCAAAACT TTATCAACGT 7 44 0 

CGTCAAGAAT TGAACCAAGG AAAACCTCAT TTCACCTTGC ATGATGGCCC TCCATACGCT 7 500 

AACGGAAATA TCCACGTTGG ACATGCTATG AACAAGATTT C AAAAG AT AT CATTGTTCGT 7 560 

TCTAAGTCTA TGTCAGGATT TTACGCACCA TTTATTCCTC GTTGGGATAC TCATGGTCTG 7 62 0 

CCAATCGAGC AAGTCTTGTC AAAACAAGGT GTCAAACGTA AAGAAATGGA CTTGGTTGAG 7 680 

TACTTGAAAC TTTGCCGTGA GTACGCTCTT TCTCAAGTAG ATAAACAACG TGAAGATTTT 77 4 0 

AAACGTTTGG GTGTTTCTGG TGACTGGGAA AATCCATATG TG AC CTTG A C TCCTGACTAT 7 800 

GAAGCAGCTC AAATTCGTGT ATTTGGTGAG ATGGCTAATA AGGGTTATAT CTACCGTGGT 786 0 

GCTAAGCCAG TTTACTGGTC ATGGTCATCT GAGTCAGCAC TTGCTGAAGC AGAGATTGAA 792 0 

TACCATGACT TGGTTTCAAC TTCCCTTTAC TATGCCAACA AGGTAAAAGA TGGCAAAGGA 7 980 

GTTCTACATA CAGATACTTA TATCGTTGTC TGG AC AACG A CTCCATTTAC CATCACAGCT 804 0 

TCTCGTGGTT TGACGGTTGG TGCAGATATT GATTACGTTT TGGTTCAACC TCCTGGTGAA 8100 

GCTCGTAAGT TTGTCGTTGC TGCTGAATTA TTGACTACCT TGTCTGAGAA ATTTGGCTGC 8160 

GCTGATGTTC AAGTTTTGGA AACTTACCGT GGCCAAGAAC TCAACCACAT CGTAACAGAA 82 2 0 

CACCCATGGG ATACAGCTGT AGAAGAGTTG GTAATTCTTG GTGACCACGT TACGACTGAC 8280 

TCTGGTACAG GTATTGTCCA TACAGCCCCT GGTTTTGGTG AGGACGATTA CAATGTTGGT 8 340 

ATTCCTAATA ATCTTGAAGT CGCACTGACT GTTGATGAAC CTGGTATCAT GATGAAGAAT 84 0 0 

GCTGGTCCTG AATTTGAAGG TCAATTCTAT GAAAAGGTAG TTCCAACTGT TATTGAAAAA 84 60 

CTTGCTAACC TCCTTCTTGC C C AAGAAG AA ATCTCTCACT CATATCCATT TCACTGGCCT 35 2 0 

ACTAAGAAAC CAATCATCTG GCGTGCACTT CCACAATCGT TTGCCTCAGT TTCTAAATTC 8 5 80 

CGTCAAGAAA TCTTGGACGA AATTGAAAAA GTGAAATTCC ACTCAGAATG GGGTAAAGTC 8 64 0 

CGTCTTTACA ATATGATCCG TGACCGTGGT GACTGGGTTA TCTCTCGTCA ACGTGCTTGG 37 0 0 

G GTCTTCC AC TTCCTATCTT CTACGCTGAA G ATGGT AC AG CTATCATGGT AGCTGAAACT 87 60 

ATTGAACACG TAGCTCAACT TTTTGAAGAA TATGGTTCAA GCATTTCGTG GGAACGTGAT 88 2 0 

GCCAAAGACC TCTTGCCAGA AGGATTTACT CATCCAGGTT CACCAAACGG CGAGTTCAAA 8880 

AAAGAAACTG ATATCATGGA CGTTTGGTTT GACTCAGGTT CATCATGGAA TGGAGTGGTG 8 94 0 

CTAAACCCTC CTGAATTGAC TTACCCAGCC GACCTTTACC TAGAAGGTTC TGACCAATAC 9000 

CGTGGTTGGT TTAACTCATC ACTTATCACA TCTCTTCCCA ACCATGGCGT AGCACCTTAC 9060 
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AAACAAATCT TGTCACAAGG TTTTGCCCTT GATGGTAAAG GTGAGAAGAT GTCTAAATCT 9120 

CTTGGAAATA CTATTGCTCC AAGCG ATGTT GAAAAACAAT TCGGTGCTGA AATCTTGCCT 9180 

CTCTGGGTAA CAAGTGTTGA CTCAAGCAAT GACGTGCGTA TCTCTATGGA TATCTTGAGC 92 40 

CAAGTTTCTG AAACTTACCG TAAGATTCGT AACACTCTTC GTTTCTTGAT 7GCCAATACA 9 3 00 

TCTGACTTTA ACCCAGCTCA AGATACAGTC GCTTACGATG AGCTTCGTTC AGTTGATAAG 93 60 

TACATGACGA TTCGCTTTAA CCAGCTTGTC AAGACC ATTC GTGATGCCTA TGCAGACTTT 94 20 

GAATTCTTGA CGATCTACAA GGCCTTGGTG AACTTTATCA ACGTTGACTT GTCAGCCTTC 94 80 

TACCTTGATT TTGCCAAAGA TGTTGTTTAC ATTGAACGTG CCAAATCACT GGAACGCCGT 9 540 

CAAATGCAGA CTGTCT7C7A TGACATTCTT GTCAAAATCA CCAAACTCTT GACACCAATC 9600 

CTTCCTCACA CTGCGGAAGA AATCTGGTCA TATCTTGAGT TTCAAACAGA AGACTTCGTC 9 6 60 

CAATTGTCAG AATTACCAGA AGTTCAAACT TTTGCTAACC AAGAAGAAAT CTTGGATACA 9 7 20 

TGGGCAGCCT TCATGGACTT TCGTGGACAA GCACAAAAAG CCTTGGAAGA AGCTCGTAAT 97 80 

GCAAAAGTTA TCGGTAAATC ACTTGAAGCA CACTTGACAG TTT ATC C AAA TGAACTTGTC 99 4 0 

AAAACTCTAC TCGAAGCAGT AAACAGCAAT GTAGCACAAC TTTTGATCGT GTCTGAGTTG 9900 

ACCATCGCAG AAGGACCAGC TCCGGAAGCT GCCCTTAGCT TCGAAGATGT AGCCTTCACA 99 60 

GTTGAACGTG CTACTGGTGA AGTATGTGAC CGTTGCCCTC GTATCGACCC AACAACAGCA 10020 

GAACGCAGCT ACCAGGCAGT TATCTGTGAC CACTGTGCAA GCATCGTAGA AGAAAACTTT 10080 

GCGGAAGCAG TCGCAGAAGG ATTTGAAGAG AAATAAGATT GAAAAGTCTA GGCAAAATTC 1014 0 

AATTTGAGAA GAAAAGACAA CTAATTTTAT AGTCTATTAA ACGCATTGTA TCACGTTTTT 10 2 00 

GAATACCTGA TATG ATGCGT TTTTTATTTA TTTTAAAAAT TTGCGAGGTA TCACTTTTTA 102 60 

TACTCAACAA GAATCAAAGA G AAACTT AG C AAGCTAACAG TAGTAAGATA AAATAGGAAT 103 2 0 

TTGATATTAG GGATAAGATT GGTAAATAGT GTAATATTTT TACAACAATA AATTTATATA 10 3 80 

GTTATTTCTG GTTT C TG AAA AGTATTATAT TTTATTTCAT ATTATACAAA TTTTTATTTT 10440 

AT AAT ATC AG AACATACTTT TTTT AAAAGC AAATATGATA CAATTTTATT TGAAAAAAAT 10500 

AAAAAAGGAG ATTTTATTAT AAAATTAAAA ACACTTGCTT TAATTAGTGG TATCGTCGGT 1056 0 

CTTGTGGGAG GAATTTTACT TCTTATTGGT CCTTTTGTCT TGTTGGGAAT AGCGGTAAAC 1062 0 

ACAGCTGCTA CAACTCTTAA TGGAGGAGCT ACTGCAGGGG CTTTTTCAGG TGTAGCCTTA 10 6 80 

CTCTTGAATG CCTTGAAGAT TGCAAATCTT GTTCTTGGTA TCATTGCTAT TGTTTACTAT 10740 

AAAGGAGATA AGCGTGTAGG TGCAGCTCCG TCTGTACTAA TG ATTGTTT C TGGTGGAGTT 1080C 

AGTCTCATTC TATTCCGTTC TTAGGATGGG TTGGGGGGAT TTTTGCTATT ATCGGAGGAT 10 8 60 
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CTCTATTCCT 
AAAAGAACAA 
GCATGTGGAA 
ACATCCAATA 
ATTGAATCAA 
TATTTTACTA 
ACTAAACTAA 
GAAGATAGAA 
TCCTTTTTGG 
AGGGGAAACC 
GCCACGGTGA 
TGTCTGAAAA 
TCGAATTTGA 
CTCTTGAAGC 
GTCACTACGG 
AGCAAGTTCA 
ATGAGCACTC 
ATGCTGAAAA 
CTCCAGCTCT 
CCCTTGTAAA 
ACTTCCCACC 
TTGGAAAATA 
AGTATGATAA 
ACTTACATGG 
GTATTTGGCC 
GGTCCACTTC 
TT AG CTG AG A 
GAGCAGCACC 
ATTCACATTG 



TTCAACATTG 
AAAAGT TT AT 
CATAAAGTTC 
GTTCAAAAGA 
TACGCACTTA 
AAGCTGAGGA 
AAGAGTCAAC 
TAGAAACAAC 
TTTTGACTAG 
TCGGAAAATT 
GTCTGAATGG 
AGGTACACAA 
CCAAGCTTAC 
TTCTGACCAA 
TGG TTTG ACT 
CATCTGGCGT 
AGCTCACACA 
CTTGAAAGTG 
TAAAGATGGT 
ACACATCAAA 
ATTGGTATTC 
AAAAATTGTA 
GGAATAAAAA 
GTTGGGAAGA 
TTATAGGGCC 
CTGATAGTTC 
TTCGGCAGGC 
TAATTGGGCC 
AGTTC AG CCA 



AAGAAATTCA 
CGGTATAGGA 
AAAGAATACT 
AACAATCACT 
TGCAGATTAT 
AGGTTTCAAA 
TCAAAAATTA 
CAAAAACAAT 
CTTTTTTGTG 
TAAAGGAGAA 
AACAAAGCTA 
CAACCGATTG 
ACTTCAGTAT 
TTGTGGCTTC 
GGTAAAAACA 
CGTTCATACG 
CACCGTCGTT 
ACTTTGGAAC 
AAAAACGTAT 
GGTTTGTCAG 
GAATTCGACG 
AGTCTAGAAT 
ACAAGATTAT 
AGAAGCTTTA 
TGAACGGATT 
AGATTATCAG 
TTTGGAGAAG 
AAAAACGAGT 
GCAAGGTACG 



365 
AATCAGAAGA 
GTAGCTCTAT 
TCAACAAGTA 
TTCGATACAC 
ATAGATCTTT 
GGCATAGCTA 
TTCGATGCGC 
GTGATTGCCA 
AAAAATTGTG 
TCCATCTAAT 
ACCTTTTCAC 
ACGCTGGTAA 
TGAAACGTGC 
CAGTTGAAAA 
AAGCTGAAGC 
ATCTATTCCC 
ACGCTTCACT 
GTGCTCTTCC 
TCGTAGGAGC 
ATGACGAGAT 
AAAAATTCAA 
TGATTTCTAG 
GTACTGGCCT 
CCGATAGGCA 
CAATTTAATG 
GGTGGAAATC 
AGAGATTACA 
CAATATGGGA 
ACTTTGTCTC 



ATAAAAGGTA 
TATCTCTTTC 
ATGATGAGAA 
CGGTTGTAAC 
ATAAAAATAT 
TGGAAAATAA 
AGAAAAAAAG 
AACATTGTCA 
TAAAATAGAA 
GGTAAAATTG 
TGGTTGGGCT 
ATTGATCAAA 
TATCAAAACA 
ATCATGGCGC 
TGCTGAACAA 
TCCAAACATG 
TGACGACTCA 
ATTCTGGGAA 
TCACGGTAAC 
CATGGACCTG 
CGTCGTTTCT 
CCTTTT7ATC 
ACAAGCAACC 
ATGGTTCTTT 
AAAAAAGTCT 
TTCAGGATCA 
ATCTGGCTAA 
CCTATCTGTC 
AGGTGACGGA 



TTTT AG CATG 
TCTTCTAGTT 
GACAGTAGCA 
AGACGATGCG 
TTTTGATGAT 
TGACTCGTTT 
GTTAAATAAT 
AACAGTCCTT 
TAGATAAACG 
GTTTTTGCTC 
GATGTTGATT 
GAAGCTGGTA 
ACTAACTTGG 
TTGAACGAAC 
TTTGGTG ATG 
GACCGTGATG 
GTTATCCCAG 
GATAAAATCG 
TCAATCCGTG 
GAAATCCCTA 
GAATACTACC 
TTAGTATGGA 
AGCTTCAACC 
AGGAGCAAAA 
CTGGTCTGGA 
GTATGTTTTT 
GGAACTGGCT 
TTTTGGGG AT 
CTATCAGAGA 



10920 
10980 
11040 

moo 

11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
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CAGCTGAATA TTAGTAAGGC ACTTGCGACG ACTTCTTATG TCTATAAGGG AACGCGATTT 12 660 

GAACGTAAAG CTTTTGCGAG TTTTCCAGAT GATCTCTTGG TTCAATCTTT TACTAAGGAA 12720 

GGGTTGGAAA CTCTAGATTT TACT AT AG AA CTATCCTTGA CCTGTGATTT GGCTTCTGAT 12 780 

GGAAAGTATG AGCAGGAAAA ATCTGATTAC AAGGAGTGTA AGTTGGATAT TACTGATTCT 12 8 40 

CATATCTTGA TGAAGGGAAG AGTTAAGGAT AATGATCTGC GGTTTGCTAG TTATCTAGCT 12 900 

TGGGAAACGG ATGGAGATAT TAGAGTTTGG TCAGATAGGG TTCAGATATC AGGAGCCAGT 12 960 

TATGCCAATC TCTTCTTGGC CGCTAAGACG GATTTTGCCC AAAATCCTGC TAGCAATTAT 13020 

CGCAAGAAAC TAG ATTT AG A GCAACAGGTG ATAGACTTGG TGG AC AC AGC TAAAGAAAAG 13 080 

GGCTATACCC AATTGAAATC AAGGCATATC GAGGACTACC AAGCCT T ATT CCAGCGTGTT 13 140 

CAATTGGATT TGGAAGCTGA TGTTGACGCA TCC AC T AC AG ATGATTTGTT AAAAAATTAT 13 200 

AAGCCACAAG AAGGGCAGGC TTTGGAGGAG CT G TTCTTCC AGTATGGACG GTATTTATTG 13 2 60 

ATTAGTTCGT CCAGAGACTG CCCAGATCCT CTACCAGCTA ACCTACAGGG AGTCTGGAAT 13 320 

GCGGTCGACA AT CCTCCTTG GAATTCGGAC TATCACTTAA ATGTCAATCT GCAGCTGAAT 13 380 

TATTGGCCAG CCTATGTTAC CAATCTCCTA GAGACGGTCT TTCCAGTCAT CAACTATGTA 13440 

GATGATTTGC GTCTCTATGG TCGTCTAGCG GCTGTAAAGT ATGCAGGAAT CGTCTCTCAG 13 500 

AAAGGTGAGG AGAATGGTTG GTTGGTTCAT ACTCAAGCGA CTCCCTTTGG TTGGACGGCA 13 560 

CCTGGTTGGG ATTACTATTG GGGTTGGTCA CCAGCTGCCA ATGCGTGGAT GATGCAAACC 13 620 

GTTTATGAAG CCT ATTT ATT TTATAGGGAC C AAG ACT ATC TCAGGGAGAA AATTTATCCC 13 68C 

ATGTTGAGGG AAACGGTTCG TTTTTGGAAT GCCTTTTTAC ATAAGGATCA GCAGGCGCAG 13 740 

CGTTGGGTGT CTTCTCCGTC TTATTCCCCA GAACATGGGC CGATTTCGAT TGGCAATACC 13 800 

TATGACCAAT CTCTCATTTG GCAGTTATTT CATGATTTTA TTCAGGCTGC TCAGGAATTC 13860 

CGACTCGATG AGGACTTGTT GACTGAGGTT AAGGAGAAGT CTGATTTACT AAATCCTTTG 13 920 

CAAATCACTC AATCTGGTCG AATCAGGGAG TGGTATGAGG AGGAAGAGCA GTATTTTCAA 13 980 

AATGAGAAAG TGGAGGCCCA GCATCGGCAC GCTTCCCATC TAGTGGGACT CTATCCTGGC 14 040 

AATCTCTTTA GCTACAAGGG ACAAGAGTAT ATTGAAGCGG CGCGTGCTAG CCTCAATGAT 14100 

CGTGGAGATG GCGGCACAGG CTGGTC C AAG GCTAATAAGA TCAATCTCTG GGCGCGTTTG 14160 

GGAGATGGCA ATCGAGCCCA TAAATTATTG GCAGAGCAGT TAAAGACATC CACCTTGCAA 14 220 

AATCTTTGGT GTAGCCATCC TCCTTTTCAG ATAGATGGTA ATTTTGGTGC TACTAGTGGC 14 2 80 

ATGGCAGAAA TGTTACTCCA GTCTCATGCA GCTTATCTGG TACCTCTAGC TGCCCTACCT 14 3 40 

GATGCTTGGT CAACAGGTTC TCTTTCAGGC TTAATGGCAC GTGGACATTT TGAAGTGAGC 144 00 
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ATG AG CTGGG AAGATAAAAA ACTCTTACAG TTGACCATTT TATCAAGGAG TGGAGGAGAT 14 4 60 

TTGCGAGTTT CTTATCCAGA TATTGAGAAG AGTGTGATTA AAATGAATCA AGAAAAAATA 14 520 

AAAGCGAAAT GCATGGGGAA AG ATTGT ATT TCGGTGGCAA CAGCAGAAGG TGATCTTGTT 145 8 0 

CAATTTTATT TTTAAGAAGA TGTTATAAGG CAGTAATTTG AAACTGCCTT TTAATAAGGA 14 640 

TTTAAGAATA TAAGCAGTTT TCAACTAGTT GAAAAAACGT TATAATGATA ATAGGAAGTA 147 0O 

ATACTCAATG AAAATCAAAG AGCACAAACT AGGAAGCTAG CCGCACGTTC CTCAAAACAG 147 60 

TGTTTTGAGG TTGCAGATGG AAGCTGACGT GCTTTGAAGA GAGATTTTCG AGGAGTATAA 14 820 

TTTGTTTGAT AGAGGGTGGG TCTGATGGCT TATATTGAGA TGAAACACTG TTACAAGCGT 14 880 

TATCAGGTTG GGGACACGGA GATTGTGGCC AATTGTGATG TGAATTTTGA GATTGAAAAG 14 9 40 

GGGGAGCTGG TTATTATCCT TGGTGCTTCA GGTGCAGGCA AGTCAACAGT TCTTAACCTT 15000 

CTTGGGGGAA TGGATACCAA TGATGAAGGG GAAATCTGGA TTGATGCTGT TAATATTGCG 15060 

GATTATAGTT CCCACCACCG CACCAATTAC CGTAGAAATG ATGTGGGGTT TGTTTTTCAG 15120 

TTTTATAATC TAGTTTCTAA TCTGACAGCT AAGGAAAATG TGGAACTGGC TTCTGAAATT 15180 

GTG AC AG AT G CCTTGAATCC TGATCAGGCC TTGACAGATG TAGGTCTGGC TCATCGTCTC 152 40 

AATAACTTTC CAGCCCAGCT TTCTGGAGGG GAGCAACAGC GAGTCTCCAT TGCACGCGCG 15 3 00 

GTAGCCAAAA ATCCTAAAAT TCTCCTTTCT GATGAACCGA CTGGAGCCTT GGATTATCAC 15 3 60 

ACGGGCAAGC AGGTTTTGAA AATTCTCCAA GACATGTCTC GTCAAAAGGG AGCGACGGTC 15420 

ATCATCGTGA CTCATAATGG AGCTTTGGCG CCCATTGCTG ATCGCGTGAT TCAAATGCAC 15480 

GATGCCAGTG TCAAGGATGT GGTGCTCAAC CAGCATCCTC AGGATATTGA CAGTTTGGAG 15540 

TACT AGC ATG ATCAAGCGAA AAACTTATTG GAAGGACTTA GTTCAGTCCT TCACAGGCTC 15600 

CAAGGGGCGT TTTTTATCCA TCTTGATCCT GATGATGTTG GGATCTCTAG CCTTAGTAGG 156 60 

CCTCAAAGTA ACCAGTCCCA ACATGGAGGC GACAGCTAAT GCTTATTTAA CAACTGCTCA 15720 

AACCTTGGAT TTGGCAGTCA TGTCTAACTA TGGCTTGCAT CAAGCAGACC AAGAAGAACT 157 80 

AAAACAGACG GAGGGCGCAG AGGTCGAGTT TGGCTATTTG ACAGATGTGA CTATGGATAA 15 840 

TGGGCAGGAT GCCATTCGGC TGTACTCCAA ACCAGAGCGA ATTTCAACCT TTCAGCTAAG 15 900 

AAAGGGACGA CTTCCTCAGT CAGACAAGGA AATCGCTTTG GCCACTCATT TGCAAGGCCA 15 9 60 

ATACAGCGTG GGACAGGAGA TTAGTTTTAA AGAAAAAGAA GAGGGTCATT CCTCTTTAAA 1602 0 

AG AC CAT ACT TATACCATTA CTGCTTTTCT GGATTCGGCT GAAATCCTCT CCCAGCGAGA 16 080 

TATGGGCTAC GCAGGAAGTG GAAGTGGGAC TCTGACAGCC TATCGGGTGA TTTTACCTAG 16140 
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TCAATTTGAT 


CAGAAAGTCT 


ACAATATAGC 


TCGTTTGAAA 


TATCAAGATT 


TAGCGGGTTT 


16200 


AAATCCCTTT 


TCATCAGCTT 


ATGAAGAAAA 


ATCCAAGCAA 


CATCAAGAAG 


AGCTTGAACA 




AATTTTATCA 


GAT AATGG C A 


AGGTACGTCT 


GCAACTTTTG 


AAAAAAGAAG 


GACAAGAGTC 


16320 


TCTAGACAAG 


GGGCAAGAGA 


CCCTTGACAA 


GGCTCAGACT 


AATTTGCAGG 


AAGGCAAGCG 


16380 


TCGTTTAGCA 


GCTGCTCAAG 


CTCGTATACA 


GGCTCAAGAA 


AGTCAACTAG 


CCTTGTTTCC 


1 5440 


TCAAGTTCAG 


AGAGAGCAGG 


CTAGTGCTCA 


ACTTACCCAA 


GCCAAGCAGG 


AATTGGGCAA 


1 6500 


GGAAGAGGAC 


AAACTAAAGC 


AAGCTGAACA 


AAATCTAGCC 


CAAGAAAAGG 


AAAAATTAGA 


16560 


AAAACATCAG 


CAAGTCTTGG 


ATGATTTGGC 


GGAGCCAAGG 


TATCAGGTTT 


ATAATCGTCA 


16620 


GACCATGCCA 


GGTGGTCAGG 


GCTATCTTAT 


GTATAGCAAT 


GCTTCATCCA 


GTATTCGACC 


1668 0 


AGTGGGCAAT 


ATCTTTCCTC 


TGGTACTTTA 


TGCCGTAGCA 


GCCATGGTGA 


CCTTTACGAC 


16740 


CATGACTCGC 


TTTGTAGACG 


AAGAGCGAAC 


TCATGCAGGG 


ATTTTTAAGG 


CCTTGGGTTA 


16800 


TCGTAGTAAG 


GATATTATCG 


CCAAGTTTCT 


CCTTTATGGA 


CTACTAGCTG 


GGACTGTCGG 


168 60 


AACGGCTCTA 


GGTAGTATAC 


TTGGTCATTA 


TTTGCTAGCC 


AGTGTAATTT 


CAAGTGTCAT 


16920 


TACAAAAGGC 


ATGGTGGTGG 


GAGAAACTCA 


GATTCAGTTC 


TATTGGACCT 


AT AGCTT AC T 


16980 


AGCTTTTGTC 


TTGACCTTGT 


TGGCGAGTGT 


GTTACCAGCC 


TATCTGGTGG 


CTTGGAGGGA 


17040 


ACTTCATGAC 


GAAGCAGCCC 


AGCTTCTACT 


TCCTAAACCT 


CCTGTCAAAG 


GAGCTAAAAT 


17100 


CTTATTGGAG 


CGTATCGGTT 


TTATCTGGCG 


TCGTCTCAGT 


TTT ACT C AT A 


AGGTAACAGC 


17 160 


CCGCAACATC 


TT TCGTT AT A 


AGC AG AG AAT 


CTTGATGACA 


ATCTTTCGTG 


TGGCAGGTTC 


17 220 


TGTAGCTCTG 


CTCTTTGCAC 


GTTTGGGAAT 


CCAATCTTCT 


GTAGCAGGAG 


TTCCGTCTAA 


17 2 30 


ACAGTTTCAA 


CAAATCCAAC 


AGTATCAGAT 


CCTTGTCTCT 


GAAAATCCTA 


GTGCGACCAA 


17 3 4 0 


TCAGGACAAG 


GTAGAGCTAG 


CAGAAGTGTT 


GAAAGGGCAG 


GAGATACTAG 


CCTACCAGAA 


17400 


AATCTATTCT 


AAAGCGCTAT 


ACAAGGATTT 


CAAAGGCAAA 


GCTGGTCTTC 


AAAACATTAC 


17 4 60 


TCTTATGATC 


ATAGAGAAGG 


AAGATTTCAC 


TCCCTTTATC 


CATCTTCAAC 


ATCATCAGCA 


17520 


GGAGCTGACA 


TTAAAAGATG 


G CATC GT TAT 


TACAGCTAAA 


CTCGCCCAGC 


TGG C AGGTGT 


■ 7580 


CAAGGTTGGG 


CAGACTTTAG 


AAATTGAAGG 


TAAGGAACTA 


AAGGTCGTTG 


CTATTACTGA 


17640 


GAACTACGTT 


GGTCACTTTA 


TTTATATGAG 


TCAGGCTAGC 


TATGAGCAAC 


TTTACGGACA 


17700 


GCTACCCCAA 


GCCAACACTT 


ATCTGGTCTC 


ATTAAGGGAT 


ACCACTGCAA 


CTAGTATCGA 


17760 


AAGTCAGGCG 


GGCTTGCTTA 


TGAATCAATC 


TGCGGTGTCC 


AGCGTTGTCC 


AAAATGCTTC 


17820 


AGCCATTCGA 


CTCTTCGACT 


CTATCGCTAG 


CTCACTCAAT 


CAGACCATGA 


CCATCTTGGT 


17880 


CATCGTATCG 


GTTCTATTAC 


CTATTGTCAT 


' CCTTTACAAT 


CTG AC C AAT A 


, TCAACGTAGC 


17940 
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PGTGAACTCT 


CC ACT ATCAA 


GGTTCTTGGT 


TTTCATAATA 


ATGAAGTCAC 


18000 


Li- 1 L 1 ftLn * 1 


TArPGTGAGA 


CG ATTGTGC T 


GTCCCTTGTG 


GGAATCGTAC 


TTGGTCTGAT 


18060 


Awv. 1 ulj 1 * » 


TATTTACACC 


AATTTTTGAT 


TCAAATGATT 


TCGCCTGCGA 


CTATTCTCTT 


18120 


l i A 1 LHjCAU 




AAGT CT ATGT 


AATCCCAGTG 


GCAGCAGTAA 


GCATCATTTT 


18180 




r.r.T'TCTTC G 


TCAATTATTA 


TCTGAGAAAG 


GTTGATATGT 


TAGAAGCCCT 


18240 


tj AAA 1 L_ iljin 


rz a cit a AfV^T A. 


GTTATTTTTA 


GCTGATTGAA 


CTTCTATTTA 


CTAATATTCA 


18300 


AAAft H-L L V_L. 


^TTTT a A An A 
olll LAAnUrt 




CTTTGTGACA 


GAGGATTTTT 


TCTATAGGGC 


18360 


I . i AOLAUC l 


(jLAAL 1 UL uu 


PTTPGAACTT 


TGGCTCAGAA 


TTGATATTAT 


CCACGTATTC 


18420 


AALu i ALiLLrA 


nlLulnl I Vi 1 


PACVTATCGAG 


GACAAAGACT 


GCGCGTGCTA 


ATAGGTGCCA 


18480 




AAuAuOLiLA * 


AATfCJCGCCC 


GAAAGAATGG 


TCAAAGTACT 


C TG AAAGC AT 


18540 


AA i ubLA * I La 




TAnCACCGCA 


CCAACGTTTT 


TGAGCAAAAG 


GTAGGTCCAT 


18600 


I OAAALAb I L 


AA 1 AL UnL L, 


TGTTGTCCAC 


TCCAGCCAAT 


TCTTCATTAA 


AACGACGTGT 


18660 


i. I bAu i I UALi 


LAUn 1 L>L\_ 1 vj 


T ATPCATAGA 


AGGAACGACA 


CTCAACACTT 


TTTTCTTGCC 


18720 


A 1 LAAAA L t_A 


LiLLAuAuAl 1 


^TTTAGAAAG 


ATCTGTTGTA 


GTAAGAGAAA 


AATCAAGCGC 


18780 


L IIUI LViL v_L» 


AL 1 1 0 > AO i I 


GTTTACCTGT 


AAAGCTCACA 


GGATTTCCGA 


G AAAAGTT AC 


18840 


La 1. ftoOA i AL> 


1LLAA1L 1 1 I 


TTTCTTCCAT 


TTTAGCTGAA 


ACACTCGGAA 


TTTTCCAATG 


18900 


All 1 L> Al_ L- Ljsj 




ATAGAAAAAA 


CGCCAGCTCA 


TGTGAGAATG 


ACGTTTTTCA 


18960 


i rtw 11 I A i I 




TTCAGCAATC 


TTGTCAAGGT 


TGTATTTCAT 


CATCCTGTAG 


19020 


• ALiL «. V_» i LUL 


£»*T*H"> £*T» "^"J 1 A^ G C 


TTGTTCTGC G 


ATAGAGTCAG 


TAAAGATTTG 


AGCGTAGATT 


19080 


Ouun IVj 1 i i. Li 


TGTCTTG AG A 


AACAGTTTTC 


ATTGGACCCT 


CATCCACACT 


TGATTCTACA 


191-30 


AAunVj 1 bn 1 vj 




TTGGCGAAGT 


TTTTCAACCA 


AGGTCTTGAT 


TTGTTCAGGA 


19200 


GTTCCTTCTT 


CTTCAGTATT 


GATTTCCCAG 


ATGTAAGCAC 


TTGGGACACC 


ATAGGCTTTA 




CAGAAGTATT 


TGAATGCTCC 


TTCGCTGGTT 


ACAATGAGTT 


TCTTTTCAGC 


AGCGATCTTA 


19320 


TTAAATTTAT 


CCTTACTTTC 


TTTATCAAGT 


TTGTCTAACT 


TATCAGTATA 


TTCTTTGAGA 


19 380 


TTTTTTTCAT 


AGAATTCTTT 


ATTGTTAGGG 


TCTTTGGCGC 


TCAATTGTTT 


GGCGATATTT 


19440 


TTAGCAAAAA 


TAATACCGTT 


TTCAAGGTTA 


AGCCAAGCGT 


GTGGGTCTTC 


TTTTCCTTTT 


19500 


TCATTTTGAC 


CTTCAAGGTA 


GATAACATCA 


ACGCCGTCGC 


TGACTGCGAA 


GTAGTCTTTG 


19560 


TTTTCAGTTT 


TCTTGCCATT 


TTCTACCAAT 


TTTGTAAACC 


AAGCATTGCC 


ACCTGTTTCA 


19620 


AGGTTGATAC 


CGTTATAGAA 


AATCAAATTA 


GCC7CAGAAG 


TTTTCTTAAC 


GTCTTCAGGA 


19630 
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AGTGGTTCCT ATTCGTGTGG GTCTTGCCCA ATCGGAACGA T ACT ATG AAG GTCAATTTTG 197 4 0 

TCACCAGCAA TATTTTTAGT AATATCAGCG ATGATTGAGT TTGTAGCAAC AACTTTTAGT 19800 

TTTTGACCAG AAGTTGT ATC TTTTTTTCCG CTAGCACATG CTACAAGAAT GATTGCAGAA 19860 

AGAAAGAGAA CGAGTAATGT ACCTAATTTT TTCATTAGAT CCTCCAATTT AT T AGGGCTT 19920 

TGCCCCTTAT TTTAACAAAT GTTTATTTTT CAGTTTCAAA TATCGTTGTT TGGGAGCGAT 199 80 

AAAGAAGCTA ATGAGAAAGA AACTAGCAGC TGTAAGCACG AT ACT AG AAC CTGCCGCAAC 20040 

ATTAAAACTA TAGCCAATAA AGAGTCCCAA AACTGAAGCA GTAGCTCCGA AGGTTGAGGA 20100 

AAGGAAAATC ATACTTTTCA G ACT ATT AG C AT AC AG AT AA GCAGTTGCAG CTGGGGTAAT 20160 

CAGCATGGCT ACAATCAGGA TAGTTCCGAC ACTTTGCATG GCTOTCACAG ACACGAGAGT 202 20 

CAGGAGTACC ATGAGAAGGT AG TGAT AG AA ATTGACAGGC ATTCCCATGG CTTTAGCCAA 20280 

GAGTTCATCA AAGGAACTTA TCAAGAGTTG CTTGAAGAAA ATCCAGATTA AC AAG AGG AT 2 034 0 

AGCTGCCCCC ACACCCATAG TAATAAACAT ATCCGTATCT TGGACGGCCA GGATATTACC 20400 

AAAAAGG AT A TGGAAAAGGT C AGTTG AACT TTTAGCGACA CCAATCAAGA TGATACCGAG 204 60 

GGCTAAGAAA GAAGAAAAGG TAATGCCCAT GGCGGTATCG CTTTTGATAA TCGAGTTTCC 20520 

TTTGATGTAG GTAATGATGA TGGCAGCTAG CAATCCAAAG ACAATGGCTC CGATAAAGAA 2 0 580 

GTCAAGGCCC AAGATGAAGG AT AGGGCT AC ACCTGGTAAG ACAGCATGTG AAATGGC AT C 2 0640 

TCCCATGAGT GACATCCCGC GTAGAATAAT GAAACATCCC ACAGCTCCAG CTACAATCCC 20700 

GACGACAATA GCTGTTATCA AGGCATTTTG TAGGAAATGG AATTTTTGC A ATCCATCGAT 207 60 

AAATTCTGCA ATCATAGGTC ACCTCCATTG AAAAAGAGTT GATTACCGTA AGCTTCTTTT 2 0820 

AGATTGGTTT CGGTAAAAGT TTCTTTTGTT GGACCAAAGG CAATCACTTC TCGATTGACA 2 08 80 

AGTAAGACTT GATCGAAGTA GTGGGGAATC TTGCTGAGGT CGTGGTGAAC GATGAGAACC 20940 

CTCTTCCCAG CTTTTTTCAA ATCTCTCAGC GTATTCATGA TGATTTCCTC ACTGACAGAG 21000 

TCAATCCCAG CAAACGGTTC ATCCAAGAGG ATATAGTCGG CTTCCTGCAC CAAACATCTC 210 60 

GCAATCAAGA CCCGCTGGAA TTG AC CTCC A GACAGTTCAC TAATTTGACG TTCAGCGTAG 2112 0 

TCAGCTAGGC CGACGATTTC AAGGGCCTCT TGCACTTTCT TCCAATGTTT AGCCTTTAAA 21130 

CTTCGAAAGA GAGGAATAGA GGG AAAT ACT CCTAACGAGA CGCATTCCTT GACCTTGATG 21240 

GGAAAGTTGT AGTCGATATT GATTTTTTGT TCGACATAGG CAATTCGGTG TAAGGATTTT 213 00 

TTAACTTCCT TGTCATCGAG AAATGCCTGA CCTTGATGTG GGATAATTCC CAACATACCT 2136C 

TTTAATAGTG TTGATTTCCC AGCGCCGTTT GGACCAATGA TGCCGGTAAT TGTTGGTCCA 214 20 

TGGAGCACTA GTGAAATATC CTTAAGTGCC AACGTTT CTT TGTAGGAGAC ACTGAGGTTT 214 8.0 
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TCGATACGTA TCATAAACTT GTATTCCTCC TGTCTCTTAA TATACATTAA AAAAAAAATT 21540 

AAGTCAAGTT AATTTTTGAA AAAATTAAAA TAATAACTCA AAAATAGATT CTAAAGATAA 21600 

CTTTCAGGAT AAATTTCTAA AT T AT AAAAC GCATAGTATC AAGTGTAAAA AACTTGGAAT 21660 

TATGCGTTTT AT C ATGG AAA GATTTTTTAT AAT AG CT AAA AAATAA 217C6 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6171 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

GATCCCCAGG AAAAACCGAG GTTTTCCCAA TCAATCGTTA CTGTCATATT CCACTCCTTA 60 

TTCTAAAAAC CTATTTCTTA TATTCTACAC TATTTTTCTA AAATAGCAAG TATATTTTGT 120 

AATTTTCAGA AAATTTCTCC AATAAAAACC AACTCTTAGA ACTGATTCTT CATTTCACTT 130 

ATTTATCTTC AGTAACTACT TCCTGAAGAT AAG CGTC AAA AACTTCTTCA TCTGAAATCC 2 40 

TGTCAGAAAT GAAGCTTCCA TTGCTAGTGC GTTCTGACAA GTTCAAGTCT TGCAATCGCC 3 00 

TTT C AT AG AT TGTTCCTTTA TTGGATTGGA CAAGCAGAGT TTGGTCGTTC ACATCCACTT 3 60 

CCGTACTGAA GAAATCGCCA AC AAATC CTT GCTCTGCAAC TGCTCCTGCC AAGAAGACAC 4 20 

GATGCGGTTT GTTTTTCAAC TCACGCAAGA CTTGTAATCC TCGTTTGGCA CGGCTGGTTG 4 80 

CTAGAATTTC CTCAATGGAA ACACGTTTCA AGCTTCCACG CTGGGTCAAG AGGTAGAAGG 54 0 

ACGAAGTATT ACAGATAAAG CCAGATTGGA GCACATCATC TTCTTTCAAA TTCATAGCCT 600 

TGACACCTGC TGCCTTAGCA CCGACAACCG GAACCTCTTC GATATTGAAA CGCAGGGCAT 6 60 

AACCATTTTG ACTAACCAAG AC AAC AT CAT CTAGTTTAAT CGGACCCACT GCTACAATCT 7 20 

GATCTGTATC GTCTTTGAGC TTAGCATACT TGACAGACTT AGATCTATAG GTCCGCCATG 7 80 

GAGTGAATTC TTTTCGCTCT ACCCGTTTGA TTTGACCAAG GCGAGTCACT GCAAAGTAGG 84 0 

TTGTCGCATC GTCAAACTGA TCCAGTACTT CCACATAAAG GATTTCTTCA TTCGTTTCAA 9 00 

AGTTTGTGAT GGTTTGGCTC AG ATGCTCT C CGATGTCCTT CCAACGAATA TCTGCCAACT 9 60 

CATGGATTGG TCTGTAGATG ACATTTCCAA GACTTGTGAA CATCAAGAGG TGCTGGGTTG 10 20 

TCTTGGCAGA TTGAACAAAA ATCAAACGGT CATCATCACG CTTGCCAATT TCTTCCAAGG 1080 

TGGAAGCCGC AAAGGAACGT GGACTGGTAC CCTTCATGTA ACC7GCCTTG CTCACGCTGA 114C 
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CGTAGGTATC TTCCTCAGCG ATAAGACTAG CTGTATCAAT CTCAATTGCT TTCGCAGTGT 12 00 

CTTCTAAAGA ACTCAAACGA CGAGTTGCAA ATTTCTTCTT GACCTCACGA AGTTCTTTCT 12 60 

TCATGAGATT GTACATAGTC CTTTCATCAC CGATAATAGC CGCCAGCATA GCAATCTTCT 13 20 

CACGAAGCTC TGCTTCTTCT TCCTGCAAGA CAACCACATC GGTATTGGTC AAACGGTACA 13 80 

GTTG C AAAGT T AC GAT AG C C TCAGCCTGTT CTTCCGTAAA AT C AT AGCT A ACTTTGAGGT 14 4 0 

TTTCCTTGGC GTCCGCCTTA TTCTCAGAAG CACGGATAAG AGCAATGACT TCATCCAAAA 1500 

TCGAAATCAC ACG AATC AAA CCTTCGACGA TATGGAGACG TTTCTCAGCC TTTTCTTTGT 1560 

CAAAGCGTGA ACGCGCCAAA ATCACTTCTC GACCGTGAGC GATATAGCTA GACAGGATTG 1620 

GAACAATCCC AACCTGACGA GGTGTGAAAT TGTCAATCGC CACCATATTA AAGTTGTAGT 1680 

TG ATTTGT AG GTCGGTGTAC TTAAATAAGT AGTTGAGAAC AAGCTCAGTA TTAGCGTCTT 17 40 

TCTTAAGTTC GATAGCGATA CGAAGACCAT CACGGTCAGA CTCATCACGA ACCTCAGCAA 1800 

TCCCAGCTAC CTTGTTATTA ACACGAACAT CATCGATTTT CTTGACTAGA TTGGCCTT AT 18 60 

TG ATTTC AT A AGGAATCTCA AT AAT AACG A TTTGTTCCTT ACCACCTTTT AGCTTTTCAA 19 20 

TTTCAGTCTT GGAACGAACA ACCACGCGCC CTTTCCCAGT CTCATAAGCT TTCTTGATTT 19 80 

CATCACGACC CTGAATAATA GCCCCTGTAG CGAAGTCTGG TCCAGGCAAC AATTCCATGA 204 0 

GTTTATCAAT CTTTGCAGTT GGGTGGTCAA TCATGTAAAC TGCAGCATCT ATGACCTCAG 2100 

CTAAATTATG GGGAGGAATG TCTGTGGCAT AACCAGCCCA AATCCCAGTC GAACCATTGA 2160 

CCAAGAGGTT TGGAAAGGCT GCTGGCAAGA CCGTTGGTTC TTTCTCCGTA TCGTC AAAGT 222 0 

TCCATGCAAA AGGAACTGTC TTTTTCTCGA TATCCTGAAG AAGGTAGCCT GCAATTTCAG 228 0 

ACAAACGTGC CTCAGTATAA CGCATAGCCG CAGGAGGATC TCCGTCCATA GAACCGTTAT 2 340 

TACCGTGCAT TTCAACTAGA ATCTCACCAT 77TTCCAGTT rTGTGACATA CGAACCATGG 24 00 

CATCATAGAT AGAAGAATCC CCGTGTGGGT GGAAATTCCC CATGATGTTC CCGACTGACT 24 60 

TGGCCGACTT ACGGTAGCTC TTGTCAAAAG TATTGCTATC CTTATTCATA GAATAAAGAA 2 520 

TACGGCGCTG AACCGGCTTC AACCCATCAC GAATATCTGG CAAAGCCCGG TCTTGAATAA -!5 80 

TGTACTTGGA GTAGCGACCA AAGCGCTCTC CCATGATGTC CTCCAGGGAC ATGTTTTGAA 2 640 

TGTTAGACAT AAGATACAAA GCCCATAAAA TACCAAGTGA AAATAGAAAA TTCTTGAAGT 27 00 

AAGCAAACTC ACAAGAGAAT TTATCTTTTT CACACAGTAT CTAGGGCGTG TTCAACTCCT 2 7 60 

TTCAAAGAAT GTAGAGTAGG TTTTTATGCA GTAAAAGATA TTTTACGGGA ATTCCTCCCG 2820 

TGTTCAGTTA CGATAAGTAA CCAAACTATC CTGTTTGTAT TTTTCAATAT GAAAATCTGG 2 8 80 

TTTTCCAAAA TT AGTCTT AG TTTGTGTCTT AGCCGCTCCC TTAACCGCCT CTTTGAGATA 2940 
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AGCACTCATA GCAGATTCTT CATTAATAAT CCTCCAATTT TTTCAAACCA AGATTTTCAA 3000 

ACTGCTTTTT CACATAGTCA TTCACATCCG ACTCTAATTT CCAGTTTACT AACATATTAT 306 0 

TTTCTTTCAT TAAAACACTG TCGTTTCTTC TAGCGTAAAC TTGACATTAT CTTCAATCCA 3120 

TTTACGGCGT GGTTCTACCT TATCTCCCAT GAGAACATTG ACGCGGCGTT CGGCGCGCGC 3180 

TAAATCTTCA ATTGTGACAC GGATGAGGGT ACGTGTTTCT GGGTTCATGG TTGTTTCCCA 3 240 

GAG CTGGT C C GCATTCATCT CACCAAGTCC TTTGT AT CGT TGGAGGGTAG CGCCTTTACC 3 300 

GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA CTTCTTCTTT 3 3 60 

CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA CATGACCTGC 3 420 

CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT GCATATGGGC 3 480 

ACCGTCGGTA TCCGCATCGG TC ATGAT AAT . GATCTT ATCA TAGTTGGCAT CTTCAATAGA 3 54 0 

CAAGTCTGCT CCAACACCCG CACCAATGGT ATAAATCATG GTATTGATCT CTTCATTTTT 3 600 

GAGGATATCC GCCATCTTGG CCTTGGCTGT ATTGACAACC TTACCACGAA GAGGTAGAAT 3 660 

AGCCTGGAAC TTGCGGTCAC GACCTTGTTT GGCAGAACCA CCGGCAGAGT CCCCCTCAAC 3 7 20 

TACATAGAGT TCATTCTTAG CAGGATTCTT AGATTGGGCT GGGGTCAATT TCCCAGACAA 3 7 80 

CAAGCCCTTA TCTTTCTTGT TTTTCTTCCC ATTTCGGCTC TCATCACGCG CCTTACGTGC 3 84 0 

TGCTTCACGA GCATCACGGG CCTTGATACC CTTGCGGATG AGGTTAGAAG CTAATTCCCC 3 900 

ATTTTCCATA AGGAAAAAGG TCAACTTATC ACCCACTATT CCATCCACAA CTGGGCGAGC 3 9 60 

TAGGGGGCTT CCTAGTTTAT CCTTGGTCTG TCCTTCAAAC TGCAAGTCTT CTTCAGGAAC 4 02 0 

TAAGATAGAA AGAACGGCCG CTAGTCCCTC ACGATAGTCT GAACCTTCAA GGTTTTTATC 40 80 

TTTTTCCTTG AGAAGACCTG TTTTACGTGC ATAGTCATTC ATGACCTTGG TAATGGCAGA 4140 

CTTGAGTCCT GTCTCGTGCG TTCCACCGTC CTTGGTGCGA ACGTTATTGA CAAAAGATAG 4 2 00 

AATCTTATCT GAGAATCCGT CATTGTACTG GAGGGCTACT TCCACTTGAA AACCATTGTC 4 2 60 

TTCCCCTTCA AAGTAAAGAA CTGGCGTCAA GATTTCCTTA TCTT CGTTG A GATAAGAAAC 4 3 20 

AAAATCTTGT ACTCCATTCT CATAGTGGAA CTCAATCGCT TCATTTGTTC GCTTGTCCGT 4 3 80 

TAAAGACAAG GTCACATTTT TCAAGAGAAA GGCTGATTCA TTAAGGCGCT CTGAAATCCT 4 4 40 

ATTGTACTTG AAATCTGTCG TAGAAAATAT AGTCGCGTCA GGCATAAAAG TAACTTTGGT 4 500 

GCCTGTTTTA GACTTGGGTG CTGTACCGAT TTTCTTCAAA GTCGTGACAG GTTTTCCACC 4 5 60 

ATTTTCGAAA CGTTGCTTGT AAACTGCGCC ATCACGGGTA ATTTCAACTT CTAACCAGCT 4 620 

AGAAAGGGCG TTAACAACGG AAGAACCCAC TCCGTGAAGT CCACCTGATG TCTTATAGCC 4 680 
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ACCTTGACCG AATTTCCCTC CCGCATGAAG AATCGTAAAG ATAACCTCAA CAGTTGGAAT 4 74 0 

TCCCATAGCG TGCATACcTG TCGGCATCCC ACGTCCATGG TCTTGAACCG TTAGACTACC 4300 

GTCTTTATTG AT AGTT AC AT CAATACGATC ACCAAACCCA GACAAGGCTT CATCGACTGC 4860 

ATTATCAACG ATTTCCCAAA CTAGGTGATG AAGACCAGCG CCATCGGTCG ATCCAATATA 4 920 

CATCCCTGGA CGTTTTCGGA CCGCATCCAA CCCTTCTAGC ACCTGAATAG CATCATCATT 4980 

ATAATTGTTA ATATTGATTT CCTTTTTTGA CACAAGGAAC CTCCTATTCG TTCATCTTTA 504 0 

CTATTCTACA GGTTTTCCAA GGATTTTGCA AAATTTTTCT TTCTCCGATG TGACAATTTC 5100 

AGCAGAGATT CTCTGCTTTT CTTTCCCAAT TCATGATATA ATAGGAGTAT GATTACAATA 5160 

GTTTTATTAA TCCTAGCCTA TCTGCTGGGT TCGATTCCAT CTGGTCT CTG GATTGGACAA 52 2 0 

GTATTCTTTC AAATCAATCT ACGCGAGCAT GGTTCTGGTA ACACTGGAAC GACCAACACC 5280 

TTCCGCATTT TAGGTAAGAA AGCTGGTATG GCAACCTTTG TGATTGACTT TTTCAAAGGA 5 340 

ACCCT AG C AA CCCTCCTTCC GATTATTTTT CATCTACAAG GCGTTTCTCC TCTCATCTTT 5 400 

GGACTTTTGG CTGTT AT CGG CCATACCTTC CCTATCTTTG CAGGATTTAA AGGTGGTAAG 54 60 

GCTGTCGCAA CCAGTGCTGG AGTGATTTTC GGATTTGCGC CTATCTTCTG TCTCTACCTT 552 0 

GCGATTATCT TCTTTGGAGC TCTCTATCTT GGCAGTATGA TTTCACTGTC TAGTGTCACA 5580 

GCATCGATTG CGGCTGTTAT CGGGGTTCTG CTCTTTCCAC TTTTTGGTTT TATCCTGAGT 564 0 

AACTATGACT CTCTCTTCAT CGCTATTATC TTAGCACTTG CTAGTTTGAT TATCATTCGT 5700 

CAT AAGG ACA ATATAGCTCG TATCAAAAAT AAAACTGAAA ATTTGGTCCC TTGGGGATTG 57 60 

AACCTAACCC ATCAAGATCC TAAAAAATAA AATCCCAGTT CTGTACTGCC CCCAAACAGT 58 20 

TAGACAAATA ATTTATCCAA AGGATTTAGT TCTGT ACTG C ACAGGACTAA GTCCTTTTAG 5880 

TTTTACCTTA ATTCGTTTGT TGTTGTAGTA ATCAATATAG TCTATAATGG CTTGTTCCAA 5940 

TTGATTAAGT GATTTAAATG TTTTCTCATA GCCATAAAAC ATTTCGGATT TTAAAATGCC 6000 

AAAGAAAGAT TCCATCCTAC CCTTGTCTTG GCTGTTGCCC T T ACGTG AC A TGGATGCTTG 6060 

AATTCCCTTA CTCTCTAGGA ACCGATGATA AGAATCGTCT TGGTATTGCC AGCCTTCGTC 6120 

ACTATGGAGA ATCCTATTCT CGTAGTGCTT CTCTGTGAAT GCCTGTTCCA A 6171 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18475 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(Dl TOPOLOGY: linear 
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(Xil SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
TATTACAAAT AAAAAAACGG AGGAGTGCTT TAT G AAAGCC TATACTTATG TTAAACCAGG 60 

ACTTGCTTCT TTTGTTGATG TAGACAAACC AGTTATTCGC AAGCCAACAG ACCCTATTGT 120 

GCGTATTGTA AAAACCACTA TTTGTGGAAC AGACCTCCAT ATTATCAAAG GGGATGTTCC 130 

TACTTGCCAA AGTGGTACCA TTCTTGGCCA CGAAGGGATT GGGATTGTTG AAGAAGTTGG 2 40 

GGAACGAGTT TCCAACTTCA AAAAAGGTGA CAAGGTCTTG ATTTCTTGCC TCTGTGCCTG 300 

TGGTAAATGC TACTACTGTA AAAAAGGAAT TTATGCTCAC TGTGAAGACG AAGGGGGCTG 3 60 

GAT TT TCGGT CACTTGATTG ATGGTATGCA GGCTG AAT AT CTACGTGTCC CTCATGCAGA 4 20 

TAATACTCTT TACCATACTC CAGAAGACTT CTCAGATGAA GCTTTGGTT A TGCTGTCAGA 4 80 

CATTCTGCCT ACTGGATATG AAATTGGTGT CTTAAAAGGG AAAGTAGAAC CTGGTTGCAG 54 0 

CGTAGCCATT ATTGGTTCAG GTCCAGTTGG ATTGGCTGCT CTTTTAACAG CCCAATTCTA 600 

TTCACCAGCT AAATTG ATT A TGGTAGACCT AG ACG AT AAC CGCTTGGAAA CTGCCCTATC 6 60 

ATTCGGTGCG ACTCATAAGG TTAATTCTTC AGACCCTGAA AAAGCCATTA AAG AAATTT A 7 20 

TGATTTGACA GATGGTCGTG GTGTGGATGT CGCTATCGAA GCTGTTGGTA TTCCTGCAAC 7 80 

ATTTGATTTC TGTCAAAAGA TTATCGGTGT AGACGGAACG CTTGCCAACT GTGGTGTGCA 840 

TGGT AAACC A GTTGAATTCG ATTTAGATAA ACTTTGGATT CGCAACATCA ATGTAACAAC 900 

TCGTTTCCTA TCTACAAATA CGACTCCACA ATTGTTGAAA GCACTTGAAA GTCATAAGAT 9 60 

TGAACCGGAA AAATTGGTAA CTCACTATTT CAAACTCAGT G AAATTG AAA AAGCCTACGA 1020 

AGTCTTCAGT AAGGCAGCAG ACCACCATGC CATTAAGGTC ATTATCGAAA ACGATATCTC 108 0 

AGAAGCCTAA GTAGTAAAAA TATTTTTGTA CATAAGTAAA TAGAAATTCA GTCATCCATC 1140 

AGATGGCTGG ATTTTTTATC AAAAAATTAA GAAATGAGCA TATTTCTTTC CTTGTCTGGC 1200 

GGAATTGGTT AT AAT AT ACG GTACAAAGGA ATGAATGAAT ATGTATCGTG TTATAGAAAT 12 60 

GTACGGAGAT TTTGAACCGT GGTGGTTCTT AGAAGGTTGG G AAG AAG ATA TTGTAGCAAG 13 2 0 

TAGAAAATTT GACCAGTATT ATGATGCTCT CAAATACTAC AAAACTTGCT GGTTTAGATT 13 80 

GGAACAAGAA TCGCCTCTTT ATAAAAGTAG AAGCGACTTG ATGACCATTT TTTGGGACCC 1440 

GGAAGACCAA CGC TGGTGTG ATGAATGTGA TGAGTATTTA CAACAATACC ATTCTTTGGC 1500 

TCTTTTGCAG GATGAGCAGG TTATCCCACA CGAAAAACTA CGCTCAGGCT ATGAAAAACA 1560 

AACCAGTCAG GAAAGGAATC GTTCTTGCCG TATGAAATTA AAATAGAGAA AAGTAACTTT 1620 

TTTGGAGTTG CTTTTTTTAT TTTTCTAACT CTTTGCGAAT AGT AT AGGTG AGGAGGTAAG 1680 
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TATGGTTCAA GAAATTGCAC AAGAAATCAT TCGTTCAGCT CGGAAAAAAG GGACGCAGGA 1740 

TATCTATTTT GTCCCTAAGT TAGACGCCTA TGAGCTTCAT ATGAGGGTAG GAGACGAGCG 1800 

CTGTAAAATT GGTAGCTATG ATTTTGAAAA GTTTGCAGCC GTTATCAGTC ACTTTAAGTT 18 60 

TGTGGCGGGT ATGAATGTGG GAGAAAAAAG ACGTAGTCAA CTGGGTTCCT GTGATTATGC 192 0 

CTATGACCAT AAGATAGCGT CTCTACGTTT ATCTACTCTA GGCGATTATC GGGGGCATGA 19 80 

G AGTTT GGTT ATCCGTTTGT TGCACGATGA GGAGCAGGAC CTGCATTTTT GGTTTCAGGA 204 0 

TATTGAAGAA TTACCCAAGC AGTACAGGCA ACGGGCACTC TATCTTTTTG CTG GTCCGGT 2 100 

TGGGAGTGGT AAGACGACCT TGATGCATGA ATTGTCCAAG TCACTCTTTA AAGGACAGCA 2160 

AGTTATGTCC ATCGAAGATC CTGTCGAAAT CAAGCAGGAC GACATGCTTC AGTTGCAGTT 22 2 0 

GAACGAAGCA ATCGGCCTAA CCTATGAAAA TCTAATCAAA CTTTCCTTGC GTCATCGACC 22 80 

AGATCTCTTG ATTATCGGAG AAATTCGTGA CAGCGAGACG GCGCGTGCAG TGGTCAGAGC 2 340 

TAGTTTGACA GGTGCGACAG TCTTTTCAAC CATTCACGCC AAGAGTATCC G AGGTGTTT A 24 00 

TGAGCGTCTG CTGGAGTTGG GTGTGAGTGA AG AAG AATTG GCAGTTCTTC TGCAAGGAGT 24 60 

CTGCTACCAG AGATTAATCG GGGGAGGAGG AATCGTTGAC TTTGCAACCA GACATTATCA 25 20 

AGAACACCAA CCAGCCAAGT GGAATGAGCA AATTGACCAG CTTCTTAAAG ATGGACATAT 2 580 

C AC AAGTCTT C AGGCTG AG A CGGAAAAAAT TAGCTACAGC TAAGCAAAAA AATATCATCA 2 64 0 

CCCTATTTAA CAATCTCTTT TCTAGCGGTT TTCATCTGGT GGAGACTATC TCCTTTTTAG 2 7 00 

ATAGGAGTGC TTTGTTGGAC AAG C AGTGTG TGACCCAGAT GCGTGTGGGC TTGTCTCAGG 27 60 

GGAAATCATT CTCAGAAATG ATGGAAAGTT TGGGATGTTC AAGTCCTATT GTCACTCAGT 2 82 0 

TATCCCTAGC TGAAGTTCAT GGCAATCTCC ACCTG AGTTT GGGAAAGATA GAAGAATATC 2 8 80 

TGGACAATCT GGCTAAGGTC AAGAAAAAAT TGATTGAAGT AGCGACCTAT CCCTTGATTT 294 0 

TGCTGGGTTT TCTTCTCTTA ATTATGCTGG GGCTACGGAA TTACCTGCTC CCACAACTGG 3 000 

ATAGTAGCAA TATTGCCACC CAAATTATCG GTAATCTGCC CCAAATTTTT CTAGGCATGG 30 60 

TAGGGCTTGT TTCCGTGCTT GCCCTTTTAG CACTCACTTT TTATAAAAGA AGTTCTAAGA 3120 

TGAGTGTCTT TTCTATCTTA GCACGCCTTC CCTTTATTGG AATCTTTGTG CAGACCTACT 3 180 

TGACAGCCTA TTATGCACGT GAATGGGGGA ATATGATTTC ACAGGGAATG GAGTTGACGC 3 2 40 

AGATTTTTCA AATGATGCAG GAACAAGGTT CCCAGCTCTT TAAAGAAGTC GGTCAAGATC 3 3 00 

TGGCTCAAAC CCTGAAAAAT GGCCGTGAAT TTTCTCAGAC GATAGGAACC TATCCTTTCT 3 3 60 

TTAGGAAGGA ATTGAGTCTC AT C AT AG AGT ATGGGGAAGT TAAGTCCAAG CTGGGTAGTG 3 4 20 

AGTTGGAAAT CTATGCTGAA AAAACTTGGG AAGCCTTTTT TACCCGAGTC AACCGCACCA 3 4 80 
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TGAATTTGGT GCAGCCACTG CTTTTTATCT TTGTG GC ACT GATTATCCTT TTACTTTATG 35 4 0 

CGGCAATGCT CATGCCCATG TATCAAAATA TGGAGGTAAA TTTTTAAAAT GAAAAAAATG 3 600 

ATGACATTCT TGAAAAAAGC TAAGGTTAAA GCTTTTACAT TGGTGGAGAT GTTGGTGGTC 36 60 

TTGCTGATTA TCAGCGTGCT TTTCTTGCTC TTTGTACCTA ATCTGACCAA GCAAAAAGAA 372 0 

GCAGTCAATG AC AAAG G AAA AGCACCTGTT GTTAAGGTGG TGGAAAGCCA GGCAGAACTT 3780 

TATAGCTTAG AAAAGAATGA AGATGCTAGC CTAAGAAAGT TACAAGCAGA TGGACGCATC 334 0 

ACGGAAGAAC AGGCTAAAGC TTATAAAGAA TACAATGATA AAAATGGAGG AGCAAATCGT 3 9 CO 

AAAGTCAATG ATTAAGGCCT TTACCATGCT GGAAAGTCTC TTGGTTTTGG CACTTCTCAC 39 60 

TATCCTTGCC TTGCGCTTGT CCGGCTCTGT CCAGTCCACT TTTTCAGCGG TAGAGGAACA 4020 

GATTTTCTTT ATGGAGTTTG AAGAACTCTA TCGGGAAACC CAAAAACGCA GTCTAGCCAG 4 080 

TCAGCAAAAG ACTAGTCTGA ACTTAGATGG GCAGACGCTT AGCAATGGCA CTCAAAACTT 4X40 

GCCAGTCCCT AAAGGAATTC AGGCCCCATC AGGCCAAAGT AT T AC ATTTG ACCGAGCTGG 4 200 

GGGCAATTCG TCCCTGGCTA AGGTTGAATT TCAGACCAGT AAAGGACCGA TTCCCTATCA 4 2 60 

ATTATATCTA GGAAATGGAA AAATTAAACG CATTAAGGAA ACAAAAAATT AGGGCAGTGA 432 0 

TTTTACTGGA AGCAGTAGTC GCTCTAGCTA TCTTTGCCAG CATTGCCACC CTCCTTTTGG 43 80 

G AC AAATTC A AAAAAATAGG CAAGAGGAAG CAAAAATCTT GCAAAAGGAA GAAGTCTTGA 44 4 0 

GGGTAGCTAA GATGGCCCTG CAGACGGGGC AAAATCAGGT AAGCATCAAC GCAGTTGAGA 4 500 

TTCAGGTATT TTCTAGTGAA AAAGGATTGG AGGTCTACCA TCCTTCAGAA CAGTTGTTGG 4 560 

CAATCAAAGA GCCATAAGGT CAAGGCTTTT AC CTTGTT AG AATCCCTGCT TCCCCTCATT 4 620 

GTCATCAGTG GGGGATTACT CCTTTTTCAA GCTATGAGTC ACCTCCTCAT TTCAGAAGTT 4 680 

CGCTACCAGC AACAAAGCGA GCAAAAGGAG TGGCTCTTGT TTGTGGACCA ACTTGAGGTA 4 740 

GAATTAGACC GTTCGCAGTT CGAAAAAGTA GAAGGCAATC GCCTATACAT GAAGCAAGAT 4 800 

GGCAAGGACA TCGCCATCGG TAAGTCAAAG TCAGATGATT TCCGTAAAAC GAATGCTCGT 4 860 

GGTCGAGGTT ATCAGCCTAT GGTTTATGGA CTCAAATCTG TACGGATTAC AGAGGACAAT 4 920 

CAACTGGTTC GCTTTCATTT CCAGTTCCAA AAAGGCTTAG AAAGGGAGTT CATCTATCGT 4 980 

GTGGAAAAAG AAAAAAGTTA AGGCAGGTGT TCTCCTCTAC GCAGTCACCA TAGCAGCCAT 504 0 

CTTTAGTCTT TTGTTGCAAT TTTATTTGAA CCGACAAGTC GCCCACTATC AAG ACT ATGC 5100 

TTTGAATAAA GAAAAATTGG TTCCTTTTCC T ATGC CT AAA CGAACCAAAG ATAAGGTTGA 5160 

GCAAGAAAGT GGGGAACAGT TTTTTAATCT AGGTCAGGTA AGCTATCAAA AC AAG AAAAC 5 220 
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TGGCTTAGTG ACGAGGGTTC GTACGGATAA GAGCCAATAT GAGTTTCTGT TTCCTTCAGT 5280 

CAAAATCAAA GAAGAGAAAA GAGATAAAAA GGAAGAGGTA GCGACCGATT CAAGCGAAAA 5 34 0 

AGTGGAGAAG AAAAAATCAG AAGAGAAGCC TGAAAAGAAA GAGAATTCAT AGTCAATTCA 5400 

ACT AT AATG C GTTGAATCCA GAATAGTCCA CTGTAGTTTC TAGAAAATTG CTGGAAATGG 54 60 

ATGTTAAGCT CCAATTCATT TGTTTATATC TTATTTCAGT TT ACT AT ACT TTGTGCTAAA 5520 

TTAAAGATAT GAAACATGAT TTTAACCACA AAGCAGAAAC TTTCGATTCC CCTAAAAATA 5 580 

TCTTCCTCGC AAACTTGGTA TGTCAAGCAG CCGAGAAACA GATTGATCTT CTATCAGACA 5 64 0 

AAGAAATTTT AGATTTCGGT GGTGGCACGG GTCTATTAGC CTTGCCCCTA ACCCCTAGCC 5700 

AAGCAGGCTA AGTCAGTCAC TCTTGTAGAC ATTTCTGAGA AAATGTTGGA GCAACCTCCT 57 60 

TTGAAAGTGG AGCAGCAACC AATCAAGAAT ATCCAGTTTT TGCACCAACA TTTACCGAAA 58 20 

AATCCCTTGG AGAAAGAGTT TGATTCCCTT GCTGTTAGTC GGGTTCTT C A TCATATGCCT 53 90 

GATTTGGATG CGGCTCTCTC ACTGTTTCAT CAACATTTGA AGGAAGATGG GAAACTCATC 5 940 

ATTGCTGATT TTACCAAGAC AGAAGCTAAT CATCATGGAT TTGATTTAGC TGAACTGGAA 6000 

AACAAGCTAA TTGAGCATGG TTTTTCATCT GTGCATAGTC AGATTCTCTA TAGTGCTGAA 6 060 

GACCTGTTTC AAGGAAATCA CTCAGAATTC TTTTT AATAG TAGCCCAAAA ATCACTCGCC 6120 

TAGTCAGGGA GTGATTTTTC TATAAGGATG GAAAAAAGAA GGGAAATTTG GTAAGATAGG 6L80 

AATATGGATT TTGAAAAAAT TGAACAAGCT TATACCTATT TACT AG AG AA TGTCCAAGTC 62 40 

ATCCAAAGTG ATTTGCCCAC CAACTTTTAT GACGCCTTGG TGGAGCAAAA TACCATCTAT 6300 

CTGGATGGTG AAACTGAGCT AAACCAGGTC AAGGAGAACA ATCAAACCCT TAAGCGTTTA 6360 

GCACTACGCA AAGAAGAATG GCTCAAGACC TACCAGTTTC TCTTGATGAA GGCTGGGCAA 642 0 

ACAGAACCCT TGCAGGCCAA TC AC CAGTTT ACACCGGATG CTATTCCTTT GCTTTTGGTG 64 80 

TTTATTGTGG AAGAGTTGTT TAAAGAGGAG GAAATTACTA TCCTCGAAAT GGGTTCTGGG 654 0 

ATGGGAATTC TAGGCGCTAT TTTCTTGACC TCGCTTACTA AAAACGTCGA TTACTTGGGA 6 600 

ATGGAAGTGG ATGATTTGCT GATTGATCTG CCACCTAGCA TGGCAGATGT AATT GGTTT G 6660 

CAGGCTGGCT TTGTCCAAGG AGATGCCGTT CGCCCACAAA TGCTCAAAGA AAGCGATGTG 6720 

GTCATCAGTG ACTTGCCTGT CGGCTATTAT CCTGATGATG CCGTTGCGTC GCGCCATCAA 67 80 

GTTGCTTCTA GCCAAGAACA TACTTACGCC CATCACTTGC TCATGGAACA AGGGCTTAAG 6840 
TACCTCAAGT C AG ACGG AT A CGCTATTTTT CTAGCTCCGA GTGATTTGTT GACCAGTCCT 6 900 

CAAAGTGATT TGTTAAAAGA ATGGCTGAAA GAAGAGGCGA GTCTGGTTGC TATGATTAGT 69 60 

CTGCCTGAAA ATCTCTTTGC TAATGCCAAA CAATCTAAGA CTATTTTTAT CTT AC AG AAG 7G2 0 
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AAAAATGAAA TAGCAGTAGA GCCTTTTGTT TATCCACTTG CTAGCTTGCA AGATGCAAGT 7 080 

GTTTTAATGA AATT7AAACA AAATTTTCAA AAATGGACTC AAGGTACTGA AATATAAAAT 714 0 

AG ATTTTG TT ATAATAGTTG AAAACGCTTA AAAAGGGGTA TCATGTTATG ACAAAAACAA 7 200 

TTGCAATCAA TGCAGGAAGT TCAAGTTTGA AATGGCAATT ATACTTAATG CCAGAAGAAA 7 2 60 

AAGTATTCGC GAAAGGTTTG ATTGAACGTA TCGGTTTGAA AGATTCAATT TCAACTGTAA 7 3 20 

AATTTGACGG CCGTTCTGAA CAACAAATTT TGGATATTGA AAATCATATA CAAGCCGTTA 73 8 0 

AAATTTTATT GG ATGACTTG ATTCGTTTCG ATATTATCAA GGCTTATGAC GAGATTACAG 7 4 40 

GTGTTGGACA TCGTGTTGTT GCTGGTGGAG AATATTTCAA AGAATCAACA CTTGTTGAGC 7 500 

CAGATGTTTT AGAAAAAGTT GAAGAGTTGA GTTTGTTGGC TCCTCTACAC AACCCGGCCA 75 60 

ATGCAGCAGG TGTTCGTGCC TTCAACGAAT TGTTGCCAGA CATTACCAGT CTAGTTGTTT 7 62 0 

TT G AT ACT T C CTTCCACACA ACTATGCCAG AG AAAGCTT A TCGCTACCCT CTACCAACAA 7 680 

AAT ATT AC AC AG AAAAC AAG GTTCGTAAAT ACGGTGCTCA TGGTACAAGT CACCAGTTTG 7 7 10 

TAGCAGGAGA AGCTG C AAAA CTCTTGCGAC GTCCATTAGA AGACTTGAAG TTAATTACCT 7 800 

GTCATATTGG TAACGGAGGC TCAATTACAG CTGTGAAAGC CGGCAAATCT GT AG AC ACTT 7 8 60 

CTATGGGGTT CACTCCTCTT CGTGCTATTA TGATGGGAAC GCGTACAGGG GATATTGATC 7920 

CAGCTATCAT TCCTTATTTA ATGCAATATA CAGAGGATTT TAACACACCA GAAGATATCA 7980 

GTCCTGTTCT TAACCGTGAA TCAGGTCTTT TGGGAGTTTC TGCTAATTCT AGCGATATGC 304 0 

GCGATATAGA AGCAGCTGTA GCAGAAGGGA ATCACGAGGC TAGCTTGGCT TATGAAATGT 8100 

ATGTTGACCG TATCCAAAAA CATATCGGTC AGTACCTTCC AGTGCTAAAT GGAGCAGATG 8160 

CCATTGTTTT CACAGCAGGT GTCGGTG AAA ATGCAGAGAG TTTCCGTCCT GATGTAATCT 32 20 

CAGCGATTTC GTGGTTTGGT TGTGATGTTG ATGATGAAAA CAATGTCTTT GGCGTTACAG 32 80 

GAGACATCTC AACAGAGGCA GCTAAAATCC GTGTCTTGGT TATTCCAACA G AT G AAG AAT 8 3 40 

TAGTCATTGC CCGTGACGTT GAACGCTTGA AAAAATAAGT GAAACTAAAA AAATATTCAA 8 400 

TACAAGGAGT TGGGAAAGTT ATTTTTCCAG CTTCTTTTTC TGATGAAATT GTCCAAAACC 84 60 

TTGCTATGAT TGGCTTTTTT GAAAAATATG GTATAATAGT AGTAATTTAA TAG ATGG AG T 8S20 

TGAGTTTTGA AGAAAAACTT TCGTGTAAAA AGAGAGAAAG ATTTTAAGGC GATTTTCAAG 3580 

GAGCGGACAA GTTTTGCTAA TCGCAAATTT GTGGTCTACC AATTAGAAAA CCAGAAAAAC 8 64 0 

CGTTTTCGAG TAGGTCTATC AGTTAGCAAA AAACTGGGGA ATGCCGTCAC TAGAAATCAA 8700 

ATTAACCGAC GGATTCGGCA TATTATCCAG AATGCAAAAG GGAGTCTGGT AGAAGATGTC 87 6 0 



WO 98/18931 



PCT/US97/19588 



380 

GACTTTGTTG TCATTGCTCG AAAAGCAGTC GAAACCTTGG GAWt^ TAGA GATGGAGAAA 8 92 0 

AATCTACTCC ATGTATTAAA ATTATCAAAG ATTTACCGGG AAGGAAATGG GAGTGAAAAA 8880 

GAAACTAAAG TTG ACT AGTT TGCTAGGACT GTCTCTGTTA ATCATGACAG CCTGTGCGAC 8940 

TAATGGCGTA ACTAGCGATA TTACAGCCGA ATCGG CTG AT TTTTGGAGTA AATTGGTTTA 9000 

CTTCTTTCCG GAAATCATTC CCTTTTTATC GTTTG AT ATT AGT ATCGG AG TGGGGATTAT 9060 

TCTCTTTACG GTCTTGATTC GTACAGTCCT CTTGCCAGTC TTTCAGGTGC AAATGGTGGC 9120 

TTCTAGGAAA ATGCAGGAAG CTCAGCCACG CATTAAGCCG CTTCGAGAAC AATATCCAGG 9180 

TCGAGATATG GAAAGCAGAA CCAAACTAGA GCAGGAAATG CGTAAAGTAT TTAAAGAAAT 92 4 0 

GGGTGTCAGA CAGTCAGACT CTCTTTGGCC GATTTTGATT CAGATGCCGG TTATTTTGGC 9 300 

CCTGTTCCAA GCCCTATCAA GAGTTGACTT TTTAAACACA GGTCATTTCT TATGGATTAA 9 3 60 

CCTTGGTAGT GTGGATACAA CCCTTCTTCT TCCGATTTTA GCAGCAGTAT TCACCTTTTT 9 4 20 

AAGTACTTGG TTGTCCAACA AAGCTTTGTC TGAGCGAAAT GGCGCTACGA CTCCGATGAT 94 80 

GTATGGGATT CCAGTCTTGA TTTTTATCTT TGCAGTTTAT GCGCCAGGTG GAGTCGCCCT 9 54 0 

ATACTGGACA GTGTCTAATG CTTATCAAGT CTTGCAAACC TATTTCTTGA ATAATCCATT 9 600 

CAAGATTATC GCAGAGCGCG AGGCCGTAGT ACAGGCACAA AAACATTTCG AAAATAGAAA 9 660 

AAGAAAAGCC AAGAAAAAGG CTCAGAAAAC GAAATAAATA AGGAGGAATC TGGTAGTGGT 97 20 

AGTATTTACA GGTTCAACTG TTGAAGAAGC AATCCAGAAA GGATTGAAAG AATT AG AT AT 97 80 

TCCAAGAATG AAGGCTCATA TCAAAGTCAT TTCTAGGGAG AAAAAAGGCT TT CTTGGT CT 9 840 

ATTTGGTAAA AAACCAGCCC AAGTGGATAT TGAAGCGATT AGTGAAACGA CTGTTGTCAA 9900 

AGCAAATCAA CAGGTAGTAA AAGGCGTTCC GAAAAAAATC AATGATTTGA ACGAGCCTGT 9 960 

GAAGACGGTT AGTGAAGAAA CCGTTGACCT TGGTCATGTG GTTGATGCTA TTAAAAAAAT 10020 

AGAGGAAGAA GGTCAAGGTA TTTCTGATGA AGTCAAGGCT GAAATCTTAA AACATGAAAG 10080 

ACATGCCAGC ACTATCTTAG AAGAAACTGG TCACATTGAG ATTTTAAATG AACTTCAAAT 10 140 

CGAGGAAGCG ATCAGGGAAG AAGCAGGCGC TGATGACCTT GAAACTGAGC AAGACCAAGC 10200 

TGAAAGTCAA GAACTAGAAG ACTTGGGCTT GAAAGTTGAA ACGAACTTTG ATATTGAACA 102 60 

AGTAGCTACG GAAGTAATGG CTT ATGTTC A AACGATTATT GATGACATGG ATGTTG AGG C 103 20 

TACACTTTCA AATGATTATA ACCGTCGTAG CATCAATCTA CAAATTGACA CCAACGAACC 10 3 80 

AGGTCGTATT ATCGGCTACC ATGGTAAAGT CTTGAAGGCC TTGCAACTGT TGGCTCAAAA 10440 

TTATCTTTAC AACCGCTATT CC AG AACCTT CTACGTTACA ATCAATGTCA ATGATTATGT 10 500 
CGAACACCGT GCAGAACTCT TGCAGACCTA TGCGCAAAAA TTGGCGACTC CTCTTTTCGA " 1C560 
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AG AAGGGC G C AGTCATAAAA CAGATCCAAT GTCAAATAGC GAACGCAAGA TTATCCATCG 10 6 20 

TATTATTTCA CGTATGGATG GCGTGACTAG TTACTCTGAA CGTGATGAGC CAAATCGCTA 1068 0 

TGTTGTTGTA GATACAGAAT AAGTAAAATC AGGTTTATCC TGATTTTTTG CTAGTTAGAG 10740 

GAGGTTAAAC TGATGTTGAA TAAGATAAGA GACTATTTAG ACTTTGCTGG TTTGCAGTAC 10800 

CGTAATCCTG ATAAAGCGGG AGCAGAGCGA GAGAAGATGC TGGCATTCCG CCACAAAGGA 10 8 60 

CAAGAGGCCC GAAAGGTTTT TACAGAACTG GCCAAAGCCT TTCAAGCAAG CCATCCAGAA 109 20 

TGGCAACTCC AACAGACTAG CCAGTGGATG AATCAGGCCC AGCGTTTGAG ACCACATTTT 109 80 

TGGGTTTATC TACAGAGAGA CGGACAAGTG ACAGAACCTA TGATGGCCTT ACCTTTGTAT 11040 

GGGACATCTA CTGACTTTGG AATTTCTTTG GAAGTCAGTT TCATCGAACG TAAGAAGGAT 11100 

GAGCAAACAC TGGGCAAGCA GGCCAAACTT TTAGACATTC CAACCGTTAA AGGGATTTAT 11160 

TATCTAACCT ACTCTAATGG TCAAAGTCAA CGGTGGGAGG CGAATGAAGA AAAGCGTCGT 11220 

ACTTTACGCG AGAAGGTGAG AAGTCAAGAA GTTCGAAAAG TTTTAGTGAA GGTAGATGTT 112 80 

CCTATGACAG AAAATTCGTC TGAAGAAGAA ATCGTAGAAG GCTTATTGAA GTCTTATTCT 11340 

AAAATTCTTC CCTATTATCT AGCTACGAGA AAATAAGATA ATTTGTAAAA CATC AT AAAT 11400 

CATACAGTCC AAGAGTGAAC AGTCCGCTGT GTAATTCTTG GTCTTTTTGT TTGCGCTTTC 11460 

GC ATT AT AT A ATAAACTTAC AAAAACAATT CAAAAGGAGA ACAATTATGG AACTCGTTTC 11520 

AAGTCTTCTA AAT TGGTTTT CTAGCAATAT TTTGCAGAAT CCCCCATTTT TCGTAGGTTT 11580 

ATTG GTGTTG ATAGGATATG CACTTTTGAA AAAACCTGCC CATGACGTTT TTTCAGGGTT 11640 

TGTTAAAGCA ACAGTAGGGT ATATGTTGCT TAACGTGGGT GCTGGTCGTT TGGTTACAAC 117 00 

CTTTCGTCCA ATCTTAGCAG CTCTTAACTA CAAATTCCAA ATTCGTCCAG CGGTTATCGA 11760 

CCCTTACTTT GGACTTGCTG CAGCAAACAA CAAAATTGTA GCAGAGTTTC CAGATTTTGT 11820 

TGGAACTGCA ACTACAGCTC TATTGATTGG TTTTGGAATA AATATCTTGC TCGTAGCTCT 11880 

TCGAAAGATT ACGAAGGTAA GAACCCTCTT TATTACTGGT CACATCATGG TACAACAAGC 11940 

TGCAACAGTA TCTCTTATGG TTCTATTCTT AGTACCACAA TTGCGCAATG CTTACGGTAC 12 000 

AGCAGCGATT GGTATCATCT GTGGACTTTA CTGGGCAGTT AGTTCAAATA TGACTGTTGA 12 060 

GGCAACTCAA CGCTTGACTG GTGGTGGCGG ATTTGCGATT GGTCACCAAC AGCAATTTGC 12120 

AATCTGGTTT GTAGATAAAG TAGCAGGACG CTTTGGTAAG AAAGAAGAAA GTTTAGACAA 12180 

TCTTAAATTA CCTAAGTTCC TCTCAATCTT CCACGATACA GTTGTTGCAT CTGCTACCTT 12 240 

GATGCTCGTA TTCTTCGGAG CCATTCTTTT AATCTTGGGT CCAGACATT.A TGTCTAATAA 12 300 
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AGAAGTCATC ACTTCAGGAA CTCTATTCAA TCCTGCTAAA CAAGATTTCT TTATGTACAT 12 3 60 

TATCCAAACA GCCTTTACCT TCTCAGTTTA CTTGTTCGTT TTGATCCAAG GTGTCCGAAT 124 20 

GTTCGTATCT GAGTTGACAA ACGCCTTCCA AGGTATTTCA AACAAATTGT TGCCAGGTTC 12 4 80 

ATTCCCAGCG GTTGACGTTG CAGCTTCTTA TGGATTTGGT TCTCCAAATG CTGTCTTGTC 12 540 

AGGATTTACC TTTGGTTTGA TTGGTCAATT GATTACAATT GTTTTGCTCA TCGTCTTTAA 12 600 

AAATCCGATT CTTATTATTA CAGGATTTGT ACC AGTGTTC TTTGACAATG CAGCCATTGC 12 660 

GGTCTACGCT GATAAACGCG GCGGATGGAA AGCGGCTGTT ATCCTTTCCT TTATATCAGG 12720 

TGTCCTTCAA GTTCCTCTAG GAGCTCTTTG TGTGGCCCTT CTCGATTTGG CATCTTATGG 12780 

TGGCTACCAT GGAAATATCG ACTTTGAATT CCCATGGCTT GGATTTGGAT ATATCTTCAA 12 94 0 

ATACCTTGGT ATTGTTGCTT ATGTACTTGT GTGTCTCTTC TTGCTTGTTA TTCCTCAACT 12900 
TCAATTTGCC AAAGCAAAAG ATAAAGAGAA ATATTACAAC GGTGAAGTTC AAGAAGAAGC 
TTAGTATCTA GAAAAGGAGA AATAAAATGG TT AAAGT ATT AGCAGCGTGC GGAAATGGAA 

TGGGTTCATC AATGGTTATC AAGATGAAGG TTGAAAATGC TCTCCGTAAG CTTAATCAAA 13080 
CAGATTTTAC AGTCAATTCA TGCAGTGTCG GTGAAGCTAA AGGTTTAGCA GTAGGATATG 
ACATCGTAAT CGCTTCTCTT CATTTGATTC AAGAATTGGA AGGGCGAACT AATGGGAACT 

TAATTGGGCT T G AT AACTTG ATGGATGATA AAGAAATCAC CGAAAAACTC AGTCAAGCAC 132 60 

TACAGTAAAA GCTTGGAGGG GGCTGGACAG AAACTG AG AG TTATCGTTTC TGTCCTTCTC 13320 
CCTCTTTAAA TAAAGGAGGC AGATATGAAT TTAAAACAAC CTTTAATTGA CAATGACTCG 
ATCCGACTAG GTTTAGAGCC TAACAATTGG AAAGAAGCAG TCAACGTACC AGTAGATCCC 
T T AATTG AAA GTGGGGCAAT TTTGCCAGAG TATTACGATG CTATCATTGA ATCGACTGAA 
GAGTATGGGC CTTACTATAT CTTGATGCCA GGTATGGCTA TGCCCCACGC TAGACCTGAA 

GCAGGTGTGC AAAGTGATGC CTTTTCATTG ATTACCTTAC AAAATCCTGT TGTATTTTCA 13 620 
GATGGGAAAG AGGTATCTGT TTTGTTGGCA CTAGCAGCAA CAAGTTCAAA AATTCACACA 
AGTGTAGCCA TTCCACAAAT TATTGCCCTA TTTGAATTAG AAGATTCTAT TGCACGTTTA 
CAGGCTTGCC AGACTAAAGA AG ATGTCT TG GCTATGATTG AAGAATCTAA GGATAGCCCT 

TATCTCGAAG GATTGGATTT GG AAAGTT AG AAAGAGGAAT AAAGAAATGA CAAAAAGAAT 13860 

ACCTAATTTA CAAGTTGCAT TAG ACC AT TC AGACTTGCAA GGAGCGATTA AAGCAGCTGT 13 920 

TTCTGTTGGT CAGGAAGTAG ATATTATCGA AGCTGGAACT GTTTGCTTGC TTCAAGTTGG 13 980 

• AAGTGAACTG GCTGAAGTCT TGCGT AGCCT TTTCCCAGAT AAGATTATTG TGGCAGACAC 14040 

AAAATGTGCT GATGCTGGTG GAACAGTTGC TAAAAATAAT GCGCTTCGTG GAGCAGACTG 14100 



12960 
13020 



13140 
13200 



13380 
13440 
13500 
13560 



13680 
13740 
13300 
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GATGACTTCT 


ATCTGTTGTG 


CAACCATCCC 


TACTATGGAA 


GCAGCTCTAA 


AGGCTATCAA 


14160 


GACTGAACGA 


GG AG AACG AG 


GCGAAATCCA 


GATCGAGCTT 


TATGGCGATT 


GGACTTTTGA 


14220 


ACAAGCTCAG 


CTTTGGCTAG 


ATGCAGGTAT 


CTCACAAGCT 


ATTTATCACC 


AATCTCGTGA 


14280 


TGCTCTTCTT 


GCTGGTGAAA 


CTTGGGGTGA 


AAAAGACCTT 


AATAAGGTTA 


AAAAACTCAT 


14340 


TGACATGGGC 


TTCCGTGTAT 


CTGTAACAGG 


TGGTCTAGAT 


GTAGATACTC 


TCAAACTCTT 


14400 


TGAAGGTATT 


GATGTCTTTA 


CCTTTATCGC 


AGGTCGTGGA 


AT T AC AG AGG 


CTGTGGATCC 


14460 


AGCAGGAGCA 


GCGCGTGCCT 


TCAAGGATGA 


AATCAAACGA 


ATTTGGGGGT 


AAATCATGGT 


14520 


ACGTCCAATT 


GGAATTTATG 


AAAAGGCAAC 


CCCAACACAC 


TGTACTTGGC 


TAGAACGTTT 


14580 


AAATTTTGCC 


AAGGAGTTAG 


GCTTTGATTT 


TGTCGAGATG 


TCTATTGACG 


AACGTGACGA 


14640 


GCGTTTAGCA 


AGACTTGACT 


GGAGTAAGGA 


AGAACGCTTG 


GAAGTTGTCA 


AAGCAATCTA 


14700 


TGAAACTGGT 


GTTCGTATTC 


CTTCTATCTG 


TTTTTCACGC 


CATCGTCGCT 


ACCCATTGGG 


14760 


TTCAAAAGAT 


CCAGTTCTAG 


AGGAAAAATC 


TCTAGAACTC 


ATGAAAAAAT 


GTATCGAATT 


14820 


AGCTCAAGAC 


TTGGGAGTTC 


GTACGATTCA 


ATTAGCTGGT 


TACGATGTTT 


ACT ATG AGG A 


14880 


AAAGTCACCC 


CAGACACGCC 


AACGTTTTAT 


CAAAAATTTG 


AGAAAAGCCT 


CTCACTGCGC 


14940 


TGAAGAAGCT 


CAGGTGGTAC 


TTGCTATTGA 


AATTATGGAT 


GATCCTTTCA 


TCAGTAGCAT 


15000 


CGAAAAATAT 


TTGCCTATAG 


AAAAAGAGAT 


TGACTCTCCC 


TTCCTCTTTG 


TATATCCAGA 


15060 


TATTGGTAAT 


GTGTCTCCAT 


GGCATAATGA 


TATCTATAGT 


GAGTTTTATC 


TTGGTCATCA 


15120 


TGCCATCGCA 


GCTCTCCATC 


TCAAGGATAC 


TTATGCAGTG 


ACAGAAAGTT 


CAAAGGGCCA 


15180 


GTTCCGAGAT 


GTACCTTTCG 


GGCAAGGTTG 


TGTCAAATGG 


CAAGAAGCTT 


TCGATATTTT 


15240 


AAAGGAAACC 


AATTATAATG 


GACCTTTCCT 


AATCGAAATG 


TGGTCTGAAA 


ATTGTGAAAC 


15300 


AGTAGAAGAA 


ACACGCGCAG 


CCATTCAAGA 


GGCGCAAGCT 


TTTCTCTATC 


CACTCATTAA 


15360 




T TC A TGT AAG 


ATGAATCAAG 


TAATCAATGC 


TATGCCTAAA 


CGAGTCTGTG 


15420 


ATGCCAATCA 


ATCATTGCCA 


AAACATGGAC 


TTGTCAAATT 


TACCTGGGGG 


AATGTATCTC 


15480 


AAGTTAATCG 


CGAACTCGGT 


GTCATTGTTA 


TCAAACCATC 


AGGCGTGGAT 


TATGACGAAT 


15540 


TGACACCTGA 


AAACATGCTA 


GTGACTGATC 


TAG ATGGT AA 


GATCCTAGAA 


GGGGATTTAA 


15600 


GACCATCTTC 


CGACCTCCCA 


ACTCATGTGC 


AATT AT AT AA 


GACTTGGTCA 


GAAATTGGTA 


15660 


GTGTGGTTCA 


CACCCATTCG 


ACAGAAGCTG 


TTGGTTGGGC 


TCAGGCAGGT 


CGTGATATTC 


15720 


CTTTCTACGG 


AACAACCCAT 


GCAGATTATT 


TCTACGGTTC 


AATCCCTTGC 


GCCCGTAGTT 


15790 


TGACCAAGGA 


CGAAGTAGAA 


GTGGCCTATG 


AAAAAGATAC 


TGGCCTGGTT 


ATCGTAGAAG 


15840 
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AGTTTGAACA 


TCGCGGACTT 


AACCCGGTTG 


AAGTACCAGG 


AATTGTTGTA 


CGCAATCACG 


15900 


GTCCATTCAC 


CTGGGGCAAA 


AATCCAGAGA 


ATGCTGTTTA 


TCACTCTGTC 


G T ACT AGAGG 


15960 


AAGTATCAAA 


GATGAATCGC 


TTTACAGAAC 


AAATCAATCC 


AAGAGTTGGA 


CCTGCTCCCC 


16020 


AGTACATACT 


AGAAAAACAC 


TACCAACGTA 


AACATGGACC 


AAATGCTTAT 


T ATGG TC AAA 


16080 


AGTAAGAACG 


ATGAAGGAGG 


AGAAAAAGAT 


AAATTTAGCT 


CCTCTTTTTA 


CATTTGATTT 


16140 


TTATTGAGAG 


TAAAGTTGGA 


GTTGAAGTAA 


TTTTAAAAGA 


TTTTTTAGAA 


ATAGCGCTTG 


16200 


AT AT AT AT AT 


GGTAAAATAA 


AAAGAATTGC 


TGTGATATCA 


ATAGATTTGG 


GGGATTTTTT 


16260 


AATATGGTAC 


TCGATAAGGC 


AAGTTGTGAT 


TTGCTTCAAT 


ATTTGATGGA 


TCAAGAAACG 


16320 


TCCAAAACGA 


TTATGGCCAT 


TTCGAAAGAT 


TTGAAAGAGT 


CAAGAAGGAA 


AATTTATTAT 


16380 


C AC ATTG AC A 


AAATCAATGC 


TGCTCTGGGT 


GACGAGGCGC 


TTCACATCAT 


TAGTATTCCA 


16440 


CGAATTGGTA 


TTCACTTAAC 


GGAAGAGCAG 


AGAGATGCTT 


GTTGTAAACT 


ATTATCGGAA 


16500 


GTAGATTCGT 


ACGATTATAT 


CATGAGTGCG 


CATGAACGTA 


TGATGATAAT 


GTTACTATGG 


16560 


ATACGTATTT 


CTAAAGAACG 


TATTACGATT 


GAAAAATTGA 


TAGAGTTAAC 


AG AGGT ATCT 


16620 


AGGAATACTG 


TTCTCAATGA 


TTTGAATAGT 


ATTCGTTATC 


AACTAACTTT 


GGAACAATAT 


16680 


CAGGTGATCT 


TGCAAGTGAG 


CAAGTCACAG 


GGATACAACC 


TTCATGCCCA 


CCCTCTTAAT 


16740 


AAAATTCAGT 


ATCTTCAATC 


GCTTCTATAT 


C AT ATTTTT A 


TGGAAGAAAA 


TGCCACTTTT 


16800 


GTATCTATTT 


TAGAAGATAA 


GATGAAAGAG 


AGGTTAGATG 


ATCAGTCTTT 


GCTTTCTGTT 


16860 


GAAATGAACC 


AATTTTTTAA 


GGAACAGGTT 


CCTTTAGTTG 


AACAAGATTT 


AG GG AAG AAA 


16920 


ATAAACCATC 


ATGAAATAAC 


TTTTATGTTG 


CAGGTTCTAC 


CTTATTTCCT 


GTTAAGCTGT 


16980 


CATAATGTTG 


AACAGTATCA 


AGAAAGACAT 


CAGGATATAG 


AGAAAGAATT 


TTCTTTGATA 


17040 


AGAAAAAGAA 


TAGAGTATCA 


GGTGTCTAAG 


AAATTAGGAG 


AACGGTTGTT 


TCAAAAGTTT 


17100 


GAAATTTCTT 


TGTCAGGACT 


TGAAGTTTCT 


CTTGTAGCTG 


TTCTCCTCCT 


CTCCTATCGT 


17160 


AAAGATTTGG 


ATATTCATGC 


AGAAAGTGAT 


GATTTTCGGC 


AATTAAAACT 


TGCTTTAG*\A 


17220 


GAATTTATCT 


GGTATTTTGA 


ATCACAAATC 


CGAATGGAGA 


TTGAGAACAA 


GGATGATTTC 


L7280 


TTACGAAATT 


TGATGATCCA 


CTGTAAACCC 


TTGTTATTTA 


GAAAGACTTA 


CGGTATTTTT 


17340 


TCTAAAAATC 


CTCTAACAAA 


ACAAATTCGA 


TCCAAGTATG 


GAGAATTATT 


TTTAGTCACT 


17400 


AGAAAATCTG 


CGGAAATTTT 


AGAAGGAGCA 


TGGTTTATTC 


GGCTAACAGA 


CGATGATATT 


17460 


GCCTATTTGA 


CG ATT CAT AT 


TGGAGGATTT 


TTAAAATATA 


CACCATCATC 


TCAAAAAAAT 


17520 


ATGAAAAAAG 


TTTATCTCGT 


TTGTGATGAA 


GGTGTTGCGG 


TTTCGAGACT 


TTTCCTGAAA 


17530 


CAATGCAAAC 


TTTATTTTCC 


AAATGAGCAA 


ATTCACACTC 


TATTTACAAC 


AGAACAATTT 


17640 
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AAGAGTGTGG AAGATATTGC ACAAGTTGAT GTAGTGATTA CTACTAATGA TGATTTGGAT 177 00 

AGCAGATTTC CGATTTTAAG GGTTAATCCT ATCCTTGAAG CAGAAGATAT TTTG AAAATG 17 7 60 

CTAGACTATC TTAAACACAA TATATTTCGT AATAAGAGCA AAAGTTTCAG TGAAAATCTT 17S20 

TCTAGTCTTA TTTCGTCTTA TATTGTAGAC AGCAAGTTGG CTAGTAAGTT CCAAGAAGAG 17 880 

GTTCAAACAC TTATAAATCA AGAAATAGTA GTTCAAGCTT TTTTGGAAGr TATTTGAAGG 17 940 

ACAGTCCAAT GATGAACACA AACCTGTGTk TTTCsTGGTC TTTTtTAGTG TTTTGAAGGG 18000 

TGGkATACTA ATCTCAAAGA TAACAATTAT ATCCAAAGGA GGCAACATAT GCCAAACGTC 18060 

AAAGAAATTA CAAGAGAGTC ATGGATTTTA GCCACTTTCC CAGAGTCGGC AACATGGTTG 13120 

AACGAAGAAA TCGAAGAAGA AGTCCTACCT GAAGGCAACT TTGCCATGTG GTGGCTAGGC 18130 

AACTGTGGTA CTTGGATTAA GACACCAGCT GGTGCTAACG TTGTCATGGA CCTTTGGTCA 18240 

AACCGTGGAA AATCAACCAA AAAAGTGAAA G AT ATGGT TC GTGGGCACCA AATGGCAAAT 18300 

ATGGCAGGTG TTCCTAACCT GCAACCAAAC TTGCGTCTTC AGCCAATGCT TATCGATCCA 193 60 

TTTCCTATCA ACGAACTAGA CTATTACTTA GTTTCACACT TCCACAGTGA TCATATCGAC 18420 

C CAT AC AC AG CTGCAGCAAT TCTCAATAAT CCTAAGTTAG AGCATGTTAA GTTGG 18475 
( 2 ) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCAGGATTTG GTACCGTTGC AAGTGGT GTG CCTTTCCTCC TAAAGGAAAA TGGAGGAAAA 60 

ATCAATCAAT CAGCACATTC AGATATCAAA GTTGCTAAGG TATTGGTCAA GGATGAAGAT 12 0 

GAAAAAAATC GCTTGCTTGC AGCAGGGAAT GACTTTAACT TTGTAACCAA TGTGGATGAT 180 

ATTTTATCAG ACCAGGATAT TACTATCGTA GTGGAATTGA TGGGCCGTAT TGAGCCTGCT 24 0 

AAAACCTTTA TCACTCGTGC CTTGGAAGCT GGAAAACACG TTGTTACTGC TAACAAGGAC 3 00 

CTTTTAGCTG TCCATGGCGC AG AATTG CT A GAAATCGCTC AAGCTAACAA GGTAGCACTT 3 60 

TACTACGAAG CAGCAGTTGC TGGTGGGATT CC AATTCTTC GTACTTTAGC AAATTCCTTG 42 0 

GCTTCTGATA AAATTACGCG CGTG CTTGG A GTAGTCAACG GAACTTCCAA CTTCATGGTG 4 80 

ACCAAGATGG TGGAAGAAGG CTGGTCTTAC GATGATGCTC TTGCGGAAGC ACAACGTCTA 540 
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GGATTTGCAG AAAGCGATCC GACGAATGAC CTAGATGGGA TTGATGCAGC CTACAAGATG 600 
GTTATTTTCA GCCAATTTGC CTTTGGCATG AAGATTGCCT TTGATGATGT AGCCCACAAG 660 
GGAATCCGCA ATATCACACC AGAAGACGTA GCTGTAGCTC AAGAGCTTGG TTACGTAGTG 7 20 

AAATTGGTTG GTTCTATTGA GGAAACTTCT TCAGGTATTG CTGCAGAAGT GACTCCAACC 780 
TTCCTACCTA AAGCGCACCC ACTTGCTAGT GTGAATGGCG TKSTGAACGC TGTCTTTGTA 84 0 

GAATCTATCG GTATTGGTGA GTCT ATGT AC TACGGACCAG GTGCGGGTCA AAAACCAACT 900 
GCAACAAGTG TTGTAGCTGA TATTGTCCGT ATCGTTCGTC GTTTGAATGA TGGTACTATT 9 60 

GGCAAAGACT TCAACGAATA TAGCCGTGAC TTGGTCTTGG CAAATCCTGA AGATGTCAAA 102 0 

GCAAACTACT ATTTCTCAAT CTTCGCTCTA GACTCAAAAC GTCAGGTCT? GAACTTGGCT 1080 

GAAATCTTCA ATGCTCAAGA TATTTCCTTT AAGCAAATCC TTCAAGATGG CAAAGAGGGT 1140 

GACAAGGCGC CTGTCGTTAT CATCACACAC AAGATTAATA AAGCCCAGCT TGAAAATGTC 12 00 

TCAGCTGAAT TCAAGAAGGT TTCAGAATTC GACCTCTTGA ATACCTTCAA GGTGCTAGGA 12 60 

GAATAAGATG AAGATTATTG TACCTGCAAC CAGTGCCAAT ATCGGGCCAG GTTTTGACTC 13 20 

GGTCGGTGTA GCTGTAACCA AGTATCTTCA AATTGAGGTC TGCGAAGAAC GAGATGAGTG 13 80 

GCTGATTGAA CACCAGATTG GC AAATGG AT TCCACATGAC GAGCGTAATC TCTTGCTCAA 14 4 0 

AATCGCTTTG CAAATTGTAC CAGACTTGCA ACCAAGACGC TTGAAAATGA CCAGTGATGT 1500 

CCCTTTGGCG CGCGGTTTGG GTTCTTCCAG CTCGGTTATC GTTGCTGGGA TTGAACTAGC 15 60 

CAACCAACTC GGTCAACTCA ACTTATCAGA CCATGAAAAA TTGCAGTTAG CGACCAAGAT 162 0 

TGAACGGCAT CCTGACAATG TCGCTCCACC CATTTATGGT AATCTCCTTA TTGCAAGTTC 1680 

TGTTGAAGGG CAAGTCTCTG CTATCGTACC AGACTTTCCA GAGTGTGATT TTCTAGCTTA 17 4 0 

CATTCCAAAC TATGAATTAC GTACTCGCGA CAGCCGTAGT GTCTTGCCTA AAAAATTGTC 1800 

TTATAAGGAA CCTGTTGCTC CAAGTTCTAT CGCCAATGTA GCGGTTGCTG CCTTGTTGGC 18 60 

AGGAGACATG GTGACCGCTG GGCAAGCAAT CGAGGGAGAC CTCTTCCATG AGCGCTATCG 192 0 

TCAGGACTTG GTAAGAGAAT TTGCGATGAT TAAGCAAGTG ACCAAAGAAA ATGGGGCCTA 19 80 

TGCAACCTAC CTTTCTGCTG CTGGGCCGAC AGTTATGGTT CTGGCTTCTC ATGACAAGAT 204 0 

GCCAACAATT AAGGCAGAAT TGGAAAAGCA ACCTTTCAAA GGAAAACTGC ATGACTTGAG 2100 

AGTTGATACC CAAGGTGTCC GTGTAGAAGC AAAATAAAGA ATAGAAGATA GGATGGGGAA 2160 

ACTCTTGACC AGAGGGGTTC ATATCCTTTT TGTGAAAAGA AGTTTATACT CAATGAAAAT 2 22 0 

CAAAGAGCAA ACTAGGAAGC TAGCCGCAGG CTGCTCAAAA CAGTGTTTTG AGGTTGCAGA 22 3 0 

TAGAACTGAC GAAGTCAGCT CAAGACACTG TTTTGAGGTT GCAGATAGAA CTGACGAAGT 2 34 0 
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CAGTAACCAT ACTACGGTAA GGTGACGCTG ACGTCCTTTC AAGAGATTTT CGAAGAGTAT 2 400 

TAG TT AAAAA CGTGATAAAG GAGAAATAAA GATGGCAGAA ATTTATCTAG C AGGTGG TTG 2 4 60 

TTTTTGGGGC CTAGAGGAAT ATTTTTCACG CATTTCTGGA GTGCT AG AAA CCAGTGTTGG 2 5 20 

CTACGCTAAT GGTCAAGTCG AAACGACCAA TTACCAGTTG CTCAAGGAAA CAGACCATGC 2 5 80 

AGAAACGGTC CAAGTGATTT ACGATGAGAA GGAAGTGTCA CTCAGAGAGA TTTTACTTTA 2 640 

TTATTTCCGA GTTATCGATC CTCTATCTAT CAATCAACAA GGGAATGACC GTGGTCGCCA 2700 

ATATCGAACT GGGATTTATT ATCAGGATGA AGCAGATTTG CCAGCTATCT ACACAGTGGT 2 760 
GCAGGAGCAG GAACGCATGC TGGGTCGAAA GATTGCAGTA GAAGTGGAGC AATTACGCCA 
CTACATTCTG CCTGAAGACT ACCACCAAGA CTATCTCAGG AAGAATCCTT CAGGTTACTG 

TCATATCGAT GTGACCGATG CTGATAAGCC ATTGATTGAT GCAGCAAACT ATGAAAAGCC 2 94 0 

TAGTCAAGAG GTGTTGAAGG CCAGTCTATC TGAAGAGTCT TATCGTGTCA CACAAGAAGC 3 000 

TGCTACAGAG GCTCCATTTA CCAATGCCTA TGACCAAACC TTTGAAGAGG GGATTTATGT 3060 

AGATATTACG ACAGGTGAGC CACTCTTTTT TGCCAAGGAT AAGTTTGCTT CAGGTTGTGG 312 0 

TTGGCCAAGT TTTAGCCGTC CGATTTCCAA AGAGTTGATT CATTATTACA AGGATCTGAG 3180 

CCATGGAATG GAGCGAATTG AAGTTCGTTC TCGTTCAGGC AGTCCTCACT TGGGTCATGT 32 4 0 

TTTCACAGAT GGACCGCGGG AGTTAGGCCG CCTCCGTTAC TGTATCAATT CTGCTTCTTT 3 300 

ACGCTTTGTG GCCAAGGATG AGATGGAAAA AGCAGGATAT GGCTATCTAT TGCCTTACTT 3 3 60 

AAACAAATAA AACAGACAGT GGGGCTTCCC ACTTTCTTCA TTTCTAGAAT ATGAATAGAA 34 2 0 

CCGATTTATC AAACACCTAT TATCTTACTT CAAACCCTAC ATCAAGGAAT CAATTTTAGC 34 80 

CCCCTTGTTC AAGCTGTTAG AAGCTGTTTT TGAGCTCTTG GTTCCCATGG TGATTGCTGG 3 54 0 
GATTGTTGAC CAATCTTTAC CTCAGGGAGA TCAAGGTCAT CTCTGGATGC AGATTGGCCT 
GCTCCTTATC TTTGCAGTAA TTGGCGTTTT AGTGGCCTTG ATAGCTCAAT TTTACTCAGC 
AAAGCCAGCA CTACGTTCTG CTAAGGAATT GACAAACGAT CTTTATCGTC ATATTCTTTC 
CTTGCCCAAG GACAGCAGAG ACCGTCTGAC AACTTCTAGT TTGGTCACTC GCTTGACTTC 

GG AT AC CT AC CAGATTCACA CTGGTATCAA TCAATTCCTG CGTCTCTTTT TACGAGCGCC 3840 
CATTATCGTT TTTGGTGCCA TTTTTATGGC TTATCGAATC TCAGCTGAGT TGACTTTCTG 3 900 

GTTCTTAGTC TTGGTTGCCA TTTTG ACC AT TGTCATTGTA GGGTTATCTC GATTGGTCAA 3960 
TCCTTTCTAC AGTAGTCTCA GAAAGAAAAC GG AC C AACTG GTTCAGGAAA CGCGCCAGCA 
ATTGCAAGGG ATGCGGGTTA TTCGTGCTTT TGGTCAAGAA AAACGAGAGT TACAGATTTT 



3600 
3660 
3720 
3780 



4020 
4080 
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TCAAACCCTT 


AACCAAGTTT 


ATGCTAGATT 


ACAAGAAAAG 


AC AGGTTTCT 


GGTCTAGTTT 


4140 


ATTAACACCT 


CTGACCTATC 


TGATTGTCAA 


TGGAACTCTT 


CTCGTTATTA 


TCTGGCAAGG 


4200 


CTATATTTCA 


ATTCAAGGAG 


GAGTGCTCAG 


TCAAGGTGCT 


CTCATTGCTC 


TTATCAATTA 


4260 


CCTCTTACAG 


ATTTTG GTGG 


AATTGGTCAA 


GCTAGCCATG 


TTGATCAATT 


CCCTCAACCA 


4320 


GTCCTATATC 


TCAGTCAACC 


GAATCGAGGA 


AGTCTTTGTT 


GAGGCTCCAG 


AGGATATCCA 


4380 


TTCAGAGTTA 


GAACAAAAGC 


AAGCTACCAG 


AG AT AAGGTT 


TTACAAGTCC 


AAG AA TTG AC 


4440 


CTTTACCTAT 


CCTGATGCGG 


CCCAGCCTTC 


TCTGAGATAC 


ATTTCCTTTG 


ATATGACTCA 


4500 


AGG AC AAATT 


CTAGGTATCA 


TCGGGGGAAC 


TGGTTCTGGT 


AAAT C AAG CT 


TGGTGCAACT 


4560 


CTTACTTGGA 


CTTTATCCAC 


T AG ACAAGGG 


GAACATTGAC 


CTTTATC AAA 


ATGGACGTAG 


4620 


TCCTCTTAAT 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


GTACCTCAAA 


AGGTCGAACT 


4680 


CTTTAAAGGA 


ACCATTCGTT 


CCAACTTGAC 


TCTAGGTTTC 


AATCAAGAAG 


TATCTGACCA 


4740 


GGAACTCTGG 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


TTTGTCAGTG 


AAAAGGAAGG 


4800 


ACTCTTG G AT 


GCTCTAGTTG 


AGGCAGGGGG 


GCGAAATTTC 


TCAGGTGGAC 


AAAAACAAAG 


4860 


ATTGTCTATC 


GCCCGAGCAG 


TCTTGCGCCA 


GGCTCCGTTT 


CTCATCCTAG 


ATGATGCAAC 


4920 


CTCGGCACTG 


GATACCATTA 


CAGAGTCCAA 


GCTC TTG AAA 


GCTATTAGAG 


AAAATTTTCC 


4980 


AAACACGAGC 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


5040 


TCTCCTCTTG 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


5100 


CAGCCAAGTC 


TATTGTGAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


5160 


GACAAACTGT 


AAACCAGACG 


CTCAAACGTT 


TAGCCGTAGA 


TTTAGCAACC 


CATCCTTTCC 


5220 


TCCTTTTCCT 


AGCCTTTCTA 


GGAACTATTG 


CCCAAGTTGG 


CTTATCAATT 


TACCTACCTA 


5280 


TTCTCATTGG 


CCACCTCATT 


GACCAAGTCC 


TAGTGGCTCG 


TTCATCACCA 


GTTTTTTGGC 


5340 


AGATTTTTCT 


CCAGATGCTC 


TTGGTGGTAA 


TAGGAAATAC 


TCTGGTACAA 


TGGGCCAATC 


5400 


CTCTCCTCTA 


TAATCGTCTA 


ATCTTCTCTT 


AT AC C AG AG A 


TTTACGGGAG 


CGAATCATCC 


5460 


ATAAGCTCCA 


TCGTTTACCG 


ATTGCCTTTG 


T AG AT AGG C A 


AGGTAGTGGA 


G AG ATGGTT A 


S520 


GTCGTGTAAC 


CACGGACATC 


GAACAGTTGG 


CAGCTGGCTT 


GACCATGATT 


TTTAACCAAT 


5580 


TTTTCATTGG 


TGTTTTGATG 


ATTTTGGTCA 


GTATTCTAGC 


CATGCTCCAA 


ATTCATCTCC 


5640 


TCATGACTCT 


CTTAGTCTTG 


CTGTTGACGC 


CACTGTCCAT 


GGTGATTTCA 


CGCTTTATTG 


5700 


CCAAGAAATC 


CTATCATCTC 


TTCCAGAAGC 


AAACAGACAC 


GAGGGGAATT 


CAGACTCAGT 


5760 


TGATTGAAGA 


ATCGCTTAGT 


CAGCAGACTA 


TAATCCAGTC 


CTTCAATGCT 


CAAACAGAAT 


5820 


TTATCCAAAG 


AT T GCGTG AG 


GCTCATGACA 


ACTACTCAGG 


CTATTCTCAG 


TCAGCCATCT 


5880 
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TTTATTCTTC AACGGTCAAT CCTTCGACTC GCTTTGTAAA TGCACTCATT TATGCC CTTT 594 0 

TAGCTGGAGT AGGAGCTTAT CGTATCATGA TGGGTTCAGC CTTGACCGTC GGTCGTTTAG 6000 

TGACTTTTTT GAACTATGTT CAGCAATACA CCAAGCCCTT TAACGATATT TCTTCAGTGC 6060 

TAGCTGAGTT GCAAAGTGCT CTGGCTTGCG TAGAGCGTAT CTATGGAGTC TTAGATAGCC 6120 

CTGAAGTGGC TGAAACAGGT AAGGAAGTCT TGACGACCAG TGACCAAGTT AAGGGAGCTA 6180 

TTTCCTTTAA ACATGTCTCT TTTGGCTACC ATCCTGAAAA AATTTTGATT AAGGACTTGT 62 4 0 

CTATCGATAT TCCAGCTGGT AGTAAGGTAG CCATCGTTGG TCCGACAGGT GCTGGAAAAT 6 3 00 

CAACTCTTAT CAATCTCCTT ATGCGTTTTT ATCCCATTAG CTCGGGAGAT ATCTTGCTGG 6 3 60 

ATGGGCAATC CATTTATGAT TATACACGAG TATCATTGAG ACAGCAGTTT GGTATGGTGC 64 2 0 

TTCAAGAAAC CTGGCTCACA CAACGGACCA TTCATGATAA TATTGCCTTT GGCAATCCTG 64 80 

AAGCCAGTCG AGAGCAAGTA ATTGCTGCTG CCAAAGCAGC TAATGCAGAC TTTTTCATCC 654 0 

AACAGTTGCC ACAGGGATAC GATACCAAGT TGGAAAATGC TGGACAATCT CTCTCTGTCG 6600 

GCCAAGCTCA GCTCTTGACC ATAGCCCGAG TCTTTCTGGC TATTCCAAAG ATTCTTATCT 66 60 

TAGACGAGGC AACTTCTTCC ATTGATACAC GGACAGAAGT GCTGGTACAG GATGCCTTTG 67 2 0 

CAAAACTCAT GAAGGGCCGC ACAAGTTTCA TCATTGCTCA CCGTTTGTCA ACCATTCAGG 67 80 

ATGCGGATTT AATTCTTGTC TTAGTAGATG GTGATATTGT TGAATATGGT AACCATCAAG 684 0 

AACTCATGGA TAGAAAGGGT AAGTATTACC AAATGCAAAA AGCTGCGGCT TTTAGTTCTG 6 9 00 

AATAAGCCAT TCTCTTTTGA AAGTTTATGG ACGAAAAAAG TTGCCTTCGA GTGACTTTTT 69 6 0 

TGTTACAATA GCTAGAAAAA TTGTTCACTG TAATACTCAA TGAAAATCAA AGAGCAAACT 7 020 

AGGAAGCTAG CCGTAGGTTG CTCAAACCAC ACCTTTCAGG TTGTAGATAA GACTGACGAA 7 080 

GTCAGTTCAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAG CTCAAAACAC 7 140 

TCTTTTGAGG TTGC AG AT AG AACTGACGAA GTCAGCTCAA AACAGG 7 186 

(2i INFORMATION FOR SEQ ID NO: 40: 

<i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 14273 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4C: 

CTGAAAATTC TAAAAAATTT ATAAGTAAGG AATTAATTAG TTATTTTTGT GATAAAGTTT 6 0 
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ATGATGAAAT ATTTGTTCAA GAGGTAGTTC CCCACGTTTT TCTGCCATAT GAATCTGACT 120 

TACTTCTTAT TTTACCAGCT ACGGCAAATG TGATTGGCAA AATTGCTAAT GGTATTGCTG 180 

ATGATTTAGT TACAGCAACT GTTTTAAACT TTAATAAAAA AATAATTTTT TGTCCCAATA 24 0 

TGAACTCTAC TATGTGGGAC AATCACATAG TTCAAAGAAA TGTATCAATT CTAAAGGAGT 300 

TGGGACATAT ATTTTTATTT GAGTCTAAAA AAAC AT AT G A GGTAGGATTG CGTAAAGCAA 3 60 

TAGATTCAAC ATGTTCAATG TTACAACCAC AGTCGTTAGT AAAAGAACTT ATCAAATTAG 4 20 

AAAATATTGT CCTTGAAGAG GGACATTAAA AACTACTGAG AATATTAATG AGGGGAAAAA 48 0 

ATGGAAAATT CATCAATCGA TGTAGATATG CTGTTGGAAG AATTGACACA AGAAGCAATG 54 0 

GTCGTTGTTG CTGTTG AT AA GGACTGTTAA TTT AAACTT A TGGCAATATA TGAAAGGTTA 600 

CTGGATGTTT TAAATTATGC AGGCAGTAGC CTTTTATTAT ATACAAATGG ATAAAGTAAG 6 60 

GATAATACAA TGATTAATAA AAAAATACAA CAAGTTGTTT TGGAATCATT ACAGAATTTT 7 20 

TTGAATGGGA ACTTCATTTC GCCTTGTGTA GTCTATGATT TTGGCTTGCT GGAAACTGTA 7 80 

CTTGATGAAT TTAAAAATCA AATTCCTGTA ACATTCAATT ACCAACTTTT TT ATG CCGTT 84 0 

AAAGCAAATT CAAATGAGAA GATACTTGAA TTCTTAGTAG AT AAAATT G A TGGAGTTCAT 900 

G TGGCGTC AT TATCTGAATT AGATGTGGCT AAAAAATTTT TCCCACCAAC T C AAATTTCT 960 

GTTAATGGTC CCGCATTTTC TTATGAAACT TTATATAATC TGATTAAAAA ACAATATAAA 1020 

GTTGATATTA ACTTTTTGGA ACATCTTCAA C AATTTTC CC CAAAAGAATC TGTTGGAATA 1080 

AGAGTAACGG AGCC AG ATG A ACTTAATAAT CGTATGAGTC GATTTGGAAT AAATATTTGC 1140 

AGTGATAATT GGACTAGTAA TTTACAAAAT CCTTTAATTA CACGACTGCA TTTT C ATTTT 12 CO 

GGAGAAAAAG ATGATAAATT TATTCTTAAG TT AG AT AAAA TATTATTTAA GTTACAAGAA 1260 

ATTAATAAAC TTAGAGAGGT TAG AG AAAT A AATCTTGGAG GCGGTTTTAT GAAATTATTT 13 20 

ATGGAAAATC GTTTGAAAGA ATTTTTTCTA TCACTTATGG AAATCTATAA AAAGTACGAT 13 80 

ATTGATAGTA CTGTGACTAC AATAATAGAA CCAGGTACTG CAATTACTTC ATTTTCTGCC 14 4 0 

TATATGATTA CTAGCCCAGT TAATGTTAGT GAGGTGAATG AGCAGCAGGT TATCACGTTA 15 00 

GACACATCAA TATACACCAA T AC ATT ATGG TTTGTTCCGC ATATTATTAC AACGTTAAAT 15 60 

TCAAGTAGTA AAGAGCGTTA T AGT ACT ATT CTCTATGGTA ATACCTGTTA TGAACATGAC 162 0 

AAGTATAAAA TGAAAGTTTC GCTTCCAAGG TTAACTCAAA ATAGCAGTAT AGTGTTTTTT 1680 

CCTGTAGGAG CTTATATAAA AAGCAATCAT TCAAATTTAC ATCGTAATGA TTTTATGCGG 17 40 

GAGGTATATT TGTGGACAAA AAACTTGACA TATTAGATAA AGTTAAGGAA TATTTAGGAA 1800 

ATAAAACTAC TCAAATTCTG GATAATCAAT ATAAAGAATT TTTCAAACTT AATGATATAA 1360 
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GGCGAGCGTT TCGTATTTCA GAAAAAGTAT TAAACAATTC TTTTAATTTT ACGAGTAAAG 1920 

AATTT AATGA TTTAATTAAT AACGAAAATT ATTTATTCGA ATATGCATGT AGAATTAGAG 19 SO 

AGGAATGGAG AAAAAAATGC TTTAATCATT CTTATCGTTT TCTATGCTCA CCTATAATTA 204 0 

CAGATGATTT TCTTAACACG AAGACATTGA GAAGTAGCCA AATTGAATAT AAATATGAGC 2100 

GATATTTATC GAAAAGTTCG ATAGGCGATA GAGCGGTTGA TGGCTTTGTT TCCTTCAATA 2160 

CTTTAACAGC T AATGGT AT G TCTGCTATTA AACTATGTCT TG AG AT ATT A AACTCTATTT 22 20 

TCTTCAAGAA GAAGATTGAT TTATTATATT CAACCGGATA TTATGAAACA AGATTTTTAT 2 2 80 

TAAATAATCT TGCTAAATCA GGTATTAGTT GCTATGAGGT AAGTAATTGT GAATTGGATA 2 34 0 

AAGATAAATT TTATAATGTA TTCATGATGG AACCCAATCG AGCCGATTTA AC AT T AC AAA 2 400 

AAACTGATTT CAAG AT AGT A GAATATTTTG TTAAGTATAA AAATAATTCA ATAAAAGTCG 24 60 

T T ATTTT AG A TATTTCATAT CAAGGTTCTA ATTTTAAATT AG TAG AATTT TTAGAGAAAT 2 520 

TTAAATTTGC GAATGTAATT ATTTTTGTGG TACGATCTTT G AT AAAAT T A GATCAAATCC 2 580 

GATTAGAATT GACAAATGGG GGAATAATAG AAGTGTTTAT TCCTAATCAT TTGAGAAAGT 2 64 0 

TGAAAAATTT TATTGAAGAG GAATTCAATA AATTT AG AAA TTCTCACGGA GCTAATCTAA 2700 

GCCTCTATGA ATACTGTTTG CTTGATAATT CTTTAACTTT AAAAAATGAT TGGAACTATT 27 60 

CTGATTTAGT TATGAAATTT ACGAGTAATT TTTATGCTGA TATAAAAGAC TTGTTCATGG 2 82 0 

AAAATTCTGA TATTGAAATC ATCCATGAAG AGGGAGTACC TTTTGTATTT TTAGATTTAA 2 880 

TAGGTGAAGG TAAAAAAGAA TATGAAATGT TTTTTCAATG GTTAAACTTC TTTTACAAAC 2 94 0 

AGCTTGGAAT CACATTGTAT GCTAGAAATA GTTTTGGGT T TCGGAATCTA ACAGTAGAGT 3 000 

ATTTTGGAAT TATTGGCACA GAAACATATA TATTTAAGAT TTGTCCAGGT GTTTATAAAG 3 060 

GGTTAAGTTA TTATTTGATG AAATTTTTAT TAAAATCTTT TTCAAATGAA TATTTAAAAA 312 0 

CTACTGATGA GGTTAATAGA TGAAAAATTT GATAAAGTTG CTAATAATTA GATTGATTCT 3 180 

TAACTTAGCA G AC AGT GT AT TTT AT AT AGT AGCATTGTGG CACGTTAGCA ATAATTATTC 3 24 0 

TTCGAGCATG TTCTTAGGAA TATTTATTGC AGTAAATTAT CTACCGGATT TGTTACTAAT 3 3 00 

CTTTTTTGGA CCAGTTATTG ACAGAGTAAA TCCGCAAAAA ATTCTTATAA TATCAATTTT 3 3 60 

GGTTCAATTA GCAGTGGCTG TAATATTTTT ATT ATT ATT A AACCAAATAT CATTTTGGGT 3 4 20 

GATAATGAGT CTAGTGTTTA TTTCAGTAAT CGCTAGCTCC ATAAGTTACG TGATAGAAGA 34 80 

TGTGTTGATT CCTCAAGTGG TAGAATATGA TAAGATTCTA TTTGCAAATT CTCTTTTTAG 3 54 0 

TATTTCGTAT AAAGTATTAG ATTCT ATTTT TAATTCATTC G CATC ATTTT TACAGGTGGC 3 6 00 
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AGTACGATTT ATTTTATTGG TTAAGATAGA TATAGGCATA TWP^ACTTG CTCTATTTAT 36 60 

ATTGTTGTTG TTAAAATTTA GAACTAGCAA TGCGAATATA GAAAACTTCT CTTTCAAATA 37 20 

TTACAAGAGA G AAGTG TT GC AAGGTACAAA GTTTATTTTA AATAATAAAT TATTATTTAA 37 80 

AACCAGTATT TCTTTAACGC TTATAAACTT TTTTTATTCA TTTCAGACAG TAGTTGTACC 384 0 

GATTTTTTCT ATTCGATATT TTGATGGTCC GATTTTTTAT GGTATTTTTT TAACTATTGC 3900 

TGGTTTGGGT GGTATATTGG GAAATATGCT AGCGCCAATC GTAATAAAAT ATTTAAAATC 3 9 60 

GAATCAAATT GTTGGTGTAT TTCTTTTTTT GAACGGCTCA AGTTGGTTAG TAGCAATTGT 4 0 20 

TATAAAAGAC TAT AC TTT AT CACTTATTTT ATTTTTCGTT TGTTTTATGT CTAAAGGAGT 4 080 

CTTCAATATT ATTTTTAATT CGTTGTACCA ACAAATACCT CCACATCAAC TTCTTGGTAG 4140 

GGTAAATACT AC C ATTG ATT CTATTATTTC TTTTGGAATG CCAATTGGTA GTTTAGTTGC 4 200 

AGGAACCCTT ATTGATTTGA ATATTGAATT AGTGTTAATT GCTATTAGCA TACCTTATTT 4 2 60 

TTTCTTTTCT TATATTTTTT ATACGGATAA TGGATTGAAA G AATTT ACT A TATATTAGAA 4 3 20 

ATGTTTATGT TCATTCAAAA GCATAATGAC TATAACTGAA AAAGAAAACT GATATCTTTA 4 3 80 

AGGTTGTTCT TCTTGGTGGT GAGATTCGTG AGACAACCCA AGCTTTTCTC GGAAAGATTA 4 44 0 

CCAATGCTTT GATGGATAGG ATGTACTTTA GCAAGATGTT TTTAGTGGTA ACGGTATCGT 4 500 

GGATCGACGT GTAATAACCT CTTCTTTCGA GGAGTATTTT ACTAAAAAAC TAGCCTTGGA 4 5 60 

GCGTTCCCCA GAAACGGACT TACTCATTGA CTCTTCAAAG ATTTGGGGAG AAGATTTTGC 4 6 20 

TTCATCTGTT CCTTGAAAAA AGTCACAGCA GTC ATC AC AG ACGATAGTAC TGAACAAAAC 4 6 80 

TATGAAGAGT TAGAAATTTA TACGCACCTG ATTGT AT AAA CGATCTGGAA ATAGATAAGA 47 4 0 

AGTTGATTAG TATTGACCTA GGTGGTACAA ATATTAAGAT TACTGTTCTT TCAAATGACG 4 800 

GTGAGATTGA AACTTTGTGG AGTATTACAA CAGATACAAG TGAGAAAGCT TCTCAAATTA 4 8 60 

TATCGGACAT CATCAGTTCT ATTAAAAATA AATTGACCGA ACGGAATATT CCTGATAGCG 4 920 

ACCTTCTTGG AATCGGTATG GGAAGTTGCT CATCATACTT TCCTTGTAAA TCATAGGGGC 4 9 80 

TATAAACTCT CCGTCTACTT GTCCTGCAAC AATTGAAGTC TGCTCAAAAC GCCGTCCGCT 3040 

AATCTTTTCA TAGACTTTCT CCCTTTTAGG AGCCTAGCTT TCT AGTTTGT TCTTTGATTT 5100 

TTATTGAGTA TACCACTATT TTACTCCCTC TGGCAAGGGA CTTTGTCTAT GTGGAGGGAT 5160 

TGGGCTCCTA TGTGGTGG AG CTTTTCTGTT CTTTCTGAAA TATGGTATAA TAGCACTAAT 52 20 

CAATTTCTAG G AAAAT AG AT ACAGAAAGGG GCTGAAAGAT GTCTCATATT ATTGAATTGC 5 2 80 

CAGAGATGCT GGCAAACCAA ATCGCGGCTG GAGAGCTCAT TGAACGTCCT GCCAGTGTGG 5 340 

TCAAAGAGTT GGT AG AAAAT GCCATTGACG CGGGCTCTAG TC AG ATT ATC ATTG AG ATTG 5 40 0 
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AGGAAGCTGG TCTCAAGAAG GTTCAAATCA CGCATAACGG TCATCGAAT7 GCCCACGATG 54 60 

AGGTGGAGTT GGCCCTGCGT CGCCATGCGA CCAGTAAGAT AAAAAATCAA CCAGATCTCT 5 520 

TTCGGATTCG GACGCTTGGT TTTCGTGGTG AAGCCTTCCC TTCTATTGCG TCTGTTAGTG 5 530 

TCTTGACTCT GTTAACGGCG GTGGATGGTG CTAGTCATGG AACCAAGTTA GTCGCGCGTG 5 64 0 

GGGGTGAAGT TGAGGAAGTC ATCCCAGCGA CTAGTCCTGT GGGAACCAAG GTTTGTGTGG 5 7 00 

AGGATCTCTT TTTCAACACG CCTGCCCGTC TCAAGTATAT GAAGAGCCAG CAAGCGGAGT 5160 

TGTCTCATA? CATTGATATT GTCAACCGTC TGGGCTTGGC CCATCCTGAG ATTTCTT TT A 5820 

GCTTGATTAG TGATGGCAAG GAAATGACGC GGACAGCAGG GACTGGTCAA TTGCGCCAAG 5 880 

CAATCGCAGG GATTTACGGT TTGGTCAGTG CCAAGAAGAT GATTGAAATT GAGAACTCTC 594 0 

ACCTAGATTT CGAAATTTCA GGTTTTGTGT CCTTGCCTGA GTTGACTCGG GCTAACCGCA 6000 

ATTATATCAG CCTCTTCATC AATGCCCGTT ATATTAAGAA CTTCCTGCTC AATCGTGCT A 60 60 

TTTTGGATGG TTTTGG AAG C AAGCTTATCG TTGGACGTTT TCCACTCGCT GTCATTCACA 6120 

T C CAT ATCG A CCCTTATCTA GCGGATGTCA ATGTGCATCC AACTAAGCAA GAGGTGCGGA 6180 

TTTCCAAGGA AAAAGAACTG ATGACTCTGG TTTCAGAAGC TATTGCAAAT AGTCTCAACG 6 240 

AACAAACCTT GATTCCAGAT GCCTTGGAAA ATCTTGCCAA ATCGACCGTG CCCAATCGTG 6 300 

AG AAG GTGG A GCAAACTATT CTCCCACTCA AAGAAAATAC GCTCTACTAT GAGAAAACTG 6 3 60 

AGCCGTCAAG ACCTAGTCAA ACTGAAGTAG CTCATTATCA GGTAGAATTG ACTG AT G AAG 64 20 

GGCAGGATTT GACCCTGTTT G CC AAGG AAA CCTTGGACCG ATTCACCAAG CCAGCAAAAC 6 4 80 

TGCATTTTCC AGAGAGAAAG CCTCCTAACT ACGACCAGCT AGACCATCCA GAGTTAGATC 6 5 40 

TTCCTAGCAT CCATAAGCCT TATGACAAAC TGGAGCGAGA AGAAGCATCC AGCTTCCCAC 6 600 

AGTTGGAGTT TTTCGGACAA ATGCACGGGA CTTATCTCTT TGCCCAAGGG CG AG AT GG AC 6660 

TTTACATCAT AGATCAGCAC GCTGCTCAGG AACGGGTCAA GTACGAGGAG T ACCGTG AAA 6 720 

GCATTGGCAA TGTTGACCAA AGCCAGCAGC AACTCCTAGT GCCCTATATC TTTGAATTTC 67 80 

CTGCGGATGA TGCCCTGCGT CTCAAGGAAA GAATGCCTCT CTTAGACGAA CTGGGCCTCT 684 0 

TTCTAGCAGA GTACGGAGAA AATCAATTTA TTCTACCTCA ACATCCTATT TGGATGGCAG 6900 

AAGAAGAGAT TGAATCAGGC ATCTATGAGA TGTGCGACAT GCTCCTTTTG ACCAAGGAAC 69 60 

TTTCTATCAA GAAATACCGA GCAGAGCTGG CTATCATGAT GTCTTGCAAG CG ATCTATCA 702 0 

AGGCCAATCA TCGTATTGAT CATCATTCAG CTAGACAACT CCTCTATCAG CTTTCTCAAT 7 0 80 

GTGACAATCC CTATAACTGT CCTCACGGAC GTCCTGTTTT GGTGCATTTT ACCAAGTCGG 7 14 0 
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ATATGGAAAA GATGTTCCGA CGTATTCAGG AAAATCACAC CAGTCTCCGT GAGTTGGGGA 7 200 

AATATTAAAA GTATAAAAAA GTCTGGGAAA AATTTTCAAA ATCAAAAAAA CGCATAAAAT 7 2 60 

CAGGTGTTCA AAAACCTTGA TTTTATGCGT TTTATCATGG AAATAGTTAC TTCATTTTTT 7 320 

CCTAATTCTT TTCGAAACTC TTTTTAAACG ACGTCAGTTT TATCAGTAAT CTCAAAACAG 7 380 

TGTTTTGAGC TAATTTTGCC AGTTTTGTCT GTAACATCGA AGTTCTCTTT TACCACTCTG 74 40 

CGACTGGTTT CCTAGTTTGC TCTATGATTT TCACAGAGCA TTAAATTGCG ATTTTGCCAA 7 500 

GTTTCTTTAT TCGXCTAAAA GTAGAGTCTG TTCTATGCGT CTAATGTACG AATCAGGTTG 7560 

ACCATTTCAA TACCTCCTTG TGCACACTCA GAACCCTTAT TTCCTGCTTT AGTACCAGCT 762 0 

CGTTCTATGG CTTGTTCAAT TGTATCTGTC GTTAGCACAC CAAACATAAC AGGAATTTCG 7 680 

CTATTTAAAC TGATTTGGGC GATTCCCTTA GATACCTCGC TACATACATA ATCATAATGA 77 4 0 

CTTGTATTCC CTCTAATGAC AGCTCCCAAG C AG AT AATTG CATCATATTT TTT ACT TTTT 7 800 

G C C ATTTTTG ATGCAATCAG TGGTATTTCA AAAGCTCCTG GAACCCAGGC TACCTCTATA 7 860 

TCTTTCTCGT TTACATTCTC TCTTTTGAGA TTATCTAGTC CTCCAGATAA TAATTTTGAA 7 920 

GTTATAAATT CATTAAATCT CGCTACAACA ATACCTATTT TAATATTGTT TGCTACTAAA 7 980 

TTACCTTCAT AAGTGTTCAT TTATTTTTCC TCCATATTTA AAATGTGACC CATTCGATTT 8040 

TTCTTTGTTT CTAAATAAAA ACTATCGTAA GGATTGGCTT CT ATTTCG AT TGATATTCTA 8100 

CTGGAAATGG TAATTCCATA TTTTTCTAAC TGTTCAACCT TGTCAGGATT ATTTGTCAGT 8160 

AAATGAAGTG ACTGAAGTCC CACATCTTTA AGCATTTTTG CTCCAATATG ATATTCTCTT 92 20 

AAATCACCTT CAAAGCCTAA TGCAAGATTG GCATCAAGCG TATCCATGCC TTGATCTTGT 8280 

AAATGATAGG CTTTTAATTT ATTGATAAGT CCAATTCCTC GTCCCTCCTG TCCCAAGTAA 3 3 40 

AGTAAGACAC CCGAACCATT CTCAACAATC ATTTTCATAG CCTTATCGAA TTGCTGTCCA 84 00 

CAATCGCAAC GTAAAGAGCC TAAAACATCT CCTGTTAAAC ATTCGGAGTG GACCCGACAT 8 4 60 

AATACATTGG CTTCATCCTC TATATTTCCC ATAATAAGAG CAAGATGATG TTCCCCATTT 8 520 

AGTTTATCTA TATAGCTAAT TGCTTTGAAA TTACCGTATC TAGTAGGCAT ATTGACAGTT 8 5 80 

GAAACTCGTT CTACCAGCTG ATCATATACT TTT CT AT ATT CTTGTAATTC TTTGATGGTA 8 640 

ATTACTGGAA TGTTGTGTTT TTTCGAGAAC TGAATTAAAT CATCTGTTCT CATC ATT TTG 8 7 00 

CCATCATGAT TCATTATTTC ACAACATAGG CCACACTCTT TTAGTCCAGC TAATTTTAAT 87 60 

AAATCAACAG TTCCTTCTCT GTGTCCATTT CTTTCTAGCA CACCACCTTT TTTTGCAATT 33 20 

AAAGG AAAC A TGTGTCCTGG CCTGCGAAAA TCAGAGGGTG TTATATCTTC AGCTACACAC 8880 

ATACGTGCGG TCAGTCCTCT TTCCTCGGCA GAAATACCTG TGGTCGTTTC TTTATAATCA 894 0 
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CCACGAGAAT GATTGTGAGT GGAAAGGTCT TAAATGATGT CTAACAAAAA TAAGGAAATT 12 540 

CTGATTTTTG CG A TTCTCT A TACAGTCCTC TTTATGTTTG ATGGCGTTAA ATTCCTGCCT 12600 

TCTTTAATGC CATCTGCCAT TGCAAATTAT CTTGTTTATG TAGTTTTAGC TCTATATGGC 12 660 

TCCTTCTTGT TCAAGGATAG ATTGATCCAA CAATGGAAGG ACATTACAAA GACTAAAAGA 12720 

AAATTCTTCT TTGGAGTCTT AACAGGATGG CTCTTTCTC A TTCTGATGAC TGTTGTCTTT 12 780 

GAATTTGTAT CAGAGATGTT GAAGCAGTTT GTGGCACTAG ATGGACAAGG TCTAAATCAG 12 840 

TCTAATATTC AAAGTACCTT TCAAGAACAA CCACTACTGA TAGCTCTTTT TGCTTGTGTC 12 900 

ATTGGACCTC TGGTAGAAGA ATTATTTTTC CGTCAGGTCT TATTGCATTA CTTGCAGGAA 12 9 60 

CGGTTCTCAG GTTTACTAAG CATTATTCTG GTAGGACTTG TTTTTGCTCT GACTCATATG 13020 

CACAGTTTGG CTCTATCAGA GTGGATTGGT GCACTTGGTT ACTTAGGTGG AGGCCTTGCC 13 080 

TTTTCTATTA TTTATGTGAA AGAAAAAGAG AATATCTACT ATCCCCTACT TGTTCACATC 1314 0 

TTAAGCAACA GCCTCTCCTT AATCATTTTA GCTATCAGTA T AG T AAAATG AAATGAGAAC 13 200 

AGGACAAATC GATTTCTAAC AATGTTTTAG AAGTAGAGGT GTACTATTCT AGTTTCAATA 13 2 60 

TACTGTAATA TGTGATGAAA ATGCCAGTAA TGATACCGAG AAAAAAGCTG ACAAACTTTT 13 320 

CCCAGCTTTA TTTGTTATAG TCAAAGAGAA TGACTTGTTC CTGTGCATCT ACATGAGCAT 13 380 

GGACCCCAAA GGGTACAATT GCTCTTGGAG TTGCGTGGCC GACATTCAGA TTATAGACAA 13 440 

TCGGGATATT GCTGTCAATG ATATCCAATA GTGCCTCTTT ATAGTCGTCA TGGAAAGTTT 13500 

CATCCATAGG TTTTCCGACC AAGAGTCCAT TGATCACCGC GAATATGCCA GTGTCCTTTA 1356C 

AAGTTAGCAA CATCTTTTTG AAGTCTTCTG GCTTAGCCTT TTCTTCGCTT GTTTCGACCA 13 620 

AGAGGATTTT CCCTTCCCAC TCTCAC.Vv~? CAOGGAAJVAG TTT^TATTT"? T^GCACACTT 1 3 530 

CCGTGCTATC TGCGTATCGA GAGTTGTCAA AGATATCGTA GAGGGATTCG AGCCAACCAC 13 740 

CGACGATTTT CCCCTCGAAC TGGGCACTTC CTTGCAACAA GTCAAAACCT GTATTTGTAT 13 800 

GACTGACACG AGGTGTTCCC AGGGCCGTGG G ACT AAAATC AGTTCGTTC C TCATACCAAA 13 860 

CGTCACTAGG GCGGATTTCT GAAATTCTTC CCGTCTCAAT CAATTCTTTA AAGTAGTGAA 13 920 

GGCTATAGGC TAGCATTTCT TTGTCTAATT CACAAATCTC TGCTAAAAAG GATTGACCAT 13 980 

AAAAAGTCTT GATTCCTAAT TTATGCAACA TGAGGTGGTT CATGGTTGTA TCCGAGAAGC 1404 0 

CAAGAAAAAT TTTTTGCTTG ATAACCTTTT GGAGTTGGTC ATTTTCAAAA AGATAAGGTA 14100 

GCAAGCGATA GGTATCGTCT CCACCGATGG CACATAGGAT CATGTCGATG CTATCATCAG 14160 

AAAAGGCATG AATCAAATCC TCTGCACGAG CTTCACGATG GTCCTTGATA AAGTCTAATC 14220 



WO 98/18931 



PCT/US97/19588 



398 

CTTTTAACGA ATGGG GC AAA AAGATGGGAT TGGTCCCAGA TCCTTGAGAC GTT 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GTGAAGTGCG GCAAAAGGTG CAAGTGATGA GCTCAGCTTC TTT AGCTCTT GACATTGCCC 60 

TTGGCTCAGG TGGTTATCCT AAGGGACGTA TCATCGAAAT CTATGGCCCA GAGTCATCTG 12 0 

GTAAGACAAC GGTTGCCCTT CATGCAGTTG CACAAGCGCA AAAAGAAGGT GGGATTGCTG 180 

CCTTTATCGA TGCGGAACAT GCCCTTGATC CAGCTTATGC TGCGGCCCTT GGTGTCAATA 24 0 

TTGACGAATT GCTCTTGTCT CAACCAGACT CAGGAGAGCA AGGTCTTGAG ATTGCGGGAA 300 

AATTGATTGA CTCACCTGCA GTTGATCTTG TCGTAGTCGA CTCAGTTGCT GCCCTTCTTC 3 60 

CTCGTGCGGA AATTGATGGA GATATCGGAG ATAGCCATGT TGGTTTGCAG GCTCGTATGA 4 20 

TGAGCCAGGC CATGCGTAAA CTTGGCGCCT CTATCAATAA AACCAAAACA ATTGCCATTT 4 80 

TTATCAACCA ATTGCGTGAA AAAGTTGGAG TGATGTTTGG AAATCCAGAA ACAACACCGG 540 

GCGGACGTGC TTTGAAATTC TATGCTTCAG TC CGCTTGG A TGTTCGTGGT AATACACAAA 600 

TTAAGGGAAC TGGTGACCAA AAAGAAACCA ATGTCCGTAA AGAAACTAAG ATTAAGGTTG 6 60 

TAAAAAATAA GGTAGCTCCA CCGTTTAAGG AAGCCGTAGT TGAAATTATG TACGGAGAAG 7 20 

CAATTTCTAA GACTGGTGAG CTTTTGAAGA TTGCAAGCCA TTTGGATATT ATCAAAAAAG 7 80 

CAGGCGCTTG CTATTCTTAC AAAG ATG AAA AAATTGCCCA ACCTTCTGAC AATGCTAAGA 84 3 

AATACTTGGC AGAGCACCCA GAAATCTTTG ATGAAATTGA TAAGCAAGTC CGTTCTAAAT 9 00 
TTGGCTTGAT TGATGGAGAA GAAGTTTCAG AACAAG AT AC TGAAAACAAA AAAGATGAGC 9 60 

CAAAGAAAGA AGAAGCAGTG AATGAAGAAG TTCCGCTTGA CTTAGGCGAT GAACTTGAAA 1020 

TCGAAATTGA AGAATAAGCT GTT AAAG C AG TGGAGAAATC CGCTACTTTT TCGATTTTTG 10 80 

ATTCAAGTTT TT AG ATT AT A TATAGTAGCT TGAAATAAGA TATGAACAAC TCTATTAGGA 114 0 

AAGTCAAATT AATTTCTAGA AATGTTTTAG CAGCTACAGC GTACTATTCC AAACTCAACC 12 00 

AACTATAATA GATCGAAACT AGAATAGTAC ATATCTACTT CTAAAACATT GTT AAAAAT C 12 60 

GATTTGACTT TCCTTATTTC ATTCCGCTAT ATATAGTTTG CTGTTTCTTG TCGCTCCTCT 13 20 

GGAAAGCTGA TATAATAGCT TTATGAATAA AAAACGAACA GTGGACCTGA TACATGGTCC L3 8C 
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GATTCTTCCC TCGCTCTTAA GCTTCACCTT TCCAATTTTG CTATCAAATA TTTTTCAACA 1440 

GCTCTATAAC ACTGC7GATC TCTTG ATTGT TGGACGATTT CTTGGTCAAG AATCCTTGGC 1500 

TGCAGTAGGA GCGACGACAG CGATTTTTGA CCTG ATTGT A GGTTTTACAC TTGGTGTTGG 15 60 

CAATGGCATG GGGATTGTCA TTGCTCGTTA TTATGGGGCT CGGAATTTCA CTAAAATCAA 16 20 

GGAAGCAGTA GCAGCCACCT GGATTTTAGG TGCTCTTTTG AGCATTCTAG TTATGTTGCT 1680 

GGGCTTTCTT GGCTTCTATC CTCTCTTGCA ATACTTAGAT ACTCCTGCAG AAATTCTTCC 174 0 

TCAATCTTAT CAATATATTT CTATG ATTGT GACCTGTGTA GGTGTCAGCT TTGCTTATAA 18 00 

TCTTTTTGCA CCCTTGTTGC GGTCTATTGG TGACAGTCTA GC AGCCCTCG GATTTCTGAT 19 60 

TTTCTCTGCC TTGGTTAATG TGGTTCTGGA TCTCTATTTT ATTACCCAAT TGCATCTGGG 1920 

AGTTCAATCC GCAGGACTTG CTACCATTAT TTCGCAAGGT TTATCAGCGG TTCTCTGCTT 1980 

TT ATT AT ATT CGTAAAAGTG TGCCAGAACT CTTGCCACAG TTTAAACATT TCAAATGGGA 2040 

CAAAAGCTTG TACGCGGATC TCTTGGAGCA AGCTTTGGCT ATGGGCTTGA TGAGTTCAAT 2100 

TGTATCTATC GGCAGTGTGA TTTTACAGTT TTCTGTT AAT ACATTTGGTG CAGTGATTAT 2160 

TAGTGCCCAG ACGGCAGCTC GACGCATTAT GACCTTTGCC CTTCTTCCTA TGACCGCTAT 2220 

TTCTGCATCA ATGACGACCT TTCCTTCTCA GAATCTAGGA GCTAAGCGAC CTGACCGTAT 22 80 

TGTTCAAGGT CTTCGAATCG GCAGTCGTTT AAGTATATCC TCGCCAGTTT TTGTTTGTAT 2 3 40 

TTTCCTCTTT TTTGCCAGTC CACCTTTGGT TTCCTTCTTG GCTAGTTCGA CAGATGGTTA 24 00 

CTTGATAGAA AATGGAACTC TCTATCTGCA AATCAGTTCA ACCT7TTATC CCATTTTGAG 24 60 

CCTCTTGTTG ATTTATCGCA ATTGCTTGCA GGGCTTGGGG CAAAAGATCC TTCCTCTAGT 25 20 

TTTT AGCTTT ATTGAACTAA TCGGAAAAAT CGTTTTTCTC GT"!"~7 "A ?T.\ 77C7TTCGGC 2 530 

AGGATATAAG GGTGTTATCC TTTGTGAACC TCTTATCTGG GTTGCCATGA CAGTTCAACT 264 0 

GTACTTCTCA TTATTCCGTC ATCCCTTGAT AAAAGAAGGC AAGGCAATCT TGGCAACCAA 2 700 

AG7GCAATCC TAGTTGGATT TACTGAATAA AATCCATTTC CTCTAGTGAA AATCGAAAAA 27 60 

ACTTGTGTTC TCTTCTTTAG TTTGGTGTTG AAAATAGTTT AACAGACTTT TGACTTCTTT 2820 

TATATGATAT AAT AAAGT AT AGTATTTATG AAAAGGACAT ATAGAGACTG TAAAAATATA 28 80 

CTTTTGAAAA TCTTTTTAGT CTGGGGTGTT ATTGT AG AT A GAATGCAGAC CTTGTCAGTC 2940 

CTATTTACAG TCTCAAAATA GTGCGTTTTG AAGTTCTATC TACAAGCCTA ATCGTGACTA 3000 

ACATTGTCTT CTTTGTAAGG T AG AAAT AAA GGAGTTTCTG GTTCTGGATT GTAAAAAATG 3 0 60 

AGTTGTTTTA ATTGATAAGG AGT AG AAT AT GGAAATTAAT GTGAGTAAAT TAAGAACAGA 312 0 
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TTTCCCTCAA 


GTCGGCGTGC 


AACCATATAG 


GCAAGTACAC 


GCACACTCAA 


CTGGGAATCC 


3iao 


GCATTCAACC 


GTACAGAATG 


AAGCGGATTA 


TCACTGGCGG 


AAAGACCCAC 


AATTAGGTTT 


3240 


TTTCTCGCAC 


ATTGTTGGGA 


ACGGTTGCAT 


CATGCAGCTA 


GGACCTGTTG 


ATAATGGTGC 


3300 


CTCGGACGTT 


GGGGGCGGTT 


GGAATGCTGA 


GACCTATGCA 


GCGGTTGAAC 


TGATTGAAAG 


3360 


CCATTCAACC 


AAAGAAGAGT 


TCATGACGGA 


CTACCGCCTT 


TATATCGAAC 


TCTTACGCAA 


3420 


TCT AG C AG AT 


GAAGCAGGTT 


TGCCGAAAAC 


GCTTGATACA 


GGGAGTTTAG 


CTGGAATTAA 


3480 


AACGCACGAG 


TATTGCACGA 


ATAACCAACC 


AAACAACCAC 


TCAGACCACG 


TTGACCCTTA 


3540 


TCCATATCTT 


GCTAAATGGG 


GCATTAGCCG 


TGAGCAGTTT 


AAG CATC ATA 


TTGAGAACGG 


3600 


CTTGACGATT 


GAAACAGGCT 


GGCAGAAGAA 


TGACACTCGC 


TACTGGTACG 


TACATTCAGA 


3660 


CGGCTCTTAT 


CCAAAAGACA 


AGTTTGAGAA 


AATCAATGGC 


ACTTGGTACT 


ACTTTGACAG 


3720 


TTCAGGCTAT 


ATGCTTGCAG 


ACCGCTGGAG 


GAAGCACACA 


CACGCCAACT 


GGTACTGGTT 


3780 


CGACAACTCA 


G G C G AAATGG 


CTACAGGCTG 


GAAGAAAATC 


GCTGATAAGT 


GGTACTATTT 


3840 


CAACGAAGAA 


GGTCCCATGA 


AGACAGGCTG 


GGTCAAGTAC 


AAGG AC ACTT 


GGTACTACTT 


3900 


AG AC G CT AAA 


GAAGGCGCCA 


TCGTATCAAA 


TGCCTTTATC 


CAGTCAGCGG 


ACGGAACAGG 


3960 


CTGGTACTAC 


CTCAAACCAG 


A CGG AAC ACT 


GGCAGACAAG 


CCAGAATTCA 


CAGTAGAGCC 


4020 


AGATGGCTTG 


ATTACAGTAA 


AATAATAATG 


GAATGTCTTT 


CAAATCAGAA 


CAGCGCATAT 


4080 


TATTAGGTCT 


TGAAAAAGCT 


TAATAGTATG 


CGTTTTCTTG 


T GG AG AT ATT 


TCCTTCAATT 


4140 


TTGCTACTAT 


ATTAAACAAA 


AATCAAAAAG 


CAAACTAGAA 


AGTTATCCTC 


AAATAAAATC 


4200 


TAAATTTGAC 


AATGTAAACC 


GAGTCGGATA 


GCTTTAAGTA 


CTGTTTTGAG 


GTTGAAGATA 


4260 


CGATTTTTGA 


TAGGAACTCA 


TCAATTTTAG 


ATTTTTAAGC 


AGCATCAATA 


AATTGCTTCC 


4320 


^TGTT^^TG^C 


ATAATTTTTT 


TATTTAAAAA 


ATTATGACma 


GAGTGTGCTA 


TTCTTTTTAT 


4380 


GAGAGGTGTA 


TGAATATGAT 


AAATGTATGT 


GATAAATGTA 


TGTGATGTTG 


GAAAAAGAAT 


4440 


AAAAGAACTT 


AGAATATCTT 


CAAATCTTAC 


TCAAGATAAG 


ATTGCTG AGT 


ATTTCTCTVT 


4500 


GAATCAAAGC 


ATGATTGCCA 


AAATGGAAAA 


AGGTG AAAGG 


AATATCACGA 


ATGGATTTAA 


4560 


GTAATAAAGC 


TTC AAAT CTT 


AGAAAAAAGT 


TGGGAGCTGA 


TGGTGAATCG 


CCGATAGATA 


4620 


TTTTTAAATT 


GGTACAAAAG 


ATAGAAAATT 


TGACGCTGGT 


ATTTTATGGA 


CTCGGAAAGA 


4680 


AT ATT AGCGG 


AGTCTG TT AT 


AAAGGAACTC 


AGTTCAGTCT 


CATTGCAGTC 


AATTCAGACA 


4740 


TGCCATTAGG 


AAGGTAAAGA 


TTTTCTTTAG 


CACATGGACT 


GTATCATCTT 


TATTATGATG 


4300 


AGGTGAAGAA 


CACTTCACTC 


AGTCTTATCT 


TGATTGGTGA 


AGGAGATGAA 


ACTGAAAGAA 


4860 


AAGCGGATCA 


GTTTGCTTCT 


TATTTTTTAA 


TTTTCCCATC 


TTCACTGTAT 


AGGATGGTTG 


492C 
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AGGAAATCAG AGAAAATGCC AATAGAACTC ATCTTGAAGT AG AAG AT ATT ATAAAATTGC 4 9 80 

GTCAGTTTTA TGGTATCAGT CATAAAGCTA TGTTATATAG ATTGAGGAAT GATGGATACC 504 0 

TTG ATGC AG A AGAAATTAAA AATATGGATA TTAGTGTTAT AGAGACAGCT TCAAGATTAG 5100 

GCTATGATAC AAGTTTATAT CGTCCTTTGT CAGAAAGTAA AAAAGAAATG GCATTAGGAT 5160 

AATATATTAA TTCAACTGAA CAACTTTTAG AAAATAACAG AATTTCGCAA GGCAAGTATG 5220 

AGGAACTGTT ACTACATCCT TTCAGATATG AT ATTG T AT A TGGGCTAGAT GAAGAGGGGG 52 30 

GAGTTGTCGT TTGACTAGTC GTGTATTTAT TGATGCAGAT TGTATTTCAG TATTTTTATG 534 0 

GGTTGGCACT GAACATCTTT TAGAAAAGCT CTATTTGGGT AAAATTGTTA TTCCACAAGA 54 00 

GGTGTATGAT GAAATCAATA TACCTACAAT TCCCCATTTA AAATCTAGGA TAGATCAGTT 54 60 

GGTAGCTAAG GGTTCAGCTG AGATTGTGAG CATAGACATT GGAACTGAAG AATACGCATT 5 520 

ATATAGAGAT TTAACAAGAA AT CATG AT AG TAACAAGATT ATTCGTAAGG GAGAAGGGGC 5 580 

ATCTATTTCC TTAGCGAAAA AGCATAATGG GATATTAGGA AGTAATAACC TAAGAGATGT 5 64 0 

T AAATCAT AT GTAGAAGAAT TTTCTTTAGA ATATATGACA ACAGGAGATA TACTGATTGA 57 00 

AGCGTTTAAA GCGTAATTTA TTACTGAATA AGAGGGCAAT CATATCTGGA ATAATATGCT 5~? 60 

TAAAAAGAGA AGGAAAATTG GTGCAAATTC ATTTTCAGAC TATCTTCGTG GAAGTATTCA 5820 

TCAAAATAGA CAAAAATAAA TTTGGATAAA TCGAACTCAC TATTCAGGAG GCATATGAGC 5 880 

AATTCGAAAA AGAAAAGTGT CAAATTGAGC CTATAGGAGT AG AAG TG AAA TAGTAAGTCC 5 94 0 

TGCATAGTCG ATG AG AG AAA AGTTCTCCTT GAAGTTTTCC TGAACTATCA GTCGCATGTC 60C0 

AAACGATATG TAGGGTAATG TGAGAGGGGA TAGCGAGTAG TTTTTCGTTA TTTTATCAAA 6060 

AAAC7TATAT TTTATTATAC CCAATCATAA AATA7AA7AA AAATGATA'JA ATAACGAAAA 6\2Z 

AACATGAATG TC AAAAAG AT AATGTCAATT TTTCAATCCT TTTATGTTGA TGTC AG T ATT 6180 

GAGGAACTGA CTTTGACTTT ACCAATCAGT TTTGTAAAAA GGTTTGAGTA TACTCAAATG 6240 

ACTTTTCATA AGGAATCATT TTT ATTG ATT AAAGAAAAGA GAAGGGGGAG TTTGAGTTCA 6 3 00 

TTTGTTACTC AGGCTCGCAC TATGGGTGAA AAAGCCAATA TGGATGTTGT TTTGGTGTTT 6 3 60 

TCGAAGTTAT CAGACAGTGA AAAAAAGCAA T T ACTTC AAG CTAGAGTTCC GTTTGTAGAC 64 20 

TTTAAGGGAA ACCTCTTCTT CCCTCCATTG GGACTAGTAC TCAATGCGAA TGATACTGAA 64 80 

GTCCCTAAGG AATTAACACC TAGCGAACAA TTAACGTCCA TTGCCTTTTT ATTGACAAAA 654 0 

GGTCAAAAAG TAGTAGATGT TGATTTGCTT TCACAAGTCA CTGGACTTCC AAACTCAACA 66 00 

ATTTATACGT GTTTGAGGAC TTTTAAAGCT TTATATTCCT TAAACAAGCA AAATAAGCTT 66 60 
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T AC AC AT AT A 


CGGTGTCAAA 


GAAAGAATTA 


TTCTTAAAAT 


CCGTGTCATG 


TTTATTTAAT 


6720 


CCCATCAAAA 


AACGGATTTT 


ATTGCCAGAT 


GGCGATATAA 


AG C AG AT AAA 


AT CTGTTTCT 


6780 


AACCTTCTAT 


ATGGTGGTGC 


TTATGCTTTC 


TCGCATTCAA 


CTTTTTTAGC 


TGAAACGGAT 


6840 


GAAAATATTA 


GCTATGTCAT 


ATGGCAGAGA 


AAATTCAATC 


AGTTATCCTT 


GCCACTTTCT 


6900 


CAGCATGTTT 


TAAAATGAAA 


GATGCTAGAG 


ATATGGAAAT 


ATCGTCCTTT 


TGT ATCTG AG 


6960 


TTTTGGAATG 


ATTTTAAAAA 


TAATCATGAT 


AAACAATTTG 


TAGATCCGAT 


TTCTCTTTAT 


7020 


TTGACCTTAA 


AAGATGATGA 


TGACCCACGT 


AT AG AG G AAG 


AGAGTGAAGC 


ACTAGAAAAT 


7080 


ATGATATTAC 


AGTATCTGGG 


AGAAGATGAT 


GCCAGCTAAT 


ACGAAAGTTA 


TTTTTCAAGA 


7140 


AATGTTTGCG 


CATTTTCAGA 


ACTATTATGT 


TCTGATTGGG 


GGAACTGCTA 


CCTCTATCGT 


7200 


ATTGGATTCG 


CAAGGATTTA 


AAAGTCGCAC 


AACAAAAGAT 


TATGATATGG 


TCATCATTGA 


7260 


TGAAGTAAAA 


AATAAGGAAT 


TTTATACTAC 


CTTCAATCAT 


TTTTTAGAAT 


TGGGAGAGTA 


7320 


TCAAGGAAGT 


CAGAAAGATG 


AGAAAGCGCA 


GCmTTCGA 


TTTACAACAA 


CTAATCCTGA 


7380 


GTTTCCTTCT 


ATGATTGAAC 


TATTTACTAT 


CTTACCAGAA 


TATCCATTAA 


AG AAGG ACGG 


7440 


TCG AG AAATT 


CCCTTACATT 


TTGACCAAGA 


TGCTAGTTTA 


TCAGCCTTAT 


TATTGGATGA 


7500 


AG ATT ATT AT 


AATATATTGG 


TGCATGAAAA 


AGAAACCATT 


CAGCGGTATT 


CGGTATTGAG 


7560 


TAATTGTGGT 


TTATACTCTT 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCC 


ATCTACAACC 


7620 


TCAAAACAGT 


GTTTTGAGCA 


GCCTGCAGCT 


AGCTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


7680 


GAGTATTAAT 


TATTTTTAAG 


GCTAAAGCTT 


GGCTGGATAT 


GAGGGAGCGC 


TCTGCCACAG 


7740 


GTGCTCAAGG 


TTTAAGTAAG 


TCCATTAAAA 


ACCATTTGAA 


TG ACCTT AC C 


CGTTTGACAG 


7800 


CTTCCTTGCT 


AGGAGATGAA 


AAGTTATCGG 


CTATAACATC 


AAGTAGTGCG 


GTAAAAGCAG 


7860 


ACATGCACCG 


CTTTGTGATA 


GAATTAGAGC 


CTGTGAAGTC 


AACTATTCTT 


CAAAATAATG 


7920 


ACATTTCATT 


GGATCAAAAT 


GAAATTTTTG 


AAATTCTGAA 


AAATTTTCTC 


G AT GGT T AAA 


7980 


ATAATTGTAG 


CGAGATGGCT 


ATATTGAATT 


CGTCTATATC 


TGGAAACTAG 


AAAAAACTTC 


804G 


AATTTCAGGA 


GAAAATGAAG 


TCAATCTTCC 


C AC AATC AAA 


CGTATAGTAT 


CAAGGTTTTT 


8100 


CAAGACCTGA 


TATT ATGCGT 


TTTTTGCTTT 


TCAAAACTTT 


TTGCCCAGTC 


TTCGTTTTTA 


8160 


TCCTCTAGTC 


ACTTGATTTG 


TTTCAGGTGG 


TTTTTTAGTA 


TAGTAGAATG 


AAACGAGAAC 


8220 


AGGACAAATT 


GATCAGGACA 


GTCAAATCGA 


TTTCTAACAA 


TGTTTTAGAA 


GCAGAAGTGT 


8280 


ACT ATT CT AG 


TTTCAATCTA 


CTATAGTTAA 


ATCTGCGGTC 


AAGTCTACTG 


GTGAATCTAT 


8340 


GATTGTAATA 


CTCTTCCAAA 


ATCTCATCAA 


CCACGTCAGT 


CTTGCCTTGC 


AGTCTGTATC 


8400 


TTACTGACCA 


AGCTAGTGAT 


GGATTTAGAA 


, TAGGTGATTT 


GGACCGTCCT 


' ATTAGCTAGG 


8460 
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AAATGCTGCT 


CATAGTCCTT 


TGCTGAGGCT 


AGGGTGTTTC 


AACATTCAAC 


ACTCAACTGG 


3520 


TTGATCTAGT 


TGATAGGAAG 


GGAGTTACTA 


TAAAATACTC 


AGGCTTCCAT 


CATATTTTTT 


3580 


GAAACGATTG 


TGTAATCAAA 


ATGTACCAAT 


ATTGTACTAT 


TGGTACAGAA 


GATGTTGTGA 


3640 


ATGGATAAAT 


ATATCATAAC 


TGCTATCTCA 


AAAAGATTTC 


ATATGTCTGT 


GC AT AT AT AA 


8700 


TAGACTTCCT 


GCAAAACTAG 


AATCCTAGTT 


CATGATTGAT 


AATACCAGCA 


ATCAAATTCA 


8760 


TTCGTAATCC 


AAAGCG TTT A 


CGATGATTTC 


GATAGGTTGT 


TGAAAACATT 


TTAAACGTTT 


3820 


CTACTTTGCC 


AAAGATGTTC 


TCAACCTTGC 


TTCTCTCCTT 


AGATAGCGCA 


TGGTTATAGG 


883C 


CTTTATCTTC 


AGCTGTTAGC 


GGCTTGAGTT 


TGCTGGATTT 


ACGTGGAGTT 


TGTGCTTGAG 


8940 


GACATATCTT 


CATGAGCCCT 


TGATAACCAC 


TGTCAGCCAA 


GATTTTACCA 


GCTTGTCCGA 


9000 


TATTTCTGCA 


ACT C ATTTTG 


AACAACTTCA 


T AT C ATG AC A 


ATAGTTCACA 


GTGATATCCA 


9060 


AAGAAACAAT 


TCTCCCTTGA 


CTTGTGACAA 


TCGCTTGAGC 


CTTCATAGCG 


TGAAATTTCT 


9 120 


TTTTACCAGA 


ATCATTCCCT 


AATTCTTTTT 


TTAGGGCGAT 


TGATTTTTAC 


TTCCCTCGCA 


9180 


TCAATCATTA 


CCGTGTCCTC 


AGAACTAAGA 


GGAGTTCTTG 


AAATCGTAAC 


ACCACTTTGA 


9 24 0 


ACAACACTTA 


CTTCAACCCA 


TTGG CTCCG A 


CCCATTAACT 


TGCTTTCGTG 


AATACCAAAA 


9 300 


TCAGCCGCAA 


TTTCTTCATA 


AGTGCGGTAT 


TCTAGGCTTA 


ATTTAGGTTT 


TCGTCCACCT 


9 3 60 


TTTGCGTGTT 


T AAGTTG AT A 


AGCTGTTTTT 


AATACACCTA 


ACATCTCTTT 


AAAAGTCGTG 


94 20 


CCCTGAACAC 


CAACAAGACG 


CTTAAATCGT 


GTATCAGTTA 


ATTGTTT ACT 


TGCTTCATAA 


9480 


TTTCGCAGGG 


AGTCTATTGA 


CTCTTTGGTA 


GGTGTCAATG 


TTTTTTTCAT 


CTATCCCGAG 


9540 


AATTATTTTC 


CCGCCATTTG 


TATTTGCAAA 


TGCTGAGTAG 


GTTTCCCAGA 


AAGACTCTGG 


9600 


AACATTCTTT 


TTAGCTTTTT 


TCTATTCTAA 


ATCAACCCCT 


TCAAATTTTA 


ACT CC AT ATT 


9660 


TTTCCTTTAC 


ATCTGTTTTT 


TGTGGTTCTG 


GTATTTGTTC 


AAGTTGAGTG 


ATAATATAGC 


9720 


GAATTGAATT 


TCGAGAGTTT 


TTACTCAGTT 


AATTTCTTTT 


TTAACCCACT 


TTAATTCCTT 


9780 


TTTTAACACG 


GGTTAAAAAA 


GAAATTAAAG 


TGGGTTAATT 


TTTCTTGA 




9823 


(2) INFORMATION FOR SEQ ID NO: 4 2 











(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 33 69 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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CCGCGAAAGA 7ATTTTTGAA CAAGAGTTTG GACGTGAGGT CeW.'TTAT AATAAAGTAG 60 

AAGTTGACGA GTTTTTAGAC GATGTCATCA AGGACTATGA AACCTATGCT GCCTTGGTCA 12 0 

AGTCACTTCG TCAGGAAATT GCGGATTTGA AGGAAGAATT AACTCGTAAA CCGAAACCTT 180 

CACCAGTTCA AGCAGAACCC CTTGAAGCGG CAATTACAAG TTCTATGACG AATTTTGATA 2 40 

TTTTGAAACG CCTGAATAGA TTGGAAAAAG AAGTT TTTGG TAAACAAATT TTAGATAACT 3 00 

CAGATTTTTA AGTAGTTATT TGAGATGTGC AATTTTTGGA TAATCGCGTG AGGAGAATTG 3 60 

TTTCTCATGA GGAAAGTCCA TGCTAGCACA GGCTGTGATG CCTGTAGTGT TTCTGCTAGG 4 20 

CGAAACCATA AGCCTAGGGA CGAGAAATCG TTACGGCAG? TGAAATGGCT AAGTCCTTCG 4 30 

ATAGGCCAGA GTAGGCTTGA AAGTGCCACA GTGACGGAGT CTTTCTGGAA ACAGAGAGAG 5 40 

TGGAACGCGG TAAACCCCTC AAGCTAGCAA CCCAAATTTT GGTCGGGGCA TGGAGTACGC 600 

GGAAACGAAC GTAGTATTCT GACTGCTATC AGCTACAGCT GTTAGTGGTA GACAGATGAT 6 60 

TATCGAAGGA AGTCGTCCTA GTCACTTCTG GAACAAAACA TGGCTTATAG AAAATTCCAT 7 20 

ATAGGTTGGG GCTCACAAAT TTTCTCAACC TCATTTTTTA AAGTGG AC AT ATAGAAAGGT 7 80 

CTTGCAAGAC TGTAACATGA AAAAAGAATT TAATTTAATT GCAACTCTCC CAGCAGGGCT 840 

TGAGGCTGTC GTTGGTCGTG AAGTCCGAGA GTTGGGCTAC GATTGTCAGG TTGAAAATGG 900 

ACGTGTTCGT TTTCAAGGAG ACGTGAGAGC TATTATCGAA ACCAACCTTT GGCTTCGGGC 9 60 

AGCAGATCGT ATCAAAATTA TCGTAGGAAC GTTCCCAGCT AAGACTTTTG AAGAGCTATT 1020 

TCAGGGAGTT TTCGCTTTGG ATTGGGAAAA TTATTTACCA CTTGGAGCTC GGTTCCCGAT 1080 

TTCAAAAGCT AAATGTGTTA AGTCCAAACT TCACAATGAG CCCACTGTTC ACCCTATTTC 114 0 

TAAGAAAGCT GTTCTCAAGA AATTCCAGAA ACACTATGCT CGCCCAGAAG GGGTTCCTCT 1200 

GATGGAGAAT GGCCCAGAGT TTAAGATTGA CCTCTCTAT? CTCAAAGATG TGGCAACTGT 12 60 

CATGATTGAT ACGACCGGGT CTAGCCTCTT TAAACGTGGT TATCGTACCG AAAAAGGTGG 1320 

CGCTCCTATC AAGGAAAATA TGGCAGCAGC CATTTTACAA CTTTCTAACT GGTATCCACA 1380 

CAACCCTTTG ATTGATCCGA CCTGTGGTTC GGGGACTTTC TGTATTGAGG C AG TT AT GAT 1440 

TGCTAGAAAG ATGGCGCCAG GTCTTCGTCG CTCTTTTGCA TTTGAGGAAT GGAACTGGAT 1S00 

CAGCGATCGC TTGATTCAAG AAGTGC G C AC AGAAGCGGCT AAAAAAGTAG ACCGTGAGCT 1560 

TGAGCTGGAT ATCATGGGCT GTGATATTGA TGCTCGCATG GTGGAAATTG CTAAGGCCAA 1620 

TGCTCAGGTA GCTGGTGTTG CAGGAGACAT TACTTTTAAG CAGATGCGCG TGCAGGATTT 16 80 

ACGTTCCGAT AAAATCAATG GAGTAATCAT TTCCAATCCG CCTTATGGTG AACGTTTGTC 17 40 

AGATGATCCA GGGGTGACCA AGCTCTATGC TCAGATGGGG CAAGTATTTG C AC CG CTG AA 1800 
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AACTTGGAGC AAATTTATCC TGACTAGTGA TGAAGCTTTT GAAAGCAAGT ATGGTAGCCA 1860 

AG C AG AT AAG AAGCGTAAGT TATACAACGG AACCTTGAAA GTGGATCTAT ATCAATATTT 19 20 

TGGTCAGCGT GTCAAACGGC AAGAGGTAAA ATAGAAAGGG ATACTCATGA GTAAAAAAAG 1980 

ACGAAATCGT CATAAAAAAG AAGGTC AAG A ACCGCAATTT GATTTTGATG AAGCAAAAGA 2 040 

G CT AAC AGTT GGTCAAGCTA TTCGTAAAAA TGAAGAAGTG GAATCAGGAG TCTTGCCTGA 2100 

GGATTCCATT TTGGACAAGT ATGTTAAGCA ACACAGAGAT CAAATTGAGG CGGATAAGTT 2160 

TGCGACTCGT CAATACAAAA AAGAGGAGTT CGT7GAAACT CAGAGTCTGG ATG ATTT A AT 22 2 0 

TCAAGAGATG CGTGAGGCTG TAGAGAAGTC AGAAGCTTCT TCGGAGGAAG TTCCATCTTC 2 2 30 

TGAAGACATC TTACTACCCT TGCCTCTGGA CGATGAGGAG CAAGGCTTGG ATCCTCTATT 2 34 0 

GCTAGATGAT GAAAATCCAA CAGAAATGAC TGAAGAAGTG GAAGAGGAGC AAAACCTTTC 24 00 

TCGTCTGGAT CAAGAGGACT CAGAAAAGAA AAGTAAAAAA GGCTTTATTT TGACCGTTTT 24 60 

GGCGCTTGTA TCAGTAATTA TTTGTGTCAG TGCTTATTAT GTCTACCGTC AAGTCGCTCG 2 52 0 

TTCGACTAAG GAAATTGAAA CTTCTCAATC AACTACAGCC AATCAATCGG ATGTGGATGA 2 580 

TTTTAATACA CTTTATCACG CCTTTTACAC AGATAGCAAT AAAACGGCTT TGAAAAATAG 2 640 

CCAGTTTGAT AAACTGAGTC AACTCAAGAC TTTACTTGAT AAGCTGGAAC GTAGTCGTGA 2700 

ACATACGCTT GCCAAATCTA AATATGATAG TCTAGCAACC CAAATCAAGG CTATTCAAGA 2760 

TGTCAATGCT CAATTTGAGA AACCAGCTAT TGTGGATGGT GTGTTGGATA CCAATGCCAA 2 82 0 

AG C C AAATCG GATGCTAAAT TTACGCATAT TAAAACTCCA AATACGGAGC TTGATAAAGT 2 8 80 

GCTAGATAAG GCTATCAGTC TTGGTAAGAG CCACCAAACA AGTACTTCTA GCTCAAGTTC 2 940 

AAGTCAAACT AGCAGCTCAA GTTCAAGTCA AC C AAG T T C A AATACGACTA GTGAGCCAAA 3000 

ACCAAGTAGT TCAAATGAGA CTAGAAGTAG TCGCAGTGAA GTCAATATGG GTCTCTCGAG 3 060 

TGCAGGGGTT GCTGTTCAAA GAAGTGCCAG TCGTGTTGCC TATAATCAGT CTGCTATTGA 312 0 

TGATAGTAAT AACTCTGCCT GGGATTTTGC GGATGGTGTC TTGGAACAAA TTCTACCGAC 3180 

TTCACGTTCA CGTGG CT AT A TCACTGGAGA CCAATATATC CTTGAACGTG TCAATATCGT 32 4 0 

TAACGGCAAT GG TT ATT AC A ACCTCTACAA GCCAGATGGA ACCTATCTCT TTACCCTTAA 33 0 0 

CTGTAAGACA GGCTACTTTG TCGGAAATGG CGCTGGTCAT GCGGATGACT TAGATTACTA 3 3 60 

AGCAGTCGG 3 3 69 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9713 base pairs 
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(3) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
( D} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 3: 
AAGTTTACAA TTTAAATGAA TTAACAATTT TCCCAACTAA AACCACTCCA GTTACCGCAA 60 

CGTTTGTACT GAATGTACTA AATCGCATTC CATCAACTTC ATCTGTTTCG TCAACTTGAA 120 

CAGATACTAA TTGAAGATTT AATACTTCTG CTGCCATAGC TAGCTCCTCC TATTTAAATT 180 

TTTGGGATTA AGTACTTTAT CCACCCTCAT AT ACTCTCT C CACCAGTAAA ATGCAAGCAA 240 

TGATACAAAA TAGATTTAAC TATTTTATAT AGCGAAAACT TACAAATTTT TAAGAAATAA 300 

TTTTTGCATT CTTAAAGATA AAATAGGAAC TTTTAGTAAT AAAT AT T AAA ATAAATAAAA 3 60 

TAATAGATAC TATAAAATTT GGAAGTATTA ACCCCAAAAG ATTCATATCA TCTATTAAAA 420 

TATCCTCTAA AGAGTAGTAT ATTAAAGCCA TAATTTTAAT GTTAAGTAAA AATGC AATT A 4 80 

ATGAAGTAAC AAATGTCAAA AATATAGCCT CACCAACTTT AATCTTAACC ATCTGGTAAT 540 

TAGAAGTTCC T AAAATTT C A AATTGCTGAA TCTCAATCCT TTCTTGATGC GATGACAAAA 600 

AT GC AATTG A AATAATATTT GCAAGTACTA TCAAAATTGG TCCTCCTACA TAGACAATAA 660 

ATGCT ACTTT TAGCTCTAAA TCACTGTCAT CTTGAAATTG AGATAGTATA TTCT G AG AAA 720 

TCATTTGAAA ACTAGAAATT AGTAATATAG CTCCTGTAAT TGCAGCACTG ATAGATTTTA 7 80 

TATAAGACTT ACAATATAGT AAATTCCACT TCGAAACAAT GAACATAAAA TTATTTCTAA 840 

ATATAATTAT AGAAAGTAGT TTGATAAAAC ATGACTGTAT AAAAGGAGAT AATTGATAAA 900 

TAATCACAAT ATCTAAGATT ACAATATTGA ATATTATCTG GGCCTTCGCT AAAATTGTGC 9 60 

TATCTTGGAA AATTTGTTGC AAAGAAAGCA ACCAGATAAC ACTAAAACCA GCCAATAGCA 1020 

GTATTCTTTT TACTATTGAA AGAACATGCC TTATTTTAGA ACTCTTCCTA TTTCTAATCT 1080 

TCTTGAACGT ATAAAAGCAA CCACTTAGAA AGGCTAAAAA TGAAATCAAC ACTACTGTAA L140 

TGATACATCC AAC AG C ACT C GTTTGAAATT GGATATCAGG TAATATATTT TCCCCGAAAA 1200 

AGTATTGTAA AAAATAATAA TAATTTGACG TAACAAATAT AGAGCATAGA TATGCAATAA 12 60 

AACTAATAAT CGAGGAAATG ATAAAAATCT GTCCCCCCAC AAG AAATG AT AGTTGAAGGC 1320 

GACTTGCTCC CAACACCTCC AGAAGTTCGT AATCATCTCT AAAAATTTCA AC C AAC AT AT 1380 

TTATTATGTT AGAGAGCACA AAGAATAATG TTACTCCTCC GAATACTATC GGAAACATAA 14 4 0 

AAATTGGTTT AGGATCTGGA AGTCCGACAA ATACTTGCGA ATTATTCTCA ACATTAATTA 1500 

CCCCATTAAC AGCCAATCCC ATAACTAAAC TCGAAACAAA AATTACTGGT GAAACGCCTA 1560 
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ACCATTGTTT 


CTTATTATGT 


AAAAATTGAT 


AGTAAACTAA 


TCTGAGCATC 


TCTATTCCTC 


1620 


CGTAGTTGAT 


TGTACCTCTA 


AGATTTTATA 


CAACTCTTCC 


CCGCTAGGTC 


TATGAAGTTC 


1680 


TTTGAAAATT 


TTTCCATCTT 


TCAATATTAA 


TGCACGATCA 


CTTTTCGAGC 


CCAATTCTAT 


1740 


ATCGTGCGTT 


ACCATAATTA 


CACACTTACC 


CGCCCCTACT 


AACTCTCTCA 


AT AATT C AAA 


1800 


AATTACTTCA 


CGAGAAACGC 


TGTCTAAAGC 


CCCAGTTGGC 


TCATCAGCAA 


ATATTATATC 


1860 


ACTATCAGCA 


ATAACCGCTC 


TAGCTATAGC 


AACCTTCTGT 


TGTTCTCCAC 


CAGACAGAGT 


1920 


TCCAACAAAA 


TCCTTTAAGC 


C AG C ATT AAA 


CTTCATTCTT 


TTGAGTAAGT 


TTTCTACATT 


1980 


TTTAATAGTT 


AATTTTTTTT 


GTGATAATCG 


CAAAGGAAGT 


GCTATATTTT 


CTATTACCGG 


2C40 


CAGGGAAGGT 


ATTAAATTGT 


ATGCTTGAAA 


TATAAAAGAT 


ACTTCGTTAC 


GTCTTATACT 


2100 


TGACAATTTT 


GCATTTCTGA 


TTTTATAGGG 


GTTGATTCCA 


TTTAAAATTA 


CTTCCCCACT 


2160 


TGTTGGTTCA 


AG C AAACT AG 


AAATACATTT 


TAATAAAGTT 


GACTTTCCAG 


AACCACTAAT 


2220 


TCCTAGAATA 


CTT AT AAATT 


CTCCTCTCGA 


AGCAGAAAGA 


GAAACATTTT 


TCAGCACTTG 


2280 


CAACGTTTTA 


TTATTTCCTA 


GTAAAAATTG 


ATG AT AC AGC 


CCTTTCACTT 


TTAATATATA 


2340 


ATCTTTATCC 


ATATTCTTGC 


CTCCAATCAC 


TTAATTTTCA 


AAAGTGTTCC 


ATTTTCCAAT 


2400 


TTATATATAT 


CAGTGTATCT 


CTTGTCATTT 


AAGTCATAAT 


GATGTGAAAC 


TTCAATAAAT 


2460 


GAAATACCTA 


AATTGAACAG 


AATATCATGT 


ATGGAATTTG 


AATTATCATT 


ATCTAAATTA 


2520 


GCTGATATTT 


CGTCAAATAA 


GTACACTTTA 


TTATTTCTAA 


TCAGACCTCT 


AGCTAAAGCT 


2580 


ATTTTTTGTT 


TTT G AC CT C C 


AGACAAATTA 


CTACCATTTT 


CACCACATTC 


ATAATTTAGT 


2640 


ATATCTATCT 


TTTCTAATTC 


TTCATATAGA 


TTTACCTTTT 


TTAACACCTC 


AATTATCTGA 


2700 


TCATCTGAAA 


AATATTCATT 


TTGAAATAAA 


GTTACCTTCT 


CACGAATAGT 


AGTGTCAAAA 


2760 


ATATATGGTG 


TCTGATCAAC 


TGTTGGTATT 


GAATCTGAAC 


TCTTTTTCCC 


ATGTGATAAC 


2320 


AAATT T AC AT 


AA C C TTTTn 






i I AAA i i 1 AA 


ft ft TrrTHTTT 


2 8 B 0 


TTCCCACTAC 


CAGAAGTTCC 


TGTTAATAAT 


ACCCTAAATG 


GTGACTTAAA 


TGAGAAGTCA 


2940 


ATACTTAATT 


T ATTT T CT GG 


TGTAATAGAA 


TATACAACAT 


CTTTCATGTG 


TATCTCATCT 


3000 


ATTGATGAAG 


TATACAGTCC 


GTTATTATCA 


TGTTCAGCGT 


CTATAAAATT 


CTTCTCTCCA 


3060 


CTTAAGTATT 


TTAAAAACGG 


TTTCCTTAAA 


TCTTTGGTTG 


TATTTATCTT 


ATTTAATGAA 


3120 


TAGGCAATTG 


ATTGTATCGG 


CCCTAAAACT 


TTATCGTTTG 


CTAAGAAAAT 


ACCTATCAGT 


3180 


TC ACT AAAAG 


AAAGGCTTTT 


ATG AT AAATT 


ACAAAATAAC 


ATCCTACAAC 


CAAGGGAACT 


3240 


AG AAAG C AAA 


AACCTGAAAT 


TAGTACTGCA 


ACCAATTTTG 


AAAGAACCTC 


TG AT CGTTT C 


3300 



WO 98/18931 



PCT/US97/19588 



408 



AAATTAAAAG 


TAGAATCTTC 


TACTTTATCC 


AACTTTTTAT 


CCGACAAACT 


AATTATTTCT 


3 3 60 


TTAGTAACAG 


AATAAGATTT 


TAATGTCTTA 


AAACCATTAA 


AAATTTCTTT 


TATTATGTGA 


3420 


GTATACTCTG 


CATTGCTGTT 


AGAGTACTCA 


TTAGCTGAAT 


T AG AC AAC AT 


CTTCTTCATA 


3480 


AAGACAGGTA 


CTATAATCGG 


CAATGCTGAT 


AATACAATAA 


ATATTATTGA 


nACTAGGAAG 


3 540 


TTTAAATAAA 


GCATAAAACT 


TAGAGAGACG 


ATGAACAACA 


AT AT TG AAG A 


AATTATTTCA 


3600 


AAAATTTGTC 


TAAAATAGTT 


TTCTTCGATT 


AATCTCAAAT 


CATTTGACAA 


AACTGAAATA 


3660 


ATAGATGAGT 


AATCTTTAAC 


CATTTCAGAA 


GAAAGATACT 


GTTCTCTAAA 


ATATCCTTGT 


3720 


TTAATTTTTA 


CATTTATATC 


TTTAGTTATT 


GATGCTTCCG 


TTACTTCTAA 


ATAGTAATTT 


3780 


GATATATAGA 


TTGCTGACCA 


ACCCAGAATA 


CTTATAGCAC 


CAAATCTTAG 


AACGTCAGAA 


3840 


AATGAGGAAG 


TCTGATTTAA 


ACTACCTGCA 


TATACAATAA 


TTCCTGAGAG 


CAAGACACCA 


3 900 


TTAAACGAAG 


ATAGAAATAT 


TAAAATCCCC 


ATTAATATAA 


GTTTAGTCTT 


TTTTATAAAT 


3 960 


TTTAAATAAT 


TCATAAGTTA 


TTCCTTCCCA 


CTTCTTCAAA 


GAAATAATTT 


AAAGTATCAA 


4020 


TCATTAAGAG 


AACATCTGAT 


GGAGTAAAAC 


CTCCATGACC 


AGCTGCTTTG 


TTTAAATACA 


4080 


ACAAACTTTT 


AACTCCAATA 


GAATTTAATT 


TCTTTGACCA 


CTCTATCACT 


TCGTTATTAT 


4 140 


TAATATATGG 


GTCTTTCTCA 


CCCAAAATAT 


TAACTATAAC 


AGTATTTGAG 


TCTCGTGCCT 


4200 


TTTCAATATT 


TTGCATAGCC 


GAATATGACT 


TTATATAAGC 


CTTTACTTCA 


GGGTCTCTAA 


4260 


TATCTCCCCA 


CTCTGCTATT 


TCGGTCTTAG 


AAAGAGGATC 


ATTTCGATTC 


TGAAGTGTAT 


43 20 


CATAAGGATT 


TATAAATGGC 


GAAAATAAGA 


GAATGCTTTG 


CAATAAATTT 


TTTTCCTCGT 


4 380 


TCAACACCGC 


ACCAGCAATT 


ATTCCACCTG 


CACTAGAAGT 


TATTAAACCT 


AATCGCTTAC 


4440 


TGTCAATTAC ATCATTTTCC 


CTTAAATAAT 


TTACTCCCTC 


AATAAAATCT 


CTGATAGAAT 


4500 


TCCATTTCTT 


TAACGCCTTT 


CCTGAGCGAT 


ACCATTCACC 


ACCCAAATAG 


CCTCCACCTC 


4 560 


TTACATGAAC 


TATAGCATAA 


ATAAAACCTG 


CATCTATTAT 


AGATAACATA 


ATTTCATCTA 


4 620 


AATCAGAATT 


AT C ATTCTT A 


CCATAAGCCC 


CAT AG AC ACT 


TAGAATACAT 


TTTTTTCTTC 


4680 


TTGGGAGCTC 


ATCCGTATCT 


TCACTTTTCC 


AAAATAAAGA 


AATCGGTATG 


CTTACATCAT 


4740 


AACTGTCTTT 


TTTAGTCCAA 


ATCACCTTAG 


AAAAATATTT 


AGTATTATTC 


GATTTTATGA 


4800 


TGGGTCTTTC 


AAATTCAGTT 


TTTAATGTAT 


TTTCTATTAA 


ATCAAAACTA 


AGTATTTTTT 


4860 


CGTAAAAAGT 


TCTCCTCTCT 


AAAAACAGAA 


GAACACGATC 


AGAAAATGAA 


TTTTCATAAA 


4920 


GTGTTGTCTT 


TTCATCAAAT 


GTT AT CTT AT 


TAACACTCAA 


CTCCCTCAAA 


CTATTATTTT 


4980 


TAAATGTAGC 


AAG AT AAAAG 


ACGGAATTCG 


CTGCCTTTGA 


ACAGTCTAAA 


AGGATATAAC 


5040 


GTCCTATACA 


GTGAACTCTT 


CTAGCCCTAT 


CTT GAT AT GG 


TATAGTAATA 


GAAACTCTGT 


5100 
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CTCCCGAAGA AGTTTCCCTT AGAATTAGTT GATCTTTCTT TTCTTCAGTT GAAGAGAGCC 5160 

CAAGAAAGTA CTGTGCTTTT TCTGTACTAA ATACACCCAT ATCTCTAGGT GTTGGGGCTA 5220 

CCGTTTCTGT GTAAGAGTGT CTAACAAAAC CCGTCCGGTC GAAACTGTAT AGAAAAATCC 5280 

TGCCTTTCTG AAAGT CT ACT GACTTTACAA AACAATTATT GCTATCAATG TGGACTATTT 534 0 

TTAATCGAAA AGAGCATTCG TTTTCTTCAA ACAGTTCCTC TTCTGTAAAG CTATCAAAAG 54 00 

ATTTATAGAA TAACTTACTT GGCCTCCCGT ACTCTTTGGA GCGAGTATAC ATAACACCGA 54 60 

ATTTACCCAA ATAGAACGAA CTTTCTACTG AAATATCTTC AATCATAAAT AACTCTTCCA 5520 

TAGTATATTT TTTTATTCCA AT T AAATT AG TCGTACGCAG TGAGGATACA ACCAAAACTA 5530 

TATAACTCTC ATCAGATGAA ATCCTAACAT CCTGTAAGAT ACTATCATCT GGC AAAGT AT 564 0 

ATTTTTCCAC ATCAAAGACA ATTTTAACTG AATTTGAATT GTCTAAACTG GAAGAACTAA 5 7 00 

CCTTAGGAAT CCAGTCATTA TCTTCGACAT ACCATTCCTT TATTACACCA GTATTGGGTA 57 60 

TACTCCAATT ATCAAATTGG TACCAATATC GCCCTCTCCT AAATATCAAA GAATTCCATT 5 820 

TTTTTAATTC CTGAAATGAT GAAGAGATAG ACCTCTTATA GTGTGTTTTT TCCTGTATTG 5880 

TATTTAAAAA TATTTCATTA CTCTGATTCA CAAGTATGAC CCCTTAATAA TGGTATCTAA 5 94 0 

AT ATT AT ATT TGAGGAAGAA TCGTCAATTT ATTATCCATT ATTGATACCA ATCCAATTGC 6000 

AACACCCGCA AATCCCGAAG CAATATCTGT TGTTATCTTT AAAC C ATT AT CTCCCGCAAT 6060 

AACAAATCCT TCTTCAATTA CACACAAATA TCT AT AAAGT TGTTCAATTA ATTTCTTTTG 6120 

TCCTGAAAAG TTATCATCGA TATCACTATA TATATTATTA GCAACTTCAA GACCACAAAA 6130 

TCCGTTAAAT AAACCTGGTA AT AC ACAAAA AACTACATCA GTTGCCCTCT CTAAAGAAGT 624 0 

TAAATATTTT AAGTATTTGC TTGACAAGAT TTCTTTATTT CTATTAATAA GTAAAAGCAG 6 3 00 

GCCAGCACTT CCAGTTGCTA CATATGGTAG TAATCTATGA CCTTGG CTGT ACTGCAATGA 63 60 

ATTATTACTA TCTACTTTAT AAGCAACTAA TTCTTTATCT ACAGCCAATT CTAGACCATT 64 20 

TT TAT AG ATA CTTTCACCAG TTAATTTATA AGCTTCACCG AAGAGCCAAG CTACCCCTGC 64 80 

GTGACCATAT AGTAATCCAC CAAAATTCTC ATAAGGATCG TTACTCTGAA CATCACTAGC 6 540 

GCCAACTTTA CAAAAAGTTT CTGGATTTTC TATATAATTT AAAGT AT ATT CTCTAAGCCT 6 600 

AATTAGTATT TCTTCTCCTA GTTTATTATC AATTCCCCCT TTACTAAGAA AATACAGTCC 6 660 

AACCAGTAAA ATTCCAGCCT GCCCACTATA TAAATTTTTA TTTTGTGAAT TCTC AAAT AT 67 20 

CTCTATAAAA TGAGTTGTAA AAAGTTCAAC TGCCCGATCT ATCTCCCCAA ATT CAT AAAT 67 30 

GAGGCAGATT GTACCAATTT TACCATCAAA AAGACCAGAA AGGGACGATT TCTTAAAATT 6 84 0 
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ATTTACTGCC 


TCATTAATAA 


CCTGTGTTCG 


AATCTCATAA 


TAGTCATCAA 


ACTTGAAATT 


69C0 


TTTTACTTTC 


TTAGCTAGTT 


GTTGATAACT 


CCAAAGGATA 


GCTAAATCTG 


AAAACGCAAT 


6960 


TCCTTGATTA 


AAATTCAGAC 


CATAATAATG 


AACTGGGAAG 


AATCTTGATT 


GAAATTCTTT 


7020 


ACGCCACTGT 


CCATAAGTTA 


GCGTAAACCC 


TCTCAATAAT 


TTTATAATAA 


AATCTTGTAT 


7080 


ATCTTGCTCA 


CTCTCGATAG 


TTCTAATCTC 


ATGCATGGGT 


TTTAAAACTT 


TTTTCCTGGA 


7140 


AATATTCTCA 


ATCTGTGGAC 


ATTTAGAATC 


TAGATATGAC 


AATAAACTTT 


CTACATAATC 


7200 


TATATCTTCT 


CTTGT AT AAC 


CCAAAGACTC 


AAATAGTTTT 


TTTCCTTCTA 


TCCTGGTTTG 


7260 


ACTTACATAG 


TTGTATCTCA 


AATCCGATGT 


AGTTACTAGT 


GGCATGTATA 


AATAATGAGC 


7320 


TATTTGTCTA 


ATACCATACC 


AATCTATCTC 


ACTGGGAAGT 


GTTTCTCGCC 


ATGCTCT AAA 


7380 


ACCAGGGGCT 


G C AACTTT AT 


GT AC AACTTT 


TTCATCATTT 


GAAAAGACAG 


CCTGTTCCCA 


7440 


GTCTATTATA 


CTAATCTCAT 


CTTCATCCTT 


AACCAAGATA 


TTTCCTAAAT 


GTAAATCTTG 


7500 


ATGATATACA 


4 TTTTCAGAAT 


GAAACTTATT 


CGTTAAATCG 


ATGAGTTTTT 


CT ACT AT CTT 


7560 


TGAAACTCTC 


AATAGATAAT 


CTTTGGTCTT 


ATCAACAACT 


TCATATAAAG 


GAAAATTATT 


7620 


GGTAACCCAT 


CTATTTAGTG 


GAACGCCCTT 


CATATGTTCA 


ATTCCTAAGA 


AGGTGTGCTC 


7680 


CC AG AT CTT A 


CCGTGCCAGT 


ATATTTTAGG 


CGTCTCACTC 


CATTCATTTA 


GAATTTTTAG 


7740 


TGCTTTGCAC 


TCCGAAGCTA 


ATTTCTCTGA 


AGAATAAGTA 


CCATCAAATC 


CTAGACCTGT 


7800 


ATACGGTCTA 


GCCTCTTTTA 


AAATTATTTT 


TTTCCCATCT 


TCTTTTAGCC 


TAGCATTATA 


7860 


TATCCCACCA 


CTGTTTGAAA 


ATCTAATTGC 


ATTATCTATA 


ATAAAGCGAA 


AGTCTCCCTG 


7920 


»j< m r^rip ^ rn £ 


TTCTTGTCAA 


GCCATTTATT 


CAAAAAGTCA 


GGGGGCACTA 


TACCTTTTGG 


7 9 80 


AATTTTAAAT 


ACTGGTAAAC 


GTTCATCTTT 


AACAACTTCA 


TCGCCAACAA 


TTAATTCATC 


3040 


AATAGCAACC 


TTCTTTTCAT 


CATCCCTTGA 


CGGCCTAAAC 


ACACCATACC 


TCAGATATAT 


8100 


TCCTGCTTCA 


TCCCAACGTT 


TATCGCTTAA 


AATATATGGC 


CC ATT AT ATT 


CCTTTAACGC 


8160 


ACTTTCTAAC 


CTTTGCAAAA 


CCGACTCTAA 


TTCATTTTGA 


TTTGGATAAC 


ATGTAATAAA 


8220 


TTTACCAGAA 


AATCCTCGAC 


TAACCAATTT 


CCCGTTTCGC 


ATGATAAATT 


TGTCTTCTGT 


8280 


ACT AAG AT GT 


TTAAATGGAA 


TTCGCATTTC 


ATGGCAAATT 


TTTGCTACAT 


CTTGT AACAA 


8340 


TTCATGTGAA 


CTGTTATACT 


CTGAACTAAT 


GTGTATTTTC 


CACCCTTGTC 


TTTCAACAAA 


8400 


TTTTCCAATA 


GGGTATTGAT 


AAACCCACTC 


ATCATTATTC 


ATTACTTCGT 


GCCAATTAAA 


8460 


AGGCAGACTT 


ACTTGGTACT 


TTATGCTAGT 


ATCTGTACTA 


TAATCATTAT 


TAGTGAAAAA 


8520 


GAAAGGATGC 


TCCAAATTGA 


AATTATAATC 


CAT AAC AAAA 


TCTCCAAGAA 


ATTTTATCAA 


8580 


ACTTAATATA 


TCTATAGCTA 


GACAGACTTA 


TTTAAATAAA 


AAGGGAGAAT 


CCTTTGG ATT ' 


8640 
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CTCCCCATAT AAGCACTAAC ATTCCAACGT GCACATATTC GAACGACATC CATAACTCCA 8700 

CAGAATCTCT AAAGTTTACA ATTTAAATGA ATTAACAATT TTCCCAACTA AAAGCACTCC 37 60 

AGTTACCGCA ACGATTTGTA CTGAATGTAC TAAATCGCAT TCCATCAACT TCATCTGTTT 9 82 0 

CCTCAACTTG AACAGATACT AATTGAAGAT TTAATACTTC TTCTGCCATA GCTAGCTCCT 8880 

C CT AT TT AAA TTTTTGGGAT TAAGTACTTT ATCCACCCTC ATTATACTCT CTCCACCAGT 894 0 

AAAATGCAAG CAATTATACA ATGTTGTCAC ATAGAAAATA ATGTTTCCGT AACTTTTCAA 9000 

AGTAACTTCC ATCTCTCTCC CAAAACTGGA ACTTACTTTT AGAAGTTACC T AAAAAT C AG 9060 

GTCACCTATT TTAAAAAAGC AGCAAACTAT AAACTAGTAG GTTCCACACC AAATGTAGTC 912 0 

CCATACTGCC CCATAAGTCA GATTTATAGC GCACCATACC TAAAAACATC CCAAGTGAAA 9180 

CATACAAACA CCAAGCTAGA ATGGTTCCTG TATGATGTGC TAAGGCAAAT AAAACACTTC 92 4 0 

TCAAAGCAAC TCTGATATCT AATTTTCTGA CCAAATTCCA T AAAATTT CT CG AT AC AG AA 9 3 00 

ATTCTTCAAC CATACTCGCA TTGATTAAGA ACAATAAAAA TGAAAACCAA CGAATTTGAT 9 3 60 

GTTGAAGGCC AATTAAGTTT GCTTGATTCG TGCTTCCTTG AGCATGAATC AGACTAAAAC 94 2 0 

AT AG ACTT AT AATCAGTAGG CTAACAAATT CAACACCAAG CCATTTCATC CTAGATTTCA 9 4 80 

TATTGACCTT ATGCGCTTGT TTGCGTTGGC CATACATCCA TAAAAAAGAA ATGACTGACC 9 540 

AACCATAGAG AATCTGTAGT AT AGTTtnA CT CACCGATACA AAGAAATTTC AATAAGTATA 9 600 

GAG r T ACCAA TAsGACATTT ACTTGTTGGA AT AT AT AAA C TGGAATTATT CTTTTCATAG 9 6 60 

TTACCTCCGA AATAAATCTT CATAATCTAA ATCTAATACC TGCACAATCC TTT 9713 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8657 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

AAAGAAATTG TCAGAGAGTG GCTAGATGAA GTAGCAGAGC GGGCTAAGGA CTATCCAGAG 60 

TGGGTGGATG TTTTCGAGCG TTGCTACACC GATACCTTGG ACAATACGGT TGAAATCTTA 12 0 

GAAGATGGTT CAACTTTTGT CTTGACTGGG GATATTCCTC CCATGTGGCT TCGAGATTCG 130 

ACAGCCCAAC TCAGACCCTA CCTTCATGTA GCTAAAAGAG ATGCCCTCCT GCGTCAGACC 240 

ATTGCAGGTT TGGTCAAACG TCAGATGACC TTGGTACTCA AGGATCCCTA TGCTAACTCC 300 
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TTCAACATTG AGGAGAACTG GAAAGCGCAC CACGAGACTG ACCACACAGA CCTTAACGGC 360 

TGGATCTGGG AGCGCAAGTA TGAGGTGGAT TCGCTTTGCT ATCCTTTGCA GTTGGCTTAT 4 20 

CTCCTCTCGA AAGAGACTGG CGAGACTAGT CACTTTGATC AGATTTTTGT CGCAGCGACT 4 80 

AAGGAAATTC TCCATCTGTG GACGGTGGAA CAAGACCACA AGAACTCTCC TTATCGTTTT 54 0 

GTCCGAGATA CGGACCGTAA GGAAGACACC TTCCTAAATG ATGCCTTTGG ACCTGACTTT 600 

GCAGTGACAG G7ATGACTTG GTCAGCTTTT CGTCCGAGTG ATGACTGTTG CCAGTATAGT 660 

TACTTGATTC CGTCAAATAT GTT TGCTGT A C7AGTCTTGC GTTATGTGCA AGAAATCTTC 7 20 

GCAGCATTAA ACCTAGCTGA TAGCCAGAGT GTTATTGCTG ATGCCAACCG TCTTCAGGAT 780 

GAAATCCAAG AAGGAATCAA AAACTACGCT TACACCACCA ACACCAAGGG CG AAAAG ATT 84 0 

TACGCTTTTG AAGTGGATGG CCTAGGAAAT GCCAGCATCA TGGATGATCC AAATGTACCA 900 

AGTCTACTAG CTGCGCCCTA TCTGGGCTAC TCTTCGGTCG ATGATGAAGT GTATCAAGCT 9 60 

ACTCGTCGTA CCATTTTGAG CTCTGAAAAT CCATACTTCT AC C AAGG AG A ATACGCAAGC 1020 

GGTCTCGGCA GTTCTCATAC CTTCTATCGC TATATCTGGC CAATCCCCCT TTCTATCCAA 1080 

GGCTTGACAA CAAGAGATAA GGCAGAGAAA AAATTCTTGC TGGATCAGCT GGTTGCCTGC 114 0 

GATCGTGCTA CAGGTGTCAT GCACGAAAGC TTTCATGTAG ATGATCCCAC CCTCTACTCT 1200 

CGTGAATGGT TCTCCTCGCC TAACATGATG TTCTGTGAGT TGGTCTTGGA TTACTTGGAT 12 60 

ATTCGCTAAG GCGCTCGCTT TAGCTCAACC GATTCTTATC AGAATCACAA GTTTACATTT 13 20 

AAAACGTTAA AATTTAAATT TAGAATGAGG TTTTACTTCA TGGAAAATGT TGTTGTACAT 1330 

ATTATCTCAC ATAGTCACTG GGATCGTGAG TGGTACTTGC CTTTTGAAAG CCATCGTATG 14 4 0 

CACTTGCTCG AATTGTTTGA CAATCTCTTT GATCTCTTTG AAAATGACCC TGAGTTCAAG 1500 

AGTTTCCACT TGGATGGACA AACT ATTGTC C TTG AT G ACT ACTTACAAAT TCGCCCTGAA IS 60 

AATCGCGACA AGGTCCAACG CTACATTGAC GAGGGCAAAC TTAAAATTGG TCCCTTTTAC 16 20 

AT CTTGC AGG ATGACTACTT GATCTCCAGT GAAGCCAATG TCCGCAATAC CTTGATTGGV 1680 

CAACAAGAAG CTGCCAAATG GGGTAAATCA ACCCAGATTG GCTACTTTCC ACATACCTTT 174 0 

GGAAATATGG GACAAGCGCC TC AAATTCT T CAAAAATCAG GCATTCACGT GGCGGCCTTT 1800 

GGTCGTGGTG TGAAGCCGAT TG GAT TTG AC AACCAAGTCC TTGAAGATGA GCAGTTTACG I8 60 

TCTCAGTTTT CAGAAATGTA CTGGCAGGGT CTGCATCCTA GTCGTGTTTT AGGTATTCTC 1920 

TTTGCCAACT GGTACAGTAA CGGGAATGAA A7TCCACTTG ACAAAGATGA GGCCTTGACC 19 80 

TTCTGGAAAC AAAAATTGTC AGATGTGCGT GCCTACGCTT CGACCAACCA ATGGTTGATG 204 0 

ATGAACGGCT GTGACCACCA GCCTGTACAG AAAAATCTGA GCCAAGCCAT TCGTCTGGCA 2100 
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AATGAACTCT TCCCGCATGT AATCTTTGTT CATAGTTCTT TTGATGAATA TGTTCAAGCT 2160 

GTAGAAGGTG CGCTTCCTGA ACACTTATCA ACTGTTACAG GCGAGTTGAC CACTCAGGAA 222 C 

ACAGATGGCT GGTACACACT TGCCAACACT TCTTCATCCC GCATTTACCT AAAACAAGCC 2 280 

TTCCAAGAAA ATAGCAACCT CCTAGAGCAA GTGGTAGAAC CCTTGACTAT TATCACTGGT 2 340 

GGACACAACC AC AAGG AC C A GTTGACCTAT GCTTGGAAAA CACTTTTGCA GAATGCGCCA 2 4 CO 

CATGATAGTA TCTGTGGCTG TAGCGTGGAC GAAGTTCACC GCGAGATGGA AACGCGTTTT 2 4 60 

GCCAAGGTCA ACCAAGTAGG AAACTTTGTT AAAAGTAACT TGCTCAACGA GTGGAAGGGT 2 520 

AAAATTGCTA CGGATAAGGC TCAAAGTGAC TATCTCTTTA CTGTCATTAA CACAGGCTTG 2 530 

CATCATAACG TCGATACTGT CAGCACAGTG ATTGATGTGG CGACTTGTGA TTTCAAGGAA 2 64 0 

TTGCACCCAA C AG AAGGCT A CAAAAAGATG GCTGCTCTTA TCTTGCCAAG TTACCGTGTG 2 7 00 

GAGGACTTGG ATGGTCGTCC TGTAGAGGCT ACAATCGAAG ACCTCGGAGC T AAT TTTG AG 2 7 60 

TATAATTTAC CAAAAGACAA GTTCCGCCAA GCTCGTATTG CTCGTCAAGT GCGCGTGACC 2 92 0 

ATTCCAGTTC ACCTAGCGCC GCTTTCTTGC ACAACCTTCC AATTGCTGGA AGGAAAACAA 2 880 

GAACACCGTG AGGGT ATTT A CCAAAACGGA GTGATTGATA CACCATTCGT AACCGTGACT 2 9 4 0 

GTGGATGACA ACATCACAGT CTATGACAAG ACAACTCACG AAGCCTATGA AGACTTTATC 3000 

CGCTTTGAAG ACCGTGGGGA CATCGGAAAC GAGTATATCT ATTTCCAACC AAAAGGAACA 3060 

GAGCCAATCT TTGCAGAGCT TAAGGGCCAC GACGTCTTCG AAAAC A C AG C TTGCTATGCT 312 0 

AAAATCTTGC TCAAACATGA ATTGACCGTG CCTGTCAGTG CGGATGAAAA GCTAGAAGAA 3L80 

GAGCAACAAG GTATCATCGA GTTTATGAAG CGTGAGGCTG G ACGGTC AG A AGAATTGACA 3 24 0 

AACATTCCTC TGGAAACTGA GTTGACTGTC TTCGTTGACA ATCCACA/L'.T CCGCT7C.^\G 3 3 00 

ACTCGCTTTA CTAACACTGC CAAGGATCAC CGTATCCGTC TCTTGGTCAA GACTCATAAC 3 3 60 

ACGCGTCCAA GCAATGATTC TGAAAGTATC TATGAGGTGG TGACACGACC AAACAAACCA 3 4 20 

GCTCCTTCAT GGGAAAACCC TGAAAATCCT CAACACCAAC AAGCTTTTGT CAGTCTGTAT 3 4 80 

GACGATGAAA AAGGGGTGAC TGTATCCAAC AAGGGATTGA ATGAATACGA AATCCTTGGG 3 54 0 

GATAACACCA TTGCCGTGAC CATTTTGCGT GCATCAGGTG AGCTAGGTGA CTGGGGCTAC 3 600 

TTCCCAACGC CAGAAGCACA ATGCTTGCGG GAGTTTGAAG TCGAGTTTGC ACTTGAATGC 3 6 60 

CACCAAGCCC AAGAACGCTT CTCAGCCTAT CGTCGTGCCA AAGCCTTGCA GACACCCTTT 37 2 0 

ACCAGCCTTC AGCTTCCTAG ACAGGAAGGA AGCGTGGTTG CGACTGGTAG CCTCTTGAGC 37 80 

CATTCTGTTC TCAGCA7ACC GCAAGTTTGT CCAACAGCCT TTAAGGTAGC TGAAAATGAA 3 84 0 



WO 98/18931 



PCT/US97/19588 



414 

GAAGGCTATG TGCTTCGTTA CTACAATATG TGTAGTGAAA ATGTACGTGT GCCAGAAAGT 3 900 

CAACATCTCT TCCTTGACCT ACTTGAACGA CCATACCCAG TTCATTCAGG ACTATTGGCT 3 9 60 

CCACAAGAGA TTCGTACAGA ATTCATCAAA AAAGAAGAAA TTTAATTTCA AAAAGTAAAC 4 020 

ATCAAAAGAA AGGAGGGGCG AAAAAGTAAG AACTAACTGC TGATTCGCCC CTTTTATGGT 4 080 

AAAAACAATG ACCATTGCAA CGATTGATAT CGGAGGGACT GGGATTAAGT TTGCCAGTCT 4140 

GACTCCTGAT GGGAAAATAC TGGATAAGAC AAGTATTTCA ACGCCTGAAA ACTTGGAGGA 4 2 00 

TTTACTAGCG TGGCTAGATC AACGCTTGTC AG AAC AGG AT TACAGTGGGA TTGCTATGAG 42 60 

CGTTCCAGGT GCAGTCAATC AAGAGACAGG TGTGATTGAT GGCTTCAGTC CGGTGCCCTA 4 3 20 

CATCCATGGC TTTTCTTGGT ATGAGGCGCT TAGCTCTTAT CAGCTACCTG TCCATTTAGA 43 80 

AAATGATGCC AACTGCGTTG GACTCAGTGA ACTACTAGCT CATCCAGAGC T TG AAAATGC 44 4 0 

AGCCTGTGTC GTGATTGGGA CAGGGATTGG CGGAGCCATG ATTATCAATG GT AG ACTTC A 4 500 

TCGAGGTCGC CACGGTCTGG GTGGAGAATT TGGCTACATG ACAACCCTTG CCCCTGCTGA 4560 

AAAACTTAAT AACTGGTCGC AACTAGCATC AACTGGGAAT ATGGTACGAT ACGTGATTGA 4620 

AAAAT CTGGT CATACTGATT GGGACGGTCG CAACATTTAC CAAGAGGCCG C AG CTGGT AA 4 6 80 

TATCCTTTGT CAAGAAGCCA TTGAGCG C AT GAACCGCAAT CTGGCGCAAG GCTTGCTCAA 4 7 40 

TATCCAGTAT CTGATCGATC C AGGTGTC AT CAGTCTGGGT GGCTCTATCA GTCAAAATCC 4800 

AGATTTTATC CAAGGTGTCA AGAAGGCTGT TGAAGACTTT GTCGATGCCT ACGAAGAATA 4860 

CACGGTCGCA CCAGTTATCC AGGCCTGCAC CTATCACGCA GATGCCAATC TCT AC GGTGC 4 920 

TCTTGTCAAC TGGCTACAGG AGGAAAAGCA ATGGTAAGAT TTACAGGACT TAGTCTCAAA 4 980 

CAAACGCAAG CTATTGAGGT TTT AAAAGGT CACATTTCTC TACCACATCT CGAACTGGCT 5 04 0 

GTCACTCAGT CTGACCAAGC ATCTATCTCT ATCGAGGGTG AGGAAGGTCA CT AT C AATTG 5100 

ACCTACCGCA AACCTCACCA ACTTTATCGT GCCTTGTCCT TGTTGGTAAC AGTTCT AG C A 5160 

GAAGCTGATA AAGTAGAGAT TGAGGAACAA GCAGCTTACG AAGATTTGGC TTACATGGTT 5220 

GACTGTTCTC GAAATGCGGT GCTGAATGTG GCTTCTGCCA AGCAGATGAT TGAGATATTG 52 80 

GCTCTCATGG GCTACTCAAC CTTTGAGCTT TACATGGAAG ACACTTACCA GATTGAAGGG 53 40 

CAGCCTTACT TTGGCTATTT CCGTGGAGCT T ATT C AG C AG AGGAGTTCCA GGAAATCGAA 54 00 

CCCTATGCCC AACAGTTTGA CGTGACCTTT GTACCATGCA TCCAGACCTT GGCCCACTTG 54 60 

TCGGCCTTTG TCAAATGGGG TGTCAAGGAA GTCCAGGAGC TCCGTGATGT AGAGGACATT 55 2 0 

CTTCTCATTG GCGAAGAAAA GGTTTATGAC TTGATTGATG GC AT GTTTGC CACGTTGTCT 5580 

AAACTGAAGA CTCGCAAGGT CAATATCGGG ATGGACGAAG CCCACTTGGT TGGTTTGGGA 5 64 0 
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CGCTACCTGA TTCTGAACGG TGTTGTGGAT CGTAGTCTCC TCATGTGCCA ACACTTCGAC 5700 

CGCGTGCTGG ATATTGCTGA CAAATATGGT TTCCACTGCC AGATGTGGAC TGATATCTTC 57 60 

TTCAAACTCA TGTCAGCGGA TGGCCAGTAC GACCGTGATG TGGAAATTCC AGAGGAAACT 582 0 

CGTGTCTACC TAGACCG7CT CAAAGACCGT GTGACTCTGG TTTACTGGGA TT ATT AT C AG 5880 

GATAGCGAGG AAAAATACAA CCGTAATTTC CGCAATCATC ACAAGATTAG CCATGACCTT 59 4 0 

GCATTTGCAG GGGGAGCTTG GAAGTGGATT GGCTTTACAC CTCACAACCA TTTTAGCCGT 6000 

CTAGTGGCTA TCGAGGCTAA TAAAGCCTGC CGTGCCAATC AGATTAAAGA AGTCATCGTA 6060 

ACGGCTTCGG GAGACAATGG TGGTGAAACT GCCCAGTTCT CTATCCTACC AAGCTTGCAA 612 0 

ATCTGGGCAG AACTCAGCTA TCGCAATGAC CTAGATGGTT TGTCTGCGCA CTTCAAGACC 6180 

AAT ACTGGTC TAACGGTTGA GGATTTTATG CAGATTGACC TTGCCAACCT CTTACCAGAC 62 4 0 

CTACCAGGCA ATCTCAGCGG TATCAATCCC AACCGCTATG TTTTTTATCA GGATATTCTT 63 00 

TGTCCGATTC TTGATCAACA CATGACACCT GAACAGGACA AACCGCACTT CCCTCAGGCT 6 3 60 

GCTGAGACGC TTGCTAACAT TAAAGAAAAA GCTGGAAACT ATGCCTATCT CTTTG AAACT 64 20 

CAGGCCCAGT TGAATGCTAT TTTAAGTAGC AAAGTAGATG TGGGACGACG CATTCGTCAG 64 80 

GCCTACCAAG CGGATGATAA ACAAAGTTTA CAACAAATCG C C AG AC AAG A ATTACCAGAA 6 54 0 

CTTAGAACCC AAATTGAAGA CTTCCATGCC CTCTTTACCC ACCAATGCCT GAAAGAAAAC 6600 

AAGGTCTTTG GTTTGGATAC AG TTG AC AT C CCTATGGGCC GACTCTTGCA ACGCATCAAA 6 660 

CG AG C AG AAA GCCGTATCGA GGTTTATCTG GCTGGTCAGC TTGACCGCAT CGACGAGCTG 67 2 0 

GAAGTTGAAA TCCTACCATT TACTGACTTC TACGCAGACA AGGATTTCGC AGCAACTACA 67 80 

GCCAACCACT CC CAT AC CAT TCCGACAGCG TCGACGATTT ATACGACTTA ATATTCTTCG 63 40 

AAAATCTCTT CAAACCACGT CAGCTTCCAT CTGCAACCTC AAAACAGTGT TTTGAGCAAC 69 00 

CTGCAGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAAAAC AAGAACACCT 6 9 60 

TGCTTGGCGC AGGCTGTTTC GCGTGAAACA GAAGAATTAT CTGGTTTCAA ATGCTACAGT 7 0 20 

TAGAC AAACT TATGATAAAA TAGCAGAAAG TGAATGTTTC CTAAGAGCAA TTCGACGTAT 7 0 80 

TATGCTACAC TTAAAATTAG TAAAACAAGA AATAGAAGCT GAAAAGCCAG C AT CTGT AG A 7140 

AGCTTGGATC ATTTCCGTCA AATTTAAAAA AGGTTGCTAC CGACATATAT AGATTCCAAA 72 00 

AACAAAAACG TTAGCGGAAC TAGCAGATGT GATTTTATGG ACTTTTGATT TTGCAAATGA 72 60 

TCATGCTCAC GCATTTTTCA TGGATAATGT TGAGTGGAGT CATGCAGATT CTTACTTTCG 7 3 20 

TAGCTTTGTT AGTGACGATG TTGAAGAACG TTACACAGAA AATGTCTATC TGGATAGCCT 7 3 30 
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AAGTGTCAAA CAAAAATTTA AGTTTATTTT CGACTTCGC7 GWPG^.TGGC GTTTTGAATG 74 40 

CCAAGTGCTG AGAGAAATCG AGACAGAGGA CGAAGAAGCT TATCTCGTAC GTTCGGTTGG 7 500 

AACGTCGCCA GAACAATATC CAGATTATGA TGGTTTTGAC TATGAAGAAT GGTAAAATTG 7 560 

AAATCAGTCT GTGTAGGCTT AGTATTTCAA TAGACTTCCT G CAAAACT AG AATCCTAGTT 7 620 

CATGATTGAT AATACCAGCA ATCAAATTCA TTCGTAATCC GAAGCGTTTA CGATGATTTC 76 80 

GATAGGTTGT TGAAAACATT TTAAACG7TT TTACTTTGGC AAAGATGTTC TCAACCTTGC 7 74 0 

TTCTCTCCTT AGATAGCGCA TGGTTATAGG CTTT AT CTTC AGCTGTTAGT GGCTTGAGTT 7 8 00 

TGCTGGATTT ACGTGAAGTT TGTGCTTGAG GACATATCTT CATGACCCCT TGATAACCAC 7 3 60 

TGTCAGCCAA GATTTTACCA GCTTGTCCGA TATTTCTGCA ACTCATTTTG AACAACTTCA 7 92 0 

TATCATGACA ATAGTTCACA GTGATATCCA AAGAAACAAT TCTCCCTTCA CTTGTGACAA 7 9 80 

TCGCTTGAGC CTTCATAGCG TGAAATTTCT TTTTACCAGA ATCATTCGCT AATTCTTTTT 8 04 0 

TT AG GGCG AT TCATTTTTAC TTCCGTCGCA TCAATCATTA CCGTGTCCTC AGAACTAAGA 8100 

GGACTTCTTG AAATCGTAAC ACCACTTTGA ACAAGAGTTA CTTCAACCCA TTGGCTCCGA 8160 

CGGATTAAGT TGCTTTCGTG AATACCAAAA TCAGCCGCAA TTTCTTCATA AGTGCGGTAT 8 2 20 

TCTAGGCTTA ATTTAGGTTT TCGTCCACCT TTTGCGTGTT TAAGTTGATA ACCTCTTTTT 82 80 

AATACAGCTA ACATCTCTTT AAAAGTCGTG CGCTGAACAC CAACAAGACG CTTAAATCGT 8 340 

GTATCAGTTA ATTG TTT ACT TG CTTC AT AA TTTCGCAGGG AGTCTATTGA CTCTTTGGTA 84 00 

GGTGTCAATG TTTTTTTCAT CTATCCCGAG AATTATTTTC CCGCCATTTG T ATTTG C AAA 84 60 

TGCTGAGTAG GTTTCCCAGA AAGACTCTGG AAGATTCTTT TTACCTTTTT TGTATTCTAA 3 5 20 

ATCAACCCCT TCAAATTTTA ACTCCATATT TTTCCTTTAC ATCTGTTTTT TCTGGTTCTG 3 5 80 

GTATTTCTTC AAGTTGAGTG AT AAT AT AGC GAATTGAATT TCGAGAGTTT TTACTCACTT 8640 

AATTTCTTTT TTAACCC 3657 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11384 base pairs 
O) TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
{ D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TCTATTTTGG GTATAGACTT ACCTATAAAG AAAAATATCT ATACACTGCC TTACTAGCTA 60 

TACTGAACGA CTCAACAAAA ACGATATATA TTGATGATAT AAAT AC AGC A ACATTTTTTA 120 
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C AATG AT ATT 


CCTAATTCGT 


CTTTAAAAAA 


AATTGACTAT 


ATCGCACCTT 


180 




TTCATTTAGT 


ACGTACGTTC 


GACAACGTTC 


T AAAGT AATT 


CCTAAAATTT 


240 




AT^AAAATCA 


AGTTTTTTAT 


TAGAGAATAT 


AGATGTTTCT 


GGTTACACTG 


300 


rp x ft ft A (TWIPT 




TTAACAAAAC 


ATAGAACAAT 


CAAAATTAGT 


AAAAACTAAC 


360 


1 (ju 1 1 L»rV 1 \_ 1 




TACCTAACAA 


AACCACGCGC 


CTTGCCTGCT 


G ATGGAAAG A " 


420 






^CAAAGAAAA 


AATCAAAAAG 


AATGGCCAAA 


GAGTTTATTA 


480 


TGCTAGTGTT 




wflAC CAAC T 


AACGGGCAAA 


AAAGCCCGTA 


CAACTGTTAC 


540 


ALjI_AA*_1_AW *. 




T T AAAGT AAA 


AGCGCGTGAT 


GCGATCAATA 


CTTTTGCTGC 


600 


TAATVjCjL I A L 




AC AAGCCGAC 


AATTACAACA 


TAT AATG AG C 


TTGTAAAAGT 


660 


TTGGTCiGGA 1 


7v / *m"t in sra 

Avj 1 1 ALftftUrt 


AT APAGTT AA 


GCCAAATACT 


CGCCAATCCA 


TGGAGGGATT 


720 


/-•/-•rrtrri ft ^ » f^T*/ - * 


LAI 1 Ini 1 "jv 


CTGTATTTGG 


CGATTACAAG 


CTATCTAAAC 


TT ACT AC G C C 


780 


rp K ^^W^*p^l^ IV ft 

TATTCTTCAA 


/-^a/^/-»ft arTii 


ArAAATGGGC 


TGACAAGGCA 


AATAAAGGCG 


AAAAAGGGGC 


840 


ATTTGCTAAv- 


1 AL ILl 1 I vj<— 


TrfATAACAT 


GAATAAGCGT 


ATTTTGAAAT 


ATGGCGTAGC 


900 


I A 1 l_ v_ AOLi 1 A 


A 1 Ak_ Ai\ 1 t\\-t\ 


ACCCAGCTAA 


TGATGTCATC 


GTTCCACGCA 


AACAGCAAAA 


960 


ft **"• ft ft ft ft Mil 

AGAAAAGGCT 


Iji- 1 VJ 1 V_rtAA I 


APTTAGACAA 


CAAAGAATTA 


AAACAGTTTC 


TTGATTATTT 


1020 


AGATGCTCTG 


ft fT\f ft ft t^f"* ft ft 

(jAI LAA i. V.AA 


ATTATGAGAA 


CTTATTTGAT 


GTTGTTCTGT 


ATAAGACTTT 


1080 


Ai ibVjLLAL 1 


rrTTr.rrr.T a. 


TT ACT G AGG C 


TCTGGCTCTT 


GAATGGTCTG 


ATATTGACCT 


1140 


AvjAAAvjv_Ovj i 




TCAATAAGAC 


ACTAAACCGC 


TAT C AGG AAA 


T AAACTC AC C 


1200 


I AAA 1 LAAbL 




GTGATATACC 


AATAGACAAA 


GCCACATTAC 


TTTTACTGAA 


1260 


n <"> ft AT^Rpl ft ft 
AL.AA i flLrtnn 




AAATTC AGT C 


TTGGAAATTA 


GGCCGATCTG 


AAAC AGTTGT 


1320 


ATTCTCTGTA 




AATATGCTTA 


TGCTTGTAAC 


TTACGCAAAC 


GCCTAAATAA 


1380 


GCATTTTGAT 


GCTGCTGGAG 


TAACTAACGT 


ATCATTTCAT 


/"* /"**T» T* *T* f"* 

GGTTTCCtiLL 




14 4 0 


TACTATGATG 


CTCTATGCTC 


AGGTTAGCCC 


GAAAGATGTT 


CAGTATAGAT 


TAGGCCACTC 


1500 


TAATTTAATG 


ATCACTGAAA 


ATACTTACTG 


GCATACTAAC 


CAAGAGAATG 


CAAAAAAAGC 


1560 


CCTCTCAAAT 


TATGAAACAG 


C T AT CAAC AA 


TTTATAAAAA 


ATAAGGGTGA 


CCCATTTCCG 


1620 


GGCTACCCTC 


TTACTATACC 


AAAAATTAGT 


AGGGGTAGTA 


AAAAGGGTAT 


TAAATTATAA 


1680 


AAAGCACTAA 


GGGAAAGCGC 


CCCAAAGTGC 


TTATTTCAAA 


GGCTTTATAG 


CCTATAATCA 


1740 


CATAAAGAGA 


TTATTTTTTA 


AGG TTGT AG A 


ATGATTTCAA 


TCCACGATAT 


TCAGCTACTT 


1300 


CACCAAGTTG 


GTCTTCGATA 


CGAAGCAATT 


GGTTGTATTT 


AGCGATGCGG 


TCTGTACGTG 


L860 
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AAAGTGAACC AGTCTTGATT TGTCCTGCGT TAGTTCCAAC TGCAATATCA GCGATTGTTG 1920 

AATCTTCAGT TTCACCTGAA CGGTGTGATA CAACAGCAGT GTAACCAGCT TCTTTAGCCA 1980 

TTTCGATAGC TTCAAAAGTT TCAGTAAGAG TACCGATTTG GTTAACTTTG ATAAGGATTG 204 0 

AGTTAGCAGC ACCTTCTTGG ATACCACGTG CAAGGTAGTC AGTGTTTGTT ACGAAGAAGT 2100 

CGTCACCAAC AAGTTGTACT TTCTTACCAA GACGTTCAGT AAGAGCTTTC CAACCATCCC 2160 

AGTCGTTTTC AT CC AT AC C A TCTTCAATAG TGATGATTGG GTATTTGTTA ACCAATTCTT 222 0 

CAAGGTAGTC GATTTGTTCT GCAGATGTAC GAACAGCAGC ACCTTCACCT TCAAATTTAG 22 90 

TGTAGTCGTA AACTTTACGT TCTTTATCGT AGAATTCTGA TGAAGCACAG TCAAATCCGA 2 3 40 

TAAATACGTC TTTACCTGGT ACATATCCAG CAGCTTCAAT CGCAGCAAGG ATAGTTTCAA 2400 

CACCATCTTC AGTTCCTTCG AAACGAGGAG CGAATCCACC TTCGTCACCT ACGGCAGTTT 2 4 60 

CCAAACCACG TGATTTAAGG ATTTTCTTAA GAGCGTGGAA GATTTCAGCA CCGTAACGAA 2 5 20 

GGGCTTCTTT AAATGTTGGC GCACCAACTG GCAAGATCAT GAACTCTTGG AAAGCGATTG 2 5 80 

GAGCGTCAGA GTGAGAACCA CCCTTGATGA TGTTCATCAT TGGAGTTGGA AG AACTTT AG 2 640 

TGTTGAATCC ACCAAGATAG CTGT AAAGTG GGATTTCAAG GTAGTCAGCA GCAGCACGAG 2700 

CTACAGCGAT AGACACACCG AGG ATTGC AT TCGCACCCAA TTTACCTTTG TTAGGAGTAC 2 7 60 

CGTCAAGTGC GATCATAGCA CGGTCAATAG CTTGTTGATC ACGTACATCG TAGCCAATGA 2 82 0 

TAGCTTCAGC AATGATGTTG TTTACGTTGT CAACAGCTTT TTGTGTACCA AGACCACCCT 2 880 

AACGAGATTT GTCACCGTCG CGAAGTTCAA CTGCTTCGTG TTCACCAGTA GAAGCTCCTG 2 94 0 

ATGGAAC CAT ACCACGTCCG AAAGCACCTG ATTCAGTGTA AACTTCTACT TCAAGTGTTG 3 000 

GGTTACCGCG TGAGTCTAGG ACTTCGCGAG CGTAAACATC AGTAATAATT GACATTTTTT 3 060 

ACTCTCCTTA TGAGTTAAAT TTTTTACACC TCTATAATAC CTTAAAACCC CTCCTTTTTC 312 0 

AAGAAAAAAC GTTATCTTTG TGC AACTTT T CCTT AACTTT AT AAAGT AAT CGCTTTCTTT 3180 

TGTCTGTTTT ATTCTAACTT TTATGATATA CTGTTTTCAT G AC AG ATTT A TCAAAACAAT 3 2 40 

TACTTGAAAA AGCTCATGGT GGGTTAAAAA TAAATCCGGA TGAGCAAAGA CGCTATCTTG 3 3 00 

GTACTTTTGA GGAAAGAGTT CTTGGATATG TAGATATTGA CACAGCAAAT AGCCCTCAGT 3 3 60 

TAGAAAAAGG CTTTTTATTT A TT TT AG AAA ACCTTCAGGA AAAAGCAGAG CC ACT ATTT G 3 420 

TGAAGATTTC ACCAACTATC G AATTTG AT A AGCAAGTTTT CTACTTAAAA GAAGCAAAAG 3 480 

AAACTGATAG TCAAGCCACC ATAGTATCTG AAGAGCATAT TACTTCTCCT TTTGGCCTGG 3 54 0 

TTATTCATAG CAATGCACCA GTTCAAGTAG AAGAAAAAGA CCTTCGACTT CCTTTTCCAA 3 600 

AACTT TGGG A AGTTAAAAAG GAAGAACCAG CCAAAACATC CTTATGGAAG AAATGGTT T A 3 660 
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GCTAAATCTT GCACATATTT AATAAGTGCC CAATATTGGC AGCCGTGCGC TCCAGATAGA 372 0 

AACTGGCATT TTTC AAAC T A TCTTCTAAAG GTTCACTTTT CTCCAAAATA GAAAAGACAG 37 80 

CTTGGATATT TTCAAATGGT AGGGGAGGTA AATCTTCAGC AAGACTACCG CAAATAGCAA 3 84 0 

TAACAGGAAC TCCAACAGGG GTTCTTTTTG CAACACCTAT AGGCGCTTTC CCAGCAAAGC 3 900 

TTTGACTATC AAGTCTTCCT TCTCCAACAA CAACCAAGTC AGCATCTGAA ACTTTCTTAT 3 960 

CAAAGTTCAT TAAGTCCAAG CAGGTATCAA TTCCAGACAC GATACTTGCC TCAGCAAAGG 4C20 

CACACAAACC ACC AG CAAGG CCTCCACCTG CTCCTGCTCC TTTAATTTCT AATGTTGCAG 4 080 

GTGAGAATTT TTCATAAAAA TCTTGGATCG CCTCATCTAC GACTGCAAAC ATAGTCGGAT 4 140 

GTAGACCTTT TTGATTGCCA AAAGTGTAAG TCGCACCTTG ATGACCACAT AAGGGACTCA 4 200 

CGACATCTGC TAAAATATGA ATTTGAACAC CTTCAGGAAT TTTATAGCAA TTTTCTGTTG 4260 

AAACAGAAGC TAAGTTTAAT AAGCATTGAC CGGAAGCAGG CAAGACATTT CCATCCCTAT 4 320 

CATAAAATTG ATAACCTAAA C C AG C AG C AA TCCCCAGTCC TCCATCATTA CTGGCCGTGC 4 3 80 

CACCAACACC GATATAAATA TCTTTAATCC CTTTAGAGAT GAGATGAAGA ATCAACTCTC 4 440 

CAATACCACA AGTTTGGATT TGAAGTGGAT TTCGTTTCTC TAGCCGAATT TTTCCAAGAC 4 500 

CAACCAAGTC AGCTACTTCA AATAGTGCCA GTTCCCCTTT TT G AAAAT AG CGCATGGCTT 4 560 

CTTTTTGTCC AAAAGGGTCT GTCACTTGGA TCCATTTTTC TTTTAGGTCA AGAGAATGTC 4 62 0 

GGATAGCATC TACAGTACCT TCTCCCCCAT CACCAACAGG GCAGAGGAGA CATTCTACAT 4 680 

CTGCTATCCA TTGTTGGAAG CCTCTTTTTA TTGCTTCAGC TACCTGTTGA GCTGTCAAGC 4 74 0 

TTTCCTTAAA CCAATCCCCT GCAATTACAA TCTTCATATT TTCCCTCATT CTAAACAGTC 4 800 

AATCAAAGGG AGAACTTCTA AAAAATCCCT CTTGTCAACA TGATGTGGTA TTTCTTTTTT 4 860 

GAGCACTTCT TTGGCACAAA AGGCGATTCC TAACTTCGCC GACTTCAACA TTAATAGATT 4 920 

ATTAACCCCA TCACCGATTG CCACCGTTCT TTCTTTAGAA AGTTTTAGTT TCTTTCTCCA 4 980 

TTTTTCCAGA GTCTCTTTTT TGACCTGGGG A CTT AT AATT TGTCCAACTA ATTTTC CTG T 504 0 

TAAAAGACCT TCTTTGACTT CAAGCTAGTT GGCAGTGAAA TAGGCAATAC CAAGGGATTT 5100 

TGCTAATCTC TCCAACTATT GGTGTAAATC CACCAGACAC CAGACCAACT AGGATGCCAT 5 160 

TCTTTTGGAG AATAGAGATG AACTCTGGGA CATTTACCGA TAG AT G AAT T GAGTTGAAGA 5 220 

CGTTATCAAA G AC C AAAAT A GGAAGACCTT CCAACAAGGA CACTCTTTTT CTTAAACTGC 5280 

TTTCAAAGAC CAACTCTCCT CGCATTGCTC GACTTGTAAT CTGCGAAATT TCCGCCTCAT 534 0 

GACCTGCCTC TCTCCCTAAA AGATCAATCA CTTCTTCTAC GATTAAGCTT CCATCTACAT 5 4 00 
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CCAAAACACA CAAGCCTTTT ACTTGAGACA TCAGTTCTCC TCTCTAAACA GCCTAAAAAT 54 60 

CGTATGAAGT CAT C AT ACG A TTTTATCTAT TAATTAACTA AACTATGGTA CAAGTCAAGG 55 20 

TATGACTTGC AGGCTGTATC CCATGAGAAG TCACTCTCCA TAGCTTGTTT TTGTAGGTTT 5580 

CTCCAAATGT CTGGATGGTT TCTATACAAG TCCAATGCTG TTTGGAAAGT CCAATTTAAC 5640 

CAATAAGGAG ATAGATTGTC AAAGCTAAAG CCAGTACCGC TT CCTTCG AT TGGATTGAAA 5700 

GCGCGAACTG TATCTCGCAA GCCTCCAACT TCATGGACCA ATGGCAAGGT TCCATAACGC 5760 

ATAGCCATCA TTTGAGACAA GCCACACGGT TCAAAACGAC TTGGCATGAG GAAGAGGTCA 5820 

CAAGCAGCGT AGATTTCCTG AGCAAGTTTG ACATCAAAAG TGATATTTGT TGATAGCTTG 5 880 

TCTGGGTAAA TCTGAGCAAA CCATGAGAAA GCTCCTTCAA AGGCTGGATC GCCAGTTCCC 594 0 

AAAAGAACAA TCTGAACATC TTCTTGCAAG ATATGGTGAA GACTTTCGAC CACCACATCA 6000 

AAACCTTTTT GACGTGTCAA ACGAGAAACA ATTCCCACCA GTGGAACGTC TGCTCTAACA 6060 

GGCAAGCCAA CTCTTTCTTG CAATTTTGCC TTATTTTTGG CTTTCCCAGA CAAATCTTCC 6120 

TGATTGAAAT GATAGTCTAA AAGAGCATCC GTCTGAGGAT TATAAAGATC AGCATCAATC 6180 

CCATTCACGA TACCAGATAC TTTACCAGAC TCCATTTTAA GAATCTGATC CAAATTACAT 6240 

CCAAACTGAC TAGTCATAAT TTCATGAGCA TAGCTAGGTG AAACGGTTGA AACACGGTTC 6300 

GCATAGAGAA TACCTGCCTT CATCCAGTTC AG AC AG TTGT TCCATCGAAG GGTGCCATCA 6 3 60 

GCGTAACGTT CAAAGCCAAC TCCAAACAAA TCACCCAACA TTCCTTCTCA AAATTGTCCT 642 0 

TGGAATTCTA AATTATGAAT GGTTAAAACT GTTTCAATGT CCTCATAGGC TTGAATCCAA 64 80 

CGGTATTTTT CCTTCAACAA GAAAGGAATC ATAGCTGTAT GGTAGTCATG AACATGGACA 6 54 0 

AGATCAGGAA TAAAGTCAAT CCTTTCCATA GCCTCAATGG CACCCAGTTG GAAAAAGGCA 6600 

AAGCGTTCTC CGTCATCAAA ATCACCGTAA ACATGACCAC GGAAGAAATA ATATTGATTG 6 660 

TCAATAAAGT AGAAGGTTAC ACCATTTAAT ACTGTTTTCT TAATTCCACA ATACTGTCTG 6720 

CGCCAACCAA CGCTCACCTC AAAATGAAGC ACATCTTCAA TCTGATTTCC AAATTTAGCC 6 7 80 

TCTACCATAT CATAGTAGGG TAAAATCACT GCAACTTCGT GCCCAGCTTT TACCAGTGAT i".84 0 

TTTGGAAGAG CGCCAATGAC GTCTCCCAAA CCACCTGTTT TTGAAAAGGG TGCACCCTCT 6900 

GCTGCTACAA ATAAAATTTT CATGAATGAA TATCCTCTGT TACTTTAGCA CCTTTCTTAA 6 960 

CCACAACTGG ATCTTCTGCA GTTCCTCGAA TCACAACACC ATGCTCAACT TCAACCCCTT 7 020 

TGTCCAAGAT AG C AT ATT C G ACCTGAGCCC CTTCTCCAAT AACAACACGA GGGAAGAGCA 7 080 

GGCTATCTTT AACCAAGCTA TCCTTATGGA CATGAATATT ACGTGATACA ACAGAATTAG 714 0 

CTACTTGACC TTCAATAATA CTACCAGAGG CAAACTGAGA AGTGCTTACC TTAGATGTAT 7 2.C0 
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TAG C AT AGT A AGTTGG CTCT TCGTTTTTGA CCTTTGTATA AATCTTTTGG TTTGGTGAGA 7 2 60 

AAAGAGAATA GAATTTTTGT CATTCAAGCA TATCGATATT CGCTTGATAA TAAGATTTAA 7 3 20 

CAGAGTGAAT ATTGGCTAGA TAGCCCGTGT ACTCGTAGGC GAAAGCTCCC TCTTTTACAG 7 3 80 

CCAAATCCCG TAAAACATAG CCCAATTTC? CTGGATGTTC T TTTTT AGCT TCTTCTTCCA 74 40 

AGTGTT C AAT CAACCAAGGT GTATCAACGA CAAAGATATC TGTAGACATA TTGAACGTTT 7 500 

CAGCTGTTGA CTTGCTATCA AAGAGTTTAT GAGAAAGAAC ATGGTCTGTT TCATCTACAT 7 5 60 

CCAAGATTGC ATTTACTTCT GAAATATCTT TCTTAGCTAG T TTTTT AT AA ACTACAGTGA 7 6 20 

TAGGCTCTTT TGTTGTACTA TGTAGGTGGA AAACTTGGTT CAAATCAATG TTAATAAGAA 7 6 80 

CATCGCAGTT GAGGGCAACC GTTTGGTTTG AGCCAGAACG TTTCAAATAA GTAAGAAGCT 77 40 

GTTGGTAGTA TTCTTTTCCA ACTGTACTAC TTTCTACACG GGTATTGTAA ATTCCTACAT 7 800 

AGTAATGGCT AAGAAGGGTT GATAAGCCCC ACTCGCGTCC TGAACGAATA TGGTCAAATA 7 8 60 

CTGAGCTGAT ATTATCCTGC TGGAAAATAC CAAAGACACT ACCAACACCT GCATTAGCAA 79 2 0 

GCCTTGAAAG TGGGAAGTCA ATCAAACGAT ATTTCCCACC AAATGGCAAA CTTGCTACTG 7 9 80 

GACGGTGGTC CGTCAATGTC GACATATTGT GAAAACCAAC TGTATTTCCT AAAATGGCAG 80 4 0 

AATATTTATC AATCTTCATC TGTTGCTACC CCCACTACTT CATTATATCC TACAACTTGT 8100 

ACTTCATCTG TTCCATCAAT TTCGACACCG TCAGAAATAA TCGCACCTTC ACCAATAATG 8160 

GCACGTTTAA TCTTAGCTCC TTGACCAATG ATAGCTCCAC TCATGATAAC TGAATCAAGG 8 220 

ACTTCCGCTC CTTCCCGAAC TTGCGCGCCT GTTGAAAGCA TAGAATGTTT AACAGTTCCA 82 80 

TCAACGAAAC ATCCGTCTAC AACTAATGAG TCTTCCACAT GAGCATTTCC CCCGAGGAAG 8 340 

TTTGGTGGTG AAATCAAGTT TCTTGAGTAA ATCTTCCATT GACGGTTACG ACTATCCAAC 8 4 00 

GCATTTTCTG GAGAAATATA CTCCATGTTC GCTTCCCAAA GTGACTCAAT AGTACCAACA 84 60 

TCTTTCCAAT AACCACTAAA TTCGTAAGCA TAAACACTTT CACCTGACTC AAGGTAATTT 85 20 

GGAATGACAT TTTTACCAAA GTCTGACATG CCAACCTTGC TCTTTTCAGC AGCGACTAAC 8580 

ATATTACGAA GCCGTTGCCA AT C AAAAATG TAGATTCCCA TAGAAGCTTT TGTACATTTA 9 64 0 

GGTTGAGCTG GTTTTTCTTC AAATTCAACA ATACCATTGT TAGCATCTGT GTTCATGATA 87 00 

CCAAAACGGC TTCCTTCTTT AAGAGGGACG TCTAAAACTG CTACTGTCAA GCTGGCATTA 9760 

TTATCCTTAT G AG ACTG GAG C AT AT C AT C A TAGTCCATTT TGTAGATGTG ATCCCCAGAC 8 8 20 

AAAATCAAGA CATACTCAGG ATTG AC ACTG TCGATATAGT CGATATTTTG GTAAATAGCG 8880 

TGACTAGTCC CCTCAAACCA ACGATTTCCT TCACTTCCAG AATAAGGTTG AAG AAT AG AG 894 0 
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ACACCTGAAT 


TAATACCGTC 


TAGTCCCCAG 


CTTGAACCAT 


TCCCAATATG 


GTTGTTGAGA 


9000 


GCAAGTGGTT 


GATACTGTGT 


AACGACCCCA 


AC ATTGTG AA 


TCCCTGAGTT 


GGCACAGTTT 


9060 


GATAGGGCAA 


AGTCAATGAT 


ACGGTAGCGC 


CCACCAAATT 


GCACAGCTGG 


TTTTGCGATG 


9120 


CTTTGAGTGA 


GTTTACCGAG 


ACGAGTTCCT 


TGCCCACCAG 


CAAGAATCAA 


AGCTAACATT 


9130 


TCATTTTTCA 


TTTTCTACTC 


CTTTTTGGTT 


TTTATTTGTG 


ACGGTTTTAG 


TAGATTTCAA 


9240 


GCGACGTTTG 


ATTTTCCATA 


CACTTGCTCC 


CATAGCCGGT 


AGGGTAAAGG 


TTAAGGTCTG 


9300 


CTC AT AATCT 


TTCCATAGTC 


CTTCTTGCGT 


TTGAACAGTT 


TGATTATGTT 


CTTTCCAAAC 


9360 


GCCTCCCCAC 


TCTTCCAACT 


CAGTATTCCA 


TACTTCTTCG 


TAAATTCCTG 


CAACGGGTAG 


9420 


TCCGATTGTA 


AAATCTTTCC 


GCTCAACAGG 


TACCATATTA 


AAGATACAGA 


CTAACATTTC 


9480 


TCCCTTTTTA 


CCCTTACGAA 


TAAAGGAAAG 


AACACTCTGG 


TCTCGATTAT 


CCGCATCAAT 


9540 


GATTTCAATA 


CCATCATAGC 


TGGTATCAAT 


TTCCCACAGA 


CAGCGATGAT 


CTTTGTAAAA 


9600 


CTGGTTTAGC 


TGAGAAGCGA 


AATACTTCAT 


CTTAGCATTC 


ATTGGGTCTT 


CTAGGTTAGA 


9660 


CCATTCCAAC 


TGTTCTTCAG 


ATTTCCATTC 


TAGGAATTGA 


CCGTATTCGC 


TACCCATGAA 


9720 


GAGCAATTTC 


TTACCACGGT 


GACAAATTTG 


GTACGTATAG 


AGATTGCGCA 


AGCCTGCGAA 


9780 


TTGATTGTAA 


CCATCTCCCC 


ACATCTTATG 


CATCATACTC 


TTCTTGCCAT 


GAACCACTTC 


9840 


ATCGTGCGAG 


AATGGCAAGA 


G AT AATTCTC 


CTTGAAAACA 


TACATAAAGC 


TGAAAGTCAC 


9900 


CAGGTTAAAG 


T CAT ATTT AC 


GATAGATCGG 


ATCTTCTTCG 


TAGAAACGGA 


GGATATCATT 


9960 


CATCCAGCCC 


ATGTTCCATT 


TGT AGTC AAA 


TCCTAGACCA 


CCAATCTCTT 


TCATTCCCGT 


10020 


AATCTTGATC 


GCAGACGAAC 


TTTCTTCTGC 


AATCATCATC 


ACATCTGGAT 


ATTCTAACTT 


10080 


AATAACCTCA 


TTCAAGCGCT 


GAAGGAAATA 


ATAACCTTCA 


TAGTTGAGAT 


TTCCGCCATC 


10140 


TTTA7TAGGT 


GTCCATGGAG 


CATCATCATA 


GTCCAAATAG 


AGCATGTTGC 


TAACAGCATC 


10200 


CACACGAATA 


CCATCCAAAT 


G AT AG AC AT C 


AATCCAATGC 


TTAATGCAAG 


AAATTAAGAA 


10260 


GGACTGGACT 


TCATTTTTTC 


CAAGGTCAAA 


ATTAAGGGCA 


CCCCAACCAT 


GGTTATGAGC 


10320 


CTTATTATGC 


TCTTGGTATT 


CAAAAGTCGG 


TCTCCCATCA 


TAATAGGCTA 


AGGCATCATC 


10380 


GTTGATGGTA 


AAGTGACTGG 


TACCCAGTCC 


ACAATAACCC 


CAATATTATG 


GGTATGACAC 


10440 


TCCTCGACAA 


AATCTTGAAA 


CTCCTCTGGT 


CGGCCATAAG 


CATGCTCTAA 


AGCGAAGTAA 


10500 


CCCATAAGCT 


GATACCCCCA 


ACTCAAGCCC 


AAAGGATGGG 


ACATCAAGGG 


CATAAACTCA 


10560 


ATATGAGTAT 


AGTTCATTTC 


AACGAGATAA 


GGAATGAGTT 


CATCCTTGAG 


CTGGGCAAAA 


10620 


CTATAAGGAC 


TGCCATCAGA 


ATTTCTTTTC 


CATGATCCAG 


CGTGAACTTC 


ATAAATATTG 


10680 


ACAGGACGCT 


CTTCAAAGCC 


CCAACGTTTT 


. CTTCGTGCCA 


GCCAAAGTCC 


ATCCTTCCAT 


10740 
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T7CTTCTCAG GAAGCTCTGT TACGATTGCC CCTGTTCCTG GACGAGCCTC ATACCTGACA 10800 

GCAAAAGGGT CAATCTTCAT CAGTTGATGA CCATTTTGAC GTGTGACATG ATATTTGTAA 10860 

ATATGCCCTT CTTGAGCCAT ATTGGTAAAG ACTTCCCAGA CCCCAAAATC ATTTCTTACC 10920 

ATTGGAATCT GATTTTCAAT CCAGTTGGTA AAATCACCAA CCAAGTGAAC AGCCTGAGCA 10930 

TTAGGTGCCC AAACACGGAA GGTATAGCCA TGCTCTCCAT TTAGTTCTTC CCTATGTGCT 11040 

CCTAGATAAT GTTGGAGATA AAAATTTTCA CCCGTCATAA ACGTTTTTAA TGCTTCTCTA 11100 

TTATCCATAT ACTCCCCTTC TCCTGTAAGC GTTTTCTATG TTTTTATTAT ACTACCTTTT 11160 

TAGAGAAGAT TCAAGTAAAT TACTATACTT CTTTAATTAT TTTGAAAATC TACAACAAGT 11220 

TCACTTACTC GTTCAATTGT AAATCAATAT TTTTTCAAAA AATTGCGAAA ACGCCTTTCT 11280 

TTTTCTACTA TAGTGAAATG AAATAAAACA TGCGCAAATC GATTAAGGAA TTTAATCTAA 113 4 0 

TTTCTAACAA TGTCTTAGAA ATCAAAGTGT ACT ATTT T AA CTCC 11384 
12) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7577 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

TCTTGATTTG TTACTAGACG TTGACCAACG TCCTTCCCCT GGAAAAGGAA TTCTCCTTAG 60 

TTTCCAACAC CTTTTCGCCA TGTTTGGTGC GACCATCTTG GTACCATTCA TTTTGGGAAT 120 

GCCTGTATCT GTTGCCCTTT TTGCTTCAGG TGTTGGAACA CTCATCTACA TGATTGCTAC 180 

TGGTTTTAAA GTTCCAGTTT ATCTAGGTTC TTCATTTGCC TTTATCACAC CTATGTCACT 2 40 

GGCTATGAAA GAAATGGGGG GGGATGTATC TGCTGCCCAA ACAGGGGTTA TCTTGACTGG 300 

TTTGGTCTAT GTCCTTGTTG CTACCAGCAT CCGATTTGTA GGAACAAAAT GCATTGATAA 3 60 

ACTCTTGCCA CCAATCATTA TCGGTCCTAT GATCATCGTT ATCGGTCTTC GACTTGCAGG 4 20 

TTCAGCTGTT ACCAATGCAG GTCTTGTAGC AGACGGAAAT TGGAAAAATG CTCTGGTAGC 4 80 

CGTTGTTACT TTCCTAATTG CTGCCTTTAT CAATACAAAA GGAAAAGGCT TCCTACGAAT 54 0 

CATTCCATTC CTCTTTGCCA TTATCGGTGG TTACCTTTTC GC ACT AACTC TTGGCTTGGT 60 0 

TGACTTTACA CCAGTTCTTA AAGCCAACTG GTTCGAAATT CCTCGTTTCT ACTTGCCATT 6 60 

TAGCACAGGT GGTGCCTTTA AAGAGTACAA TCTTTACTTT GGTCCAGAAG CCATCGCTAT 72 0 
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CTTGCCAATC GCTATCGTAA CAATTTCTGA ACATATCGGA GACCATACTG TTTTGGGTCA 78 0 

AATCTGTCGT CGTCAATTCT TAAAAGAACC AGGTCTTCAC CCTACTCTTC TTGGTGACGG 840 
TATCGCAACT TCTGTTTCTG CCTTCCTTGG TGGACCAGCC AATACAACTT ACGGAGAAAA 900 
TACAGGGGTT ATCGGTATGA CTCGTATCGC TTCTGTCTCA GTTATCCGTA ACGCTGCCTT 960 

CAT CGCG ATT GCCCTCAGCT TCCTTGGTAA ATTCACTCCC TTCATTTCAA CT ATT CC AAA 1020 

CGCTGTACTT GGTGGTATGT CAATCCTTCT CTATGGGGTT ATCGCCAGCA ATGGTTTGAA 1080 

AGTCTTG ATT AAAGAACGTG TTGATTTCGC TCAAATGCGA AACCTCATCA TCGCAAGTGC 1140 

TATGTTGGTT CTTGGACTTG GAGGAGCTAT CCTTAAACTT GGTCCAGTTA CACTTTCAGG 1200 

TACTGCCCTT TCAGCCATGA CAGGAATCAT CTTGAACTTG ATCTTGCCAT ACGAAAATAA 12 60 

AGACTAAGAG TCTAAATACA CCTAATCCAC TCAGACAGCT GAGTGGATTT TTCGTATACC 13 20 

ATAATAAAAG TGTCTTAACA AAATTATTAA AATCAAAAAA CGTATAATAT CAGATATTCT 13 8 0 

AAAACCTTGA TACTGTACGT TTTATCATAG AAATTTTTAC TTTATTTTCT CATC AAATG A 144 0 

GATTTCCATC AATCTCTTGT CTTACTTGCG TTTCTTCTTC GCTTTCTTCA TTTTGTTAGC 1500 

CATACGTTTC ATGGACTGTT TCATGGCAAA TTCACCAATT TTACCTTTCA AACCGCCACC 1S60 

AAACATCTGG CTCATATCTG GCATTCCTGC TCCTCCGAGA GCTGATAAGT CAGGCATACC 162 0 

CCCTTGTCCC ATCATTCCTT CAAGGGCAGA CATATCCATT CCTCCCATAT TTGGC AT ATT 16 80 

TTTAGGAAGG TTATTTGGAT TAATCCCCAT TTGCTTCATC ATT TT AT TC A TATCCCCAGA 17 40 

CATAACACCC TGCATGAGCT GTTTAGCCTG GTTAAAGTCC TTGATGAATT TATTGACTTC 18C0 

GACGAATGTA TTTCCAGAAC CAGCAGCAAT ACGACGGCGA CGGCTTGGAT TTAACAAATC 1360 

TGGGTTTTCA CGCTCTTCAG GTGTCATCGA AG AC A C AAT G GCACGTTTAC GAGCAATCTG 1920 

GCGTTCATCC ACCTTCATGT TTTGAAGGGC TGGATTGTTG GCCATACCTG GAATCATCTT 1980 

GAGCAAGTCT TCCATCGGCC CCATATTTTG CACCTGATCT AATTGATCGA TGAAATCATT 2 04 0 

AAAATCAAAG GTGTTTTCGC GCATCTTCTC AGCCATTTCA AGGGCTTTTT GTTC ATCG' L* A 2100 

TTCCTGAGAA GCTTTCTCAA T C AAAGTG AG CATATCCCCC ATACCAAGGA TACGCCTAGA 2160 

CATGCGGTCT GGGTGGAAGG TTTCAATGTC CGTAATCTTT TCACCTGTAC CAGTGAACTT 222 0 

GATTGGTTTT CCAGTAATGT GACGAACAGA CAGAGCAGCA CCACCACGAG T ATCG C CATC 22 80 

AATCTTGGTA AGGATGACCC CAGTCACTTC CAACTGAGCA TTAAACTCAC GCGCAACATT 2340 

GGCTGCTTCC TGACCAATCA TAGCATCAAC GACAAGCAAG ATTTCATTTG GTTGAGCCAA 2 400 

TGCTTTCACA TCACGAAGCT CATTCATGAG GAGCTCATCA ATCTGCAAAC GACCCGCAGT 2 4 60 

ATCAATCAAG ACATAGTCGT TATGATTAGT TTGGGCTTGC TCCAAACCTT GACGTACAAT 2520 
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GGTAC I ; CTG 


TTCCAAGTGC 


AAAGACAGGC 


ACATCAATCT 


GTTGTCCCAA 


2 580 




1 Ijb I t- AA I IAj 


CAGLIGGALU 


ft m ft ft » m ft 

ATAAATATCC 


GCCGCAATCA 


TCAAAGGACG 


2640 


AGCATTvT. LT 


TCTTTCTTGA 


GTTTGTTGGC 


CAATTTACCA 


G C AAAGGTTG 


TTTTACCAGC 


2700 


Ut-(_ HOI AAA 


CCAACCATCA 


TGATGATGGT 


TGGAATCTTA 


GGTGACTTGA 


TAATTTC TG C 


27 60 


CfT* ft fp^ivj™ a iv 
t-G rATt_ AOAA 


C CT AAAA CGG 


CTGTCAATTC 


CTCATCAACG 


ATTTTAATAA 


TCTGTTGCGC 


2 820 


AGGAI 1AAGT 


GTATCAATGA 


CCTCATGCCC 


GACTGCACGC 


TCACGAACTT 


TCTTGATAAA 


2880 


GTCCTTTACA 


ACAGGCAAGG 


C AACGTCGG C 


CTCGAGCAAG 


GCCAAGCGAA 


TTTC TTTGGT 


2940 


i GCCTCTTGG 


ACATCAGATT 


CAGAGATTTT 


TCCTTTTTTA 


CGTAGATTTT 


TAAAGACGTT 


3000 


CT GC AAACGT 


TCTGTTAAAC 


TTTCAAATGC 


CATTTTTCTT 


CCTCTTATTC 


TCTATTATCA 


3060 


ATGCTTGT 4 A 


AAATTTCTAT 


CTGCTCCTGC 


AGAAACTCAT 


CCTTGGGATA 


GCGCTCCAAA 


3120 


ATUTGA i C AA 


AAATCTGACT 


GCGGACAATA 


TAGTCCGAGT 


ACATGTGCAA 


TTTCATCTCA 


3180 


i AATCTTCCA 


ft ft ffV^ffWIWIMM^ 

GAATCTTTTC 


TGTTCGCT TG 


ATATTGTCAT 


AGACAGCCTG 


ACGACTGACA 


3240 


CCGAAC . CCT 


CGGCAATTTC 


AGCAAGGCTG 


TAATCATCAG 


CGTAGTAGAG 


CTCGATATAA 


3300 


TTCATTTG CT 


TATCTGTCAA 


AAGCGCCGCA 


TAAAATTCAA 


AGAGCGCATT 


CATACGATTG 


3360 


GTTTTTTC G A 


TTTCCATAAC 


TTTTATTATA 


CCAAAAATTA 


GCCTAATCTA 


CCACACTAGG 


3420 


AAGCCGATCC 


ft ft A ft ft j-m ft #** m 

AAGAAGATAG 


ATAGCTAAAT 


TTGAAAAAGA 


CATGAGCCTA 


GCCCCAAGTA 


3480 


ATTTCCAATT 


GATAGCTGGC 


AAAGGGATGT 


CCCTCTTGAT 


TTTGT AGTTG 


ATAATCTAGT 


3 540 


TCAATCTTTT 


GCCTATCAAC 


TTGATAATCG 


CTCGTTTGGA 


TGATAAACTC 


CTGCATGCCC 


3600 


ATAGGTGTAG 


a ft ft m ft m » a a a 

GAATATAGGC 


TAAACTATCG 


CTATCCTTTA 


GAAAGCGCAT 


AATGGTCTTG 


3660 


L»^jA 1 i AvjAAA 


A rCvjGk-TCA * 


^.ACAAGTTCT 


TGACCATGAA 


ft HnWH ft ft ^ A m 

ATTTAATCAC 


TACTTTTTCC 


3720 


TTTTCCTCAT 


T AT AG AAAAG 


CAGGTAGCT A 


ft m TO a A^nm 

TAATCTCCTT 


TTTCATGCAC 


TTCCACATCA 


3780 


TAAAGCTGGT 


CAATCACTTC 


CAACTGCTCA 


TCAAACTGAA 


TCGTATTTCG 


CATCCGAATC 


3840 


TTCACATCAG 


GCCCTCTTTC 


TTGTCTCTTG 


TCCTACTATT 


TTACCAAAAA 


GAGCAGGATT 


3900 


TTGCTATAAT 


GGTCATATGA 


ACGAAAAAGT 


ATTCCGTGAC 


CCTGTTCACA 


ACTACATCCA 


3960 


TGTCAATAAT 


CAAATCATCT 


ATGACTTGAT 


T AAT AC AAAA 


GAATTTCAGC 


GTTTGCGCCG 


4020 


GATCAAACAA 


CTGGGAACTT 


CCAGTTATAC 


CTTCCACGGT 


GGAGAACACA 


GTCGCTTCTC 


4080 


TCACTGTCTA 


GGAGTCTATG 


AAATTGCACG 


ACGCATCACA 


GAGATTTTCG 


AAGAAAAATA 


4140 


TCCTGAGGAA 


TGGAATCCTG 


CCGAGTCTCT 


CTTCACCATG 


ACCGCTGCTC 


TCCTACACGA 


4200 


CCTTGGGCAT 


GGTCCCTACT 


CCCATACTTT 


TGAACATCTC 


TTTGATACAG 


ACCATGAAGC 


4260 
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CATTACTCAG GAGATTATTC AAAATCCTGA GACAGAGATT CACCAAGTCC TGCTACAAGT 4 32 0 

GGCACCTGAT TTCCCAGAAA AGGTGGCCAG TGTCATTGAC CATACCTATC CTAATAAGCA 4 3 80 

GGTCGTGCAG CTCATTTCTA GTCAGATTGA CGCAGATCGC ATGGACTATC TCTTGCGCGA 44 4 0 

CTCCTATTTT ACAGGAGCAT CCTATGGGGA ATTTGACCTG ACTCGAATCC TCCGAGTCAT 4 500 

TCGTCCTATC GAAAATGGTA TCGCCTTTCA GCGCAATGGC ATGCACGCCA TCGAAGACTA 4 5 60 

CGTCCTCAGT CGCTACCAGA TGTACATGCA GGTTTATTTC CACCCCGCAA CACGCGCCAT 4 620 

GGAAGTTCTC CTACAGAATC TTCTCAAACG CGCCAAGGAA CTCTATCCTG AGGACAAGGA 4 680 

TTTCTTTGCC CGAACTTCTC CACACCTCCT GCCTTTCTTC GAAAAAAATG TG ACCTTG AC 4 7 40 

TGACTATCTG GCTCTGGATG ATGGCGTGAT GAATACCTAC TTCCAGCTTT GGATGACCAG 4 800 

TCCTGACAAG ATTCTTGC AG ATTT ATCGC A TCCCTTTGTC AACCGCAAGG TCTTTAAATC 4 8 60 

C ATT AC CT TT TCACAAGAGG ACCAAGATCA ACTTACTAGC ATGAGAAAAT TGGTTGAGGA 492 0 

TATCGGCTTT GATCCCGACT ACTACACTGC CATTCATAAG AACTTTGACC TCCCTTATGA 4980 

TATCTATCGT CCCGAATCTG AAAACCCACG G AC ACAG ATT GAGATTTTAC AAAAAAATGG 504 0 

AGAACTGGCC GAACTCTCTA GCCTGTCTCC TATCGTCCAA TCCCTTGCTG GCAGTCGCCA 5100 

CGGAGATAAT CGCTTTTATT TTCCAAAAGA AATGTTGGAC CAAAACAGCA TCTTTGCAAG 5160 

CATTACCCAG CAATTTTTAC ACTTGATTGA GAACGATCAT TTTACCCCAA ATAAAAACTA 5 220 

GAAGAGGAAA TTTATGAGTA TTAAACTAAT TG CCGTTG AT ATCGACGGAA CCCTTGTCAA 52 8 0 

CAGCCAAAAG GAAATCACTC CTGAAGTTTT TTCTGCC AT C CAAGATGCCA AAGAAGCTGG 53 4 0 

TGTCAAAGTC GTGATTGCAA CTGGCCGCCC TATCGCAGGC GTTGCCAAAC TTCTAGACGA 54 0 0 

CTTGCAGTTG AGAGACGAGG GGGAC7ATGT GGTAACCTTC AACGGTGCCC TTGTCCAAGA 54 60 

AACTGCTACA GGACATGAGA TTATCAGCGA ATCCTTGACT TATGAGGATT ATCTAGATAT 5520 

GGAATTCCTC AGTCGCAAGC TCGGTGTCCA CATGCATCCC ATTACCAAGG ACGGTATCTA 55 80 

TACTGCAAAT CGCAATATCG GAAAATACAC TGTACACGAA TCAACCCTCG TCAGCATGCC 564 0 

TATCTTCTAC CGTACCCCTG AAGAAATGGC TGGCAAAGAA ATTGTTAAAT GTATGTTTAT 57 0 0 

CGATGAACCA GAAATTCTCG ATGCTGCGAT TGAAAAAATT CCAGCAGAAT TTTACGAGCG 57 6 0 

CTACTCCATC AACAAATCTG CTCCTTTCTA CCTCGAACTC CTTAAAAAGA ATGTAGACAA 5820 

GGGTTCAGCC ATTACTCACT TGGCTGAAAA ACTCGGATTG ACCAAAGATG AAACCATGGC 5 8 80 

AATCGGTG AT GAAGAAAATG ACCGTGC CAT GCTGGAAGTC GTTGGAAACC CCGTTGTCAT 5 9 40 

GGAAAATGGA AATCCAGAAA TCAAAAAAAT CGCCAAATAC ATCACCAAAA CAAATGACGA 60 00 

ATCCGGCGTT GCCCATGCCA TCCGAACATG GGTACTGTAA AAGTATCATT TTTCAATAAG 60 60 
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AATTGATTAG CAATAAAATC CAATGAATTT TTTTAGCAAA CTATTT AATT TAAAACAAAA 6120 

TAATCATAAT AGAGACACAA ATTCTGATTG TAACAATTTT TACCTAAACG AAT TAG AATG 6130 

TGGCCTTACT CCTGGGCAAC TCATACTCAT AGATTGGACT CAAAAAACAG GGAGAAATTA 62 4 0 

TAATTTCCCA AGATATTTTA AATACTCTCT TCAAATTGAC CCTGAATCTA CACACAATCA 6 3 00 

ATT AT AC AAA TTAGGATACT TCACTAAAAA TAAGACTTTA TCATATCTTA CAGTAGTAGA 6 3 60 

ATTAAAAACT AT ATT AT CT A AACATAATTT AGCTACTTCT GGAAAAAAAG CAGAATTAAT 64 20 

TACAAGAATA ATT AAT AATG TTAACATTGA CAATTTAGAT ATTCCGTTCG AATTTAAACT 64 80 

AACAAAAGAA CCACAAAATC TTATTATCGA ACATAGTGAC TATATCAAAG CATACTATGA 6 54 0 

TAAAGACATA ACTATCGAAG ATTATTGTAA AGAAAAAAAC AATATCTCTT TTAAAGCAAC 6600 

TTTTGGTGAT ATAAAATGGA GTCTCTTAAA TAAACAAGCT CATAGGAATA CTGTATCAGG 6660 

AGATTTTGGA TGCTTATCTA AC AC ACG AAA GGCTCAGGGA AG AC ATTTG G AACAAGAAGG 67 20 

T AAT ATT AAA CATGCTTTAA TATATTACAT AGAATCTTTG ATAATTACTA TTTCAGGATT 67 8 0 

AGAAAACAAT TTTTCAGCCA CTCATTATCC AGTATATTAT CCCGATTCGA TACCTGACTA 684 0 

CTCACTAAAA CATATTCAAA CATTAATGGA ATCATTATCT GATGACGATT ATGATTT T GC 6 900 

TTTTGATGAA GCATTATTTC GCTTCTCAAT TTTGAATGCA AATCATTTTT TATCTAAGGA 69 60 

AG AT ATTG AC TATTTAAGAG TTAATTTACC TCGTTCCACT GCTGAAGAAA TAAACAATTA 7020 

CTTAAAGAAA TATGAATGTT ATAGTCCTTT AAATAATTTA G AACTTC ACG ATTTTGAATA 7080 

AATTGACTAT ACAAACATTT AT AT ACT C G A TATAGTCTCA ATTTTATCTG ATGATTGCCC 7 140 

AAAT TTTTC A ATAATAAAAC GCATAATATT ATGG AG AC AA TCCCCTATAT TATGCGTTCT 7 2 00 

TTTAATATCA AAGACTTT7T CACAAACTTC TTTGATATCT AATTACATGC CCCCTGCAGG 7 2 60 

AATCGAACCT GCAACTACTC CTTAGGAGGG AGTTGTTATA T C C ATTG AAC TAAGGGAGCT 7 32 0 

AGATAAAAAC TCTCCTAAAT GAGCAGAGTT TTTTAGTCGA ATTAACGACG GATTTCTTTG 7 3 80 

AT A CG AGCTG CTTTACCTTG AAGAGCACGC AAGTAGTACA ATTTCGCACG ACG T ACTTT A 74 4 0 

CCGTAACGAA CAACTTCGAT TTTTTCAACA CGTGGAGTGT GGATTGGGAA GATACGCTCA 7 500 

ACACCTACAC CG TT AG AG AT TTTACGAACT GTGTAGTTTT CTGAGATTCC AGCACCTTTA 7 5 60 

CGTGCGATAA CAACACG 7 57 7 

(2> INFORMATION FOR SEQ ID NO: 47: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4945 base pairs 
tB) TYPE: nucleic acid 
{€) STRANDEDNESS : double 
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ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CCTCGCTGAT GATTGGTGCT GTTTT ATTTG CTGGTCCAGC CTTGGCTGAA GAAACTGCAG 60 

TTCCTGAAAA TAGCGGAnCT AATACAGAGC TTGTTTCAGC AGAGAGTGAG CATTCGACCA 120 

ATGAAGCTGA TAAGCAGAAT GAAGGGGAAC ATGCTAGAGA AAACAAGCTA GAAAAGGCAG 180 

AAGGAGT AGC GATAGCATCT GAAACTGCTT CGCCAGCAAG CAATGAAGCT GCAACTACTG 2 40 

AAACTGC AG A AGC AGCT AG C GCAGCTAAAC CAGAGGAAAA AGCAAGTGAG GTGCTTGCAG 300 

AAA C ACC ATC TGCAGAAGCA AAACCTAAGT CTGACAAGGA AACAGAAGCA AAGCCCGAAG 3 60 

CAACTAACCA AGGGGATGAG TCTAAACCAG CAGCAGAAGC TAATAAGACT GAAAAAGAAG 4 20 

TCCAGCCAGA TGTCCCTAAA AATACAGAAA AAACATTAAA ACCAAAGGAA ATC AAATTT A 4 80 

ATTCTTGGGA AGAATTGTTA AAATGGGAAC CAGGTGCTCG TGAAGATGAT GCTATTAACC 54 0 

GCGGATCTGT TGTCCTCGCT TCACGTCGGA CAGGTCATTT AGTCAATGAA AAAGCTAGCA 600 

AGGAAGCAAA AGTTCAAGCC TTATCAAACA CCAATTCTAA AGCAAAAGAC CATGCTTCTG 6 60 

TTGGTGGAGA AGAGTTCAAG GCCTATGCTT TTGACTATTG GCAATATCTA GATTCAATGG 720 

TCTTCTGGGA AGCTCTCGTA CCAACTCCTG ACGTT ATT G A TGCAGGTCAC CGTAACGGGG 780 

TTCCTGTATA CGGTACACTC TTCTTCAACT GGTCTAATAG TATTCCAGAT CAAGAAAGAT 84 0 

TTGCTGAAGC TTTGAAGCAA GACGCAGATG GTAGCTTCCC AATTGCCCGT AAATTGGTAG 900 

ACATGGCCAA GTATTATGCC TATCATGGCT ATTTCATCAA CCAAGAAACA ACTGGAGATT 960 

TGGTTAAACC TCTTGG AG AA AAGATGCGCC AGTTTATGCT CTATAGCAAG GAATATGCTG 102 0 

CTAAGGTAAA CCATCCAATC AAGTATTCTT GGTACGATGC CATGACCTAT AACTATGCAC 10 80 

GTTATCATCA AGATGGTTTG GGAGAATACA ACTACCAATT CATGCAACCA GAAGGAGATA 1140 

AGGTTCCGGC AGATAACTTC TTTGCTAACT TTAACTGGGA TAAGGCTAAA AATGATTACA 1200 

CTATTGCAAC TGCCAACTGG ATTGGTCGTA ATCCTTATGA TGTATTTGCA GGTTTGGAAT 12 60 

TGCAACACGG TGGTTCCTAC AAGACAAAGG TTAAGTGGAA TGACATTTTA GACGAAAATG 13 20 

GGAAATTGCG CCTTTCTCTT GGTTTATTTG CCCCAGATAC CATTACAAGT TTAGGAAAAA 13 80 

CTCGTGAAGA TTATCATAAA AATGAAGATA TCTTCTTTAC AGGTTATCAA GGAGACCCTA 14 40 

CTGGCCAAAA ACC AGGTG AC AAAGATTGGT ATGGTATTCC TAACCTAGTT GCGGACCGTA 1500 

CGCCAGCGGT AGGTAATACT TTTACTACTT CTTTTAATAC AGGTCATGGT AAAAAATGGT 15 60 

TCGTAGATGG TAAGGTTTCT AAGGATTCTG AGTGGAATTA TCGTTCAGTA TCAGGTGTTC 16 20 
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TTCCAACATG GCGCTGGTGG CAGACTTCAA CAGGGGAAAA ACTTCGTGCA GAATATGATT 1630 
TTACAGATGC CTATAATGGC GGAAATTCCC TT AAATT C T C TGGTGATGTA GCCGGTAAGA 174 0 

CAGATCAGGA TGTGAGACTT TATTCTACTA AGTTAGAAGT AACTGAGAAG ACCAAACTTC 1800 

GTGTTGCCCA CAAGGGACGA AAAGGTTCTA AAGTTTATAT GGCATTCTCT ACAACTCCAG 1860 

ACTACAAATT CGATGATGCA GATGCATGGA AAGAGCTAAC CCTTTCTGAC AACTGG AC AA 19 20 

ATGAAGAATT TG ATCTT AG C TCACTAGCGG GTAAAACCAT CTATGCAGTC AAACTATTTT 19 80 

TCGAGCATGA AGGTGCT GTA AAAGATTATC AGTTTAACCT AGGACAATTA ACTATCTCGG 204 0 

ACAATCACCA AGAGCCACAA TCGCCGACAA GCTTTTCTCT AGTGAAACAA TCTCTTAAAA 2100 

ATGCCCAAGA AGCGGAAGCA GTTGTGCAAT TTAAAGGCAA CAAGGATGCA GATTTCTATG 2160 

AACTTTATGA AAAAGATGGA GACAGCTGGA AATTACTAAC TGGCTCATCT TCTACAACTA 22 20 

TTTATCTACC AAAAGTT AG C CGCTCAGCAA GTGCTCAGGG TACAACTCAA GAACTGAAGG 2280 

TTGTAGCAGT CGGTAAAAAT GGAGTTCGTT CAGAAGCTGC AACCACAACC TTTGATTCCG 2 34 0 

GTATGACTGT AAAAGATACC AGCCTACCAA AACCACTAGC TGAAAATATC GTTCCAGGTG 2 4 00 

CAACAGTTAT TGATAGTACT TTCCCTAAGA CTCAAGGTCG AG AAGGT ATT GAAGGTATCT 24 60 

TG AACGGT AC CATTACTAGC TTGTCAGATA AATGGTCTTC AGCTCAGTTG ACTGGTAGTG 25 2 0 

TGGATATTCG TTTGACCAAG CCACGTACCG TTGTTAGATG GGTCATGCAT CATGCAGGAG 2 580 

CTGGTGGTGA GTCTGTTAAC GATGGCTTGA TGAACACTAA AGACTTTGAC CTTTATTATA 264 0 

AAGATCCAGA TCGTGAGTCG AAGCT AG CT A AGGAAGTCCG TGGTAACAAA GCACACCTGA 270 0 

CAGATATCAC TCTTGATAAA CCAATCACTG CTCAAGACTG GCGCTTGAAT GTTGTCACTT 27 60 

CTGACAATGG AACTCCATGC AAGGCTATTC GTATCTATAA CTGGAAAATG TATGAAAAGC 2 82 0 

TTGATACTGA GAGTGTCAAT ATTCCGATGG CCAAGGCTGC AGCCCCTTCT CTAGGCAATA 28 8 0 

ACAAGGTACA AGTTGGCTTT G C AG ATGT AC CGCCTGGAGC AACT ATT AC C GTTTATGATA 2 940 

ATCCAAATTC TCAAACTCCG CTCGCAACCT TGAAGAGCGA AGTTGGAGGA GACCTAGCAA 3 COO 

GTGCACCATT GGATTTGACA AATCAATCTG GTCTTCTTTA TTATCGTACC CAGTTGCCAG 3 060 

GCAAGGAAAT TAGTAATGTC CTAGCAGTTT CCGTTCCAAA AGATGACAGA AGAATCAAGT 3120 

CAGTCAGCCT AGAAACAGGA CCTAAGAAAA CAAGCTACGC CGAAGGGGAG GATTTGGACC 3180 

TT AG AG GT GG TCTTCTTCGA GTTCAGTATG AAGGAGGAAC TGAGGACGAA CTCATTCGCC 324 0 

TAACTCACGC AGGTGTATCA GTATCAGGTT TTGATACGCA TCATAAGGGA GAACAGAATC 3 3 00 

TTACTCTCCA ATATTTGGGA CAACCGGTAA ATGCTAATTT GTCAGTGACT GTCACTGGCC 3 3 60 
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AAGACGAAGC AAGTCCGAAA ACTATTTTGG GAATTCAAGT AAG^vXAA CCGAAAAAAG 3 4 20 

ATTACCTAGT TGGTGATAGC TTAGACTTGT CTGAACGACG CTTTGCAGTG GCTTATAGCA 34 80 

ATGACACCAT GGAAGAACAT TCCTTTACTG ATCAGGGAGT TGAAATTTCT GGTTACGATG 3 54 0 

CTCAAAAGAC TGGTCGTCAA AC CTTG ACGC TTCATTACCA ACGCCATCAA GTTAGCTTTG 3 600 

ATGTTTTGGT ATCTCCAAAA GCAGCATTGA ACGATGAGTA CCTCAAACAA AAATTAGCAG 3 660 

AAGTTGAAGC TGCTAAGAAC AAGGTGGTCT ATAACTTTGC TTCATCAGAA GTAAAAGAAG 37 20 

CCTTCTTGAA AGCAATTGAA GCGGCCCAAC AAGTCTTGAA AGACCATGAA ACTAGCACCC 37 80 

AAGATCAAGT CAATGACCGA CTTAATAAAT TGACAGAAGC TCATAAAGCT CTGAATGGTC 3 8 40 

AACAGAAATT TACGGAAGAA AAGACAGAGC TTGATCGCTT AACAGGTGAG GTTCAAGAAC 3 900 

TCTTGGCTGC CAAACCAAAC CATCCTTCAG GTTCTGCCCT AGCTCCGCTT CTTGAGAAAA 3 960 

ACAAGGCCTT GGTTGAAAAA GTAGATTTGA GTCCACAAGA GCTTACAACA GCGAAACAGA 4 020 

GTCTAAAAGA TCTGGTTGCT TTATTGAAAG AAGACAAGCC AGCAGTCTTT TCTGATAGTA 4 080 

AAACAGGTGT TGAAGTACAC TTCTCAAATA AAGAGAAGAC TGTCATCAAG GGTTTGAAAG 414 0 

TAGAGCGTGT TCAAGCAAGT GCTGAAGAGA AGAAATACTT TGCTGGAGAA GATGCTCATG 4 200 

TCTTTGAAAT AGAAGGTTTG GATGAAAAAG GTCAAGATGT TGATCTCTCT TATCCTTCTA 4 2 60 

TTGTGAAAAT CCCAATTGAA AAAGATAAGA AAGTTAAGAA ACTATTTTTC TTACCTGAAG 4 3 20 

GCAAAGAGGC AGTAGAATTG GCTTTTGAAC AAACGGATAG TCATGTTATC TTTACAGCAC 4 380 

CTCACTTTAC TCATTATGCC TTTGTTTATG AAT CTGCTG A AAAACCACAA CCTGCTAAAC 44 40 

CAGCACCACA AAACACAGTC CTTCCAAAAC CTACTTATCA ACCGACTTCT GATCAACAAA 4 500 

AGGCTCCTAA ATT GGAAGTT CAAGAGGAAA AGGTTCCCT? TCATCGTCAA GAGCATGAAA 4 560 

ATACTGAGAT GCTAGTTGGG GAACAACGAG TC AT CAT AC A GGGACGAGAT GGACTGTTAA 4 620 

G AC ATGT CTT TGAAGTTGAT GAAAACGGTC AGCGTCGTCT TCGTTCAACA GAAGTCATCC 4 6 30 

AAGAAGCGAT TCCAGAAATT GTTGAAATTG GAACAAAAGT AAAAACAGTA CCAGCAGTAG 4 7 40 

TAGCTACACA GGAAAAACCA GCTCAAAATA CAGCAGTTAA ATCAGAAGAA GCAAGCAAAC 4 800 

AATTGCCAAA TACAGGAACA GCTGATGCTA ATGAAGCCCT AATAGCAGGC TTAGCCAGCC 4 860 

TTGGTCTTGC TAGTTTAGCC TTGACCTTGA GACGGAAAAG AGAAGATAAA GATTAAATAT 4 920 

CGAAAAATCT TGTGAAATCT TTCCG 4 94 5 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25002 base pairs 

(B) TYPE: nucleic acid 



WO 98/18931 



PCT/US97/ 19588 



431 

<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GACAACTCAA GTAGCTTTTT CTTATTTTGA AAAAGGAGAT C AG AGTTT AA CTATGTCAGA 60 

AAAATCACAA TGGGGGTCGA AACTTGGTTT TATTCTAGCA TCTGCTGGCT GGCCATCGGG 120 

CTTGGTTCCG TTTGGAAGTT TCCCTACATG ACTGCTCCTA ATGGCGGTGG AGGCTTTTTA 180 

CTAATCTTTC TCATTTCCAC TA TT TTAATC GGTTTCCCTC TCCTGCTGGC TGAGTTTGCC 240 

CTTGGCCGTA GTGCTGGCGT TTCCGCTATC AAAACCTTTG GAAAACTGGG CAAGAATAAC 3 00 

AAGTACAACT TTATCGGTTG GATTGGCGCC TTTGCCCTCT TTATCCTCTT ATCTTTTTAC 3 60 

AGTGTTATCG GAGGATGGAT TCTAGTCTAT CTAGGTATTG AGTTTGGGAA ATTGTTCCAA 4 20 

CTTGGTGGAA CGGGTGATTA TGCT CAGTT A TTTACTTCAA TCATTTCAAA TCCAGCCATT 4 80 

GCCCTAGGAG CTCAAGCGGC CTTTATCCTA TTGAATATCT TCATTGTATC ACGTGGGGTT 540 

CAAAAAGGGA TTGAAAGAGC TTCGAAAGTC ATCATGCCCC TGCTCTTTAT CGTCTTTGTT 600 

TTTATCATCG GTCGCTCTCT CAGTTTGCCA AATGCCATGG AAGGGGTTCT TTACTTCCTC 660 

AAACCAGACT TTTCAAAACT GACTAGCACT GGTCTCCTCT ATGCTCTCCG AC AAT CTTTC 72 0 

TTTGCCCTCT CACTAGGGGT TACAGTCATG TTGACCTATG CTTCTTACTT AGACAAGAAA 780 

ACCAATCTAG TCCAGTCAGG AATCTCCATC GTAGCCATGA ATATCTCGAT ATCCATCATG 84 0 

GCAGGTCTAG CCATTTTCCA AGCTCGATCC CCCTTCAATA TCCAGTCTGA AGGGGGACCC 90 0 

AGCCTGCTCT TTATCCTCTT GCCTCAACTC TTTGACAAGA TGCCTTTTGG AACCATTTTC 9 60 

TACGTCCTCT TCCTCTTGCT CTTCCTTTTT GCGACAGTCA CTTTTTCTGT CGTGATGCTC 102 0 

GAAATCAATG TAGACAATAT CACCAACCAG GATAACAGCA AACGTGCCAA ATGGAGTGTT 1080 

ATTTTACCAA TTTTGACCTT TGTCTTTGGC ATTCCTTCAG CCCTATCTTA CGGTGTCATG 114 0 

GCGGATGTTC ACATTTTTGG TAAGACCTTC TTTGACGCTA TGGACTTCTT GGTTTCCAAT 12 00 

CTCCTCATGC CATTTGGAGC TCTCTACCTT TCACTTTTTA CAGGCTATAT CTTTAAAAAG 12 60 

GCTCTTGCAA TGGAGGAACT CCATCTCGAT GAAAGAGCAT GGAAACAAGG ACTGTTCCAA 13 20 

GTCTGGCTCT TCCTTCTTCG TTTCTTCGTT TCGTCATTCC AATCATCATC ATTGTGGTCT 13 80 

TCATTGCCCA ATTTATGTAA TCAAAAAGGA CTTGAGTAGT GAACTCAGGC CCTTTCTTTT 14 40 

TATGGATGGC TAACAATCAA TTCCAAACCT TGCCCTTCCA CACTCCAACC TTCAACATCA 1500 

CTTGGTAGGA TAAAGTGGCT GCCTTTTTGA ATTGGATAAT TTTTCCCCTC AACAGTTAGC 15 60 
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TGACCTTGAC CAGCCAAGAC ACTCAATAAG CTGTAGTCAG CTGTCTTTTC AAAGTCAACT 1620 

TTTCCAGTAA TTTCCCACTT GTAAACTGCG AAGAAATCAT TAGATACAAG GAGAGTGGAA 1680 

CCCAAATCAT CTGCTTTAAC AGTTACAGGA CGGCTATTTG CTGGCTCACC AATGTTC AAG 1740 

ACATCGATGG ATTTTTCAAG ATGAAGTTCA CGCAAGTTGC CTTTGTCATC CTTGCGGTCA 1800 

AAGTCATAGA CGCGATAGGT GGTATCGCTA GACTGCTGGG TTTCAAGGAT TAAGATACCC 1860 

GCCCCGATAG CGTGCATAGT CCCGCTTGCT ACATAGAAGA AATCTCCAGC CTTAACAGGG 1920 

ACTTTGCTCA ACAAGTCATC CCAGTTCTTG TCCTCGATTT GCTGGCGGAG TTCTTCTTTT 19 80 

GACTTGGCAT TGTGACCGTA GATAATCTCT GAACCTTCAT CCGCTGCGAT AATGTACCAG 2 04 0 

CATTCTGTTT TTCCGAGTTC GCCTTCATGC TCGAGTCCAT AAGCATCGTC TGGGTGAACT 2100 

TGGACACTGA GCCAGTCGTT GGCATCGAGG ATCTTGGTCA AAAGTGGAAA TACAGGTTCT 2160 

GGACGATTGC CAAATAATTC ACGGTGTTCC GCATACAAAG TAGCAAGATC TGTTCCCTCG 2 2 20 

TAACGACCAT TGGCAACTTT AGAGACTCCA TTTGGATGGG CTGAGATGGC CCAATATTCT 2 2 80 

CCGATTTTTT CACTTGGGAT GTCGTAGCCA AACTCATCAC GTAGCTTGGC TCCACCCCAG 2 3 40 

ATTTTTTCTT GCATAACTCA TTGTAAAAAT AATGGTTCTG ACATCTCGAT CTCCTGTCTG 2 4 00 

ATTTTTCTCC CCTCATTATA GCAAAAAAAG AGTTCGAATT GAACTCTTTT TTACATCTTA 2 4 60 

TAAAGCAGGG AGAAGATTTT ATAAAAATAG TAAACAAATG TGCTCTACCC GATGCTTGCA 2 520 

CCATTGCTAT AAATGACATC CTTGTACCAA TAGAAGGACT TCTTCTTGCT ACGTTTGAGA 2 580 

GCTCCGTTTC CTACATTATC TCGATCTACA TAGATAAAGC CATAGCGCTT ATTCATTTCC 2 640 

CCTGTGCCAG CTGAAACCGG ATCGATACAG CCCCAAGTCG TATAACCAAG CAAGTCAACC 2700 

CCGTCTTGGT AAATGGCATC TCGCATGGCC TTGATGTGCG CCTCTAAGTA AGTAATCCGA 2 7 60 

TAGTCATCTG CTACATAACC ATTCTCATCC GGTGTATCCA TAGCACCGAG TCCATTTTCT 28 20 

ACGATAATAC TAAACTAAAA TCAAAAAGCA TTATATAATA GTGATATGAA ATCAACTAAA 2 880 

GAAGAAATCC AAACCATCAA AACACTTTTA AAAGACTCTC GTACAGCTAA AT AT CAT AAA 2 940 

CGCCTTCAAA TCGTTCTATA GTAAAATGAA ATAAGAACAG TACAAATCGA TCAGGACAGT 3000 

CAAATCGATT TCTAACAATG TTTTAGAAGT AGGGGTGTAC TATTCTAGTT TCAATCTACT 3 060 

ATATTTCGTC TGATGGGCAA ATCTTATAAA GAG ATT AT AG AACTTTTATA GTAGTTTCAA 3120 

ATAAGATGTG AACAACTCTA TCAGGAAAGT CAAATTAATT TATAGAAATA TTTT AGC AG C 3180 

CAAGGTGTAC TGTTATAGAT TCAATACACT ATAGACTGTA ATCAAACAAC GATTTGGCGA 324 0 

AATGTAAAAA AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC ACGTGGTGGT 3 300 

CGTAACCATG CATATATGAC AGTTGAGGAA GAGAAAGCCT TTCTTCCCCC CCATTTGAAG 3 3 60 
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GCTACAGAGG 


CACGAGAATT 


TGTTACAATT 


GATGCCTTAT 


TTCAGGCTTA 


T AAAAAGG AG 


3420 


TTAGGTCGTT 


CCTACACACG 


TGATGCCTTC 


TATCAACTGT 


TGAAGCGCCA 


TGGTTGG CG A 


3430 


AATATTACGC 


CACGTCCAGA 


ACATCCTAAG 


AAAGCAGACG 


CTC AAAC CAT 


TGTTGCGTCT 


3540 


AAAAATAAAA 


TCTCAATCCA 


AG AAGGC AAG 


AAAGCGTTTT 


AAAT AT AGT A 


GACGTTTTCG 


3600 


TAAGGTTTGC 


TTGATGTACC 


AAGCTGAAGC 


TGGTTTCGGT 


AGAATCAGTA 


AACTGGG ATC 


3660 


TTGTTGGGCT 


CCAATAGGAG 


TAGGTCCACA 


TATCCATAGT 


C ACT AT AT AC 


GAGAATTTCG 


3720 


CTATTGTTAT 


GGAGCTGTTG 


ATGCCTATAC 


AGGCGAATCA 


TTTTTCTTAA 


TAGCTGGTAG 


3780 


ATGTAATACT 


GAGTGGATGA 


ACGCCTTTTT 


AGAAGAGCTT 


TCACAAGCTT 


ATCCTTTTAC 


3840 


TCCTTATGGA 


CAATGCTATA 


TGG CAT AAAT 


CAAGTACCTT 


AAAGATTCCG 


ACTAATATTG 


3900 


GTTTTGCATT 


TATTCCTCCA 


T AC AC AC C AG 


AGATGAACCC 


CATTGAACAA 


GTGTGGAAAG 


3960 


AGATTCGTAA 


ACGTGGATTT 


AAGAATAAAG 


CCTTTCGAAT 


TTTGGAAGAT 


GTCATGAATC 


4020 


AACTCCAAGA 


TGTC AT AC AA 


GGATTGGAGA 


AGGAGGTGAT 


AAAGTCCATC 


GTTAATCGGA 


4080 


GATGGACTAG 


AATGCTTTTT 


GAAAGCAGAT 


GAGTATTATA 


TGCAATTTCT 


TTATATAAAA 


4 140 


AGACCGGATT 


GCTCCGATCT 


TTCAATAGTT 


CATATTCTCA 


ATTTCTATTT 


TAAAAATAGC 


4200 


TAAGGTTAAC 


GTCAAAT G AC 


TACGCGACCT 


ATTTCATACG 


AT AAAAATC A 


AGCACTAGAC 


4260 


CAGCAGGTCC 


TTGAACTAAT 


AAGGACTCTG 


TTCCCCAATC 


GGTTACAGTT 


GGTCCGTGTA 


4320 


AAACCTTTAT 


ACCAAGCTCG 


TTCAACCCTT 


TGTAGTTCTG 


GTCTACATCC 


TCAACCTCGA 


4380 


TATGAATAAT 


GATTCCTGAC 


TGAAAGTTTT 


CCAAAGGAAC 


CAAATGATTT 


TGTGACAACA 


4440 


TAAGGCAGTG 


ACTACCAATC 


GTAAACTGAG 


CAAAACCATC 


ATTAGCATAA 


TCTGCCTTTT 


4500 


TATCCAAGAT 


ATGCTCCAAG 


TCAGCACAGA 


CTTGGGGAAC 


ATTTCAAACG 


ATAATATCTA 


4560 


ATTCATTTAA 


ATTCATTTAC 


TCTCCTCCAT 


AAAAAGACCG 


GATTGCTCCG 


ATCTTTTAAA 


4620 




ATGAAAATCA 


AAGAATAAAG 


TCTACAAGTT 


TCATATTTGA 


TTTTCGGCGA 


4680 


GAGGAATTAT 


TTAATTGCGC 


GTGATTGCAA 


TCCTTCTTCT 


TCCAAGAAGA 


GACGGAATGG 


4740 


TACGAGTTCT 


TCTGCTTCGT 


ATTTTTCCTT 


GAAGGCTTTG 


ATAGCTTCTT 


CTGAGTGAAG 


4800 


TTTTGGATCC 


AATTCAAGTA 


CTTCTACTGG 


AAGTGGACGG 


TGTTGAGTGA 


TGCGAGCATC 


4860 


GATGACAACA 


GTTTTACCTT 


CTTTGTTCAA 


TTTAACAGCT 


TCTGCAACAA 


CTGCATCGAT 


4920 


GTCTTCGATA 


CGGTCAACTG 


TGAATCCAAC 


AGCTCCTTGA 


GCTTCCGCAA 


TTTTAGCGTA 


4980 


GTCAGCGTTT 


GTGAAGTCTA 


CACCAAACAA 


GTGTTTGTTT 


GTATCTTCGT 


ATTTGTTCTT 


5040 


GATGAAGCCG 


TACTCAGCAT 


TTGAGAAGAC 


AAGGTTGATA 


ACTGGAAGGT 


CGTATTGAAC 


5100 
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GTTTGTGATA ACCTCTGCGT AG C AC ATGTT GAATGCTCCG TCACCCATGA TGTTCCATAC 5160 

TTGGCGATCT GGATTGTCTT TCTTAGCAGC GATACCACCA GGAAGGGCAA TACCCATTGT 5220 

CGCAAAGAGT GGAGATGTAC G C C AC ATGTT CTTAGGTGTC ATGTGAAGGT GACGAGTAGA 5280 

TGTTTG AGT A GTGTTACCTA CGTCGATTGA GTAGATAGCG TCTTG AT C AG CATGTTTGTT 5340 

GATTGCATTG TAAACTTGAT ACAATTGCAA TTCACCCTCA GTTTTACCTT CGAGTTTGTT 54 00 

CATGTAATCA CGCCAGTTTT GGTTGTTCTT AACGTTTGCA CGCCACCATG GAGTTGATTC 54 60 

AACTGGGTTT ACTTTGTCAA GGATAGCTTT AGCTGCTTGA CCACCATCAC CAAGGATTGA 5 52 0 

AGCGTCAAGG GCATGACGTT TACCAAGTTT GTAAGGGTCG ATATCGACTT GGATGAATTT 55 80 

TTCAGTGTTC TTGAATGCTT CGTAAACTTC AGCAAATGGG AAGTTTGAAC CAAGGAAAAG 5640 

AACTCTGTCT GCTTCAAAGA CCACTTCGTT GCCTGGTTTC CAACCAACAC GGTAAGCAGA 57 00 

ACCTGTCAAA CCTTCATAGT TCCATTCGAA AGCTTCAAAG TTTTTACCAG TTGTGATGAT 57 60 

TGGTGCTTTG ATTTTACGTG ACAATTCAGT AATCACTTCA CCAGCTTTAA CACCACCAAA 5820 

TCCAGCATAG ATAACTGGGC GTTCAGCATT GTTCAAGATT TCAACAGCTT TGTCGATTTC 58 80 

AACTTCGTTC AAAGCAGCAG CGATGAATGA GCGTTCGTAT GAACCTGAAC CGTAGTATGA 594 0 

GTTTTCATCG ATTTCTTGGA AACCGAAGTT TACTCGAATT TCAACAACAG CTGGACCTTT 6000 

TTTAGAAACT GCAGCACGGC AGGCTTCGTC AATTACTTTT GGCAATTGCT CAGCGTAAGC 6060 

TACACGTTTG TTGTAAACAG CGATACCGTT GTACATTGGG TTTTCCTTAA GCTCTTGGAA 6120 

AGCATCCATG TTCAATTCGT TAACTGGACG TGATCCAAGG ATCGCTAGGA ATGGAGTGTT 6180 

ATCCATAGCT GCATCGTAAA CACCGTTAAT CAAGTGAGTC GC AC CTGG AC CACCTGAACC 6 240 

AACTGCAACC CCGATTGAGC CGCCGAATTT AGCTTGC AT A ACCGCTGCAA GAGCACCTGT 6300 

CTCTTCGTGG CGAACTTGTA AGAAACGGAT ATCTTTGTCT TCAGCCAAAG CGTCCATCAA 63 60 

TGAGCTGAGT GTTCCTGATG GGATACCGTA GATTGTATCT ACGCCCCATG TTTTCAATAC 64 20 

GTTAAGCATT GCTGCAGATG CAGTAATTTT CCCTTGAGTC ATAATGATAA CTCTCCTTCA 64 80 

ATTTTTTTAA ACTTGGAGAA TACGATTACA TAGAATTGGA AACGTTCTCC AAATTTTTAC 654 0 

TATTCCACTG TATCATATTT ATGCTGACTT TTCTAAAAAT CTCCTCAAAA CTCTCTATTC 6600 

TCTATTCTAA T AC AG TTTTG AAAGTTCTGT CATTTCTGTT TTATAACAAA GAAATCTAGT 6660 

CATTACTTTT AGTCTATTTT ACTAAAATTT AACAGAAGGG AACTGGTCAG AACAGATACA 67 20 

GAACTAAAGG CCATGGCTAG ACCTGCCAAT TCTGGGTTGA GAGCCAGTCC AACACCTGAA 6780 

AAGACTCCTG CTGCAATCGG AATTCCGACA ACATTGTAGA TAAAAGCCCA GAAAAGATTG 684 0 

AGTAGAATTC GATGAAAGGT TTTCTTACTC ATATCAAAGG CACGAACCAC TCCTAAAAGA 6900 
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TTATTGGTTG 


TCAACACCAA 


ATCTGCTGAC 


TCGATGGCGA 


TATCTGTTCC 


AG CTC C CAT A 


6960 


GCAATCCCCA 


CATCTGCTAC 


ACTAAGGGCA 


GGAGCGTCAT 


TGATACCGTC 


CCCAACAAAG 


702C 


GCTACTTTCC 


CTGACTGTTG 


CACTTTATGG 


ATTTCATCGC 


CTTTTTCTTC 


TGGCAAGACG 


7080 


CCTGCAATGA 


CCTCTTCAAT 


TCCGATTTGA 


TCTGCAATAG 


CACGCGCCAC 


ACCAGCATTC 


7L40 


TCTCCTGTCA 


GCATGACTGT 


TCGGAGACCA 


CGTTTTTTTA 


GCTGACTGAT 


GGCTAGCTTA 


7200 


GCATTTTCCT 


TAGGAATATC 


TTGCAAAGCA 


AGCAAGCCTT 


TGATTTCATT 


GTCAACAGCT 


7260 


AAGAACACAA 


CTGTCTTAGC 


TTCTTTTTCT 


AGTTCTTCTA 


GTTTATCTTG 


ATAAGTATTA 


7320 


GAAATATCCA 


TGCCATCCAG 


CATTTTAGCA 


TTTCCAAGTA 


AAACTTGTTT 


TCCATTGATT 


7380 


CGCCCTGAAA 


CACCTTTCCC 


GTGCAAGCAC 


TGAAAATTTT 


CAACAGTTTG 


AAACTCAAGT 


7440 


CCAGCTTCAC 


TCGCTCGCTT 


AACGATAGCC 


TCAGCCAGTG 


GGTGTTGAGA 


AGCATCTTCC 


7500 


AAGGAGGCTG 


CCAACCCAAA 


CACTTCTACT 


TCGTCGCCGA 


TGACATCTGT 


TACCACAGGT 


7560 


TTCCCTTCCG 


TCAAAGTCCC 


GGTCTTATCA 


AAGACAAGGG 


TTTGAACTTT 


CTGGATTTCC 


7620 


TGTAAGACAG 


TTCCATTTTT 


GAGGAGAACC 


CCCATCTTCG 


CACTACGTCC 


TGTCCCCACC 


7680 


ATAAGGGCTG 


TCGGTGTTGC 


AAGTCCCAAG 


GCACAAGGAC 


AGCCGATAAT 


CAAAACCGCC 


7740 


ACTCCGTAGA 


GAAGAGAGGA 


CACAAAGCTA 


GCTCCAAGCA 


CAACCACACT 


ATCCCTGAGC 


7800 


AAGACGAACC 


AAACCCAAAA 


GGTCATGATT 


CCTAAAATGA 


CAACTACTCG 


GACAAAAATC 


7860 


CCTGAAATCT 


TATCCGTCAA 


GTCCTGAATC 


GGCGCACGAC 


TTGTCTGAGC 


TTTCTTCACA 


7920 


AAATCCACAA 


TCTGAGCCAA 


AACAGTCTCT 


GAGCCAACTT 


TTTCTGCTCT 


AAAGACAAGC 


7930 


GTTCCACTAT 


CATTCATCCT 


TGAGCCAATG 


ACACTATCTC 


CAACTGTCTT 


GTCCACAGGC 


8040 


AGACTCTCAC 


CTGTCACCAT 


GGATTCGTCA 


ATACTAGAGA 


CACCTTCTAC 


TACGACACCA 


8100 


TCAACAGCAA 


TCTTTTCACC 


GGGACGCACT 


CGAATCAGGT 


CGCCTACCTT 


GACTTGTTCC 


8160 






f\ l V— M 1 v_ At- I 




LOOT * TTAGv. 


i iGCAAGTCC 


8 2 20 


ACT AATTT CT 


CCACAGCTTG 


GGACGTATTT 


TTTCTCAT^T 


TTTCCTCAAA 


AACTGCTCCC 


8280 


AAAAGAACGA 


AAAAG AG G AT 


AAATCCAGCA 


CTTTCGAAGT 


AAACAGGCAG 


ACCAGCAAAG 


8340 


AGAGCAACTA 


GGCTATAGAA 


ATAAGCCACT 


AGAGTTCCCA 


GCGCAACCAA 


GGTATCCATC 


8400 


TTGGCATTGT 


GCTTTTTAAA 


ACTGGCCCAA 


GCACTCTGGA 


TATATGGCTT 


ACCTGCAACT 


8460 


AACATAATAG 


GCGTTGTTGC 


TAGAAAGGTT 


CCCCAATGCA 


TGACTTGATG 


ACTAATGCTA 


8520 


CCTGTCAACA 


TCCCAATCAT 


GAGAATCACA 


AGAGGCACAG 


TAAAGATACT 


AGTAATCCAA 


8530 


AAACCTTGCA 


GGAGAGATAG 


AGATTTTCCA 


GTCTTCTCAA 


CGACTGTATA 


GCTTCCCTTT 


8640 
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TGCATCTTCA 


TGCCACAAGA 


AAATTCATGT 


CGCCCTAATT 


CTTGAGCCGT 


AAAACGAATG 


8700 


ACTTTCTCCT 


CATCTACGCC 


GATTGGTTCC 


AAGATACCTT 


CTTCTTCAAA 


CAGAATTTCC 


87 60 


TTATAACAGT 


TTGAAGGAGT 


AGC ACGATG A 


AAGGTAATCT 


CAGCTGGAAT 


TCCCTTTTGA 


8820 


AGCTGGATAT 


GGGCTGGATG 


AT AGC CTTTT 


TCAGCTCGGA 


TACGGATTTT 


TTG AATGCC A 


8880 


TTTTCTAAGC 


TTGCTTTCAC 


AATTTCTGTC 


ATAGTCTCCA 


CCTACTCTAC 


AATCATCTTG 


8940 


CCGTGCATCA 


TGTTCATACC 


ACAAGCAAAG 


CCAAACTCTC 


CAGCCTGTTC 


AGGCGTGATT 


9000 


T C C ACT AC AT 


ACTCTTCCCC 


CATTGGCAGG 


TTCGCATGTA 


CACCAAAATC 


TGGAAAAACA 


9060 


ATTTGATCCA 


GACATGGTGA 


AGGATCCTTG 


CGGTCAAAGA 


CAATGCGTGC 


TGGCACTGAT 


9120 


TTCTTGAGGA 


CAATCAACTC 


AGGAGTATAG 


CCTCCCATGA 


CTTCCACTCG 


AATCTCTTGG 


9180 


TATCCGTTTT 


TTTGCTGGGC 


TTTTTGTCCA 


GATTTTTCAG 


GCTTTTTGAA 


AAACCAAAAC 


9 2 40 


AAGATAAACG 


CGATAAGGGC 


AATACAAATA 


ATGGTTACAA 


TACTATTTAA 


CATGACGTCT 


9 300 


CCTTTACATA 


CAATTACATC 


TTACTTCTCT 


TACAGCACTT 


GATTTCTTCT 


CTGAAATCAC 


93 60 


AGCTTCCAAG 


TCTTCCAAGT 


CAGTCTGACT 


AAATTCACAT 


TCTACAATCA 


AGTCAGCCAA 


94 2 0 


CAAATTCCTA 


ATCCTACGGG 


AACAAACCTT 


GTCTTTGATA 


TCTTGGACAA 


GTAAATCCCG 


94 80 


ACTTTGGTCT 


AGAGTTAAAA 


GGGCTGAATA 


AACAAAGGAC 


TTGCCTTCTT 


TTTTCCGAGT 


9540 


CAAACACTCT 


TTATCAACCA 


GACGAGCCAA 


AAGTGTCTCA 


ACCGTGGACT 


TGGACCAGTC 


9600 


AAAC CGCTCT 


GCCAAAACCC 


TAATC AAATC 


TGTACTGGTC 


TGCTCCCCCT 


GCATCCAAAT 


9660 


AATCTTCATG 


ACCTGCCATT 


CTGCATCTGA 


AATCTGCATT 


ACCATACCTC 


CAAAATCTAC 


9720 


ATTTGTCAAT 


TACACTCATC 


AGTATACTCT 


TAAAATCTAC 


ATTTGTCAAT 


TAT AG AAAT A 


. 9780 


ATATTTTCTT 


CGAAAAATAG 


AATTTT AATC 


ATTTGAAAAA 


CGATTTGCAG 


TC AAAT ATT A 


9 840 


CT AT AT AAAC 


AATAAAAATA 


TGCTATACTA 


AAGAAAAAAG 


AAAACAACCA 


CTAGGGGTGC 


9 900 


GTAAAGCTGA 


GATTAACGAC 


TGTTAGATCC 


CTCTGACTCA 


ATCTAGGTAA 


TGCTAGCTGA 


9960 


TGGAAGTGGA 


AATGATAATG 


GGGACTAGCA 


GTCTTCTATT 


GCCTTTCTAA 


AACAGACTAG 


10020 


CTTGTTCTTA 


AGAATACAAA 


CTTCAGTTGG 


TTGGGAGGTT 


TTAGATGACT 


TATTTACCCG 


10080 


TTCCTTTGAC 


CATTGCAGGG 


ACTGACCCTA 


GTGGTGGTGC 


TGGCATTATG 


GCAGATTTAA 


10140 


AGTCATTCCA 


AGCGAGAGAT 


GTCTATGGAA 


TGGCTGTTGT 


AACCAGTCTT 


GTCGCTCAAA 


10200 


ATACCAGAGG 


TGTTCAGCTA 


ATCGAGCACG 


TTTCTCCTCA 


AATGTTGAAA 


CCCCAATTCG 


10260 


AGAGTGTCTT 


TTCTGATATT 


CCACCTCAGG 


CTGTAAAAAC 


TGGAATGTTG 


G CT ACT ACTG 


10320 


AAATC ATGG A 


AATCATCCAA 


CCCTATCTTA 


AAAAACTGGA 


TTGTCCCTAT 


GTCCTTGATC 


10380 


CTGTTATGGT 


TGCTACAAGT 


GGAGATGCCT 


TGATTGACTC 


AAATGCTAGA 


GACTATCTCA 


10440 
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AAACAAACTT ACTACCTCTA GCAACTATTA TTACGCCAAA TCTTCCTGAA GCAGAAGAGA 10500 
TTGTTGGTTT TTCAATCCAT GACCCCGAAG ACATGCAGCG TGCTGGTCGC CTGATTTTAA 10560 
AAGAATTTGG TCCTCAGTCT GTGGTTATCA AAGGCCGACA T CTC AAAGGT GGTGCTAAAG 10620 
ATTTCCTCTT TACCAAGAAT GAACAATTTG TCTGGGAAAG CCCACGAATT CAAACCTCTC 10680 
ACACCCATGG TACTGGATGT ACCTTTCCTG CAGTGATTAC TGCTGAACTA GCCAAGGGCA 107 40 

ACAGTCTTTA CCAGGCAGTT GATAAGGCCA AGGCCTTTAT CACAAAAGCT ATTCAAGATG 10800 

CCCCTCAACT CGGTCATGGT TCTGGTCCAG TCAACCATAC AACTTTTAAA GATTAAGAAA 10 8 60 

AAAAACTCTC TAGTTCCCAC TTTAAGGGAA TTAGACAGTT TTTATACTCT TCGAAAATCT 10920 

CTTCAAACTA CGTCAGCTTC CATCTGCAGC CTCAAAACAC TGTTTTGAGC TGACTTCGTC 109 80 

AGTCTTATCT AAAACCTCAA GGCAGTACTT TGAGCAACCT GCGACTAGCT TTCTAGTTTA 11040 

CTCTTTGATT TTC ATTGAGT ATTAATTAGG AAAGAATGTT ATGCAACTTT TTTAAAAAGG 11100 

CTTGCGTTTT TGCCTCAATA TCTTCTGCTT GCATCAAATC ACGTACAACA GCTACACCAG 11160 

CTATGCCAGT GCCCATAAGC TGATCAATAT TCTCCGAAGT CAAGCCTCCA ATAGCAACTA 112 20 

CTGGAATGGC AACCGTTTGG CAAATTCTTT TCAAGGTCGA TATCAGAGTA ATGGGCGCAT 11280 

TTTCCTTGGT GGTGGTTGGG AAAATGGCTC CTGTACCCAA GTAATCTGCA CCTGATTTCT 11340 

CCGCTTCCAG AGCTCTTTTA ACCGTTTTAG CGGTGACACC GAGGATTTTT TCAGGACCCA 11400 

AGACTTTGCG AG CT ACCG AA ACTGGTAATT CATCATCTCC G AT ATG C AG A CCTCCTGCAT 114 60 

CAACCGCAAG ACAAACATCC AACCGATCAT CG ATT AT C AA GGGTACCTGA TAAGCATCTG 11520 

TTATTTCCTT GACTTGTTTT GCCAGTTGAT AATATTGATT CGTTGTGAGA TTTTTTTCTC 11580 

GCAATTGGAC TATGGTAACC CCTCAACCGC AGGCCGTCTC AACTTTTGCA ACAAAGCTTT 11640 

CCACGGAATC TTGATAGCGA TTGGTTACCA GATATAGTCT AAGTGCTTCT CTATTCATAA 11700 

ACCTCTCCTT TGATGGTATC TAGCCAATTT TCATCTCTTC TTAGGAGCGA AAGCTGATTG 11760 

AGTACTTGGT AACGAAATTC TTCCAATCCC ATTCCTTGAA CAACTATTTT CTCAGCAGCG 11820 

AT ATT GAG AT AAGAGACTGC TAAGCAAGAA GCTTCAAAAC CAGTCTTTCC TTGGCTGAGA 11880 

AAAACAGCTG TTAAGGCTCC AACCAAGTCT CCTGTCCCTG TTATCCAGTC TAATTCAGTA 11940 

CAGCCATTTC CCAGTACAGC GACCTGATTT TTCGAAACGA CGAGGTCCTT GGGACCTGTG 12 000 

ACTAAGAAAG ACATACCAGG ATAGGTCTGA CACCAGTCTT TCAAGACTTG AAGCAAATCC 12060 

TCCGTTTCTT GATCTTTACC ACTCGCATCG ACCCCAACGC CGTGGTGCTT TAATCCAACA 12 120 

AGACTTCGAA TTTCTGACAT GTTTCCTTTA AGGACCGTAG GTCTATAGTC TAAAAGGTCT 12180 
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TTAACTAACC TCTTACGAAT GGATGAAGTC GTTACGCCAA CCGCATCTAC TACCATCGGG 1224 0 

AG AG AAG ATT GGTTTGCATA CGAAGCTGCC ATGCGGATTG CTTTTTC CTT CTCAGCTGAC 12 300 

AAATGCCCCA AATTGATGAA GAGAGCCTGA CTTTGCTTAG TAAAATCAAG AACTTCACGG 12 3 60 

GAATCATCTG CCATGACAGG TTTGCATCCC AGAGCCAAAA TCCCATTTGC CAGCATCTCA 12420 

C AAG AAATCT CATTGGTAAT GCAGTGAATG AGGGAACTAG AGCCTATAGG AAAGGGATTT 12 4 80 

GTAAATTCCT GCATCAGTCT ATCCTTTC AC TAAAGAAATA TCCCTGCACT TTTTTAAAGA 12540 

ATTCCTCCTT G ATT AAAAAT CGAAAGGCAA TAAAGGAAAT CGCTGTACCA ATCAAGGTTG 12 600 

CTCCGAAAAA TCGACGCGTG TAGATAAACC AGCTAAGCTT AGCAGCTGAT CCTGTAAAGA 12 660 

GTACCATAAC AGGATAGGAA ACAATGGAAC CAATAATACC TGTTCCCAAA ATCTCTCCTA 12720 

GAGCAGAATA GTGAAATTTT CGACCGTACT TATAAAAGAG ACCTGCTAGA AGGGCTCCAA 127 80 

AAGTCGCTCC TGTGAGAGCT AAAGGCGGAA TCCCTTGAGT CGTCATACGG ATAAAGGCTG 1284 0 

TGACTGTAGC CATAGCCAAG GCATAAACAG GTCCCATCAT GATTCCTGCT AGAATATTGA 12 900 

CT AC ACTGG A CATCGGTGCC ATTCCCTCAA TTCGAAAGAT AGGTGTAACG ACTACATCAA 12960 

CGGCAATCAT CATAGATAAA ATGGTTAATT TGTGAACTTG TAATTGGTGC TTTCTCATGC 13 020 

TTCTATTCTT CTCCTTTTTC TAAAGACTGT AAATCGCTCT TCCATGTCTG GTGTTGGTAG 13080 

GCCATTTCCC AAAACTTGGC TTCCATATGA ACACTGATGT GGAAGGCATC TAGCATTTTT 1314 0 

TGCTTGTCTG TCTCGTCACT TTCT CG AT AG AGCTG ATTG A CCAGTGCTCC CTCCTCTCTG 132 00 

ATCTGTTGCT CT AACTC AT C CGTAATATAA GTTTCAATCC ATTGTTG AT A GAGAGGATTT 13 2 60 

GGTGATCCTT T AAG ATT AAG TGATTTGCCT A7ATCATCCT ATAACCAAGG ACAAGGAACC 13 320 

AAGCTTGCAA AAGCC ATGGC TAAGTTCGGT TCTGCAAATT GCCTATAAAT ATGAGAAATG 13 3 80 

TAATGATAAC AGGTTGGAGC GATTGGATGT TCCTCCATTT CCTGGTCGCT GATTTCCAAT 13 4 40 

TCCTTGAAAA ATTGTTGG CG AATAAATAAC TCACCCTCCA CTAAACCCTG AGCATTTTGT 13 500 

TTCAAGAGTC TTTTCATCTC TTGGTTTGAA GTCTTATCAG CCAAAAGATG AT AG ATTT CT 13560 

CAGAAACCCT TCAGATAGTA GGCATCCTGA ATCAGGTAAT AGCGGAAAAT GGCAGGTTCT 13 620 

AAATTCCCCT CTTGTAATTG TAAAATAAAG GGATGATGAA AGGAAGCCTG CCAAGCTTTC 13 680 

TTGGATAATT CCATCGCAAT ATCTGTAAAT TCCATAATAA CTCCTTTATA AAAATAGACT 13740 

GGTTTGAAGC AATAAAAAGA AAAGCAGGTA GATTAATTTT GTTTTTTTAG GAATATAAAA 13 800 

AGTCCGATAG CTATTCTTCA ACTCTGCATG TTCGTCATAT CCGTGAGCAG ATAGAGCTCT 13860 

CAGGTAAAGA TGGCGCCACC TAAAGACTGT CATCAGAACC TTACTGTAAA TCAAGGGCGA 13 920 

CCAAAAATGT AGTTCTTGAC CACGTAATAG G C AAG C TTCT TTGAGGGACT TCATTTCTTC 13980 
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CTGAATGAGA 


GGAAAAGAAT 


TGAATACCAC 


AATC AAGG C A 


TAGGACTCAAG 


AGCGTGATAG 


14040 


CCCCi ITTGA 


GCCAAGTACA 


AG AG AAGCTC 


TTTT AGTG AA 


ACAGAGGAAA 


CAAAGACAAG 


14 100 


G C CG AT AC AA 


ACTGTC ACAA 


AGGCCCTCGT 


TCCAAGCATG 


ACTGCCTGTG 


AAGCATCTCC 


14160 


GTGTAACTGA 


ACTGCC C AGT 


AGTTGGCAAA 


AGATGGTAAA 


ATGGCAAGTA 


TGATCATCCA 


14220 


AGCTAACATT 


TTAAATCGAC 


GGTAATAGAG 


CATAAAGAGA 


ATACAAAATG 


CGACTACCGA 


14280 


AAGAGTCAGA 


GCAATCGAAG 


GAATGAAAGA 


TGTTTCCAAG 


GATAAAATCA 


GCAAGAAGAC 


14340 


ACTG AT AAT C 


GGTGTCTGGG 


TTGCTACTTT 


G AC CAT AC T A 


TCTCACCTCC 


CCTTGGGTAT 


14400 


TGCTACTCTG 


AGATGTAAGT 


GGTTTGGTAA 


TGGTCACTTC 


TTTCACATGC 


CGAAGACCCT 


1446C 


GACTAGTCAT 


CTCAATCCAA 


TAATCAACCA 


CAGAAATCAA 


AGGGTCTAAA 


CGATGACTAA 


14520 


TG AG C AGAAA 


ACTTCTTCCT 


TGATTCCTCT 


CCTCCACAAT 


CCACTTCCAA 


AAATAATGGC 


14580 


AGGCTCTATC 


ATCCAAACCT 


GCAAAAGGTT 


CATCTAGCAA 


GATCACGGAA 


GCCTTACTGG 


14640 


TCAAGATGGT 


CAGGAGCTGA 


AGAATTTTTT 


GCTGACCACC 


ACTTAATTGA 


TAGGGACTCT 


14700 


TATCGACTGC 


CTGCTCCAAA 


TCAAAATATC 


GTAAAGCTTG 


AAAAATCCGC 


TGATTTCTTT 


14760 


CAGAATCAGG 


TCCATCTAAT 


TGAAGCTCCT 


CTCGCAGACT 


GACTCCGATA 


AACTGCTTCT 


14820 


CAGCTTCCTG 


AACAACACCA 


GTCAGATCAC 


GATACAAACT 


CTTTTTCTTT 


TTCAGGACCG 


14880 


AACCCTTCCA 


AGTAATGCTC 


CCCTTATACT 


TTTGAAATTG 


AAGAATAGAC 


CGAAAGAGGG 


14940 


TTGATTTCCC 


GACACCATTG 


TCACCCAGGA 


TACAGGAAAT 


CCCTTGATAG 


AATGTGAAAT 


15000 


CAGCAATTCA 


AAAG AGGG GG 


CCATTACCAA 


CCTCACCAGT 


CACACGGTTC 


AT ATGG AAT A 


15 0 6 0 


GTTCCGGGCT 


AGAAGCAACT 


TCCTTTGAAG 


CAACCTGTGT 


CA7CTCATAG 


GAAGCCATTT 


15120 


G AAAC A CTT C 


CCTTAGTTTT 


CCGTCTCTTA 


GCTCCACCAT 


ATGGTCGATA 


TAGGCTTTAT 


15180 


AGTCAGATAA 


ATCATGGTCG 


CACAAAATAA 


CTGTCTTCCC 


ATCATAGACC 


AACTCTTTTA 


15240 


GAATCTCCAA 


TATCTCGATT 


CTGCTCTTGC 


GGTCAATGGA 


AGCGAAGGCC 


TCATCCAAGA 


15300 


GATAGACCCT 


AGGATTCATG 


GCAAAGAGGA 


CAGCCAGCGC 


TCCTTTTTCC 


TTTTCCCCAC 


15360 


CTGATAAGTG 


ATGCATGAGA 


CGGTGCAAGA 


TGTCCTTGCA 


ACGACATTGC 


TGGACAACCT 


15420 


CTGCTATTTT 


AGAATCAATT 


TCCTGAAGGT 


GATAGCCGAT 


ATTTTCCATG 


GTAAAAACCA 


15480 


ACTCCTCAAA 


CAAGCTCTCC 


ATGGTAAATT 


GATGATTAGG 


ATTTTGCAAG 


AGAATACCAA 


15540 


CCGTCTGGAC 


ACGTTCGACG 


AT AG AAAG C T 


GACTGACCTC 


GCTCCCATCT 


ATCAGGACTT 


15600 


GACCGCTA7A 


GGGAAGAGAA 


CTAACTTGGG 


CAATCATTTC 


AAAGAGGCTG 


CATTTTCCAG 


15660 


ACCCACTACT 


CCCAACTAAC 


AAGGTAAAGG 


CTTGCGCATG 


AAAAGTAAAA 


TCAAACGGCT 


15720 
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CAGAGAAGAT TCCGGACTGA ATCGCTCGTA GTTCCAGACC CATCTATGCT TTTCCTCCAG 15780 

TTGCAAACTG ATGATACAGT TTGACAATGG CACGAACCAA GATGGTACAG AAGAAATAAA 15 84 0 

CAGAAATAAA ACGTACCACA AGCAAGGAAA GGACAAACGG AAGGGAAAAG GCGTAGTAAC 15 900 

CTAACTTAAT GTATTCATAG ACAAAGCTAA CAAGCGTAAT CCCAATACTA TTAGCAGTTA 159 60 

GAGAGAGCCA ACTTTCATAG CGATTCTTAG TTACGATAAA ACCAAATTCA CTTCCCAAAC 16020 

CTTGAACAAA GCCAGACAAA AGAGCTCCTA GACCAAATTG GCTACCATAA AGGACTTCAG 16080 

CAAGCGCAGC TAGCACTTC7 CCAATCGTTG CACTTCCGAC TCTCGGAACA AAGATGGCAG 15140 

CAATGGGCGC AGCCATACAC CAGAGACCGA AGAGGATTTC ATTGGCAAAG GCCTGCAAAC 162 00 

CAAGAGGTGT TAAGAGTAGA CTGAGAATAT TATACACATA TCCTGAACCA ACGAAAACCC 16260 

CACCAAAAAA GATAGACAAG AAAGCAAGCA AGATAACATC TTTTAACTGC CATTTTTTCA 16 320 

ACATAAAAAA CTCCTTTTTT TAAAGAAAAG TGAGGCACTC AAGAAGACCG ACCTAAATAC 163 80 

TTTGTATAGC AGACTGAATT TAGAACAGTA CACAAGAACA CTAAAATATT TCTAGAAATT 16440 

AATTTGAATT TTCTAATTGA TTTGTTCGCA TCTTATTTCA ATCTACTATA TCATCTTCAT 16500 
CCAGTTTCGT AAAAGAAAAA AC T CT AATT A C AG AT AC AAA TTAGAGTTCA GCTTACAAGA 
TTAGACAGTT CTTTTCGACA TACGAAAAAA ACATTTCACA TTTCCCTTCG CCAGTCTTAA 

CTGTATCAGG TTCAATGGCT ATCATCTCAG CCTAAAGCAC CCCAAATGTC TTTATTATTT 16680 

AATTATGTGA TT ATT AT AAC ACACATTTTA TACTAGTTCA AGAAATTGAA CTGGAAATAC 16740 

AGCCTTGCAC TCACAAAGAC AGCAGATCTT TCTTTTCCAA AAAACAAATG ACCTGTTTGA 16800 

TGAATTAGCC ATTCAAGCTG AATCTGGACA TAGCTTTTTA AAAAAGGAAA ATCCTACTTA 16860 

CTTAGAATCC AAGG AT AG AT ATCTATTGTT CACTCATTTC CCGAACAGTT TTTTCTATAT 16920 

TTTTTGCATA CGATATTGCC GAAATCATTG AAACGCCATC CATATTGGTC TTTATAATGT 16 980 

CTTTAATATG TTTCGTCTGT ATCCCACCAA TTGCAACTAA AGGCATTTGT GGCAATAGTT 17 04 0 

TTCTCATCAA TTCAAGACCT TCATAACCTA TAGTACCACC AGCATCATCC TTTG ACTGC G 17 100 

TACCAAATAC AGGCCCAACA CCTACATAAT CTACATATTC AACTTTTGAT TGTTGAAATT 17160 

CTTCTTCGTT TCTTATAGAA AG AC C AATT A TTTTATCTGG CAT C AATTTT CTAATTTCAT 17 220 

CAACACCAAT ATCATCTTGA CCTACATGTA CGCCATCGGC GTCAATTTCC ATTGCTAAAT 17 280 

CTATATCGTC ATTAACGATA AATGGAACAT TCTATTTTTT ACAAAGTTCT TT AATTTGG A 17 340 

TAGCTAGCTC AAGTTTTTCT AAGCCTTCTA AAGCACCCTC ACCTTTTTCT CGAAATTGAA 174 CO 

ATAAGGTTAT ACCACCTTTT AAGGCTTCCT CAACGACTGT ATATAGATTT TTTCCTTGGC 17 4 60 

AAGTAGTCGT TCC AC AAAT A AAATATAGTT TTAGTAATTC TTTATGAAAC ATCTTACTTC 17520 
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ACTC TTTTG A ATTCCTTTAC ATCTTCATCT GTAATCTCGT ATAAGGCATT TATAAATTCA 17 530 

ACTTTAAATG TCCCAGGAAG ATGTCCATTT GGACGTTTTT CTGCTATTTC TCCAGCCA7A 17 64 0 

TTGTAAACCA ACACTGCTGT TTTTAATGAT TTCAATTCTT GACCTTTTTC TAGTCCGATA 17 7 00 

AAGCTTGCTA CTACAGCTCC TAATAAGCAT CCTGTCCCAA TGACTTTCGG CATCATAGCA 17760 

CTACCATTAT GAATCATTAC CACTTCTCCA TTAACAGCAA TGGCATCCAC TTCACCTGTT 17 820 

ACTACTATTG GAATATTGAA CTTCTCATTT GCTGCTAGAG CAATTTCGTC AATATTATCT 17880 

ACGCCCGCAC TATCTACTCC TTTAGATGCC ACATCTATTC CTACTAAAGA GGCAATCTCG 17 940 

CCAGCATTTC CTCTAATCGC TGCTAGTTTA TAATTGTTGA TTAGATCATC TCCTACTTTT 13000 

TTTCTATATT CTCCTGCTCC ACAGGCTACA GGATCTAAAA CTGCTGGGAC ATTATATTTC 18060 

TCTGCAATTT TCAGAGCAGC TTGGTATAAT TTCCAATTTT CATCTGTCAA TGTTCCTATG 1312 0 

TTTATTAATA AACCACCAGC ATACTTTAAC AAATC CTCT A AATCTGCTGC AAACTCACTC 18180 

ATGGCTGGTG AGGCGCCCAG TGCTACTAAT CCATTTGCTG TGAAATTTTT TACTACATCA 18240 

TTGGTTATAC AAATGACCAA TGGTG CTTTT TCTTTTAATA ATTTTAAACT TGTCATATTG La30O 

AAATCCTTCC TTTTCACTTT ATACGATCTA CTAATTTCGA TTTATCTTTA GTTGAGAATT 18 360 

TTTTTCATTT ACATTCAATG ATTTATACTC AATG AAAAT C AAAGAGCAAA CTAGCACCCT 184 20 

AACCGCAGGT TGCTCAAAAC ACTGTTTTCA GCTTGTCCAT AGAACTGACG TGCTTTGAAG 184 80 

ACATTTTCCA AGAGTCTTAC CTCATCAAAT TTGTAAATAT CATGAGCCTT CTCTAGACAT 18540 

CGTAACCAAT ATCAAAAAAA GCTAATTCTA AAGCCACTCC TTGATTCCAG CGTTCCTGAA 13600 

GTTCTGTCAA ATCTTCTCGA TTTTTACCGA CACGATTGAG TTCGTCAACC AGAAATTGAA 13660 

CCCACTCTGC AAAGAAAGGA CCTCTGTGGA GATTGATCCA TTCCGAATGA ATATAGACTT 187 20 

CAGGTAAAGC CAAATCTTTA GAACCCCAGT CTAAATAGAG ACCTTCTGCA ATGACCAGCA 187 80 

TGACCAAAAG ATGGGCATAG TCTGATGAAG CCACCGCCGA ATACATTAGA TCCTGAAAGC 18840 

CTTTTGTTAC AGGGTGCAAA GTCACTTCTA GATAGTCATT CTCTGCTACT TTTAACTCTT 18900 

TAAAAGCCTT TTGGAAATAA CCATCTTCAT CTGCTTCAAG AAAGCCTAGT TGCTTGGCAA 18 960 

AACGAAGCTT GG ATT C AA GT TTATCTGCGT GACTACGCAG GCACCCAGCA TGGATAAGAA 19020 

GGCATCAAAG AAGTGATAAT CTTGAATCAG ATAGTCCTTT AAGACCTTAT TCTCAATTGT 19080 

CCCCGCAAAA AGTTCCTTAA CAAAACGATG ATTGATTGCA GCCTGCCAAT CCTTCTGACT 1914 0 

GCTTTTTAAT AATTCTCCAA CAGTCAAACC TCGCTGAAAT GCATAGTCTT GTGTTTCCAT 192G0 

A7TTACTTCT CCTCTCTTTA CTTGTTAGTA ATTAATAAAA CACCAAGAAA TATCAAGCAA 192 60 
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AATCGTAATT CCACTTCATC CTTTTAAAGC ACATCGAGAG CA??"' - :AGA GAGCTAACTA 19 3 20 

AACAAGCCTA TCCAGTTTAT ATAAACAAAA AACTCCAATT AC AATC AAG A ATTAGAGTTG 19 3 80 

ACTTACAAGA TTAGACCCTT CATTTCACCA TACGAAAAAA CTGTTCACAT TTCCCTTCGC 19440 

CAGTCTTAAC TGTATCAGGT TCAATGGGTA TTATCTCAGC CTAAAGCACC CCAAATGTCT 19500 

CTATTATTTA ACTACTGAAC CAGTATAGCA AAAAATGAAA GCCCTAGCAA GATATTTGAC 19560 

CGAAAAATAT CTTT AT AT AT AATATATTGA AACTAGAATA GTACACCTCT ACTTATAAAA 19620 

CATTGTTAGA AATCGATTTG ACT GTCCTG A TTGATTTGTC CTATTCTTAT TTCATTTTAC 19630 

TATAGTTTTC GATAGCAATT TATTCTTCCA ATACACGAAG AAAAACCTCC ACATTCAGTG 19740 

GAGGCAATCT GTTTTATCAA T AC AATTTT A AGTCACGAGG GTCAACTGGG AACGTTGGGT 19 800 

TGTATGGATT GTGACGGAGC TTGAAGTGTT TGACATCTTC AATGGTCTGA GTTCCAGACA 19 8 60 

ATTG CAT AAC TGTCTTCAAT TCCGCATTCA AGTGTTCAAA GACTTGACGC ACACCGACAC 199 2 0 

TACCACCGAG AGCCAAGCCA TAGATGACAG GGCGTCCAAT AGCAACCAAC TCTGCTCCTG 19 980 

ATGCCAAGGC TTTAAAGACG TGTTGACCAC GACGAACACC AGAGTCAAAG ACAATCGGCA 20040 

CACGTCTATC AACTGCTTCT GCCACTTCTT GAAGCGAGTC AAAGGC AGCT GGTCCACCGT 20100 

CGATTTGACG AC C ACCGTGG TTGGTTACCC AGATACCAGA AGCTCCTGCA GCAAGCGAAC 2 0160 

CTTCAACGTC CTCACGCCAT TGTGGTCCCT TGACATACAC AGGAAGACCA GAGTATTCAG 20220 

CGATAAATTC TACATCGCGT GGAGACAAGC GTTGTTTAGC TGATTTGTAA ACAAAGTCCA 20280 

TTGATTTACC ACCACCTTCT GGCAGGTATT CTTCAACAAT CGGCATGCCA ACTGGGAAGA 2 0340 

CAAAACCATT ACGCTTATCC ACTTCACGAT TCCCCCCTAC AGTAGCATCT CCCCTCAAGA 20400 

CAATCGCTTT ATAACCTTCA GCCTTCACAC GGTCCATGAT GTGGCCGTTG ATACCGTCAT 204 60 

CCTTACTAAA GTAAAATTGA AACCAATGAG GTGTCCCTTG GAGGGCTTCA GAAATCTCTG 20520 

GAAGGTCAAC AGTAGAGTAA GAACTGGTTG TATAAAGAGA ACCAAACTCA TGCACACCAC 2058 0 

GCGCAGTCGC CACTTCCCCC TGTTCATTTG CCAATTTATG AGCCGCAACA GGTGC C AT AA 2 0640 
TGATTGGAGA AGATAGTTTT TCACCTGCAA ATTCAATCTC TGTACTTGGA TTTTCTACAT 207 00 
TGCAAAGTGT ATGAGGAACG ATGAGCTTGT GGTTAAAGGC ACGGATATTC TCTCTTAAAG 207 60 
TGAAAGTATC TTCCGCCCCA CT AG CG AT AT AGCCAAATGC TGCTTTAGGA AT AACTT GTT 20 8 20 

GCGCCATTGG CTCCAAATCA TAGGTATTGA TGAArTCTAC ATGACCTTCT GCATTGCTTG 2 0 880 

TTTTGTATGA CATAAAATGT CCTCCTTAAT AAGTAAGCCT TTACTTTGTG TATTACAAAA 2 0940 

ATATCTTAAC TCTTTTTCAA AACTTTTAAA AT ATTTTCT T TGGAAATTTC ACAAATTTTA 21000 
TGTCTATGAT AAAAATCCTT AT AAC GGC AA TAAAAAATAG ATATTATCCA AAGAAGATTT 21060 
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TAAGTGCTAC AATAACTGTA TTATTTCTAG ATGGGACCTT CTATTTTTGG ATTGATCCAT 21120 

TGTTGAACAA TATCTACCAC TATATCAAAA GGCATTCTTT CTGACCTTGC ATATTGCAGT 21180 

TTGGGGAATT TTGGGATCCT TTCTGCTCGG TTTAATCGTT AG T AT C AT CC GACATTATCG 212 40 

AATCCTTGTT TTGGCGCAAG TAGCGACAGC CTACATTGAA TTGTCACGTA ATACGCCCCT 21300 

TTTGATTCAA CTCTTCTTTC TCTACTTCGG TCTTCCCCGA ATCGGGATTG TCCTATCTTC 21360 

AGAAGTCTGT GCAACGCTTG GGCTTGTCTT TTTAGGAGGC TCCTATATGG CAGAATCTTT 214 20 

CCGAAGTGGG CTGGAAGCCA TCAGTCAAAC CCAGCAGGAG ATTGGCCTCG CTATTGGTCT 21480 

GACACGTCTA CAGGTCTTTT ACTATGTGGT TCTTCCGCAA GCAACAGCGG TGGCACTCCC 21540 

CTCCTTTAGT GCCAATGTCA TTTTCCTTAT CAAGGAAACC TCTGTTTTCT CAGCAGTGGC 21600 

TTTGGCCGAC CTCATGTACG TCGCCAAGGA TTTGATTGGT CTCTACTATG AGACAGACAT 216 60 

TGCGCTAGCT ATGTTGGTAG TTGCTTATCT AATCATCCTC CTACCCATCT CACTGGTCTT 217 20 

TAGCTGGATA GAAAGGAGGC TCCGCCATGC AGGATTCGGG AATCCAAGTA CTCTTTCAAG 21780 

GAAATAATCT CCTGAGAATC TTACAGGGAT TGGGCGTTAC G ATTGGG AT A TCCATCCTGT 21840 

CTGTCCTCTT ATCCATGATG TTCAGAACAG TCATGGGAAT CATCATGACC TCCCATTCTA 21900 

GAATCATACG ATTTTTAACA CGATTGTATC TGGAATTTAT CCGTATCATG CCCCAGCTGG 21960 

TGCTACTCTT C ATCGTT T AC TTT GGCTTGG CTCGAAACTT TAATATCAAT ATCTCAGGTG 22020 

AGACTTCAGC TATTATCGTT TTTACCCTCT GGGGAACAGC TG AAATGGG A GACTTGGTAC 22080 

GTGGAGCTAT CACTTCTCTC CCTAAACATC AGTTTGAAAG TGGACAGGCA CTCGGCTTGA 22140 

CTAATGTTCA ACTTTACTAC C AC AT CATC A TCCCACAAGT CTTAAGAAGA CTGCTACCGC 22200 

ACCCTATCAA TCTTGTCACT CGGATGATTA AAACCACTTC ATTAGTTGTT TTCATTGGGG 22260 

TTGTGGAAGT GACCAAAGTT GGACAACAAA TCATCGATAG CAATCGCCTG ACCATCCCAA 2 2 320 

CTGCTTCATT TTGGATTTAT GGAACCATTC TAATCTTATA TTTCGCAGTT TGCTACCCTA 22 380 

TTTCCAAACT ATCCACTCAC TTAGAAAAAC ATTGGAGAAA CTAAATGTCT GAAACTATCT 22 4 40 

TAGAAATCAA GGAACTAAAA AAATCCTTCG GAGACAATCC CATCCTCCAA GGACTTTCTC 22 500 

TAGAAATCAA AAAAGGGGAA GTTGTTGTCA TCCTAGGGCC ATCTGGTTGT GGGAAAAGTA 22 560 

CCCTCCTTCG TTGCCTCAAC GGCTTAGAAA GTATTCAAGG TGGAGATATT CTTCTGGATG 22 62 0 

GTCAGTCTAT CGTTGAAAAT AAAAAAGATT TTCACCTAGT TCGCCAAAAG ATTGGCATGG 22 680 

TCTTTCAAAG TTATGAACTC TTTCCCCATC TGGATGTCTT AC AAAAC CTC ATCCTAGGCC 2 274 0 

CTATCAAAGC TCAAGGAAGG GACAAGAAAG AAGTAACGGA AGAAGCTTTG CAATTACTAG 2 2 800 
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AGCGTGTCGG 


TTTGCTGGAT 


AAACAACATA 


GCTTTGCCCG 


TCAATTATCT 


GGTGGACAGA 


22960 


AGCAACGTGT 


TGCAATTGTC 


CGTGCCCTCC 


TAATGCATCC 


AGAAATCATC 


CTTTTTGACG 


22920 


AGGTGACTGC 


TTCGCTGGAT 


CCAGAAATCG 


TGCGTGAGGT 


CCTGGAACTT 


ATCAATGATT 


22980 


TGGCCCAAGA 


AGGCCGTACC 


ATGATTTTAG 


TAACCCACGA 


AATGCAGTTT 


GCCCAAGCCA 


23040 


TTACTGACCG 


GATTATCTTC 


CTCGACCAAG 


GGAAAATCGC 


TGAAGAAGGA 


ACAGCTCAAG 


23100 


CCTTCTTTAC 


CAATCCGCAA 


ACCAAACGAG 


CCCAGGAATT 


TTTAAACGTC 


TTTGACTTTA 


23160 


GCCAATTCGG 


CTCATATCTA 


TAAAGGAGAT 


TCTTATGAAA 


CTATTCAAAC 


CACTCTTAAC 


23220 


TGTTTTAGCA 


CTTGCCTTTG 


CCCTTATCTT 


TATCACTGCT 


TGTAGCTCAG 


GTGGAAACGC 


232S0 


TGGTTCATCC 


TCTGGAAAAA 


CAACTGCCAA 


AGCTCGCACT 


AT C G ATG AAA 


TCAAAAAAAG 


23340 


CGGTGAACTG 


CGAATCGCCG 


TGTTTGGAGA 


TAAAAAACCG 


TTTGGCTACG 


TTGACAATGA 


23400 


TGGTTCTTAC 


CAAGGCTACC 


CTACGATATT 


GAACTAGGGA 


ACCAACTAGC 


TCAAGACCTT 


23460 


GCTCTCAAGG 


TTAAATACAT 


TTCAGTCGAT 


GCTGCCAACC 


GTGCGGAATA 


CTTGATTTCA 


23520 


AACAAGGTAG 


AT ATT ACT CT 


TGCTAACTTT 


ACAGTAACTG 


ACGAACGTAA 


GAAACAAGTT 


23530 


GATTTTGCCC 


TTCCATATAT 


GAAAGTTTCT 


CTGGGTGTCG 


TATCACCTAA 


GACTGGTCTC 


23640 


ATT AC AG ACG 


TCAAACAACT 


TGAAGGTAAA 


ACCTTAATTG 


TCACAAAAGG 


AACGACTGCT 


23700 


GAGACTTATT 


TTGAAAAGAA 


TCATCCAGAA 


ATCAAACTCC 


AAAAATACGA 


C C AAT AC ACT 


23760 


GACTCTTACC 


AAG CTCTTCT 


TGACGGACGT 


GGAGATGCCT 


TTTCAACTGA 


CAATACGGAA 


23320 


GTTCTAGCTT 


GGGCGCTTGA 


AAATAAAGGA 


TTTGAAGTAC 


GAATTACTTC 


CCTCGGTGAT 


23880 


CCCGATACCA 


TTGCGCCAGC 


AGTTCAAAAA 


GCCAACCAAG 


AATTCCTACA 


CTTCATCAAT 


23940 


AAAGATATTG 


AAAAATTAGG 


CAAGGAAAAC 


TTCTTCCACA 


AGGCCTATGA 


AAAGACACTT 


24000 


CACCCAACCT 


ACGGTGACGC 


TGCTAAAGCA 


GATGACCTGG 


TTGTTGAAGG 


TGGAAAAGTT 


24060 


GATTAGTCAT 


TAACTCTTAA 


AAGGAACTGG 


ATTTTAAGCT 


CCAATCCCTT 


TTTAAGATTT 


24120 


TACCTATAAC 


ATCCTGAGTC 


TATCTAAGAT 


GTTCAATCTG 


AACACAGTGT 


ACATACTTTA 


24180 


TCTTCTATTG 


CATATACTTT 


AT C AC AT AAG 


ATACCAATAT 


CCTCTTCACT 


ATGACTAGCA 


H 1 2 4 0 


ATCAAAATTG 


TTGTCCCTTT 


TTCACTAGAG 


AGCTTTCTAA 


ACAATGTTCT 


C AT ATTTT CT 


24300 


ACACTTGATT 


TATCCAAGGC 


ATT C AT AG GT 


TCATCTAGTA 


AAAGAATAGA 


GGGATTCTCC 


24360 


ATAATTGCTT 


GAGCAATCCC 


TAGCTTTTTC 


CTCATACCTA 


GCGAATAAGT 


TTTAACTTTC 


24420 


TGGTCTTTTT 


GCTC AT AT AG 


ACCAACTATT 


TTCAGTGTAT 


CATTGATTTC 


CTGATTACCA 


24490 


ACTACTCCTC 


GTATGCTTGC 


CAAATATTGT 


AAATTCTTAA 


AGCCACTATA 


ATAATTTATA 


24540 


AAACCAGGTT 


CTTCAATCAA 


AGCTCCCAAA 


TTAGCTGGAA 


TTTTTCTCTC 


AGGAACAATA 


24600 
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TTTTCCCCAT TGATTAACAC TTCTCCATAA GACGGACTAT ATAAACCAGC TATTAATTTA 2 4 660 

AACAAT AC AC TTTTCCCTGA GCCATTCGCA CCAGTAATTC CTATAATTTC CCCCTGTTTA 2 4 720 

CAACTAAAGT TAAGGTTTTG AAAAACACAT GTCTTTTTTA ATTTCAACTC AATATTTTTT 24 7 80 

AATGTAATTA TTTCATTCAT TCTATAAACC TCCTCTTTTG ACGAGTGAAA TAGAAAATGC 2 4 840 

TTTGAAAAAG AAAGACTAAA AATAGCAACT GAAGAAATAA ATCTCGTCCT ATATCTCCAT 2 4 900 

TCCCTCGATT CAAAATATAA AATAGATAAT TACTTCGATT TCCTACAAAT AGACCACCAA 24960 

ACACAATCAT GAGTAAAAAG AAACTAACGC AAGCAAAGTT CG 2S002 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 49: 

CAGGTACGGT GAGGCGCAAC TAAAATATAA TTTTCATCTT GATTAGGAAT TTTATCAGTA 60 

TTATGATAGT CAGCATTCCC ATTGATGGAC CATAAGAGCA ATACAACTAA TCCACGCAAA 12 0 

TAAGTATAAA ACATGCGATC TCCTTCGATT GTTTTCTTGT T ATT ATT AT A CCTTATCAAA 180 

GGAGGGCTGG CAAACTTTTC CCTTGACTAG ATACATATTT AGGATGAAAT TAGAATTCTG 24 0 

TTAAAAAAAA TGATATAATA GAATTTATGG ATAAAAATAA GATTATGGGA TTAACCCAAA 3 00 

GAGAAGTCAA GGAAAGACAG GCTGACGGTT TGGTCAATCA CTTTACCGCA TCAGCCAGTA 3 60 

CCAGCACTTC GCAAATCGTT AAA CG AAA TO TCTTTACCCT TTTTAACCCT TTGAACTTTC 42 0 

CCATTGCTTT GGCTCTTGCC TTTGTGCAGG CTTGGAGCAA TCTGGTCTTC TTTGCTGTTA 4 80 

TCTGCTTTAA CGCTTTTTCT GGGATTGTGA CCGAGCTACG AGCCAAACAC ATGGTGGACA 54 0 

AGCTCAATCT CATGACCAAG GAAAAGGTCA AAACCATCCC TGATGGTCAG GAAGTTGCTC 6 00 

TTAATCCTGA AGAATTAGTG CTAGGAGATG TCATTCGTTT GTCTGCAGGA GAGCAGATTC 6 60 

CTAGTGATGC CTTGGTTTTG GAAGGCTTTG CGGAAGTCAA TGAAGCCATG TTAACGGGAG 7 20 

AAAGTGATTT GGTGCAAAAG GAAGTTGACG GCTTACTTTT GTCAGGAAGT TTCCTAGCCA 7 80 

GTGGGTCAGT TTTATCTCAA GTTCACCATG TCGGTGCAGA CAACTATGCT GCCAAACTCA 84 0 

TGCTTGAGGC TAAGACCGTT AAACCCATCA ACTCCCGTAT CATGAAATCG CTGGACAAGT 900 

TGGCTGGTTT TACTGGGAAG ATTATCATTC CCTTTGGTCT GGCTCTCTTG CTGGAAGCCT 9 60 
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TGCTTTTAAA AGGCCTGCCT CTCAAGTCAT CCGTTGTAAA CTCCTCGACA GCTCTTTTGG 1020 

GAATGTTGCC TAAGGGAATT GCCCTTTTGA CCATTACTTC GCTCTTGACT GCAGTGATTA L080 

AGTTGGGCTT GAAAAAGGTC TTGGTGCAGG AGATGTACTC TGTTGAGACC TTGGCGCGCG L140 

TGGATATGCT CTGTCTGGAC AAGACGGGTA CCATCACCCA AGGAAAGATG CAGGTGGAGG 1200 

CTGTTCTTCC GTTGACGGAA ACGTATGGTG AAGAGCCTAT TGCCAGCATC TTGACTAGCT 1260 

ACATGGCCCA TAGTGAGGAT AAGAATCCAA CTGCCCAAGC CAT TCGC C AG C GTTTTGT GG 1320 

GAGATGTTGC TTATCCTATG ATTTCCAATC TTCCCTTCTC GAGCGACCGC AAGTGGGGGG 1380 

CTATGGAGTT AGAAGGCTTG GGGACAGTTT TCTTAGGGGC ACCTGAGATG TTGCTTGATT 14 40 

CTGAAGTCCC AGAAGCTAGG CAGGCCTTGG AGAGAGGATC ACGTGTCTTG GTCTTAGCTC 1500 

TCAGTCAGGA GAAATTAGAC CATCACAAAC CACAGAAACC ATCTGATATT CAGGCTCTAG 15 60 

CCTTGCTCGA AATCTTGGAC CCCATTCGAG AGGGAGCAGC AGAGACGCTG GACTATCTCC 1620 

GTTCTCAGGA GGTGGGACTC AAGATTATCT CTGGTGACAA TCCAGTTACG GTGTCCAGCA 1680 

TTGCCCAGAA GGCTGGTTTT GCGGACTATC ACAGCTATGT AGATTGCTCA AAAATCACCG 1740 

ATGAGGAATT GATGGCCATG GCGGAGGAGA CAGCTATTTT CGG ACGTG TT TCCCCTCATC 1800 

AAAAGAAACT CATCATCCAA ACGTTGAAAA AAGCGGGACA TACAACGGCT ATGACAGGGG 1860 

ACGGGGTTAA TGATATCTTG GCCCTTCGTG AGGCGGATTC TTCTATCGTG ATGGCGGAGG 1920 

GGGATCCAGC AACCCGTCAG ATTGCCAATC TGGTTCTCTT GAACTCAGAC TTTAATGATG 1980 

TT CCTG AG AT TCTCTTCGAG GGTCGTCGCG TGGTCAATAA CATTGCCCAC ATCGCCCCGA 204 0 

TTTTCTTGAT AAAG AC CATC TATTCCTTCC TGTTAGCAGT CATCTGTATT GCCAGTGCTT 2100 

TACTAGGTCG GTCAGAGTGG ATTTTGATTT TCCCCTTCAT TCCGATCCAG ATTACCATGA 2160 

TTGACCAGTT TGTGGAAGGT TTCCCACCAT TCGTTCTGAC TTTTGAGCGA AATATCAAAC 2220 

CTGTTGAGCA GAATTTCCTC AGAAAATCCA TGCTTCGTGC CCTACCAAGC GCTCTCATGG 2280 

TCGTCTTCAG CGTC CTGTTT GTGAAAATGT TTGGCGCGAG TCAAGGTTGG TCTGAGTTAG 2 340 

AAATCTCAAC TCTACTCTAT TATCTCTTGG GGTCAATTGG TTTCTTATCC CTATTTACAG 2400 

CCTGCATGCC ATTTACCCTA TGGCGTGTCC TCTTGATTGT TTGGTCAGTA GGAGGTTTCC 24 60 

TAG C CAC AGC TCTCTTCCCA AGAATTCAAA AACTGCTTGA AATTTCAACC TTAACAGAAC 2520 

AAACCTTGCC TGTTTATGGT GTCATGATGT TGCTCTTTAC CGTGATTTTC ATCCTGACCA 2 580 

GTCGTTACCA AGCGAAAAAA TAAATCAAAA CCACCAGTGT GAACTGGTGG TTTGTTCTGC 2 640 

GGCTATAAGC CGCTTCTACC GGCCAGGGCC AAAGGCCCAC CGAAATAGCT TCCTCGCGCA 2700 

CCACTTTCCC GAGCAGGTGC TAAAGCACCT TAGTTACTTC CTCTTATTTA TTTCGCCAGT 27 60 
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AAACGGATCT 


ACTGACTCGA 


ATAACGTGAG 


CTGGTCTGCT 


ACTCTCTCTT 


CTTGTAATTG 


2820 


ATTCTGAATA 


TATTCAGCTA 


TCACTTTCTG 


ATTACGGCCT 


ACCGTATCTA 


CATAATAGCC 


2880 


TCTACACCAA 


AACTTGCGAT 


TGCCATATTT 


GTATTTTAAA 


TTCGCATGCT 


TATCAAAAAT 


2940 


CATCAAACTG 


CTCTTGCCCT 


TTAAATAGCC 


CATAAAGGAC 


GAAACACTAA 


GTTTCGGAGG 


3000 


AATACTGATA 


AGCATGTGAA 


TATCGTCTGA 


ACAAGCATTC 


GCTTCATGGA 


TTATTACACC 


3060 


CTTACGCTCA 


CATAAGTCAC 


CTATGATTCT 


TCCGATACTA 


GCTTTGTATC 


TGCCATAAAT 


3120 


GATTTGACGA 


CGATATTTGG 


GTGCAAAAAC 


AATATGATAT 


TTACAATTCC 


ATGTGGTATG 


3180 


TGATAAACTT 


TGATTATCCT 


CTCTCATGAG 


GTACCTCCTC 


TATGATATGT 


TGTAGTGGCG 


3240 


GAGAAACCAC 


TTCTATCTTA 


TCATTTTAGG 


AGGTTCTTTT 


TGTTACCACG 


CTAAAAGCTC 


3300 


TATGGAAcCA 


CTAGCATAGC 


TAGTGGTTTT 


CGGGAGACAA 


CAAGAAAGAC 


TGCAATCTGT 


3360 


GGATTGCAGT 


TTTTTATACG 


ATGGATCTAT 


CCTAGATCTG 


ATGTGCAAGG 


CCTACGTGCC 


3420 


G ATC ATCT AT 


CGGTGAACCC 


AAGAGCGACC 


CTCAAGCCTG 


CTTGG AT T G A 


GGTAATAGAT 


3480 


TCAAATATCT 


GTAGTTAGAC 


TATTTGAACT 


TTGATGTAAG 


AAAGAGAAAG 


CGACAGATTG 


3540 


AAGTAATTTT 


AACTCTCTTC 


TATTGCTAGA 


AC AAATGGTC 


GGATAGGTTG 


GTAGTTTGAA 


3600 


AATGAAGATG 


CTATCTATTG 


TTAAATGGAA 


CATAGTGTTA 


TTT ATT AG AA 


AATCGTTTGG 


3660 


TTTATTTCTT 


ATCAAATACG 


AAAAGCAACT 


TAAATATTTC 


AACTAAAATA 


GATGTTATGA 


3720 


AGAAAAGGTA 


AAATGATTTT 


GGCATAGTGA 


GGTTCTGTTC 


TATTTGATAT 


CATATTTTTG 


3780 


ATAAAAACAA 


AAATGTCCAT 


TGCAAAGGAC 


AAAATGCCAA 


GTATATTATT 


TTTTGAAAGC 


3840 


GATATAATGG 


ATTCATAAAG 


GAGGTGTATC 


GTGTCTAGAA 


AACAAGAACA 


AATGGAAACG 


3900 


TTGTTGCTCC 


TTTTGCG AC A 


TAGTAAGGAT 


TATATATCTG 


CTAAAGTATT 


GGGAGAAAAA 


3960 


TTAAATTGCT 


CTGATAAAAC 


GGTTTATCGC 


CTTGTCAAGG 


GAATCAACAA 


AGATTGTCCG 


4020 


GTAGAAGCAT 


TCATTTTATC 






AATTAAATCC 


AAGAAGTTCC 


4080 


CTCGTGGACG 


TTGATGGGAA 


TTTTACAGAG 


GCTTTTGATC 


CTGAAGTAAG 


GCGTGAAAAA 


4140 


TTACTAGAAC 


GTCTCTTGTT 


GACTGCTCCT 


AAGCCACATT 


CTATTTATGA 


TTTAGGAGAG 


4200 


GAATTCTACG 


TAAGCGAGTC 


AGTAGTACTA 


AAAGATCGTC 


AGATATTACA 


AGAGAGTCTA 


4260 


GCAATTTATG 


GCTTAGATTT 


AAAAATGAGA 


CAACGAAAGC 


TTTTTATTGA 


TGGGGATGAG 


4320 


GCTCAAATTC 


GTTCAGCCAT 


TCTAAATCTA 


CTGCCAATCT 


TTAATCAGTT 


GGATTTAGAG 


4380 


CAAATTACAC 


AGAATAAGGT 


■ TCAGCCTCTT 


GACGGAGAAC 


TTGCTCACTT 


TTGTTTGGGA 


4440 


TTACTGATTA 


CACTTGAGAG 


AGAATTGGGG 


GTAAACATTC 


CCTATCCATA 


TAATATAAAT 


4500 
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ATT TTCTCTC 
GTAGCACCTT 
AT TCAAG AAA 
TATCAATACG 
TCTCAGCGAG 
GAGATTGAAA 
AGGAGATTAG 
TATCCTAATC 
TTTGCTTCCT 
GAAAAGCGAG 
G AG CTTTT AC 
GCTTATCATC 
ACGGTAGCTT 
GAGGGTGATA 
GCTTGTCCTT 
TCAAATCAGC 
TGAGAGACAG 
CTTTCAACAT 
GGATATCCAG 
ATGTATTAAA 
GTTAACAAAA 
GTCTGATCCA 
AACTATTGGT 
AGAGAGAGGC 
GTTCTGCTGT 
AGACCATTGA 
AAGCTGCTAG 
AAGTTGTTGC 
AATAAGAAAA 
TCGGAATTCC 



ACCTG7ATAT 
CAAAACCTAC 
TTGAACAATA 
TTGTATCTTC 
TTTTAGATGT 
CGACAGATCC 
ATAATAGAGT 
TGGTTAAAGA 
TGAGTCTGGA 
CACGTCCTCT 
GAGCACGATT 
AATTAGATGA 
TGCAGGAACC 
AACAACGTCT 
ATGGATATAT 
CTTCTTGTTT 
GGAAGTATAG 
CATGTCTTAG 
TGTGTTGACC 
ACATTGATGA 
GAAGAATTAC 
GACGGATTTA 
TGAGACGGGT 
AGAAGGACAG 
GGAGAAGGCG 
TGGGAAAGGG 
GGAGCATTTG 
CAAATTAGTT 
AATTTTGGAG 
CCACACTTAT 



TTTTATCAGT 
TATTGTTGAT 
TTTTAGGATG 
G AG AT TGC AA 
CACTCATTAC 
TGACTTTGTT 
ACAGATTAAG 
GTTAACAACT 
CGAGATTGGT 
AAAAACAGTA 
AGAAAAGCAA 
GCTGATAAAT 
AGCAAGTGTC 
TCAAGCAAAA 
CTGTTCAAAA 
CTGAAGATAC 
AGGTTGCTAA 
TGATTACTAG 
TTATTATCGG 
GAAGACTAGC 
GGGAGATAAT 
CAGATGCAGA 
TATGTGTCTG 
ACCGGTATTG 
GGGGTAGTCA 
GTCAAAGTAA 
AAGACCTTAT 
CGCGCTCAGA 
GGTATCCGTA 
ATTGCACAGG 



448 

AGGAATCGTC 



GAGAAAATTT 
AAGGTTGATG 
AAACCATTTT 
TATTTTAGCC 
GACTTGGCGA 
AATAGTCTTT 
ATTTCTAAAG 
TTTCTACTCT 
GTGATGTGTA 
TTTTCTGAAT 
CTATATCCAG 
CCGTTTGTCC 
ATTCAGGAGA 
TCGTCAAGAA 
AGAAAAAATA 
AGGTGTTCTT 
ATTAAAATCA 
TTTGGCCATT 
AGATGAATCA 
ATATGGAAAT 
GCAAAGAAGA 
ATAGAGACCA 
GGAATTATAT 
TAGCTATAAA 
TTGTACTCTT 
CACTCTTTGC 
CATCTGATGA 
TGAAAATTGT 
AAAAATTAGA 



GT AGT ACT AG 
ACAGTGTCTC 
CAGTTGAGAT 
CTTCCGGGAA 
GTATGTGTAT 
GTCATATCAG 
TATCACAAAT 
AAGTGACTCT 
TATATTTTGC 
CATCAGGTGT 
TGG AT ATT AT 
ATTTAGATTT 
TAGTTAGTGT 
TAAACTATGA 
GCCTACAAAC 
GAAGAGCTTC 
CT AC CAC ATT 
CCTATCACAC 
GCAGTATCAC 
TTCATAAATC 
CAAAGATATT 
GGTTTTTGAG 
ATTTATCGAA 
TGCTATTCCC 
TCACAATGAG 
TGCAGTTGGT 
TCGAAAACTT 
TGTGATTCCA 
TGGTGTTGCA 
GAATGCCGCA 



TATTCATGTT 
TCAAAAAATT 
TGACTATCTT 
GCTTCCTTTT 
GGACAATAGA 
TCCCTTACTG 
T CTTTT AACC 
AGTATTTGGT 
ACGGTTTCAA 
CGGAACTTCA 
TGATGTAGTT 
CATTGTGACG 
TTTTCTAACC 
ATAATCTTTC 
AATTAGCAAA 
TATATTACCG 
GTGAAGGAAA 
AATGGTCGAA 
AGGACAAGTC 
AATTAAAACA 
CTTAATGTGA 
GCATTAGCTC 
GGTCTTTATC 
CATAGCAAGA 
ATTCCTTGGG 
GATGATACAG 
GGTAATGACG 
G CTTTTT GTT 
GCTTGTACTG 
AAGGTAGCTG 



4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
54-00 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
i940 
6000 
6060 
6120 
618C 
6240 
6300 
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GACATGTGAT TCATGTTGAG ACTCAGGGGA CAATAGGGGT AGAAAATGAA TTGAGTCAAG 63 6 0 

AGCAGATTGA TGCAGCGGAT GTAGTTATTT TAGCAGTTGA TGTTAAGATT TCTGGTATCC 642 0 

AAC GCTTTGA GCGTAAAAAG ATTATCAAGG TTCCAACAGA AGTGGCAGTC AAATCTCCCA 64 80 

ATAAACTGAT TGCTAAAGCT GTTGAGATTG TTACGAAATA ACTGAAAATA TTTAAGGAGA 654 0 

AAATATATGT TGAAACACTT AAACTTAAAA GGTCACTTAT TGACACCCAT TTCCTATATG 6600 

ATTCCAATTG TTTGTGC TGC AGGATTCTTA GTTGCCATTG GTTTAGCAAT GGGGGGTGGT 6660 

GTTCCTGACC CTCTTGTAGC AGGAAAATTC ACTATCTGGG ATGCTTTAGC AACTATGGGT 6720 

GGTAAAGCCC TTGGTCTCTT GCCACTTGTT ATTGCTACAG GTTTGTCTTA CTCGATTGCT 67 30 

GGTAAGCCAG GGATTGCACC ACGTTTTGTT GTTGGTCTAA TTGCCAATTC TGTTGGTT C A 6 84 0 

GGGTTTATCG GTGGTATCTT GGGAGGTTAT ATAGCTGGTT TCTTGGTTCA AGCGATTATT 6 900 

AAAAAGGT C A AAGTACCAAA CTGGATTAAA GGTTTAATGC CAACCTTGAT TATTCCTTTT 6 960 

GTAGCCTCTT TGCTAACTAC TTTGATTATG ATTT AT ATT A TTGGAGCGCC TATCGCAGCC 7 0 20 

TTTACCAACT GGTTGACGAG CTTATTACAA AGCTTGGGAA G T GCTTC AAA TGG TTT'G ATG 7 080 

GGGGCAGTTA TTGGAATTCT CAGTGCTGTT GACTTTCGTC GCCCACTTAA TAAAACAGTC 7 14 0 

TATGCGTTTG TGTTGACTTT ACAGGCTGAA GGTGTGAAAG AACCATTGAC TGCTTTACAA 7200 

TTGGTGAATA CTGCTACACC AG TTGG ATTT GGATTGGCCT ATTTTATCGC GAAATTACTC 7 260 

AAAAAAAATA TCTATACTCA AGAGGAAATC GAAACATTGA AATCGGCTGT TCCTATGGGG 7 3 20 

ATTGTCAATA TTGTTGAAGC TGTAATTCCG ATTGTTATGA ATAACTTGGT TCCAGGTCTC 7 3 80 

ATTGCAACAG GTATCGGTGG TGCTGTTGGT GGTGCTGTTT CTTTGACAAT GGGTGCTGAT 7 4 40 

TCTGCTGTGC CATTTGGTGG ACTGCTTATC TTACCAACCA TGACTCGTCC AGTAGCTGGT 7 5 00 

ATTTGTGCCT TGTTAGCTAA CATTGTAGTC ACAGGACTTG TCTACGCGAT TTTGAAAAAA 7 560 

CCAATAAAAC ATGCAGAACC AGTTATGACT GTTGAAGAAG AGATTGATTT GTCAGATATT 7 620 

GAAATTTTGT AAGAGGGTAA CGATGTCAAG AATTGAATTT TCACCATCTT TG ATG ACC AT 7 680 

GGATTTGCAC AAATTCAAAG AGCAGATTAC TTTTTTGAAT GATAAAGTAG CATCTTATCA 774 0 

TATCGATATT ATGGATGGCC ATTTTGTTCC CAATATTACC TTGTCTCCTT GGTTCATTCA 7 800 

AGAAGTTCAA AAAATTAGTG ACACACCTTT ATCAGTTCAT CTGATGGTCA CAGACCCAAC 7 860 

CTTTTGGGTA GATCAAGTTC TCGATTTACA ATGTGAGTAT ATTTGTATTC ATGCTGAAGT 79 20 

TCTGAATGGT CTTGCTTTTC GTTTGATTGA T AAAAT T CAT GATGCAGGTC TAAAGGCTGG 79 80 

TGTTGTCCTT AATCCTGAAA CACCTCTTTC TACAATCTTT CCCTACATTG ATTTACTTGA 80 4 0 
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CAAAGCAACT 


ATTATGACTG 


TAGATCCAGG 


TTTTGCAGGA 


CAACGCTTTT 


TGGAGTCTAC 


8100 


CTTGTATAAA 


ATCCAAGAAC 


TCCGTCAGCT 


TAGAGTTCAG 


AATGGTTATC 


ACTACATCAT 


3160 


TGAGATGGAT 


GGTTCTTCGA 


GTCGTAAGAC 


TTTCAAACAA 


ATTGATGTGG 


CAGGACCAGA 


8220 


TATTTATGTT 


ATAGGTCGCA 


GTGGATTATT 


TGGTTTGGAT 


GACGATATTG 


CCAAAGCCTG 


82 80 


GGATATCTGT 


TCT AG AG ATT 


ACGAAGAAAT 


GACCGGAAAA 


ACAATGCCAA 


TCAAATAATG 


8340 


GTTTGAGAAG 


AAATTTATTA 


GTTAGGAGGA 


ATATATGTCA 


CTACAATCAG 


TTAACGCCAT 


8400 


TCGTTTTCTT 


GGCGTAGATG 


CTATTAACAA 


ATCTAATTCT 


GGTCACCCGG 


G AATTGTC AT 


8460 


GGGTGCTGCG 


CCAATGGCTT 


ATAGCCTATT 


T ACAAAGC AC 


CTTAGAATTA 


CACCTGAGCA 


8520 


GCCAAACTGG 


ATTAACCGAG 


ATCGCTT TAT 


CTTGTCTGCG 


GGTCATGGAT 


CAATGCTACT 


8580 


GTATGCTCTC 


TTGCATTTAA 


CAGGGTATAA 


GGATGTATCC 


ATGCACGAGA 


TTAAAAATTT 


8640 


CCGGCAATGG 


GGATCTAAGA 


CACCTGCTCA 


TCCTGAAGTG 


ACGCATACGT 


CTGGTGTGGA 


8700 


TGCGACATCT 


GGTCCGCTTG 


GTCAGGGGAT 


TTCTACTGCC 


GTTGGTTTCG 


CCCAAGCAGA 


8760 


G CGTTTTTT A 


GCTGCTAAGT 


ACAACAAAGA 


TGGTTTCCCT 


ATTTTTGACC 


ATTATACTTA 


8820 


TGTTATCGCT 


GGAGACGGTG 


ACTTCATGGA 


AGGAGTCTCT 


GCGGAGGCGG 


CTTCTTATGC 


88 80 


AGGTCATCAA 


GCTTTAGATA 


AGCTTATCGT 


CCTCTACGAC 


TCCAACGACA 


TCTGCTTGGA 


8940 


TGGTGAGACC 


AAAGATACTT 


TCTCTGAAAA 


TGTTCGCGTC 


CGTTACGATG 


CTTATGGTTG 


9000 


GCATACAGTT 


CTGGTAGAAG 


AT GG AAC AG A 


TTTAGCAGCA 


ATTTCTACAG 


CAATTGAGAC 


9060 


GGCCAAGTTT 


TCTGGTAAAC 


CGAGTTTGAT 


TGAAGTGAAA 


ACGGTAATTG 


GTTACGGCTC 


9120 3 > \ 


ACCCAATAAA 


AGTGGTACAA 


ATGCTGTTCA 


TGGTGCACCA 


CTAGGAGCAG 


AAG AAAC AGG 


9180 


AGCAACTCGT 


AAGTTTTTGG 


CATGGGATTA 


CGATCCATTT 


GAAGTACCAG 


AGGAAGTATA 


92 4 0 


TTCTGATTTC 


AAGACAAATG 


TAGCGGATCG 


TGGTCAGGAG 


GCATACGATG 


CTTGGGCTAG 


9300 


TTTGGTGTCT 


GATTACAAGG 


TTGCTTATCC 


CGAAGTTGCT 


AGTGAGATTG 


ACGCTATTGT 


Q1£A 
7 JOU 


AGCTGGAAAA 


TCCCCTGTAA 


CCATTACTGA 


AAAAGACTTC 


CCTGTCTATG 


AGAATGGCTT 


9420 


CTCTCAAGCA 


ACTCGTAATT 


CGTCCCAAGA 


TGCTATTAAT 


ACAGCAGCAG 


TTTTACCAAC 


9480 


CTTCTTAGGT 


GGATCGGCAG 


ACTTAGCTCA 


CTCTAACATG 


ACCTACATCA 


AGGCAGATGG 


9540 


CTTACAAGAT 


AAATATAATC 


CATTAAACCG 


CAATATTCAG 


TTTCGGGTAC 


GTGAATTTGC 


9600 


CATGGGAACA 


ATCCTCAATG 


GAATGGCTCT 


TCATGGTGGT 


TTACGAGTTT 


ATGGCGGAAC 


9660 


CTTCTTTGTT 


TTCTCTGACT 


ACGTCAAAGC 


TGCTATTCGG 


CTATCAGCCA 


TTCAGGAGTT 


9720 


GCCTGTAACT 


TATGTCTTTA 


CCCATGATTC 


AATTGCCGTT 


GGTGAAGATG 


1 GTCCAACTCA 


9780 


TGAACCAGTT 


GAACATTTGG 


CAGGTTTACG 


CTCAATGCCA AACTTGACTG 


; TTATCCGTCC 


9840 
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AGCGGATGCC CGTGAAACTC AAGCGGCTTG GCATCATGCC TTGACX'ACTA CCACCACTCC 9 900 

AACTGTCATT GTCTTAACCC GTCAAAACTT GGTAGTTGAA GAAGGGACAG ACTTTGGTAA 9 9 60 

GGTCGCTAAA GGAGCCTACC TCGTGTATGA TACCCCGGGA TTTGATACTA TTATCATTGC 1002 0 

TACAGGATCT GAGGTCAATC TAGCTATCAA AGCTGCTAAG GAATTGGTTT TACAAGGTGG 10080 

TAAAGTACGT GTGGTATCTA TGCCCTCAAC CCAACTATTT GATGCTCAAG ATGCTACCTA 10140 

CAAGGAAGAC ATTTTACCAT CTAAGACTCG TCGTCGTGTG GCCATTGAAA TGGCAGCGAC 10200 

CC AAAGTTGG TACAAGTATG TTGGTTTGGA TGGCGCGGTC ATCGGTATTG ACATCTTCGG 102 60 

TGCGTCTGCC CCAGCTCAGA CTGTGATTGA TAATTATGGA TTTACGGTAG AGAATATCGT 10 3 20 

TGCTCAAGTT AAGTCCCTAT AGAAACCAAT TACAATGAAG ATACACCTCT TGTC AG AC T A 10 3 80 

GCAGATGTAG TG AT AG AC AC TAATCAGATG ATTGGTTATT TAAAAACTGT AATGAAAATG 10440 

TAATAATTTA TCTACGAAAG TTATAGTAGA TAGTATACAC AATAGAGTAT ACCCTGAAAC 10 500 

GGTTGCGAAG TACGCTAATC ACTTTGCTAC TGATCTAGAT AGTTTCTTTA ATCAATAAAC 10560 

ACAGCATCCA CAGATTGACT TAGGATATTG TAACTTTTTT GAAAGCTAGA GAGAAGGTCT 10 620 

CTAAAATTAA AAAAC GC AT A GTATAGGATG TTGAAATGAT GAACTGCACC CCAAAAGTTA 10 680 

GACAGAAAAA AATCTAACTT TTGGGGTGTT TTTATTATGA AATTAACTTA TG ATG AT AAA 10740 

GTTCAGTTCT ATGAACTTAG AAAACAAGGA TATATCTTAG AGAAGCTTTC AAATAAATTT 10800 

GGGATAAATA ATTCTAATCT TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG 108 60 

TTCGT C AAAA AAGGGAAAAA TCCTTACTAT TCTCCTGATT TAAAACAAGA AATG ATT CAT 10920 

AAAGTCTGAC ATGAAGCCTG GACTAAAGAT AGAGTTTCTC TTGAATACGG TCTCCCAAGT 10980 

CGTACGATAC TTCTTAACTG GCTAGCACAA TAG AGG AAAA ACGGGTATAC TATTGTTGAG 11040 

AAAACAAAAG GGAGAGTACC TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT 11100 

CCGATTGAAG GAGGAAAAAG AGAAATAAGA AAGACAGAAA TTGTTCAAGA ATT AATG ACT 1116 0 

GAGTTTTCGT TAGATCTTCT TCTAAAAGCC ATTAAACTAG C TCGTTGG AC CT ACT ACT AT 11220 

CACTTGAAAC AGCT AG AT AA ACCAGATAAG GACCAAGACC TTAAAGCTGA AATTCAATCC 11280 

ATCTTTATCG AACACAAGGG AGATTATGCT TATCGCCGGG TTCATTTAGA ACTAAGAAAT 11340 

CGTGCTTATC TGGTAAATCA TAAAAGAGTT CAAGGCTTGA TGAAAGTACT CAATTTACAA 1140 0 

GCTAGAATGC GACAGnAACG AAAATATTCT TCTCATAAAG GAG 114 4 3 

(2) INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5338 base pairs 
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(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 50: 
CCAATTACAT TATATTATCA AAATCGTCGA AACTGGCTCC ATGAATGAGG CAGCCAAGCA 60 

ACTCTT T ATC ACTCAGCCAA GTCTCTCCAA TGCAGTGCGA GATTTGGAAA ATGAAATGGG 120 

CATTGAGATC TTTATCCGCA ATCCCAAGGG AATCACCTTG ACCCGTGATG GCATGGAGTT 180 

TCTCTCTTAT GCCCGTCAGG TTGTCGAGCA GACCCAGCTT CTGGAGGAAC GCTATAAAAA 2 40 

TCCTGTCGCC CACCGCGAAC TCTTTAGCGT TTCGTCTCAA CACTATCCCT TTGTGGTCAA 300 

TGCCTTTGTC T CTTTGCTC A AGAAAAGCGA TATCCACAAA TACGAACTCT TCCTTCGTGA 3 60 

AACTCGGACT TGGGAGATTA TCGACGACGT CAAGAACTTC CGCAGTGAGG TCGGGGTCCT 4 20 

CTTCTTAAAC AGTTACAACC GTGATGTTTT AACCAAGATG CTGGATGACA ATCACCTGCT 4 80 

AGCCCACCAT CTCTTC AC AG CGCAACCGCA TATCTTTGTC ACCAAGACCA ACCCTCTGGC 540 

AAAGAAAGAC AAGGTG AAAC TGTCTGATTT GGAGAATTTC CCTTACCTCA GCTATGACCA 600 

AGGGACGCAC AACTCCTTCT ACTTTTCAGA AGAGATTCTT TCTCAAGAAC ACCACAAGAA 6 60 

ATCCATTGTG GTCAGTGACC GTGCCACCCT CTTTAATCTC TTGATTGGTT TGGATGGTTA 7 20 

TACCATTGCG ACAGGCATTT TGAACAGCAA CCTAAACGGA GACAATATCG T TT CT ATC C C 130 

AC TGG AT ATT GATGACCCGA TCGAGCTGGT CTATATCCAG CATGAGAAAA CCAGCCTATC 3 40 

TAAGATGGGC GAACGCTTTA TAGACTATCT CCTAGAAGAA GTTCAGTTTG ATAGTTGAGA 90 0 

AATGATAAGA ACCAATATGT AGGCTAGCAA CAACCTGCAC ATTGGTTCTT TTTACTTATA 9 60 

ATTAAAAGTT TCCCCTGCCA ACTTATCAGC TAGCTTGGGA AAGAGAGTAT AAAACTTATG 102 0 

GGCTAGGTTC AACAAAATCG GGAGATTGAG TTCTCGTTTC TTTTTTCCTA TAATCTTGAC 1080 

AAT CTTTTT A GCCACTGCAT CTGGTTCTAG CAGGAAGCGA TCAACCGATT TAAGATAAGT 114 0 

TCCATCTGGG TCGGCTTGGT CGAAAAATCC TGTACGGATT GGTCCTGGAT TGACTGTTGT 1200 

CACATAGACT CCATAGGGCA TAAGTTCGAG TCGCAGAGCA TTTGAAAAAC CAATAGCCGC 12 60 

AAACTTGGTC GCTGAGTAAA GACTAGACTT GCCAGTAGCT ATTAGACCTG CCATGCTGAC 13 20 

GATGTTGATG AT ATGCC CTT TGCTGCTTTC CTTCATACGA GCCGCAAGGT GACGAGACAG 13 80 

ATTCATCAGG GCAAAGGTAT TG AC CTC AAA CATCTGGTGA AT AT CTT TAT CAGCAATCTG 14 4 0 

GTCAAATCCC TCAAAAATCC CGTAACCAGC GTTGTTAATC AAGACATCAA TCTTGCCATA 1500 

GCGGAGATAA AGATCAGTTA CCAGAGCTTC TAGGGCTGAA TCCTCGGTAA T ATC AATTTC 1560 
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AATCAATTCT 


GCATGGGAAT 


AATTTCCGTA 


GAGTTGGGCT 


AATTTTTCCT 


t r\ 1 I 1 v, 1 ttLL 


16 2 0 


AAGCAAGATG 


AGTTCGTCAT 


TGGGCAGGAG 


TTTGACCATT 


TCTTGACCTA 


GACCAC(~CICT 

\J V- V- rtV— V_ U\. 1 


16 80 


AGCTCCGGTA 


ATGAGAATAG 


TAGGCATACT 


TATCCTTTCT 


GTGACTGCTA 


\Jn 4. l i wvn^. 1 


17 4 0 


TCTTCCAAGT 


CTTTGACCAC 


ATGGACATTT 


TCAAAAATTG 


TGGCAGCGTC 


• * * 1 1 VJ rt\J J 


1 80 0 


TTGCTAATAT 


CTTTTGAGAG 


G AAACGGG C A 


CTGATATGGT 


TGAGTAGGAG 


GCGTTTGGrA 

VJ ^ — ^-1 X A 1 VJ * — 


18 6 0 


CCTCCTTCTA 


CCGCTACTTG 


TGCAGCTTGC 


ATATTAGTTG 


AGTGACCATG 




19 2 0 


ATTTTTTCAT 


CACCCTTGCC 


ATAAGTGGAC 


TCATGAACTA 


GGACATCTGC 


AT^GAPAGrr* 


1980 


AGACGCACAC 


TGGCACCCGT 


TTTTCGAGTG 


TCTCCTAAAA 


TAGTGATAAT 


rTTArrTfiGA 

\— I » A^V_ ^ i Vj VJ f\ 


2 04 0 


CGTGGCGCTG 


AG AT AT AG TC 


TGCTGCCTTG 


ATTTCAGT^C 


CGTCTTCCAA 


AAPAAGATrr 


2100 


TGGCCGTTTT 


TGATTTTACC 


AAAAAGCGGG 


CCGAACGCAA 




V- 1 l unu 1111 




TCAGCATCCA 


GCGTCCCTTC 


TAGATCCTTT 


TGCATGACAC 


GATAGCCAAC 


ACAGAAAATA 


222 0 


GTGTGGTCCA 


GCTCCTCTGC 


ATACACAGTG 


AATTTATCGG 


TTTCAAGAAT 


"TTACGCAGA 


22 80 


GAATCTTGGT 


CAAACTCATG 


GAAATGAATG 


CGGTAGGGCA 


GACGAGAACC 




2 34 0 


AGGCTGGTTA 


AGACAAATGA 


CTTGATTCCT 


TGAGGTCCGT 


AGATTTCCAA 




2 4 00 


TCTTCATTGG 


CCTGAAAGGC 


ACGG CT AG AA 


AGGAAACCTG 


GCAAACCAAA 


AATGTGGTCT 


2 4 60 


CCATGCAGAT 


GGGTAATAAA 


GATTTTGCTG 


ACCTTACGTG 


GTCGAATTGT 


GCTTTPPAGA 

VJvl 111 V V- r\ 


2520 


ATGCGATTTT 


GCGTACCTTC 


TCCACAGTCA 


AAGACCCAAA 


CTTCGTTAAT 


CTCATCGAAA 


2580 


AGTTTCACGG 


CGAGACTTGA 


AACGTTGCCG 


GCTTTAGAGG 


GCTGACCAGC 




2 6 4 0 


AAAAATTGAA 


TATCCATTCG 


ATACTTTCTA 


ATTAATCAAT 


ATA^AACATG 


CC^GTGGGGT 


2 7 00 


TTTCCGATCG 


CAAATAGCGT 


TTGCCAGAAA 


AAG C AG C AG C 


TTCTTGCAAT 




2 7 60 


GGCTGTAGCC 


TTTGAGACGT 


TTTCGACCAT 


CAGCCAATCT 


TTCCAAATCA 


GTCAAAGCTG 


2 8 20 


TGAGACTTTC 


TAGGCTGATA 


ACTTCCTCGT 


CCTCGACAGG 


CTTCATGTAA 


ATCTTACCAG 


2880 


ACT CT T C AAA 


GACTAATTGA 


TGGGGGAAAA 


TTTGCGCAAT 


TTCAAAGAGC 


AAGTCATCCG 


2940 


AGATTTTCTC 


CTCATTTTCA 


AAGAAAATCC 


GACCAAGGCC 


GTCACTCTCA 


TAACAAAAAC 


3000 


CAAAGGATTT 


ACCAGACAGA 


TTAAGCCGAA 


TAAAAGGCTT 


ATTTTCTAGG 


GTGAAACTTC 


3060 


GCTCAGTATT 


GTAAAGATTC 


AGTTCCTGAC 


TG AGTTCTG C 


AAAATAATCC 


GTCGCAGCCT 


3120 


GAGGACTCTT 


TTTCTGATAG 


AGTTCTG C AA . AGTAGGCATT 


AACAACACTT 


GGCGGAGGTG 


• 3130 


TAATAAGTGT 


TAACTGCTCC 


TGATCTGTTT 


TACCAGCTAC 


AAGCTGATCC 


AGATAGACCT 


3240 


TGTCCAGACT 


TGTATAACCT 


CCATACTTTA 


GAGCCAAAGT 


TTTAATATCA 


GTCATAAAAT 


3300 



WO 98/18931 



PCT/US97/19S88 



454 



TCTTCTAACC 


TCCATTTATT 


TTTCTCGGAA 


ATGTAGCCTG 


taasc\.:ttc 


GCCGTCTTCC 


3360 


TGATAATCAC 


CTTCTTCCAG 


AATTCCAACA 


CTCTCTAAAT 


C ATG AATCTT 


GTAGGACTTT 


3420 


G AAAAAGG C A 


CTCGCAGGGT 


AAATGCTTCA 


AAAATTTCCT 


TAATCTTATC 


TAGCAATAAT 


3480 


GCTTGCAAGT 


TTTCACGACT 


GTCCT C AG AC 


TTGGCAGAAA 


TGAGGGTATA 


TGGCGTTTGG 


3540 


GTAGGCGTGA 


AATCCTCCAC 


CAAATCCGCT 


TTATTATAAA 


GCGTCAAGTG 


AGGAATATCT 


3600 


TCCATGTCCA 


GGTCTTTCAT 


GATGGAGAGA 


ACCGTTTTTT 


CATGCTCCTC 


GTGGTAAGCA 


3660 


TTGCTAGCAT 


CGATAACATG 


AACCAGAAGG 


TCCACATGCT 


TGCTTTCTTC 


CAAGGTTGAC 


3720 


TTGAAACTCG 


ACACCAACTC 


TGTCGGCAAA 


TCTTGGATAA 


AGCCAACGGT 


ATCTGTCAAA 


3780 


GTTACTTGGA 


GATTGCCTCC 


CAGATGAATA 


CTCTTGGTTG 


TCGCATCCAG 


AGTCGCAAAG 


3840 


AGCTCATCTG 


CTTCATACTG 


CGTCTTACTG 


GTCAAGATGT 


TCATGATACT 


TGATTTCCCA 


3900 


GCATTAGTAT 


AACCAATCAA 


ACCAATCTTA 


AAAGTGCTAG 


ACTCCAAACG 


TTTTTCTCTG 


3960 


ACAGTCGCAC 


GATTTTTCTC 


AACCACCTTG 


AGCTGGCGCT 


CGATATCCGT 


GATTTGATTG 


4020 


CCAACGCTAC 


GACGCTTCAG 


CTCCAGCTGG 


CTTTCACCAG 


GACCACGGGA 


ACCAATTCCC 


4080 


CCTgCCTGAC 


CGCTGAGCAT 


AATCCCCTGA 


CCAACCAAGC 


GAGGCAAAAG 


GTATTTGAGT 


4140 


TGGGCTAGGT 


GGACTTGGAG 


cttcccttca 


TGGCTTCGAG 


CCCGCATGGC 


AAAGATATCC 


4200 


AAAATCAACT 


G CAT A CGGTC 


AATGACCTTA 


ACACCGAGAA 


CTTCCTCTAC 


ATTGACATTC 


4260 


TGCCTTGGGG 


TCAGACGATT 


GTTGACGATG 


ACAGTAGTGA 


TTTCTTCTGC 


ATCCACCATA 


4320 


AGCGCAATCT 


CTTCCAACTT 


ACCAGAGCCG 


ACGAACGTCT 


TGCAATCATA 


TTTTTCACGT 


4380 


TTTTGTCTGT 


AGCTATCTAC 


AACGACTGCC 


CCTGCCCTTT 


TCGCTAAACT 


AGCCAATTCT 


4440 


TCCATGGAGA 


GGTCAAAACT 


GTCCATACCC 


TGCAATTCCA 


CACCAATCAG 


CAGGACTCGC 


4500 


TCCTCTTTTT 


TCTCCGTTTC 


AATCATCTAA 


AAACTCCTCT 


ATCTGGCTTA 


AAArGCGGTC 


4560 


TTGTACACCA 


GATTCTCCAA 


TCTGATAAAA 


GGTGACCTGC 


ATGCGATTAC 


GGAACCACGT 


4620 


CAGCTGACGC 


TTGGCAAAAC 


GACGAGTCGC 


CTGTTTAAGA 


CTCTCACTAG 


CTTCCTCCAA 


4680 


CGTCTCCTCT 


CCACGGAAAT 


AAGGAAAGAG 


TTCCTTATAG 


CCAATTCCTT 


T AGC AGCCTG 


4740 


TACATTAGGG 


GAATGGTCAA 


ACAGCCACTT 


GGCCTCATCC 


AAAAGCCCAG 


CCTCAAACAT 


4800 


CAAATCCACT 


C GGTGGTTG A 


TACGCTCATA 


AAGTTGACTA 


CGTTCATCAT 


CCAAGCAGAT 


4860 


AATCAGCGGT 


TCATACAAGG 


TCTCTTGATT 


TTCCAAATCC 


TGACCAAAAT 


GGGCAATTTC 


4920 


TAAGGCACGC 


ATAGCACGAC 


GACGATTAAA 


CTGGGGAATC 


TCAAGGCCTG 


CTTGATCCAC 


4980 


CAAATGGGCT 


AATTCCTCA? 


CTGAATATGG 


CTCCAAACTA 


GCTCGATAAG 


CTAAAATCTC 


5040 


CTCATGAGGA 


GTCTCCCCAC 


CTAGGTGGTA 


ACCTTCTAGC 


AAGCTCTGGA 


TATAAAGTCC 


5100 
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AGTCCCACCG GCGATAATGG CTACCTTCCC ACGGTTGTGA ATACCCTCAA TAGTCATCTT 5160 

AGCTTCTGAA ACAAAATCAA AAGCCGAGTA ACACTCGGTT ATCTCTCTAA CATCGATTAA 52 2 0 

ATGATGAGGA ACAGCTGCCT GCTCTTCTGG ACTAGCCTTG GCCGTCCCAA TATCAAGTCC 5 2 30 

TCGATAGACT TGCTGGCTAT CTCCACTAAC CACTTCGCCA TTAAAACGCT TTGCGGGG 5 3 38 



(2) INFORMATION FOR SEQ ID NO: 51: 

<l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGAAACCCA TCTAGTCTCC ATCGTTTGGG AGACCAAGCA ACACGAATCT TAGATGCTTC 60 

TCGCCAACAG ATTGCAGATT TAATCGGTAA GAAAAGCGAT GAAATCTTCT TTACCTCGGG 12 0 

TGGAACAGAA GGGGATAACT GGCTTATCAA GGGTGTGGCC TTTGAAAAAG CTCAGTTTGG 180 

CAAGCACATC ATTGTTTCAG CCATTGAACA TCCAGCAGTC AAAGAGTCAG CCCTCTGGTT 240 

GAAAAGTCAA GGATTTGAAG TGGATTTTGC TCCAGTTGAT AAGAAAGGCT TGGTCGATGT 300 

TGAGGCGTTA C AGGTTTG AT ACGGCATGAT ACAATCCTCG TTTC CATC AT GGCTGTGAAC 3 60 

AATGAAATCG GCTCTATCCA ACCTATTGAG GCTATTTCAG AATTCTTGGC AGACAAGCCG 4 20 

ACTATTTCCT TCCACGTTGA TGCGGTTCAC GCGCTTGCCA AAATTCCGAC TGAAAAGTAT 4 80 

CTGACAGAAC GGGTGGATTG CGCGACTTTC 7CTAGTCACA AGTTCCACGG GGTTCGAGGT 54 0 

GTTGGCTTTG TCTATATCAA ATCTGGCAAG AAGATTACAC CTCTTCTTAC AGGTGGTGGC 600 

CAGGAGCGAG ATTATCGTTC GACAACTGAA AATGTGGCAC GGATTGCAGC GACAGCCAAG 660 

GCCCTCCGTT TGTCTATGGA AAAGCT AG AT ATCTTTAGGA GCAAGACTGG GCAGATGAAG 720 

GCAGTGATTC GCCAAGCTCT TCTGAACTAT CCGGATATTT TTGTCTTTTC AGATGAGGAA 7 80 

AACTTTGCAC CTCATATTCT GACTTTTGGA ATCAAAGGTG TTCGAGGTGA AGTCATCGTT 840 

CACGCCTTTG AAGACTATGA TATTTTCATC TCAACAACCT CAGCTTGTTC ATCTAAGGCA 900 

GGAAAACCAG CCGGTACCTT GATTGCCATG GGAGTGGACA AAGATAAGGC CAAGTCAGCT 960 

GTGCCTCTTA GCCTAGACTT GGAAAATGAT ATGAGTCAGG TCGAGCAGTT TTTGACCAAG 1020 

TTAAAATTGA TTTACAATCA AACTAGAAAA GTAAGATAGG AGCATTCATG CAGTATTCAG 10 30 

AAATTATGAT TCGCTACGGA GAGTTGTCAA CCAAGGGTAA AAACCGTATG CGTTTCATCA 114 0 
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ATAAACTTCG 
CAGATCGCGA 
CTCTCAAACA 
TAGAAGTTTT 
CCTTTAACAT 
ACCAAACACT 
GTCCTGACAT 
CCATTCGTGG 
CAGGAGGGAT 
AGGCAGTTCA 
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TACAAAAGCA 
CCAGTTTATC 



TAATAATATT 
CCGTGCCCAC 
AGTTTTTGGA 
GAAGTCTTCT 
TTCTAGCAAG 
TGGAGGGGCT 
CAATCTTCAG 
GGCTGGTGGT 
T G AC TC AG C T 
CTTTGCTAGT 
TAAATTGACC 
AG AG G AAATC 
GATGCGGATT 
AAGTCTAGGT 
CAACACTCCC 
CCAGG AAATC 
TGCACCAGAT 
TATGGATGTT 
ACCTCAAGCC 
ATCCAAAAGA 
AAACTAACTT 
CTGACAATGA 
TAT AC AGGT A 
GACCGTTCCT 
TCTTTCATTG 
CGCATGATTG 
TGGAAGTTCC 
GCTGAGTTTG 
GACTGCCAGC 
GGTTTGGAAT 



TCGGACGTTT 
GCTTACCTCA 
ATTCAAAACT 
GTCCAAGAGA 
CGTAGCGACC 
GTATTCGAAG 
GTGGAGATTC 
TTGCCAGTTG 
GTAGCAGGTT 
CCACCATATA 
AACTTTGGCG 
AAAGCCAAAG 
ACTGACCGTA 
CAAGTAGCCA 
ATCATTCGTC 
GATACCTTTG 
CGTCCAAAAA 
GAAGGCTTGG 
GAAAAAGATG 
ATAGCGAAAA 
TTTTTATTTT 
ATCAATCCTA 
AGGAGCGCCG 
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AAGAATCTCC 
TCATGGAGGT 
ATACGGCTAT 
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TACCCAAGTT 
TTACACAGCA 
TTATAAGGTT 
CATCTACAAG 
ACTTGATAGT 
TGTGCAAGTT 
AGCCTATCTT 
TAAAGGGATG 
TAAGCGTGGG 
TGCCCTCAAG 
GTTTATAGAG 
TTATTTGATG 
ACGAAATGGT 
TGAAAGTATG 
CATGGACAAG 
CCAACCGTTT 
AATTAAGAAT 
AGTGGCTGGA 
CTTGATTGAC 
AAGTTAGTTT 
TGATATAAAA 
AAAATGAAAG 
CTTCTTCCTA 
GACGGGCAAA 
CCAGCTATCA 
GGTATGGGGC 
CAGCAGTTTG 
TTTATCGATG 
TCACTAGGAG 
TTGGGCGTTT 



AAGGTAACAG 
GTTGCAGAAT 
GAAAAATCTG 
GAAGGTATGA 
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CTCATGTTGT 
GTGGATATCG 
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AAACTGGCTC CACCAAGAAG CCTTTAACCG CTATTTCGAG TGCCAGAAAC TATCGCCTGA 3000 

CCAGCGCATC TTCATCTATG TAGGAACAGA AGAAGCAGAT GATACAGACA AGACCTTGAT 3 060 

GGATGGCAAT ATCAAACAAG CCTATATCGA CTCCTCCCTT T G CT ATT AC C ATGATTTGAT 3L2 0 

AGCAGGGGGA GTACATCTGG ATAATCTTGT GCTAAAAGTT CAGTCTGGTG CCATCC AT AG 3180 

TGAAATCCCT TGGTCAGAAA ATCTACCAGA TTGTCTGAGA TTTTTTGCAG AAAAATGGTA 32 4 C 

AGTTAAGAAA GGAAAAAACG AAATG CAT AT TGAACATCTT AGCCACTGGA GTGGTCATCT 3 300 

TAACCGTGAA ATGTACCTTA ACCGTTATGG ACATGGTGGG ATTCCAGTTG TGGTCTTTGC 3 3 60 

TTCATCAGGT GGTAGTCACA ACGAATACTA TGATTTTCGC ATGATTGATG CCTGTGCTTC 342 0 

CTTTATCGAG GAAGGCCTTG TCCAGTTCTT TACCCTATCT AGTTTGGATA CTGAGAGCTG 34 80 

GTTGGCTACT TGGAAAAATG CTCATGACCA AGCGG AAATG CACCGTGCCT ACGAACGTTA 3510 

TGTGATTGAG GAGGCCATTC TTTTATCAAG CACAAGACAG GTTGGTTTGA TGGCATGATG 3600 

ACGACAGGTT GCTCTATGGG AGCCTATCAT CCACTCAATT TCTTCCTCCA GCATCCAGAT 3660 

GTCTTT AC C A AAGTGATTGC TCTCAGTGGT GTTTACCACG CACCTTTCTT TGTCGGTGAT 37 20 

TACTACAACG ATGATGCTAT TTACCAAAAC TCGCCAGTAG ATTATATTTG GAACCAAAAC 3780 

GACGGCTGGT TTATTGACCG TTACCGTCAG GCAGAGATTG TGCTGTGTAC GGGCCTTGGA 384 0 

GCCTGGGAAC AAGATGGTTT GCCATCCTTT TACAAGCTCA AAGAAGCCTT TGACAAGAAA 3900 

CAAATTCCAG CCTGGTTTGC TGAATGGGGA CATGATGTCG CCCATGACTG CGAATGGTGG 39 60 

CGTAAACAAA TGCCTTATTT CCTCGGTAAT CTCTATTTAT AAAAGGAGTT ACCTATGAAT 4 02 0 

TACCTTGTTA TTTCTCCCTA CTATCCACAA AACTTTCAAC AGTTTACCAT CGAACTAGCT 4080 

AATAAAGGCA TCACAGTCTT GGGAATTGGT CAACAGTCTT ACGAGCAATT GGATGAGCCC 4140 

TTGCCCAATA GCTTGACCGA GTATTTTCGT GTTGATAATC TTGACAACAT AG ATG AAGTC 4 2 00 

AAACGTGCAG TTCCTTTTCT CTTT T AT AAA CATGGTCCAA TTGGCCGCAT CGACTCTC'\C 42 6 0 

AATGAATACT GGCTTGAGCT AGACGCAACA CTCAGAGAAC AATTCAATGT TTTTGGTGCC 4 3 20 

AAACCAGAGG ATCTCAAAAA GACGAAATAT AAGTCTGAAA TGAAGAAACT TTTCAAAAAA 4 3 80 

GCAGGTGTTC CTGTGGTACC TGGAGCTGTT ATCAAGACGG AAGCAGATGT TGATCAAGCA 44 4 0 

GTGAAAGAAA TCCGTCTTCC AATGATTGCC AAACCTGATA ATGGAGTGGG AGCAGCCGCA 4 500 

ACCTTTAAAC TTGAGACAGA AGACGATATC AATCACTTCA AGCAAGAATG GGACCATTCA 4 560 

ACCCTTTATT TCTTTGAAAA . ATTTGTCACT TCCAGCGAAA TCTGTACCTT TGACGGGCTC 4 62 0 

GTGGACAAGG ATGGAAAGAT TGTCTTCTCA ACAACCTTTG ACTACGCCTA TACACCGCTT 4 680 
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GACCTCATGA TTTATAAGAT GGACAATTCT T ATT ATGTG C TCAAGGATAT GGATCCTAAA 47 40 

CTGCGCAAGT ATGGGGAAGC AATTGTCAAA GAATTTCGTA TGAAAGAACG GTTTTTCCAT 4 300 

ATT G AGTTCT TCCGTGAGGG GGACGATTAT ATTACCATCG AG T ACAAT AA CCGCCCTGCA 4 860 

GGTGGTTTTA CCATTGATGT TT AT AACTTT GCTCATTCCT TGGACCTTTA T CGTGGCT AT 4 9 20 

GCAGCTATTG TCGCAGGAGA GGAGTTCCCG GCGTCAGACT TTGAAACTCA GTATTGTTTG 4 9 80 

GCTACTTCTC GCCGTGCAAA TGCTCACTAT GTTTATTCAG AAGAGGATTT GCTTG C C AAA 504 0 
TATAGCCAGC AGTTCAAGGT TAAAAAAGTC ATGCCAGCTG CCTTCGCGGA ACTTCAAGGA . 5100 

GATTACCTGT ATATGCTCAC CACTCCGAGT CGACAAGAAA TGGAGCAGAT GATTCCAGAT 5160 

TTCGGACAAC GTCAAGAATA AG AACT ATC G GATTAAGGAA ATTAACTCCC TTAATCCTTT 52 2 0 

TGTTTTGTCT GATAAAAAAT AAGAGCATCC CAACAAGGTA GCTATCATAA AACTTGTTCG 52 8 0 

ATAACTATTT GAAGCAGGAT TAGGTGGTCA GAAATTAAAT TTTAATATTT CAATTGAGTC 5 34 0 

ATAGTATTGT GTTTGCGTAT CCTTAAATCA GCTAAAAGCA TCCATGACGA CACCTATACG 54 00 

ATATAGTTTT CAAGATACCA AACAACTCTA TTAATATTCA ATGAAAATCA AAGAGCAAAC 54 60 

TAGGAAGCTA GCCGCAGGTT TCTCAAAACA CTGTTTTGAG CTTGTGGATA GAACTGACAG 55 20 

AGTCAGTATC ATATACTACG GCAAGGTGAA GCTG ACGTGG TTTGAAGAGA TTTTCGAAGA 55 80 

GTATAAAATA TTCAGGTGAC G CAT AG AT AT AGTTAATTGA ACCTTTGTTT GAAATCTGAT 5640 

AAAATAATGA TATTACTAAG TTTTAAAAAC TAAAGAAAAG GGAAGATATG ATTACAGGCG 57 00 

AATTAAAAAA TAAAATCCAT CAGCTGTCGG AAATTCTTTG GACAGAAGGA AACGCAAATC 57 6 0 

CTTTAACAAA TATTGAACAG TTGACTTATC TCTTATTTAT CAAAGATTTC GATAGTGTCG 58 2 0 

AGCTTGGACC TGAAAGTGAT GCTGAATTTC TAGCGATTCC TTATGAGGGA GTTTTTCCAA 5380 

AAGATAAACC TGAATACCGT TGGTCAACTT TTAAAAATAT AGGAGATGCT CAGGAAGTTT 594 0 

ATCGTTTAAT GACTCAGGAG ATTTTTCCGT TTATTAAAAA TCTCAAGGGG GATACAGATG 6000 

ATACAGCCTT TTCACGATAT ATGCGAGAAG CTATTTTTCA AATAAATAAA CCTGCTACGC 60 60 

TTCAAAAGGC AATTTCTATC TT AG ATGTTT TTCCAACTAG GCGATTAGAT GTAGATTTTG 6120 

ATAATGACAA ACAAAGTATT ACTGATATCG GAGATATCTA TGAATATCTG TTATCAAAAT 6130 

TGTCGACCGC AGGTAAAAAT GGACAGTTCC GTACACCTCG TCACATCATC GATATGATGG 6 240 

TTGAGTTGAT GCAACCGACT ATCAAAGATA TCATCTCAGA TCCCGCTATG GGTTCTGCTG 6 3 00 

GCTTCTTAGT ATCTCCTACC CGTTACTTAA AGCGTAAGAA AGATGAATGG GAAACCAATA 6 3 60 

CAGATAATAT CAATCATTTT CATAATCAGA TGTTTCATCG AAATGATACG GATACGACTA 64 2 0 

TGTTGAGACT TCGCGCGATG AACATGATGC TACATGGAGT AGAAAATCCA CAAATCAGTT 64 80 
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ACCTTGACTC GCTGTCTCAA GATAATGAAG AAGCCGATAA ATATACTTTG CTTTTAGCAA 634 0 

ATCCTCCTTT TAAGGGCTCA CTTG ACT AC A ATTCAACCTC TAATGACCTT CTTGCAACCG 6 600 

TAAAAACCAA AAAAACAGAA TTACTCTTTC TTTCTCTTTT CTTGCGAACT TTAAAACCAG 6660 

GTGGACGAGC AGCACTTATC GTACCTGATG CTGTCCTTTT TGGTTCGTCT AAAGCTCATA 6720 

AAGGAATTCG TCAGGAAATT GTAGAGAATC ATAAGCTTCA TGCTGTAATC TCAATGCCTA 67 80 

GTGGTGTGTT CAAGCCTTAT GCTGGAGTTT C AACTGC C AT TCTCATCTTT ACAAAAACTG 684 0 

GTAATGGTGG TACTGACAAA GTCTGGTTTT ACGATATGAA AGCGGATGGT TTAAGTTTGG 6900 

ATGATAAGCG ACAACCGATT AGCGACAATC ATATTCCAGA TATTATCGAA CGCTTTCATC 6960 

AT CTTG AAAA AGAAGCAGAA CGTCAGAGAA CGGATCAATC TTTCTTTGTT CCAGTTGCTG 702 0 

AG AT AAAGG A AAATGATTAT GATTTGTCTA TCAATAAATA T AAAG AG ATT GAGTATGAAA 7080 

AAGTTGACTA TGAACCAACA GAAGTCATAT TAAAGAAAAT CAATGATTTA GAAAAAGAAA 7L40 

TTCAAGCTGG CTTGGCTGAA TTGGAAAAAT TACTCAAGTA GGGAGGTGGC TGTATGAAAA 7 200 

AAGTGAAGTT GGGGGAAGTC TTATCTCTAA AAAAAGGCAA GAAAGCCACT GTACTTGCTG 7 2 60 

AACAAACAAC TCTAAGCCAA CGTTATATTC AAATAGATGA TTTAAGAAAT AATAATAATT 7 320 

TAAAATTCAC TGAAAGTTTA AATATGACTG AAGCACTCCC AG AT GAT ATT CTGATAGCAT 7 3 80 

GGGATGGAGC TAATGCAGGA ACAGTTGCTT ATGGATTATC GGGAGCTGTT GGTAGTACAA 74 4 0 

TTACCCTCTT AAAAAAGAAT GAGCGATACA AAGAAAAAAT TATATCAGAT TACTTGGGAG 7 500 

TCTTTTTGGA AACTAAATCG CAGTATTTAC GAGATCATTC AACAGGTCCA ACAATTCCTC 7 560 

ATTTAAACAA GAATATATTA CTTCATTTAC AATTAGAATT GCTAGGTATC GAAGAACAAG 7 62 0 

AC AA C AT TAT CTGTATTCTT AATACCATTA AAAG GCT TAT TACTAAAAGA AAATTTCAGT 7 $30 

TAGATGAACT AAACTTGCTC GTCAAATCCC GATTTAACGA GATGTTTGGG GAAAATAAAA 7 740 

TATTTGAAAG CATTGATAAC TTATTTGATA TTATAGATGG TGATAGGGGC AAAAATTATC 7 800 

CTAAATCAGA TGAGTTGTTT AGTGAGGACT ACTGTTTATT TTTAAATACA AAGAATGTTA 7 860 

CTAAAAACGG ATTTTCATTC G AT AC AAAG C AATTTATCAC TAAAACAAAG GATAAATTAC 7 920 

TTCGAAAAGG CAAACTTGAG CGTTATGATA TAGTCTTGAC AACAAGAGGT ACTGTTGGAA 7 980 

ATGT AGCGT A CTACGATGAA TTAATAAAAT ATAAACATTT ACGTATAAAT TCAGGTATGG 804 0 

TAATATTACG TCCCAAGACA CCAAATCTAA ATCAGAAATT TATTATCCAT GTTTTAAGGA 8100 

ATAATAATTA TAGTCGAGTG ATATCAGGAA GTGCTCAGCC TCAGTTACCA ATTACAAAAT 8160 

TAAAAAAAAT ACTTCTCCCC CTCCCCCCAC TAGCCCTCCA AAATGAGTTC GCAGACTTTG 82 20 
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TAGTCCAGGT 


CGACAAATCA 


CAATTTGCTT 


CTGAGATAGC 


TATAAAAGTG 


TGGAGAAATA 


8280 


GCTTGAAATT 


TAGTATAATA 


TAGCTAAACT 


AT TTGTTT AA 


AGTGAGAAAA 


AAATGGGAAA 


3340 


TTTTAGCTTT 


CTTTTAAAAA 


ATGACGAATA 


TGAATCTTTT 


TCAAAACCTT 


GCATTGAAGC 


8400 


TGAGAATATG 


ATTGCTACAT 


CAACTGTGGC 


TACTGCCTTT 


ATGG CGCGTC 


GTGCTTTAGA 


8460 


GCAGGCTGTC 


CATTGGATAT 


ATAGTCACGA 


TT CAT ATTT A 


GAAGCTCCCT 


ATCGTG CT AC 


8520 


TCTATCTTCT 


TTAGTATGGG 


ATGATGATTT 


TAGGGATATC 


GTAGATTCTG 


AACT C C AC AA 


9580 


GCAGATAGTT 


CTGTTGATTC 


GGTGGGGAAA 


CCATGCTGCT 


CATGGTGGTG 


AAATTAAGGA 


8640 


ACGAGAAGCG 


ATTTTAGCTT 


TGCATCATTT 


GTATCAGTTT 


GTTAATTTTA 


TC G ATT ATTG 


8700 


TTACAGCAAT 


GAGTTTGTGG 


AGCGTTATTT 


TGATGAGAAG 


TGCTTACCAC 


TTTCAGCAAA 


8760 


CATCAAATAC 


CGAGAAACTC 


CACAATCTAT 


GATAAAGTTA 


CAAGACAGTT 


T AC C AG AACT 


8820 


CCCTGATTTT 


CATGAACAGA 


TGGCTGCTCA 


GTCCGTAGAA 


GTTCAAGAGA 


CTTATACTCA 


8880 


AAAACGTGAG 


ACTGCAGCGC 


AACGGCAAGA 


TCTGCCTTTC 


C AT ATTG AT C 


AATT ATCTG A 


8940 


GGCAGACACA 


AGAAAGCTCT 


TTATTGATAT 


CGATCTCCGT 


TTAGCAGGAT 


GGATATTTGA 


9000 


AGAAAACTGT 


CGTGTTGAGA 


TAGCCGTTGA 


TGGTCTCAAG 


CACGGTTCAG 


GAATTGGTTA 


9060 


CTGTGACTAT 


GTACTTTATG 


GTAAAAATGG 


GAAAATTTTA 


GCGATTGTCG 


ACGCTAAAAA 


9X20 


AGCCTCTGTC 


AATCCAGAAG 


TAGGGGAAGT 


ACAGGTCAAA 


GAATATGCTG 


AAGCTTTGGA 


9180 


GAAACATATC 


GGCTATCAGC 


CAATTTGCTT 


TATTACAAAT 


GGGTTGAAGC 


ACTATATACT 


9240 


TGATGGTCCG 


AACCGCCGCC 


AGATTGCAGG 


CTTTTACTCT 


CAAGAAGAAT 


TGCAATTAGT 


9300 


G ATGG AT AG A 


CGTCATCTTC 


AAAAACCCCT 


TGAGGATATT 


TCTAGTAAAA 


TT AGG G AC G A 


9360 


TATTTCCGGG 


CGTCACTACC 


AAAAACATGC 


CATTGCAAGC 


GTTTGTGAAG 


CTTTCTCTGA 


9420 


TCATCGTAGA 


CACGCACTTT 


TGGTTATGGC 


AACTGGGGCG 


GGGAAAACTC 


GTACAGCAGT 


9480 


TTCTCTAGTT 


GATATCTTAT 


CACGTCATAA 


CTGGGTAAAA 


AACGTTCTCT 


TCTTAGCCGA 


9540 


TAGAACTTCC 


TTGGTTAAGC 


AAGCATATGA 


TTCGTTTAGA 


AAATTACTCC 


CAGATCTTTC 


9600 


CCTTTGTAAC 


TTCTTAGAAG 


ATAAAGAAGG 


AGCTCAATCA 


AGTCGCATGG 


TCTTTTCAAC 


9660 


TTATCCGACC 


ATGATTGGAG 


CGATTAGTGG 


TCAAGAAGAA 


GTAAATCAAC 


GCCCTTTCAC 


9720 


TGTTGGGCAT 


TTTGACCTTA 


TCATAATTGA 


CGAATCTCAC 


CGTTCTATTT 


ATCAGAAATA 


9780 


CAAGTCCATT 


TTTGATTATT 


TTGATGCAAG 


AATTGTAGGC 


TTAACAGCTA 


CTCCGCGTCA 


9840 


AG ATTT AG AT 


AAAAACACCT 


ATGGATTCTT 


TAATTTGGAG 


AATGGGGTTC 


CAACATATGC 


9900 


ATATCATTTG 


GAAGAGGCTG 


TTAAAGACGG 


AT ATTT AG T A 


GCCTATCATT 


CTATCGAAAC 


9960 


CAAACTGAAA 


CTACCTACGG 


ATGGTCTACA 


TTATGATGAT 


TTGTCCGAAG 


AAGAAAAGGA 


10020 
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ACATTTTGAT 


AGCAAATTTG 


AAGACAATAG 


CTGTGAAAAA 


GATATTGATG 


GGAGTGTATT 


10080 


TAATTCCTTT 


A TTTTC AAT A 


AAAGTACAGT 


AGAAATTGTT 


TTAAATGAAC 


TCATGACAAG 


10140 


AGGAATTCAG 


ACAGCCTCGG 


GTGATGAAAT 


TGGTAAAACT 


ATTATTTTTG 


CTAAAAATCA 


10200 


TGATCATGCG 


GAATATATCA 


CAGGTATTTT 


TAACAACCGC 


TATCCTGAAA 


AAGGGAGCGA 


10260 


CTATGCTCAG 


GTGATTGATT 


ATAGTATTAA 


GCATTATCAG 


ACCTTGATTC 


ATGATTTTAA 


10320 


AATTAAGGAG 


AAGTATCCTC 


AAATTGCGAT 


TTCTGTCGAT 


ATGTTAGATA 


CAGGTATTGA 


10380 


TGTACCAGAG 


GTTGTTAATT 


TAGTCTTCTT 


CAAGAAAGTA 


CGCTCTAAAA 


CTAAGTTTTG 


10440 


CCAGATCATT 


GGTCGAGGAA 


CCCGTCTATG 


TAAAGATTTA 


TTTGGACCTG 


AGCAGGATAA 


10500 


GG AAAACTTC 


TTGGTATTTG 


ATTATGGGGA 


CAATTTTGAT 


TATTTTCGTG 


CAGATCCAAG 


10560 


AGATGGAGAG 


GGTCGTCACA 


TTGTTTCGCT 


GACTCAGCGT 


TTATTTAATA 


TCAAAGTGGA 


10620 


CTTGATTCGA 


GAACTTCAGG 


GACTCCAATA 


CCAAGAAGAT 


CAGTTTGCGA 


GAGCATACCG 


10680 


TCAGCAGCTT 


GTCTCGGAAC 


TTCAAGGTCG 


TATAGAGAGC 


TTAAATGAGT 


TGGACTTCAG 


10740 


CGTTCGTATG 


GTTTTAGATA 


CAGTTTATAG 


CTATAGGAAA 


TTGGAAAGTT 


GGCAGAATCT 


10800 


AACTGCTCTT 


ACAAGTGAAA 


CCATTCAAAA 


AAATCTCTCT 


CCGCTTTTAT 


TTGATGAAGA 


10860 


TAAAGAAGAT 


GAGATGGCGA 


GGAGATTTGA 


TTTGTGGTTG 


CTTCATATTC 


AGTTGGGGCA 


10920 


ACTGACACCT 


AAATCTTCCA 


CTGTTCATAT 


TTCCCAACTG 


ATGAAGACGG 


CTAGAGCTCT 


10980 


TTCTCCTATT 


GGCAATATCC 


CGCAGGTTTT 


TGAGCAGCCT 


GAAATTATCA 


GG AAAGT AC A 


11040 


GGAGCCTGAA 


TTTTGGAAAG 


AAGTTAACTT 


GTCTGATTTG 


GAAAAAATTC 


CTCTTGCTAT 


111C0 


TCGAGATTTA 


TTACAGTTTT 


TGGATAAAAC 


AGACCGTAAA 


CCCTACTATG 


TTAACTTTGA 


11160 


AGATCGTATA 


CTCTCCACTG 


TTCACGAGAC 


CACAGCATTT 


TTGCACGTCA 


ACCATCTTCG 


11220 


GTCTTACAAT 


GAAAAAGTTG 


AGCATTATTT 


GAAAACTCAT 


CTGGATGACG 


AGTCCATTTC 


11280 


1 1 A 1 AL 


LAI AA I AAAA 


AGTTGACATC 


TG AT G AT ATG 


CTTGCACTTG 


AAAAATTGCT 


11340 


TTGGGAAAAA 


TTAGGTAGTA 


AAGCAGACTA 


CCAAAGTCAT 


TATGAAAATA 


AGGCAATTCC 


11400 


GAGATTGGTT 


CGTGAGATTA 


TTGGCTTAGA 


TAGAGAGTCT 


GCCAATCGTA 


TTTTTTCTAA 


11460 


ATTTTTGTCG 


GATGAGAATC 


TTAATCCCAC 


GCACATTTCA 


TTTGTAAAAT 


TGATTGTAGA 


11520 


CTACATTGTA 


GAAAATGGTT 


TTTTAGAGAC 


GAAAGTGTTA 


ACGCAAGAGC 


CGTTTAAATC 


11580 


TTATGGTTCT 


GTTCAACTAC 


TCTTCCAACA 


CCAACTACCA 


GTACTTCGTA 


ATATTGTTCA 


11640 


AATCATTGAA 


CTTATCAATA 


ATCGAGCTGG 


AGAAGCGGCT 


TAAATTCTAA 


AGTGATTGCC 


11700 


ATGCTGAGAC 


TCATTTAAAA 


TTAAAAAGAC 


TAGAAATTTA 


TCCTA7ATAT 


GAGAAGTTTT 


11760 
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ATTAGGAAGA ATGTCATCGT TTTCCTAGAA TACAGTATCA GTTGTTAAGT GGTTGATAAA 11320 

TTTC AAAGT A G AT ACTTGT A CCACGATGTT TGTTGATCGA GTTATTAACA AAAGAGCTAC 11880 

TTTGATTTTA AAGAAATAGA AAACAAAAAG CCGAGCAAGA ATTCAATTCC AGGAGAAAAT 11940 

GAAATAATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAA CTAGCTGCAG GCTGCTCAAA 12000 

ACACTGTTTT GAGGTTGCAG ATGGAAGCTG ACGCGGATTG AXCAGATTTT CGAAGAGTAT 120 60 

AAATCTTCCT AGGATAAAGC AAAAC G C AT A GTATCAAGGG TTTTCAACAC TTGATACTAT 12120 

GCGTTTTCTG ATGTTAAAGA CTTTCTACCA GGTTTTTTAA AAGCATAATT GTTAGTTGTA 12180 

GTCATTTATT ATTCTTCAAA GAAAAATGGT GGGGC GAATT TTTTCAGTTC TTCAAAGCAC 12240 

TTTTGAGCAG TATCTGCATC TTCACAGATG ATAAGACAGA CATC ATT ACC ACAAAGGGTA 12 300 

GCGATAGCGT CAGGGAAGCT CAAAGTATCA ATGATAGAAC CAAAGGATTG AGCCAGTCCA 12 3 60 

GG AAGGGTT T TTAGTAGGAC TTGGTGTTGA ACTGGGCGCA TCCAGACAAG GGCGTCTTCC 124 20 

ATGTAGAGTT CGAGACGTTT TTCCCATTTT GAGATGGAAC CATTGTTAAG AACATAATAA 12 4 80 

GCGCTATCTT CTTCGCGGAC TTTTGATAGG TTCATATTTT TGATGTCGCG TGAGAGGGTT 12 540 

GCCTGGGTTA CTTGAATGTC GTTCTCAGCA AGAAGGGCTT GCAACTCAGC CTGTGTATGA 12 600 

ATCTTGTTTT TTGTGATAAG AGCGCGTATA AGTTGGTGGC GGTGTTCTGA TTTATTCATA 12660 

ATAATGTAAC TCCTTTTAGC AAGGTAAGGT AAGCATGGAC TGAGCGAGGT CGACAGTCAA 127 2 0 

CTCGTAGTCT GTATTGTCAC G G ATGGTG AT TTCAAAGTCA GTAGTATAGA GGACTAAACG 127 80 

GAG AG T GT CT CCTTCTTTTA GCTTGTAAAT AGTTGGCTGC AGTTCAAATT GAACGTCCAT 12 840 

C C ATT CAT CT GCAGTAATAT CCTCTACTAA CAGTAAATCA TTTCTATTTT GTAAATTAAG 12900 

GTAACCTTTT GTCACGACTC GTTGTGCCTC TGGTCTAAAT GGCAATTCAC AGAGATTTTC 12960 

CAACATGTGA TAGCGACCGT TGTCAATGGT TCTAGCACTT AAAATAGCTG GATAACCTTG 1302 0 

TAGGTATTTC TTTTGCCCAA ATTCTAGCAG TTGGGCAGAT AAGAGCCCCT TG TTTGT ACT 13080 

GGATTTGATA CGAAGATTGA GCTGAGCGCG ACCGTTTAGG TGGAGATCTT TAGTCACAGG 13140 

AAGGTTAATA GTAATCTGAT TGGCTTTCCC TTGATAGAGC TCTGTATTGA AGGTTTGGTA 13200 

TGTCTTACCA TAGCGCTCAA AATCCTTATC TGGGTACTGG TTTTGAATAG CTTGCTCTTC 13260 

TTGACCAAGT GAG AAGGTTT CACAGTTTTC TTGCCCACCG AAGTTATCAA GTGATAACCA 13 320 

AGTCTGTGGA GCTGTATTGT CCTGCCAGAT AACAGTAGGA ACT TG AAAGT CTGTTTCCTG 13 380 

TCCTAGTAAT TTCTTGGTCA ATAAGGCATT TATGGACTCA CCGAAGTCAA TTGATTGCCA 13 440 

ATTGTTCATG TAAACATGGG CACCATTATG G AAAAAG AG A TGCTTGTGTA TATGAGTAGG 13 500 

AAGAGCATGG AACATCTGGT AAACATGAAG .TGGTTTGACA TTCCAATCCT GAGAACCATG 13 560 
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AGTAAAGACA ACCTCTCCCT TTACTTTATG GG CAT TGAGC AGAT/VnTTGC GGTCATGCCA 13 620 

AAACTGATTC TAGTCCCCAG TTTTTCGGTC TAGCTGAGCT TTCACTTTTT CTAAGTCACC 13 680 

TTGGTGAGCT TCATTGCCAC GGATATAGTC GCCAGCTAAG AGATTACGAG AATAGGTTAA 1374 0 

CTCAGCAAGG GAGTCAAAGT CCTCACCTGG ATAACCACCT GGGCTAGTCA CCAGACCGTT 13800 

TTCACGCTAG TAGTTGTACC ATGATGAAAT TCCTGCCTCG GCAATGATAA CTTCTAAACC 13 860 

ATCGACTCCT GTAGTCGCAA GACCATTGGA CATGGTACCT AG AT AG G AAA GTCCTGTTGT 13 920 

AGCAACTTTT CCGTTTGACC AATCAGCCTT GACTTGACGC TGGCGCGTGT GATCAGTAAA 13 930 

GGCACGGCAA CGACCGTTAA GCCAATCGAT GACATTTTTA TAAGCCTCGA TTTGCTGGTA 14 040 

G TCTCC ATT A GTCATGAAAC CTGTCGAGTC TTTGGTACCA ACACCTGAGA CAT AG AG ATT 14100 

GGCAAAGCCT CTCGGAAGGA AGTAGTCGTT TAGTGTATAG CTAGAGTTGA TGTGAGTTAG 14160 

CTTTTCCTCA GCCTCTGCTA TAAGCTCAGC TTTACCTTGG GGTTGGACGA GATTTAGTTG 14 2 20 

AGGTTTCTCT AGCTCAATCT TGTGAGGAAG CTTAACCTCA ACCTCCCCCT CCATCTTGTA 14 2 80 

GAGAGCCTTG TCACTAGCCT TGTCATTGGT TCCCTGATGA TAAGGGCTGG CTGTCATGAT 14 340 

GGCAGGGATT TTTCCATCAA AACGAGGGCG AATAATCCTA ACCTTTACTA GGTCTGATAG 14 400 

CCCTTTTTGG TCAGTATCGA CACGAGACTC AACGTAAACG ACTTCACGAA TGACATCCTG 14460 

GTTAGAAAAA GTAGCCAAAC TCTTGCCGTT AAAGTAGTGG TAGTCATTAT CCTCCGGAAT 14 520 

AAGACCATCA CTAACAAGTT CGTCGATAAG ACTATTTCCT TTTTTGGTGC GAGTATTGAG 14 580 

TAACTGATAG AGATTTTCAA TCAAGTCACC ATATATAATG GG AAATCCAG TTTCTTTACC 14 640 

AAAAACGTCA CTATCTTCGA AGTCAACCAA ATAAGAAAAG CCTAAAAGTT GAAAACCAAC 14700 

AG T AT AAAAA AT AT CT C CT G TCAGTTCATC TTCTGATTGA AAAAATGTCA GCAGGTCTCT 14 760 

TTTTTTATCA GCTGCTAGGA T AG AAAGTGG GTAGTTGGTG TCTTGATAAG TGAAAAAGAA 14 820 

ACGACGTAAA AAGGTTTCAA GTGAGTCTTT GTGATTGGCT GTATTTTGTA AATCAAAGCC 14 830 

ACATTTTTTT AGTTCAGATA AGACATTTTC TTTTGGAAAA TTGATATAAC TATATTCATT 14 940 

AAAACGCATA GAACCTCCAT ATAGAATGAC ACTTAAGGTT ATTATATCAA AAAAAAAGCA 15C00 

GAAAGGGAAT TGTTAACTTC AAAAGGAAAT AATCCAATAA AAATGAATAA AGTACTAAAT 15060 

TCAATATAGA GAACAGAGTA ACAATAAGAA TAAATAGATA GGGTATAAAA GTTCTACCAG 15120 

ATTTATATTA TATGCTTTCT ATTTTTATAT ACAATATAGT ATAAATATAA AAATGATGAC 1518 0 

AAAAATACAA ATGAATAGAA AATAAATTAG TAAGCTGATG AAATTTTTCT CAAGAGAAGC 152 4 0 

CATTTATAGG TGAAAATGGT ATAATATAGT GAGAAGGATA GAGGAGAAGT GTAAATTGAT 153 00 
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CGCACAACTA CATACAAAAA CAGTCTATAG TTTTATGGAA ACCGTCATTT CGATCGAAAA 
GTATGTGAGA GCAGCTAAAG AATACGGCTA CACTCATTTG CCTATGATGG ATATTGACAA 
TCTTTATGGC GCTTTCGACT TTCTAGAGAT TACAAAAAAA TACGGCATTC ATCCTTTGCT 
AGGGCTTGAA ATGACAGTCT TTGTAGATGA TCAGGGAGTG AATTTGCGCT TTTTAGCTCT 
ATCTAGTGTG GGCTATCAGC AGTTGATGAA GCTTTCGACA GCCAAGATGC AGGGGGAGAA 
AACTTGGTCA GTCCTGTCCC AGTACCTGGA GGATATCGCG GTCATTGTGC CTTATTTTGA 
T AG AGTTG AG TCGTTAGAAC TAGGCTGTGA TTACTATATA GGGGTTTATC CAGAAACACT 
AGCAAGCGAA TTTCATCATC CTATCTTACC TCTTTATCGG GTCAACGCTT TTGAAAGCAG 
GGATAGAGAA GTTCTTCAAG TTTTAACAGC GATTAAAGAA AATCTACC GC TCAGAGAAGT 
TCCCTTGCGT TCGAGACAAG AT GTCT TT AT ATCAGCAAGT TCTTTAGACA AACTATTCCA 
AGAGCGTTTT CCGCAAGCTT TGGACAATTT ACAAAACCTT ATTTCAGGCA T TTCTT AC G A 
CTTGGATACT AGTCTGAAAC TGCCTCGTTT TAATCCAGCT AGACCAGCAG TAGAGGAGTT 
GAGAGAGCGT GCTG AACTGG GGCTTGTTCA GAAGGGGTTG ACTAGTAAAG AATATCAAGA 
TAG ACT AG AC CAAGAATTGT CTGTTATTCA TGATATGGGC TTTGATGATT ATTTCTTGGT 
TGTTTGGGAT TTGTT GCCTT TTGGACAATC GAATGGCTAT TATATGGGAA TGGGAAGGGG 
TTCTGCAGTA GGCAGTTTGG TTTCTTATGC CTTAGACATC ACGGGGATTG ACCCAGTAGA 
GAAAAATCTG ATTTTTGAAC GCTTTCTTAA TCGTGAACGC TATACCATGC CTGATATTGA 
TATTGATATC CCAGATATTT ATCGTCCAGA TTTTATCAGA T ATGTTGGT A ATAAATATGG 
TAGTAAACAT GCGGCACAAA TCGTTACTTT TTCAACCTTT GGAGCCAAGC AAGCTCTTCG 
AGATGTCTTG AAACGCTTTG GTGTGCCAGA GTATGAATTA TCTGCAATTA C T AAG AAAAT 
CAGTTTTCGT GACAATCTTA AGTCGGCCTA TGAGGCAAA? C7CCACTTTC GTCAGCAAAT 
CAATAGTAAG TT AG AAT AC C AAAAAGCTTT TGAGATTCCT TGCAAGATAG AGGGCTATCC L6620 
AAGCCAAACC TCTGTCCATG CGGCTGCTCT TGTAATTAGT G AC C AAG ATT TAACCAACTA 
CATTCCTCTA AAGTATGGTG ATGAAATTCC ACTGACTCAG TATGATGCTC ATGGAGTTGA 
GGCTAGCGGA CTTTTGAAGA TGG ACT TTCT GGGACTACGA AATTTGACCT TTGTCCAGAA 
GATGCAAGAG TTGCTTGCTG AAACAGAACG TATTCATCTG AAAATTGAAG AAATCGATTT 
AG AAG AC AAA GAAACGTTAG CTTTATTTGC CTCTGGTAAT AC AAAAGGT A TCTTTCAATT 
TGAGCAACCA GCTGCCATTC GTCTGCTTAA GCGTGTGCAA CCAGTCTGT7 TTGAAGATGT 
CGTCGCGACT ACTTCTCTAA ATCGACCGGG TGCTAGTGAC TATATCAATA ATTTTGTGGC 
AAGAAAGCAT GGGCAGGAAG AAGTGACTGT TCTGGATCCA GTACTGGAGG ATATTTTGGC 



L5360 

15420 

15480 

15540 

15600 

15660 

15720 

15730 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

15560 



16680 
L6740 
16800 
16860 
16920 
16980 
17040 
1710O. 
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TCCAACCTAC GGCATAATGC TCTATCAGGA GCAGGTTATG CAGGTTGCCC AGCGACTTGC 17160 

CGGATTTAGT CTTGGGAAAG CCGATATTTT GCGTCGGGCT ATGGGGAAAA AGGATGCCTC 17220 

TGCCATGCAT GAGATGAGGG CTT CCTTT AT TCAAGGTTCA TTAGAACCTG GTCATACTGT 172 30 

G G AAAAAGC A GAG CAGGTCT TTGATGTTAT GGAGAAGTTT GCAGGTTATG GTTTTAACAG 173 4 0 

GTCACACGCC TATGCCTACT CAGCCTTGGC CTTCCAGTTG GCTTATTTCA AAACGCATTA 17 4 00 

7 CC AG C C ATT TTTTATCAGG TCATGTTAAA TTCTTCCAAC AGTGATTACT TAATAGATGC 174 60 

ACTTGAAGCA GGTTTTGAAG TAGCCTCTCT ATCCATCAAC ACCATTCCCT ATCACGATAA 175 20 

AATTGCCAAC AAGGCCATCT ATCTACGTTT GAAATCCATT AAAGGAGTCA GTAATGATTT 17 5 30 

AGCTCTCTGG ATTATTGAAA ATAGACCTTA TTCTAACATT GAAGATTTTA TAGCTAAATT 17 640 

ACCTGAGAAT TATCTGAAAC TTCCTCTGCT AGAACCTTTG GTAAAAGTTG GTCTTTTCGA 177 00 

TTCATTTGAA AAAAATCGTC AAAAAGTATT TAATAACTTA GCTAATCTAT TTCAATTTGT 17760 

GAAAGAGTTG GGAAGTTTGT TTGGAGATGC TATTTATAGT TGGCAGGAAT CGGAAGATTG 17 820 

GACGGAACAA GAAAAATTTT ATATGGAACA AGAGCTTTTA GGGATAGGTG TCAGCAAACA 17 880 

TCCACTACAA GCTATTGCAA GTAAGGCTAT TTACCCGATT ACCCCAATCC GAAATTTGTC 17 94 0 

AGAAAATAGC TATGCTATTA TCTTGGTTGA AGTTCAGAAA ATAAAAGTGA TTCGTACCAA 18000 

AAAGGGTG AA AATATGGCCT TCTTACAGGC AG AT G AT AG T AAGAAAAAAT TGGATGTCAC 18060 

TCTCTTTTCA GACTTATATC GTCAGGTTGG ACAGCAAATA AAAGAGGGAG CCTTCTACTA 18120 

TGTAAAAGGA AAAAT A C AAT CACGTGATGG CCGTCTGCAA ATGATTGCAC AAGAAATAAG 13180 

AGAAGCAGTT GCTGAACGCT TTTGG AT AC A GGTGAAAAAT CATGAATCCG ATCAAGAAAT 18 240 

TTCACGCATT TTAGAACAAT TTAAAGGCCC AATCCCAGTC ATCATCCGGT ATGAAGAGGA 18 3 CO 

ACAGAAAACC ATCGTTTCTC CCCATCATTT TGTAGCTAAA TCCAATGAAT TAGAGGAGAA 18 3 60 

ATTGAATGAA ATCGTTATGA AAACGATTTA TCGCTAAAAA TACGGAAAAT AGAACAATTT 184 20 

TCAACGTAAA TGTGGTATAA TCAGTAAGAA TGTTAAAAGA AAAAGGAGCA TAACCAATAT ]8480 

GAAACGTATT CCTGTTTTGA CTAGTGGTGC AGACGCCCCT GGTATGAACG CTGCCATCCG 18 540 

TGCAGTTGTT CGTCAAGCAA TTTCAGAAGG AATGGAAGTT TTTGGTATCT ATGACGGATA 18600 

TGCTGGTATG GTTGCCCGTG AAATTCATCC CCTAGATGCA GCTTCAGTAG GGGACATCAT 13660 

TTCTCGTGGT GGTACTTTCC TT C ACT C AGC TCGTTACCCA GACTTCGCTC AACTTGAAGG 18720 

GCAACTTAAA GGGATTGAGC AATTGAAAAA ACACGCAATT GAAGGTGTAG TTGTTATCGG 18780 

TGGTGACGGA TCTTACCACC GCGCTATGCG TTTGACTGAA CATGGCTTCC CAGCTATTGG 18840 
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TCTTCCAGGT ACAATCGATA ACGATATCGT TGGTACTGAC TTTflT \ TCG GTTTTGACAC 18900 

AGCGGTTACT ACTGCCATGG ACGCTATCGA TAAGATTCGT GATACATCAT CAAGTCACCG 189 60 

TCGTACTTTT GTAATCGAAG TTATGGGACG TAACGCTGGT GATATCGCTC TTTGCGCTGG 19020 

TATTGCAACT GGTGCTGATG AAATC ATC AT CCCTGAAGCA GGCTTCAAGA TGGAAGATAT 19080 

CGTAGCAAGC ATCAAAGCTC GTTATGAATG TGGTAAAAAA CACAATATTA TCGTCTTAGC 1914 0 

TGAAGGTGTG ATGTCAGCGG CTGAATTTGG TCAAAAACTT AAAGAAGCTG GAGATACAAG 19 200 

CGACCTTCGT GTAACAGAAC TTGGACATAT TCAACGTGGT GGTTCTCCAA CTGCGCGTGA 192 6 0 

CCGTGTTTTG GCGTCACGTA TGGGTGCACA TGCTGTTAAA CTTCTTAAAG AAGGTATCGG 19 320 

TGCTGTTGCG GTTGGTATTC GTAACGAAAA AATGGTTGAA AATCCAATTC TTGGTACTGC 19 3 80 

AGAAGAAGGG GCATTGTTTA GCCTTACTGC AGAAGGTAAG ATTGTG GTT A ACAACCCAGC 194 40 
TACAAA 

(2) INFORMATION FOR SEQ ID NO: 52: 

til SEQUENCE CHARACTERISTICS : 

IA) LENGTH; 16S93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



19446 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



TCGTAAATAT 


GCTCTCTTTT 


TGGATTTTGT 


TTCTTAATCT 


GTTTCGCAAG 


TGCCTTCATC 


60 


ATAGAAATAG 


G ACC AC AC AT 


ATAGACGGTT 


GCATGTTCGG 


GCACTTCTTT 


TTGTTCAAAA 


120 


TTAAGATAGC 


CGTCTTTCGT 


ACTGTCGATT 


AGATGGAGTT 


CAAAATTAGG 


ATTTTTCTGA 


130 


GCATAGTTAC 


GGAGT AAATC 


TAGGTAGACT 


CCATTTTCAT 


CTCCACGGAA 


GCTATAGTAC 


240 


AAGTGAACCT 


GTTTATCTAA 


AATAGGATGT 


TCACGGATGT 


AAGAGATGAA 


GGGCGTGATC 


300 


CCAATACCTC 


CAGCAATCCA 


AACCTGATTT 


TCTCGTCCTT 


CTTCTATGAT 


CATGTGTCCG 


360 


TAAGCTCTGT 


CTAGGGTTAC 


TTTGCTCCCG 


GCTTGAAGAT 


TATCATAGAT 


ATTCTTGGTA 


420 


TGGTCGCCTG 


AAGTTTTAAC 


AGTAAAGTAA 


AGAGTTTGAC 


CATGACCTCC 


TG AG AT AG AA 


480 


AAGGGATGCG 


GAGCACTTTC 


AAAGCCTTCT 


TGGAAAATCT 


TT AG AAAGG C 


AAATTGTCCT 


540 


GATTGATAGT 


TGAAAGGTCT 


GCTAAGATGG 


ATTTGAATTT 


CTCTAGTATC 


GTGATTTAAG 


600 


CGTTTG AG AT 


GGGTAATTTT 


CCCTAGATAG 


GGGAAGGAAA 


TCTTTTGATA 


T AG AAAAATG 


660 


ATATAAAAAC 


CAGCTAGTAA 


GCCTAAAAGG 


GCATAGCTAC 


CAACAAGAAA 


ACTTAGAAGA 


720 


TTAAATGTAA 


GGAGACGATT 


GCCCATTATC 


ATGTAGATGT 


GAAAGAGTCC 


T AAAAT AT AG 


780 
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GCTAGGTAAA CCAGGCGGTG AATCCATCGC CAAGCTTCGT ATTGGATGTA TTTGCCTAAA 84 0 

TAGGCGACAA CGATGATCCT GGCAAAGATA TAGATGGCAA GATTGCCAAA CTGAGCAGCT 900 
AAGCGAGAGC CCCACAAACC GCCCATACTA AAGTTATGAA AGATTAGTAG GATGATTCAG 960 

AGAAAGGCTG TGAATTTGTG GACGGTGTAG ACCTTCTCCA AACTGTGAAA CCAGCTTTCT 1020 

AGTAGTGGGA GACGAGTGGC TAGGATAAAA GTCAGAGATA GGCTTGTTAA AGCTAGTCCT 1080 

GGAATCATGA ATTGGGGAGA AGTGTTCATC CAAGTCAAAA GAGTCAAGAT AAAACTAGCT 1140 

ATGATAAAGA GTAGTCCTTT GACTCATTTC ATAGAAAATT CCATTTCATT TAGATTTCGA 1200 

TTTGTTGTAA ATAAATTTCT T AC ATTTT AT CATAGAAAAT GTATGGTGTC AAATTGAGGT 12 60 

CTATAAATAT CTACTCTCAT CAAAAAACTC TCCAATTGAA CTGGACAGTG GCTGTTTATA 13 20 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGCA AGTTGCTCAA AACACTGTTT 1380 

TGAGGTTGCA GAT AG AG CTG ACGTGGTTTG AAG AG ATTTT CGAAGAGTGT TATTCTGCAG 1440 

CTTGTTGCCA ACGTTTGGCT AGCATATGAG ACAGGCTAGA AATTGCTAGG TTAAAGCTGA 1500 

AGTACATGAG GGCAATCAGG ATGTAAAGAC TGAAGACCTG CTCTGGTTCG AAATAACGGC 1560 

CCATGAGAAT TTGGCTGGCT CCAAAGAGTT CTTGTAGGGC GATAACAGAG TAGAGGAGAC 162 0 

TCGTATCCTT AATCACGGTA ACAAACTGAG AAATGATGCC TGGTAGCATT TTGCGGATGG 1680 

CTTGTGCGAG AATGATGTAG TAGAGGATTT GGGCTGAGGT GAAGCCTTGT GACATTCCTG 17 40 

CTTCGTACTG TCCCTTGTCT ACGGCATTGA GACCGCCTCG AATAATCTCA GCCAAGGCTG 1800 

CTGATCTAAA GAGAGTAAAG GCTGTAATAC CTGCTGGTGT GGATTTCAT7 TTGAACACCA 18 60 

AAAAG AT ACT AAAAATCCAG AGAACCTTCG GAACGTTGCG CACAAACTCG ATATAAATAC 1920 

TGGAAATAAT GCGTAAGACA GCATTTTTGC CATTTCTCGT GACAGCTAGC ACCGTACCGA 19 80 

TGATAGTAGA GAGGATGATG GCAATCAGAG AAATATAGAG GGTCAAGCCA AATCCTTTAA 2 040 

AGATAAAGAC TAGGTTATCT GGGGTTAAAA CTTCTAAAAT AGATTCCATA GTAACCTCCT 2100 

AAAGTGAATA GGCTTTTTTG TTGGCTTGCT CCATCTTGCG ACCAAACTGG GCAACAGGGA 2160 

ACCATAGAGC AAAGT AG AG A AGAGCAGCAC CTAAAAAGGC TGGTATATAG TTTCCGTTGA 2 2 20 

GAGCCGACCA AGACTTAGTC ACAAACATCA AGTCTACTCC AGAGATGATA GCTACAGTAG 2 280 

AGGTGTTCTT GATGAGGTTA AC AATTTGGT TGGTCAATGG AGGGAGAATG ATGCGGAAGG 2 340 

CCTGAGGCAA GATAATCAAG CGCATGGCAC T G AT AT AGGT AAAACCTTGC GACAAGGCGG 2 400 

CCTCCATCTG ACCACTAGGA ATAGACTGAA TCCCTGAACG AATAACCTCA GCGATATAAG 2 4 60 

CGCCGTGATA GAGTCCCACG CAGAGAACGG CTGTCCAATA AATTGGAATC ATGATGATAT 2 5 20 
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GGTCACTGAT 


AAGAGGTAGG 


CCATAAAAAA 


CAATAACAAA 


CTGCACCAAG 


AGGGGAGTAT 


2580 


TTTGGTAAAA 


TTCAACAAAG 


ATGCGAGCTA 


AAATGCGTAA 


AATTGGACGT 


TTACTGGTTG 


2640 


ACATGGCACC 


AAAGAAGATG 


CCCAAAACCA 


TAGCGAGGAT 


AAAGGAACCA 


ACCGCTAGGG 


2700 


CAAGGGTGAA 


GAGGAAACCA 


TTGAAAAATT 


GTCCAAAATC 


CTGAAAATAG 


GCTGTCCAAG 


2760 


ATGATAAATC 


TGTCATGGGG 


TGTCCTCCTT 


AATCTGCAGT 


ATGGCTAGAT 


GGTTTGAGCT 


2820 


TGTAACGGTC 


ATAAAGTTTC 


TGCAAACTAC 


CATCCTTGCT 


CCATTTAGTA 


AC C AAGTT AT 


2380 


CAAGATAGTC 


GTTCAGCTCT 


GTATTTGATT 


TCTTGGTAAC 


AATACCGTAG 


TCAGATGGCT 


2940 


TGAAACTATC 


ATCTAGTAGT 


GCTGTCCGTT 


TACTAGTGTA 


GCCAGATAGA 


ATAGAGCGGT 


3000 


CAACGGAAAA 


GGTATCGATA 


CGATGAGCGT 


GCAGGGAAGT 


AATCAATTCT 


GGGTAGGAAC 


3060 


CAAGTTCGAC 


GAATTTAAAC 


TTCAGACCTT 


TCTTTTTACC 


CAGTTCAGTA 


ATCAGGCGTT 


3120 


GGGTGATAGA 


ACCTTGGGCG 


ACTCCGATGG 


TTTTGCCGTT 


TAGGTCCTCA 


ATCTTTTTGA 


3180 


TTTTCGCAGA 


TTTATTGACC 


AAAAATCCAG 


AAGCGTCTGT 


GTAGTAGGGA 


CTGGTAAAGT 


3240 


TGTAGAGTTT 


TTTGCGTTCG 


TCCGTGATGG 


TAAACCTCCC 


GATATCCATA 


TCGACCTGTT 


3300 


CATTGTCTAG 


AACGGGGCCG 


CGGGTTTGTG 


CTGTAACCGG 


CACATAGCGA 


ATCTTGACCT 


3360 


TGAGTTCATC 


AGCTACCATC 


TTGCCCAAGT 


CCGTTTCGAT 


ACCAGAATAA 


GTACCGGTCT 


3420 


TGGGATCTTT 


GTAACCAAAA 


TTGGGAACGT 


CTTGTTTGAC 


ACCGACAACC 


AGTTCCCCTC 


3480 


TTTTTTGAAT 


GTCTGCGATA 


CTTGTATCAG 


CCTGCACTGC 


TTTGGCAGCA 


GCAAGGCCG A 


3340 


AAAGGCTAAT 


CAATAATGCT 


GATAAAAAGA 


ATT TTTTTT C 


ATAGGCGCCT 


CCTTATTTGA 


3600 


CTTTG7CACT 


TTCGTGGTTG 


ATAATTTTCC 


TGAGGAATTG 


TTGGGCACGA 


GGTTCGCTTG 


3660 


GATTCTCAAA 


AAAGTTATCG 


ACATCTGTCG 


TATCTACTAA 


AACTTCTCCG 


TCGGCCATAA 


3720 


AGATAATGCG 


GTCCGCAACC 


TCTCGAGCAA 


AGCCCATTTC 


GTGGGTAACG 


ATGATCATGT 


3730 


TCATCCCATC 


ATGCGCCAGT 


TTCTGCATAA 


CTGCTAGAAC 


ATCTCCGATA 


GTCTCAGCAT 


3840 


CAAGAGCAGA 


TGTTGGTTCA 


TCAAAGAGGA 


GGAGTTCCGG 


ATGCATAGCA 


AGACCACGAG 


3900 


CGATGGCGAT 


CCGCTGTTTT 


TGTCCACCAG 


ATAGCATGGC 


GGGATAGGAA 


TCTTTCTTGT 


3960 


CCCACATATT 


TACAAATTCC 


AGATATTTTT 


GGGCGGTTTT 


TTCAGCTTCT 


TTTTTATCAA 


4020 


TTCCTAGAAC 


TTCAATGGGT 


GCAAGCGTTA 


CGTTTTCTAA 


CACAGCTTTG 


TGTGGATAAA 


4080 


CGTTAAAATG 


TTGAAAAACC 


ATGCCGACTT 


CCTTGCGAAG 


AGGTACCAAA 


TCTTTCTGGC 


4140 


TGGCACCACC 


AACTTGGTGC 


CCATTGACTA 


GGAGACTTCC 


TTTGTCAACA 


GTCTCTAAAC 


4200 


CATTGATCGT 


ACGGATAAGA 


GTGGACTTCC 


C AG AG C C AG A 


AGGTCCAAGC 


AGGACAACAA 


4260 


CTTGTCCTTT 


TTCAAAACGG 


AGATTGATGT 


TGCGGAATGC 


GTGGTAGTCT 


GCGTAATATT 


4320 
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TTTCGACCTT TTTAAATTCT ACTAAAGCCA TGAGAGATCT CTATTGTGTT ATATTTTATA 4 3 80 

ACACGGTTCT ACAATAAAAG AATGTTCTTG T C AAAT CAT A TCTGAAAAAA TTCACTATAG 4 44 0 

TGAAATAAGA ACAGGAAAAA TCGATCGGGA CAGTCAAATC GATTTCTAAC AATATTTTAG 4 500 

AAGTAGAGGT GTACTATTCT AGTTTCAATA TACTATAAAA TGTTATAAAA AAGCAATCTC 4 5 60 

GATAGAGAAA ACGTCTAAAT CATGTTATAA TGAAGCAATA GAATTCTTAG AAAGAGTGGA 4 6 20 

TGTCTTTTTG ATAACACCTA CTTATCAATG GC AGTTTGCC CTGCAGGTAG AAGATGCGGA 4 630 

TTTTACAAAG ATAGCCAAGA AGG CT GG ACT GGGTCCTGAG GTGGCTCGGT TATTGTTTGA 4 7 4 0 

GAGAGGGATT CAGAACCAAG AAAGTCTGAA GAAGTTTTTA GAACCTTCCT TGGAGGACTT 4 8 00 

ACATGATGCT TATCTGCTCC ATGATATGGA CAAGGCAGTG GAGCGGATTC GTCAGGCTAT 4 8 60 

TGAAGAAGGG G AAAAT ATT C TTGTTTATGG AGACTATGAT GCGGATGGCA TGACTTCGGC 4 920 

TTCTATTGTG AAGGAAAGTT TGGAACAACT TGGTGCTGAG TGCCGAGTTT ACCTGCCAAA 4 9 80 

TCGTTTTACC GATGGCTATG GCCCTAATGC TAGTGTTTAT AAATACTTTA TCGACCAAGA 504 0 

AGGGATTTCC TTGATTGTGA CGGTCGACAA TGGGGTTGCT GGTCATGAGG CT ATTGC ATT 5100 

GGCTCAGTCT ATGGGAGTAG ATGTCATTGT GACACACCAT CATTCCATGC CTGAAACCCT 5160 

GCCAGATGCT TATGCTATTG TCCATCCTGA ACATCCAGAT GCGGATTATC CTTTTAAATA 522 0 

TTTGGCTGGT TGTGGAGTTC CTTTCAAGTT GGCTTGTGCC CTCTTAGAAG AAGTGCAAGT S280 

GGAATTCCTT GATTTGGTCG CTATTGGAAC TATTGCAGAT ATGGTG AG TC TGACGGATGA 5 34 0 

AAATCGTATC TTAGTTCAAT ATGGTCTGGA AATGTTGGGT CATACCCAGC GCATTGGTCT 54 00 

GCAAGAAATG CTGGACATGG CTGGGATTGC TGCCAACGAA GTAACAGAAG AAACGG TTGG 54 60 

TTTCCACATT GCTCCTCGTT TGAATGCCTT GGGTCGCTTG GATGATCCCA ATCCTGCCAT 5 52 0 

TGATTTGTTG ACTGGATTTG ATGATCAGGA AGCGCATGAG ATTGCCCTTA TGATTCACCA 558 0 

GAAAAACGAA GACCCCAAGG AAATCCTTCA GTCTATCTAT GAAGAAGCCA AGACCATCGT 564 0 

GGATCCTGAG AACAAGGTTC AGGTCTTGGC CAAGGAAGCC TGGAATCCTG GGGTTCTAGG 5700 

AATCGTGGCT GGTCGTTTAT TGGAAGAATT GGGACAGACA GTCATTGTTC TTAAT AT AG A 57 60 

AGACGCTCCT GCCAAGGGCA GTGCTCGTAG TGTGGAAGCG GTCGATATTT TTGAAGCTCT 5320 

GGATCCCCAT CGAGACCTCT TCATCGCCTT TGGAGGTCAT GCAGGTGCAG CGGGTATGAC 5880 

GCTGGAAGTT GAGCAACTCT CAGATTTATC TCAGGTTTTG GAAGATTATG TTCGTGAAAA 5940 

ACGTGCAGAT GCTGGTGGCA AGAATAAGTT AAACCTAGAT GAAGAGTTGG ATTTGGAGGC 6000 
ACTTAGCTTG GAAACGGTCA AAAGTTTTGA ACGTTTAGCT CCTTTTGGAA TGGATAATCA . 6060 
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GAAACCTATT TTTTATATCA AGAATTTTCA GGTCGAAAGT GCTCGTACTA TGGGGGCAGG 6120 

TAATGCCCAT CTAAAGCTGA AAATTTCCAA GGGTGAGGCG AGTTTTCAAG TGGTAGCCTT 6180 

TGGTCAAGCC AGATGGGCGA CAGAGTTTTC TCAAACCAAG AATCTAGAGT TAGCGGTTAA 62 4 0 

ATTGTCTGTC AACCAATGGA ATGGCCAAAC TGCCCTCCAG TTGATGATGG TGGATGCGCG 63 00 

AGTGGAAGGT GTTCAACTTT TTAACATTCG TGGAAAAAAT GCAGTCTTGC CAGAAGGTGT 63 60 

TCCAGTCTTG GATTTTCCTG GAGAACTGCC AAATCTTGCG GCTAGTGAAC CTGTTGTCGT 64 20 

AAAAAACATT CCAGAGGATA TTACTCAGCT GAAGACCATT TTTCAGGAAC AG C ATTTCTC 64 80 

TGCTGTCTAT TTCAAAAATG ATATTGACAA GGCTT ATT A T CTGACAGGTT ATGGGACTAG 6 540 

AGATCAGTTT GCC AAATTGT ACAAGACTAT TTACCAGTTC CCAGAGTTTG AT ATT CG C T A 6 600 

CAAGCTGAAA GATTTGGCTG CATATCTTAA TATTCAACAA AT CTTGCT GG TCAAGATGAT 6 6 60 

TCAAGTATTT GAACAACTAG CCTTTGTGAC GATAAAAGAT GGTGTGATGA CAGTCAATAA 67 20 

AGAGGCGCCA AAGCGGGAGA TAGGAGAAAG TCAAATTTAC C AAAATCT C A AACAAACCGT 67 80 

TAAAGACCAA GAAATGATGG CGCTGGGTAC GGTGCAAGAA ATTTATGATT TTTTGATGGA 6840 

AAAAG AGT AG AAGTTAGGAA AGACTTGGCA AATCAACTCT TTTTTGAAAA CAGACCTTCA 6900 

TTTTGAAAAT CATCAAAAAA ATGGTATAAT GGTAGGAAAA GATTCGGCTG AAAGTATCAG 6 9 60 

AACTTTTAGA ATAAGAGGGT AGAATTGCCC TATAATCAAG ATAAACTAAG ATTTTCGACC 7 020 

AAAAATGAGT AATATCAGTT TAACAACACT TGGTGGTGTG CGTGAGAATG GAAAAAATAT 7080 

GTACATTGCT GAAATTGGAG AGTCCATTTT TGTTTTGAAT CTAGGGTTAA AATATCCTCA 71-10 

AAATGAACAA TTAGGGGTCG ATGTGGTGAT TCCAAACATG GATTACCTTT TTGAAAATAG 7 200 

CGACCGTATT GCTGGGGTTT TCTTGACCCA CGGGCATGCG GATGCCATTG GTCCTCTACC 7 2 60 

GTATCTCTTG GCAGAGGCTA AAGTTCCTGT ATTTGGGTCT GAGTTGACCA TTGAGTTCGC 7 3 20 

AAAGCTCTTT GTCAAAGGAA ATGATGCCGT TAAGAAATTT AATGATTTCC ATGTCATTGA 7 3 80 

TGAGAATACG GAGATTGATT TTGGTGGGAC AGTGGTTTCC TTCTTCCCTA CGACTTACTC 7 4 40 

CGTTC C AG AG AGTCTGGGAA TTGTCTTGAA GACATCGGAA GGAAGCATCG TTT AT AC AGG 7 500 

TCACTTCAAA TTTGACCAAA CGGCTAGTGA ATCTTATGCA ACTGATTTTG CTCGTTTGGC 7 560 

AGAGATTGGT CGTGACGGCG TCCTGGCTCT CCTCAGTGAT TCGGCCAATG CAGACAGCAA 7 620 

TATTCAGGTG GCTAG7GAAA GTGAAGTTAG GGATGAAATT ACCCAAACTA TTGCTGACTG 7 630 

GGAAGGTCGT ATCATCGTTG CAGCTGTTTC CAGTAATCTT TCTCGTATTC AGCAGATTTT 7740 

TGACGCTGCG GATAAAACAG GTCGACGTAT CGTCTTGACA GGATTTGATA TTGAAAATAT 7 800 
CGTCCGCACA GCGATTCGTC TTAAGAAGTT GTCTTTAGCC AACGAAATTC TTTTGATTAA ' 7 860 
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GCCTAAAGAT ATGTCTCGCT TTGAAGACCA TGACTTGA7T ATTCTTGAGA CAGGTCGTAT 7 920 

GGGTGAGCCT ATCAATGGAC TTCGTAAGAT GTCGATTGGT CGCCATCGTT ATGTAGAAAT 7 980 

CAAGGATGGG GACCTAGTCT ATATTGCTAC GGCTCCGTCT ATTGCTAAAG AAGCCTTTGT 804 0 

TGCGCGTGTG GAAAATATGA TTTATCAGGC AGGTGGGGTT CTCAAATTGA TTACCCAAAG 8100 

TTTACATGTA TCAGGGCACG GAAATGTGCG TGATTTGCAG CTGATGATCA ATCTTTTG C A 8160 

ACCTAAGTAC CTCTTCCCTG TCCAAGGGGA GTATCGTGAG TTGGATGCTC ACGCTAAGGC 82 2 0 

TGCCATGGCA GTTGGGATGT TGCCAGAACG CATCTTCATT CCTAAAAAGG GGACGACCAT 8 280 

GGCTTACGAG AATGGAGACT TTGTTCCAGC TGGATCGGTT TCAGCAGGAG ATATCTTGAT 834 0 

TGATGGGAAT GCCATTGGTG ATGTTGGAAA TGTTGTTCTT CGTGACCGTA AGGTCTTGTC 8 4 00 

AGAGGATGGA ATTTTCATCG TGGCTATTAC AGTCAACCGT CGTGAGAAGA AAATTGTGGC 8 4 60 

TAGGGCTCGT GTTC AC ACG C GTGGATTTGT TTATCTCAAG AAGAGTCGCG ATATTCTCCG 8S2 0 

TGAAAGTTCA G AATTG ATT A ACCAAACGGT AG AAG ACT AT CTTCAAGGAC ATGACTTTGA 8580 

CTGCGCAGAT CTCAAAGGTA AGGTTCGTGA CAATCTGACC AAGTACCTCT TTGATCAAAC 8 64 0 

CAAGCGTCGC CCAGCCATTT TACCAGTAGT CATGGAAGCA AAATAATCGT TGAAATAAAC 8 7 00 

AGAGAGAAAG TCGAGTTTCG GCTTTTTCTT ATAGAAAAAT AGAAGG AG AA AATCATGGCA 87 6 0 

GTGATGAAAA TCGAGTATTA CTCACAAGTA TTGGATATGG AGTGGGGGGT GAATGTCCTC 9 820 

TACCCTGATC CCAATCGAGT GGAACAACCA GAGTGTGAAC ATATTCCCGT CTTCTACCTT 8 8 80 

TTGCACGCGA TCTCTCGAAA TCATAATAGT TGGCTTAACC GGACCAATGT AGAACCCTTC 3 9 40 

C7TCGAGGAA CTAATCTCAT CGTTGTTATG CCCAATACCA GCAATGGTTG GTACACCGAT 9000 

ACCCAGTATG GTT7TCACTA CTACACGGCT CTAGCACAGG AATTCCCACA GGTTCTGAAA 9060 

CGCTTCTTCC CTAATATGAC GAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATC 9120 

GGAGGCTACG GCTGCTTCAA ACTGGCTCTT ACG AC AAATC CTTTTTCTCA TGCAGCTAGT 9180 

TTTTCAGGTG CCCTCAGCTT TCAAAACTTT TCT CCTG AAA GTCAAAATCT GGGAAGTCCA 9240 

GCCTACTGGA GAGGTGTTTT TGGAGAGATT AGAGACTGGA CAACTAGTCC CTATTCTCTT 93 00 

GAAAGTCTGG CTAAAAAATC GGATAAAAAG ACCAAACTTT GGGCGTGGTG TCGCGAACAC 9 360 

GATTTCTTGT ACGAAGCCAA TAATCTCGCA GTGAAAAATC TCAAAAAACT ACGTTTTGAT 942 0 

GTGACCTATA GCCATAGCGC TGGAACTCAC GAGTGGTACT ACTGGGAAAA ACAATTGGAA 948 0 

GTTTTTTTAA CAACCCTACC AATTG ATTTC AAATT AG AAG AGAGACTGAC TTAGTTTGAA 9 54 0 
CTTCAGCATA GGGGGAGTAG AACTAAAATA AAATATCTTT TC ACT AG ACT TTTCAAACGm 
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AAGTAGTAGA ATAGTAATAA AATACTGGAG GAAAGAGAGT AGG AAATGT A CCGTTATCAA 9 6 60 

ATTGGCATTC CC AC ATT AG A ATATGATCAG TTTGTCAAAC AACATGAATT AGCCAATGTA 9 7 20 

TTACAAAGTA GTGCTTGGGA GGAAGTTAAG TCTAATTGGC AACATGAGAA GTTTGGTGTT 9780 

TACAGGGAAG AAAAATTACT GGCGACAGCT AGTATTTTGA TTAGAACTCT TCCGCTAGGC 9 840 

TATAAAATGT TTTACATCCC AAGAGGACCT ATATTGGATT ATGGGG AT AA AGAACTCTTG 9 900 

AATTTTGCCA ttcagtCTAT TAAGTCCTAT GCTCGCAGTA AGAGAGCGGT TTTTGTGACT 99 60 

TTTGACCCAA GTATTTGCCT ATCTCAAAGT TTAATCAATC AGGAAAAGAC AG AATTT CCT 10020 

GAAAATCTGG CTATTATTGA TAGTTTGCAA CAAATGGGAG TAAGCTGGTC AGGAAAAACG 10080 

GAGGAAATGG GAGACACCAT TCAACCTCGT ATTCAGGCGA AAATATACAA GGAAAATTTT 10110 

GAAGAAGATA AACTTTCCAA GTCAACAAAA C AGGCT ATTC GAACAGCACG AAACAAAGGG 10 200 

CTTGAGATTC AATATGGTGG ACTGGAACTA TT AG ATT CAT TTTCGGACTT GATGAAAAAA 102 60 

ACTGAGAAGC GAAAAGAGAT TCATTTG AGG AATGAAGCCT ATTATAAAAA ATTGTTAGAT 10 3 20 

AATTTTAAGG ACAAGGCCTA TATCACCTTG GCCACCTTGG ATGTTTCTAA ACGTTCGCAA 10 3 80 

GAGTTAGAAG AACAGTTAGC GAAAAATAGA GCCTTGGAAG AGACCTTTAC TGAGTCGACT 104 40 

CGAACTTCAA AAGTAGAAGC GCAGAAGAAG GAAAAAGAAC GTTTGTTAGA GG AATTGAC C 10 SCO 

TTCTTGCAGG AATATATAGA TGTAGGTCAA GCGAGAGTTC CTTTAGCGGC TACTTTGAGT 10560 

TTGGAATTTG GTACTACCTC TGTCAATATA TATGCTGGTA TGGATGATGA TTTTAAACGT 10 620 

TACAATGCAC C AATTTT AAC TTGGTATGAA ACGGCTCGCT ATGCCTTTGA ACGAGGTATG 10680 

ATCTGGCAAA ATTTAGGTGG TGTTGAAAAC TCTCTCAATG GTGGACTTTA TCATTTTAAG 10 740 

GAAAAATTTA ATCCAACGAT TGAAGAATAC TTGGGTGAAT TTACAATGCC CACTCATCCT 10800 

CTCTATCCTC TGTTAAGACT TGCTCTTGAT TTCCGTAAAA CATTAAGAAA AAAACATAGA 10 860 

AAGTAAGTAT ATGGCACTAA CAACACTCAC GAAAGAAGAG TTTCAGACTT ATTCTGATCA 10920 

GGTTTCTTCT CGTTCCTTTA TGCAATCTGT CC AG ATGGGG GATTTGCTAG AAAAAAGAGG 109 80 

GGCTCGAATT GTTTATCTTG CTTTGAAACA AG AAGG AG AA ATTCAAGTTG CAGCTCTGGT 11040 

TTATAGCCTG CCCATGCTGG GTGGTCTGCA TATGGAACTC AATTCGGGGC CGATTTATAC 11100 

CCAACAAGAT GCTCTTCCAG TTTTTTATGC AGAGTTAAAA GAATATGCCA AGCAAAATGG 11160 

TGTATTAGAG TTGCTTGTAA AACCCTATGA AACTTATCAA ACTTTTGATA GCCAAGGTAA 11220 

TCCAATAGAT GCTGAGAAAA AAAGTATTAT TCAAGATTTC ACTGATTTAG GTTATCAATT 11230 

TGATGGCTTA AC AAC AGGTT ACCCAGGTGG AGAACCAGAT TGGTTATACT ATAAAGATTT 11340 

AACTGAATTA ACTGAAAAGA GTTTGCTTAA AAGTTTTAGC AAAAAGGCTA AACCCTTGGT 11400 
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GAAAAAGGCT 


GAAACCTTTG 


GCATTCGGTT 


GAAAAAGTTA 


AAACCTGAAG 


AACTATCGAT 


1 1460 


TTTTAACAAT 


AT AAC AAAAG 


AAACCTCTGA 


ACGTAGAGAA 


TATAGTGATA 


AAAGTTTAGA 


11520 


ATATTATCAG 


CATTTTTATG 


ATACTTTTGG 


AGAACAAGCG 


GAGTTTCTCA 


TAGCAAGCTT 


11580 


AAATTTTTCG 


GACTATATGA 


GCAAATTGCA 


AGGTGAACAA 


AGTAAACTAG 


AAGAAAACTT 


11640 


GGACAAGTTG 


CGACTTGATT 


TGAGT AAAAA 


TCCTCATTCT 


GAGAAAAAAC 


AAAATCAACT 


11700 


GAGAGAATAT 


TCTAGTCAAT 


TTGAAACGTT 


TGAAGTTCGA 


AAAGCAGAAG 


CGCGAGACTT 


1 1760 


G ATTG AAAAA 


TATGGAGAAG 


AAGATATTGT 


TTTAGCTGGG 


AGTTTATTTG 


TTTATATGCC 


11820 


TCAGGAAACG 


ACTTATCTCT 


TTAGTGGTTC 


CTACACTGAG 


TTTAATAAGT 


TCTATGCCCC 


11880 


TGCACTGCTT 


CAAAAATATG 


TTATGTTGGA 


AAGCATAAAA 


CGTGGAATAC 


CTAAATACAA 


11940 


CTTCCTAGGC 


ATTCAAGGGA 


TTTTTGATGG 


AAGTGATGGT 


GTTTTGCGTT 


TTAAACAGAA 


12000 


TTTTAATGGC 


TATATTGTAC 


GCAAAGCAGG 


TACTTTCCCT 


TACCATCCAT 


CGCCTTTAAA 


12060 


ATACAAAGCT 


ATCCAGTTAC 


TCAAAAAAAT 


AGTAGGACGT 


TAAGATGAAA 


AAGTCAGTAT 


12120 


TTAGATTTCT 


TTTAGCTTCT 


TTTAGTAAAA 


TAATTCTTAT 


TTGCTAGAAA 


GGTGGAGAGA 


12130 


CATGCGCTGG 


CTTTTTCGTT 


TGATAGGGGC 


TTTCTTTTCT 


TTTGTGTGGC 


GTTTGTTTTG 


12240 


GCGTCTGGTT 


TGGATAGTTG 


TGCTCTTATG 


TGTCCTTCCT 


TTCGGACTTC 


TCTGGTATCT 


12300 


GAACGGAGAT 


TTTCAAGGAG 


CGCTAAAGCA 


AGCAGAACGG 


TCACT AAAAA 


TTGGTCAACA 


12 360 


AAGTATTGAC 


CAATGGGAGA 


AAACAGGGCA 


ACTGCCTAAC 


TTAAGCCAGA 


CAGATAGTCA 


12420 


CCAGCATTCT 


GAAGGAAGGT 


GGGCACAGGC 


CTCTCCTCCT 


ATTTACCTGG 


ATCCGCACAT 


12430 


GGATTCACCC 


TTTCAAGAGG 


CTTATTTACA 


AGCAATCCAG 


AACTGGAATC 


AAACTGGTGC 


12540 


TTTTAACTTT 


GAACTCGTGA 


CTGAGTCTAG 


TAAGGCGGAT 


ATTACGGCTA 


CGCAGATGAA 


12600 


CGACGGAGGC 


ACTCCTGTGG 


CAGGAGAGGC 


GGAAAGTCAA 


ACTAATCTCT 


TAACAGGGCA 


12660 


A TTC iTGTCC 


GTAACGGTGC 


GGTTGAATCA 


TTATTATTTG 


TCCAATCCAT 


ACTATGGCTA 


12720 


CTCCTATGAA 


CCCCTTC7CC 


ATACGGCAGA 


ACATGAGTTA 


GGTCATGCGA 


TTGGCTTGGA 


12780 


CCATACAGAT 


GAGAAGTCTG 


TCATGCAACC 


AGCAGCTTCC 


TTTTATGGTA 


TCCAGGAAGA 


12840 


GGATGTTGCA 


AACCTCCGAA 


AAATATATGA 


GACTAGTGAG 


TAGGGTACTA 


TCTTTCCCTA 


12900 


CTTTTTTTGC 


TATAATGGAA 


CTATGAACAA 


CTTG ATT AAA 


T C AAAACT AG 


AGCTCTTGCC 


12960 


GACCAGCCCT 


GGTTGCTACA 


TTCATAAGGA 


TAAAAATCGC 


ACCATTATCT 


ATGTAGGAAA 


13020 


GGCTAAAAAT 


CTGCGTAATC 


GAGTACGGTC 


CTATTTTCGT 


GGAAGTCATG 


ATACCAAGAC 


13080 


AGAGGCTCTG 


GTGTCTGAAA 


TTGTGGATTT 


TGAATTTATT 


GTTACGGAGT 


CTAATATTGA 


13 140 
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GGCACTTCTC CTAGAAATCA ACCTGATCAA GGAAAACAAG CCCAACTACA ATATCATGCT 
CAAGGATGAC AAGTCCTATC CTTTCATCAA AATCACCAAT GAGCGCTATC CACGCTTGAT 
TATCACTCGT CAGGTCAAAA AGGACGGAGG TCTTTATTTT GGACCCTATC CCGATGTGGG 
GGCAGCCAAT GAAATCAAGC GGTTGCTGGA TCGGATATTC CCTTTTCGTA AGTGTACCAA 13390 
CCCGCCCTCT AAGGTCTGTT TTTATTACCA TATCGGCCAG TGTSTGGCCC ACACCATCTG 
TAAGAAGGAT G AGG CTT ATT TCAAGTCTAT GGCCCAGGAG GTGTCTGATT TTCTGAAAGG 
TCAGGATGAC AAAATCATCG ATGATCTCAA GAGTAAAATG GCAGTAGCAG CACAAAGTAT 
GGAGTTTGAA CGTGCGGCGG AATACCGTGA CCTGATTCAG GCTATTGGAA CGCTTCGAAC 
CAAGCAACGG GTCATGGCGA AAGATTTGCA AAATCGCGAT GTCTTTGGCT ACTATGTGGA 13 680 

TAAGGCCTGG ATGTGTGTGC AGGTTTTCTT TGTCCGTCAG GcAAGCTCAT CGAGCGCGAT 
GTCAATCTCT TCCCCTACTT CAATGATCCA GATGAGGATT TTTTGACCTA TGTAGGACAA 
TTCTATCAAG AAAAATCTCA TCTAGTTCCC AATGAGGTAC TGATTCCGCA GATATTGACG 
AAGAAGCTGT CAAGCCTTTG GTGGATTCCA AGATTCTTAA CCCTCAACCT GGAGAGAAAA 
AACAACTGGT CAATCTAGCC ATAAAAAATG CTCGTGTTAG TCTAGAGCAG AAGTTCAATC 
TGCTAGAAAA AT CTGTCG AA AAGACTCAAG GAGCTATTGA AAATCT AGGG CGTTTGCTCC 
AAATCCCGAC CCCAGTACGT ATCGAGTCCT TCGATAACTC TAATATCATG GGAACTAGCC 
CTGTTTCGGC TATGGTGGTC TTTGTCAACG GTAAACCGAG TAAGAAGGAT TACCGTAAGT 
ACAAGATAAA AACGGTTGTT GGACCAGACG ACTATGCCAG CATGAGAGAG GTCATTCGCA 
GACGCTATGG TCGACTACAG CGTGAGGCTT TGACTCCTCC AG AT TTG ATT GTGATTGATG 
GGGGGCAAGG TCAAGTCAAT ATCGCTAAGC AGGTTATCCA AGAGGAACTG GGCTTGGATA 
TTCCAA 7TGC TGGGCTGCAA AAGAATGATA AGCACCAAAC CCATGAATTG CTCTTTGGAG 
ATCCGCTTGA GGTGGTGGAT TTGTCTCGCA ATTCTCAGGA ATTTTTCCTC CTCCAACGCA 
TCCAAGATGA GGTGCACCGC TTTGCTATCA CTTTCCACCG CCAACTGCGC TCCAAAAATT 
CTTTCTCATC TCAATTGGAT GGGATTGACG GTCTGGGACC TAAACGCAAG CAGAATCTTA 
TGAAGCATTT CAAGTCTTTG ACCAAAATCA AGGAAGCCAG TGTGGATGAG ATTGTCGAAG 
TTGGGGTACC TAGAGTCGTT GCAGAGGCTG TGCAAAGAAA GTTGAACCCG CAGGGAGAAG 
CCTTGCCTCA AGTAGCAGAA GAAAGAGTAG ATTACCAAAC GGAAGGAAAC CACAATGAAC 
CATAAAATCG CAATTTTATC AGATGTTCAT GGCAATGCGA CGGCGCTAGA AGCAGTGATT 
GCAGATGCTA AAAAT C AAGG GGCCAGTGAA TATTGGCTTC TGGGAGATAT TTTTCTTCCT 
GGTCCAGGCG CAAATGACTT AGTCGCCCTG CTAAAGGACC TTCCTATC AC AGCAAGTGTT 



13200 
13260 
13320 



13440 
13500 
13560 
13620 



13740 
13800 
13860 
13920 
13980 
1404C 
14100 
14160 
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CGAGGCAATT GGGATGATCG TCTCCTTGAG GCTTTAGATG GGCAATATGG CTTAGAAGAC 15 0O0 

CCACAGGAAG TTCAGCTCTT GCGTATGACA CAGTATTTGA TGGAGCGAAT GGATCCTGCA 150 60 

ACGATTGTCT GGCTACGAAG CTTGCCTTTG CTGCAAAAGA AAGAAATTGA CGGATTGCGC 15120 

TTTTCTATCT CTCATAATTT ACCTGACAAA AACTATGGTG GTGACTTGCT AGTTGAGAAT 15180 

GATACAGAGA AATTTGACCA ACTGCTAGAT GCGGAAACGG ACGTGGCAGT TT ATGGT CAT 1524 0 

GTTCACAACC AGTTGCTTCG TTATGGAAGT CAAGGGCAAC AAATCATCAA TCCAGGGTCG 15 300 

ATTGGCATGC CCTATTTTAA TTGGGAGGCG TTAAAAAATC ACCGTTCCCA GTATGCCGTG 153 60 

ATAGAAGTTG AAGATGGGGA ATTACTCAAT ATCCAATTTC GTAAAGTTGC TTATGATTAC 15420 

GAAGCTCACT TAGAATTGCC CAAGTCCAAG GGGCTTCCCT TTATCGAAAT GTATGAAGAA 15 480 

CTGCGTCGTG ACGATAACTA TCAGGGGCAC AATCTCCAAT TATTAGCCAG CTTAATAGAA 155 40 

AAGCATGGGT ATGTAGAGGA TGTGAAGAAT TTTTTTGATT TTTTGTAAGA GTTTCCTAAA 15 6 00 

ATAGCCAATG CAAACTAAAA AAGCGATTTG CTGGTCCAAT CCCTTTTACT ATATCTTATA 15660 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA CGTTGCTCAA AGCACAGCTT 15 7 20 

TGACGTTGCA G AT AAAGCT G ACCTGGTTTG AACAGATTTT CGAAGAGTGT TATTGTAACT 15780 

GAGATTGATC TGCCAGCTAA GAACCACCTA GATAGGTATT GCTGAGTTTT TCAAGGGTTC L584 0 

CGTCTTGATA GAGTTCTTTG AGCGCTTTAT C AAATTGCT C TTTAAACTCT TTTTGGTCGC 15900 

TTGAGAAAAT GATATAATTG CTGGGGCTAT CTGCAGAAGG TAAATCAACG ACTGAGAGCT 15 960 

CTAAACCACG GTCCTTCATA ATCTTTTGAA CGCATACCTT G7CAAAAACT AGCAAATCAA 1602 0 

ACTCTCCGTT AGCAAGGTCT AGGATTCGTT TACCAATATC CTCACCAGAA AAATTAATTG 16 030 

TAGCGGGATT ATCAGTGTGT TTCTCATTCC AGTTATTGAT GAATTGAGCG TTAGAAGTTC 16140 

CGGTATCCTC TTGTGTTGTT TTACCAGCGA TCTGCTCAAG AGAACTCAAA GGATTTTTCT 162 00 

TGTTGCTGAC AAGGACGAGG GGATTGTTGG AAATTGGAAG CGAGTAAAGG TATTTTTCAG 162 60 

CACGCTCTTT TGTGTAACTC AAGTTATTGG CCGCAGCCTG ATAGTGACCA GAATCAAGTC 163 20 

CTGCGAAGAT GCTCTCCCAG GCGGTTCTTT GGAATTGAAT CTCGTAGTCG CTGAGTTTTT 16 380 

CATCTACTGC CTTTAAAACT TCGATATCAA AGCCTGTCAG ATTGCCCTTG TCTTCGTAGT 16440 

CAAATGGTGG CACGTCGCCA GCTGTAGCAA GGACGATTGT CTTTTGAGCG CTAGTCTCTT 16500 

TGGGTGTAGC TTGATTCTCA CAGGCAACCA AAAATGG T AG GATAGCTAGT AATAGGCTAA 165 60 

ATTTTTTCAT ACTGTCTCCA TTCAAATGTA AAG 16593 
(2) INFORMATION FOR SEQ ID NO: 53: 
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ii) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 3510 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



CGGATATCCT 


TATATCCTTG 


TTCCTGGAAC 


CATTGTGGGA 


ATTGCTCAAC 


ACTTTTTTCA 


60 


CCTTGAATTC 


CTGGTGCAAT 


GACAGTAAGA 


ATTTCGAAAT 


CACGATCTGG 


TTTCGCCGCT 


120 


AGTTCCATCA 


ACTCTGGCAT 


ACTTTTCTTG 


CATGGACCAC 


ACCATGAAGC 


CCAAAACTTC 


180 


AAGTAAACCT 


TTTTACCCTT 


AAAATCAGAT 


AACTTAACTT 


CTTTGCCATC 


CATGGATTGC 


240 


AATGTGAAGT 


CTGGAGCATC 


TTTTC CAAC A 


GCAATTTGTT 


GT AC AGTCGT 


TTGTTGTTTT 


300 


GGCTGTTGTG 


CTGCTTGAGT 


CTTTTTAGTT 


TCTTCCTCAC 


CACAGCCCAT 


CAATACAACT 


360 


AATGACAAGA 


GACTTAAGCC 


AGCAAACATT 


ACTTTTTTCA 


TTTGTCCTCC 


TTTATTCAAA 


420 


AATTCCAGCT 


AGAACATTTA 


CTTGTCCTAA 


TAGTAACAAA 


ATTCCCATTA 


AAACAATGAG 


480 


GAAACCACCA 


ATTTTCTTT A 


GTAGCATCAT 


ATGACGCTTC 


ATTTT ACT AA 


AATATGGCAT 


540 


GACTAGACCT 


GAAGCTAGTG 


CCAATACCAA 


GAAAGGAAGG 


GCCATGCCaG 


AGTGTAAATG 


600 


AGAGTATAAA 


TCGCTCCTTG 


CCAAGCGCCA 


TTGCCTCCAG 


AAGCCCCAAC 


TGCTAAAACA 


660 


GAACTTAAAA 


CTGGACCAAT 


ACAAGGTGTC 


CAACCAAAGC 


TAAAGGTAAT 


ACCAAGTAAA 


720 


AAAGCTGACC 


AATAACGATT 


AG AATC TG AT 


TTTTTAAAGG 


TAAAACTTTT 


TTGAACTTCT 


780 


AATTTCTTCA 


AATGAAAAAT 


TTCCATCTGG 


TGAAGACCCA 


AAATGATAAT 


AATAGCTCCC 


840 


ATGCCATATC , 


. GAAACCAATT 


TGCATAGAGA 


ATATGACCAA 


AGTAACCAGC 


ACCAAAGCCT 


900 


AGAATAAAGA 


AAATGAGAGA 


GATACCAGCG 


ATAAAGCAAA 


GTGTTCGAAT 


CAAGCCTGAC 


960 


CAGAGAACCT 


TTCTCCCAAA 


CAAAGAAAAG 


CTTTTTG C AC 


TTTCTTGATC 


ATCCAATAAA 


1020 


ATCCCAGCAT 


AGACTGGCAG 


AAGAGGAAAA 


ATACAAGGAG 


AAAAAAAGGA 


TAAAACACCT 


1080 


GCTAGAAAAA 


CAGAGATTAA 


AAATACTATC 


GTTTCCAATA 


AAGAACCAAC 


TTTCTTAATA 


1140 


ATTCTAATCC 


TATTTTACTA 


TATTCAATTT 


TATTTGTAAG 


CTTTCTGCTA 


CGCAAAATCG 


1200 


TATCGGGCAC 


TATTGGACCA 


ATCTTTTCTT 


TTGCTAGTCA 


AGGCGGATCT 


TATCCCCCAA 


1260 


AATAGCCAAA 


AAGCAACGAC 


AAGGATTACT 


CATCGCTGCT 


TTTGTGAACG 


AAAATGTCTT 


1320 


TTACGTCTGA 


CATTTCATAA 


ATCATGTTTT 


ACTTGAGTTT 


GTCAAGGATT 


GCTTTAAGCT 


1380 


CCTCTACTAG 


TTTAGTTTCT 


GTCTCTGCTG 


AGCCATTTTC 


TTCTTTCACG 


AAATCAAGCG 


1440 


TTTCTTGGAG 


AAGGTTTTGG 


GCTTTGGCAA 


GGACTTTTTT 


ATCCGCTTTT 


TCTGCATCTA 


1500 
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GCTGTCCTAG 


AACCTTGATC 


AATTCCGTGC 


TTAATTGCTG 


GATT". :CAC 


TCTTTCTTAC 


1S60 


GGCGAATCAG 


C C AG AAGGC A 


ATCACGCCTA 


GGAGGGCAAG 


TAGACTGACC 


AC AATC ACT C 


1620 


CTGCCGGAAC 


TGAGTTTGTT 


TCAGTCATCT 


TATCTGAATC 


CTTACTATCT 


TCCGTTCCTT 


1680 


GTTTTGCATC 


CTTCTTGTCC 


TGTGCAGGCT 


TGCTGTCGCT 


AGCATTTGCT 


TTCACATCTT 


1740 


TGAGAGAGTC 


CAAGGCAGCC 


CAGCCTTCAC 


AGACTCTACT 


GCAGTATGCA 


GACCTTACTC 


1800 


TGTCAAGGCA 


CTATCTTCCG 


GAGCTTTTTG 


AGCATCTAGG 


AGGACAGCCT 


TGGTTGCATC 


1860 


GATTTTCGGA 


TCAGATACTG 


TTCCCAAACC 


TTTCAACCCT 


TGGTCTAACT 


CTTGACTCAA 


1920 


GGCACGAAGT 


TCAGACTTGT 


CAACTTGCTC 


TTGAGCTTGT 


GTGCTCGTTG 


AGCTAGCCGA 


1980 


AGCGCTTGCT 


ACCACTCTAG 


GATCTTGAGT 


CGGAGCTGAG 


CTTGGAGCTG 


GGACAGGGCT 


2040 


TGCAGGTTGA 


CTAGGAACAG 


TTATGGTATA 


TTG AAACT AG 


AATAGTACAT 


ATGGACTTCT 


2100 


AAAACATTGT 


TAGAATTCGA 


TTTTACTCTC 


CTGATCGATT 


TGTCCTATTC 


TTATTTCATT 


2160 


TTACTATAAT 


AACCGATGGT 


GTGGTTAATG 


TTGGTAAGAG 


AAACTTCTGA 


AACCAAGCTT 


2220 


CAAAAAAGTC 


GCTCGTCATC 


CTCTCTTCGT 


AAGTCATTGG 


AGCGATTAAT 


TCACCATTTG 


2280 


TTAGACCTGC 


AACCAAAGAA 


ATCCTCTGAT 


ATCTTCTTCC 


AGATACTTTG 


CCTCTTATTA 


2340 


ACTGACCT/TT 


TAATGAGCGA 


C CAT ATTCTC 


GATAAAAATA 


AGTATCGAAT 


CCTGTTTCGT 


2400 


CAATCTAAAC 


ACCTGCTAGG 


TGCTTTAAAC 


TATTAAAATT 


CTTAAGAAAT 


AAGGCTACTT 


2460 


TTTCTGGGTC 


TTGTTCATAG 


TAGGTCTGCT 


TCTTTTTTTC 


GAGTGTAGCC 


CATACCTTTG 


2520 


AGCGCATAGT 


GGATGGTAGT 


TGGATGACAG 


CCAAAkTCAG 


AAGCTATTTC 


AGTCAAATAA 


2580 


GCrTCTGGAT 


TGTCAGTAAG 


ATAGTTTTTA 


AGTCTATCTC 


TATCAACTTT 


TCTTGGTTTT 


2640 


GTTCCTTTTA 


CTTGGTGGTT 


TAGCTCTCCT 


GTTTTCTCTT 


TTAGCTTTAA 


CCAGCCATAA 


2700 


ATGGTATTAC 


GTGAGATTTG 


GAAAACGTGT 


GATGCTTCTG 


TTATACTACC 


TATTCGCTCA 


2760 


C AAT AAGAG A 


GAACTTTTTT 




ATTGAATATG 


CCATAAGAAG 


ATTATACC^C 


2820 


ATTGTGTACT 


ATTTTTGGTT 


CATTTCACTA 


TAACACAAAA 


T AG A TT ATT A 


TTACATAACA 


2880 


AAAAAGAGGT 


CTAAACCTCT 


TAACTCAATT 


ACTCCGCCAG 


TAGGACTCGA 


ACCTACGACA 


2940 


TCATGATTAA 


CAGTCATGCG 


CTACTACCAA 


CTGAGCTATG 


G CGG ATT AAA 


GCTAAGCGAC 


3000 


TTCCCTATCT 


CACAGGGGGC 


AACCCCCAAC 


TACTTCCCCC 


GTTCTAGGGC 


TTAACTTCTG 


3060 


TGTTCGGCAT 


GGGTACAGCT 


GTATCTCCTA 


GGCTATCGTC 


ACTTAACTCT 


GAGTAATACC 


3120 


TACTCAAAAT 


TGAATATCTA 


TTCAATTTAA 


GAAAACCGTT 


CGCTTTCATA 


TTCTCACTTA 


3180 


CTTTGGATAA 


GTCCTCGAGC 


T ATT AG T ATT 


AGTCCGCTAC 


ATGTGTCGCC 


ACACTTCCAC 


3240 
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TTCTAACCTA TCTACCTGAT CATCTCTCAG GGCTCTTACT GATATATAAT CATGGGAAAT 3 300 

CTCATCTTGA CGTGGkT"CA CACTTAGATG CTTTCAGCGT TTATCCCTTC CCTACATAGC 3 3 60 

TACCCAGCGA TGCCTTTGGC AAGACAACTG GTACACCAGC GGTAAGTCCA CTCTGGTCCT 3420 

CTCGTACTAG GAGCAGATCC TCTCAAATTT CCTACGCCCG CGACGGATAG GGACCGAACT 34 80 



GTCTCACGAC GTTCTGAACC CAGCTCGCGT 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20986 base pairs 
(3) TYPE: nucleic acid 
:C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION : SEQ ID NO: 54: 

CGG AG AAAAA CATGGCTAAG TCAAACTTTG AAAAAGT AG A ATCAGTTGTT GGCTGGGTTC 60 

GTGATAACAA AATCACAGCC TACCGTATCT CT AAAG AAA C GAATGCGCGT GAAATGTCTA 12 0 

TCATTGCTCT GGCGCAGGGT CGTGCAAAAG TAAAAAATAT TTCATTTGAA ACAGCCCTAG 180 

GCCTAATTGA TTTCTATGAA AAAAATTATG AAAAATTTGA AGATTAATCT TTGGATAACG 24 0 

GCGGATTCTT GACCTTCAAG TAGTAGAGAT AGAGAATCTG CCTTTTCATT TTGAGGACAG 300 

CAAAAAGACT GCACGGTTGA TGCAGCCTTT TCTTTTTATT TGAGATAGCG TTGAAGGAAC 3 60 

TCTTTTGTTC GGTCTTCTTT AGGATTGGTG AAGAGGTCTT CTGGTTTACC TTCTTCAGCG 42 0 

ATCACGCCCT T ATCC AT AAA GATAACACGG TGAGAGACAT CACGCGCAAA TTCCATTTCA 43 0 

TGGGTTACCA CAATCATGCT CAAGCCTTCC TGAGCCAGGT CCTGCATGAT TTTGAGGACT 54 0 

TCTCCAACCA TTTCTGGATC GAGAGCTGAT CTTCGTTCAT CAAAGAGAAT AGCGTCCGGA 600 

TTCATGGAGA GGGCACGAGC G ATGGC C AC A CGTTGTTTTT GACCACCTGA GACTTGTTTT 660 

GGTTTGCCTT GCCAGTAGCG TTCTCCCATG CCGACCTTTT CCAGCTTTTC TTTGGCAATC 720 

TTTTCAGCTT CTGTGCGTTC GCGTTTTAGG ACAGTTGTCT GAGCGACGAT TGTGTTTTCA 7 80 

AGAACATTGA CATTTTCAAA GAGGTTAAAG GATTGGAAAA CCATCCCCAA CTTTTCACGG 84 0 

TATTGCGTGA GCTCATAGCC TTTTTCGAGG ACGTTTTGTC CATGATAAAG GATTTGTCCA 900 

TCAGTTGGTG TTTCAAGTAG GTTAATGGAG CCTAGGAAGG TCGATTTTCC GCTTCCAGAG 9 60 

CTTCCGATGA TAGAGATGAC CTCTCCCTTG TGGACAGTGA GTGAAATGTC TTTTAGCACT 1020 

TCGTTTTGTC CATACGATTT TTTGAGGTGT TTAATTTCAA GGATTGCTTG TGTCATTATT 1080 

TCAAATCCTC CGTTTGCATT TGGTTAGCAC CTGTAGTGTA GGTATCCATG TCCATTCTGC 1140 
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CLC^CG AT AAA 


GCGTAGGATA 


CGTGTTACGG 


TCAACCTGAC 


GACAAAGTAA 


ATCACGGCGA 


1200 




TGTC TGG AAG 


TATTGATAGG 


TTTGTGTTGC 


CACGGTATTT 


CCTGAGAAAT 


1260 


AAAtj i l^unL 


AACAGACATA 


ACGTTCAATA 


CAGATGTATC 


TTTGATATTG 


ATGACAAATT 


1320 


i_ >\ i 1 r\\_ rtv_> i 


TGCAGGTAGG 


ATGTTACGGA 


CTACCTGAGG 


TAGGACAATC 


TTACGCATGG 


1380 


1 L 1 otj I lr\+\j 


GGTCATACCA 


AGAGCAGTCG 


CAGCTTCAAA 


TTGTCCCTTG 


TCAACTGCTA 


1440 




ACGCACGATT 


TCAGTCATGT 


AGGCACCGGT 


ATTGATTGAA 


ACGATGAAGA 


1500 


Tirrirrrar 




AGGTTGATCC 


CGAAAGCTTG 


GGCAGTTCCA 


TAGTAGATAA 


1560 






GGCGTACCAC 


GGAAAATTTC 


AATGTAGACA 


TTGAGAACCC 


1620 


Arrrr.arTar; 


TTTTTGTAGG 


CCGTAAATGA 


CTTTGTTTTC 


AG AG AG AGG A 


GCAGTACGGA 


1580 


AOALAL t_ AA I 




AT AATG AG A C 


CTATGATGGT 


TCCGACGATA 


CACATTAAAA 


1740 




AnrarrAfGr 

nv> v» v- *^«>— 


AAGAGTTGTT 


CCCAGTTTTC 


AGAAAGAATT 


TTAGCAACTT 


1800 






GTCTCTTCAG 


TTGTTGTAGC 


TTCGGCAGGT 


TGTTCCTTGA 


1860 


1 t»A 1 nLun 1 v_ 




ACTTG GTC AT 


CTTTTGAAAT 


GGTTTCAATG 


CTGGCATTGA 


1920 


I : . LiVjt- 1 AA i 




TTTTTACGAA 


GCCCGATAGC 


GATAGCTGTA 


TCTTCTTCCC 


1990 


C Avj 11*1 uAa 




ACTTG AATCA 


TCTTGAACTT 


AGAGTTCGCA 


GCTTCAGCAG 


2040 


<-p«»"> |S /»ifr /^*Pt^P 
1 AVj 1 1 IV. 




GAAACATAAG 


CATC AATG AC 


ACCAGCCTCA 


AGAGCTTGTC 


2100 






ATGGCTGTTT 


CTTTTTTAGC 


ACCTGGGATT 


TGTGC AATCA 


2160 


. In L AAAO 




TGTTGAGAAG 


TG ATTTTTC C 


ACCGTTAAAG 


TCATCCAAAG 


2220 


All 1 AbLAL i 


Trrr.T agg c* a 


GAATCTTTTT 


TGACAAGCAA 


AACTGGTTCG 


CTAGTATAGT 


2280 


HAL i i tun 


A AAGGCAATT 


TCTTGTTTGC 


CTTCTCCAGT 


TGGACTCATA 


CCTGCGATAA 


2340 


TPATCITCAAT 


cttaccagaa 


GTAAGGGCAG 


GG ACT AG AC C 


TTCCCACTTG 


GTTTTAACAA 


2400 


CCAAACGTTC 


tttacctaag 


TCCTTAGCGA 


11 1 It- 11 Utjt, 


C A'TTG A AC A 


TCGTATCCGT 


2460 


TGGCATACTG 


attggtccca 


TCGATTTTGA 


CAGCTCCGTT 


GCTATCATCA 


TCCTGGGTCC 


2520 


AGTTAAAGGG 


agcatatgct 


GCTTCCATAC 


CGATGCGTAA 


ATATTCATCG 


GCTTGAGCAA 


2580 


CATTGACAAG 


TCCTAGCATC 


AGC AAG AG AC 


TTGTGAAAAT 


AGATAAGTAy 


ATGTGGCTCA 


2640 


TGATTTCTCC 


TATTCTGATC 


TATTAAAAAA 


TAACTGTCTC 


CTATTTTATC 


GAAAAATGCG 


2700 


TAATTTTTCA 


ACATAAGTAA 


GTCTTT ACTT 


ACCAAAAAAT 


GCTATAATGA 


TAAGAAAGAT 


2760 


AAAAAG GGGG 


CTTAGTTGAT 


GAAAAAAACT 


TTTTTCTTAC 


TGGTCTTAGC 


CTTGTTTTGC 


282C 


CTTCTTCCAC 


TCTCTGTTTT 


TGCCATTGAT 


TTCAAGATAA 


ACTCT TATCA 


AGGGGATTTG 


2880 
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TATATTCATG CAGACAATAC GGCAGAGTTT AGACAGAAGA TAGTTTACCA GTTTGAGGAG 29 4 0 

GACTTTAAGG GCCAAATCGT GGGACTTGGA CGTGCTGGTA AGATGCCTAG CGGGTTTGAC 3 000 

ATTGACCCTC ATCCAAAGAT TCAGGCCGCG AAAAACGG TO CAGAACTAGC AGATGTGACT 3 0 60 

AGCGAAGTAA CAGAAGAAGC GGATGGTTAT ACTGTGAGAG TCTATAATCC AGGTCAGGAG 3L20 

GGCGACATAG TTGAAGTTGA CCTCGTCTGG AACTTAAAAA ATTTACTTTT CCTTTATGAT 3180 

GATATCGCTG AATTAAATTC GCAACCTCTG ACAGATAGTT CAGAGTCTAT TGAAAAGTTT 3 2 40 

GAATTTCATG TAAGGGGAGA CAAGGGGGCT GAAAAACTCT TTTTCCATAC AGGGAAACTT 3 3 00 

TTTAGAGAGG GAACGATTGA AAAGAGTAAC CTTCATTATA CTATCCGTTT AGACAATCTT 3 3 60 

CCGGCTAAGC GTGGAGTTGA GTTGCATGCC TATTGGCCTC GGACCGATTT TGCTAGCGCT 3 4 20 

AGGGATCAGG CATTGAAACC GAATCGTTTA GAAGACTTTA AT AAG AT AG A AGACTCGATT 3 4 80 

GTTAGAGAAA AAG AT C AG AG TAAACAACTC GTTACTTCCC TCCTCCCTTC GATCCTTTCC 3 5 40 

ATCTCCTTGT TATTGAGTGT CTGCTTCTAT TTTATTTATA GAAGAAAGAC CACTCCTTCA 3 600 

CTCAAATATG CCAAAAATCA TCGTCTCTAT GAACCACCAA TGGAATTAGA CCCTATCCTT 3 660 

TTATCAGAAG CAGTCTACTC GACCTCCTTG GAGGAAGTGA GTCCCTTCGT CAAGGGAGCT 37 20 

GGAAAATTCA CCTTTGATCA ACTTATTCAA GCTACCTTGC TAG AT G TG AT AGACCGTCGG 3 7 80 

AATGTCTCTA TC ATTT C AG A AGGAGATGCA GTTGGTTTGA GGCTAGTAAA AGAAGATGGT 3 840 

TTGTCAAGCT TTGAGAAAGA CTGCCTAAAT CTAGCTTTTT CAGGTAAAAA AGAAGAAACT 3 900 

CTTTCCAATT TGTTTGCGGA TTACAAGGTA TCTGATAGTC TTTATCGTAG AGCCAAAGTT 3 9 60 

TCTGATGAAA AACGGATTCA AGCAAGAGGG CTTCAACTCA AATCTTCTTT TGAAGAGGTA 4 02 0 

TTGAACCAGA TGCAAGAAGG AGTGAGAAAA CGAGTTTCCT TCTGGGGGCT CCCAGATTAT 4 0 80 

TATCGTCCTT TAACTGGTGG GGAAAAGGCC TTGCAAGTGG GTATGGGTGC CTTGACTATC 4 14 0 

CTGCCCCTAT TTATCGGATT TGGTTTGTTC TTGTACAGTT TAGACGTTCA TGGCTATCTT 4 2 00 

TACCTCCCTT TGCCAATACT TGGTTTTCTA GGGTTAGTTT TGTCTGTTTT CTATTATTGG 4 2 60 

AACCTTCCAC TAGATAATCG TGATGGTGTT CTAAATGAAG CGGCAGCTGA GGTCTACTAT 4 32 0 

CTCTGGACCA GTTTTGAAAA TATGTTGCCT G AG ATTG C AC GATTGGATCA GGCTGAACTG 4 380 

GAAACTATTC TCCTCTGGAA TCGCCTCTTG GTCTATGCGA CCTTATTTGG CTATGCGGAC 4 44 0 

AAGGTTAGTC ATTTGATGAA GCTTCATCAG ATTCAAGTGG AAAATCCAGA TATCAATCTC 4 500 

TATGTAGCTT ATGGCTGGCA CAGTACGTTT TATCATTCAA CAGCACAAAT GAGCCATTAT 4 560 

GCTAGTGTCG CAAATACAGC AAGCACCTAC TCTGTATCTT CTGGAAGTGG AAGTTCTGGT 4 62 0 

GGTGGCTTCT CTGGAGGCGG AGGTGGCGGC AGTATCGGTG CCTTTTAAAG AGAGCTACCA 4 6 80 
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TAGACTGAAA 


AAGTATGATA 


TAATGGAAGA 


TAGAAAAAAG 


ACAAACTATA 


AGAAAAGTCA 


4740 


ATAGTTTTAT 


CTAAACTATT 


TCTTATTTCA 


ATTTGATGAT 


TTGGCGATGA 


TTTTAGAGCA 


4800 


CGGCAAAAAG 


CCCTTGAAAA 


AGTCCATTTT 


TTCAAACCTA 


ATCCTGTGTT 


AATTTCAGAA 


4860 


ATTACATCAC 


TTTTTGTTCG 


TCAAATGGCA 


GCTCTTTTTT 


AGGATATAAA 


ACAGGGTTCG 


4920 


GATAACTTTT 


TTTGCAAGGT 


GGATGATGGC 


TACATTGTAA 


TGTTTTCCTT 


ATTCTAACTT 


4980 


AGTCTTAAGA 


TAGGCCTTAG 


AAGCAGGTGA 


AAAGCGAGGG 


CATGCTTTGG 


CAGCTTGTAT 


5040 


GAGTGCCCAC 


CGCAGATGAG 


GGGAACCCCG 


TTTGACCATT 


CTTCCAGCTA 


AATC AATCTG 


5100 


ACCTGACTGA 


TAAATAGAAG 


AATCCAGTCC 


AGCGAAAGCT 


TGTAATTGAG 


CAGGATTATC 


5160 


AAAGGCATGA 


AT ATTTCG AA 


TCTCGGCTAA 


AATGACCGCC 


CTAAACGATC 


CCCAATCCCA 


5220 


GTAACCGTCG 


TGATGACCGA 


GTTGAACTCA 


GCCATCGAGT 


CATTGATACA 


TCTTTCCGCC 


5280 


TTGTCAATGA 


GCCTCTTGTA 


ATGCTTGATG 


ATTTCGAATT 


CACGAGCAGG 


AGATGTTGTT 


5340 


CCGATAGAAC 


GAGGTGCGAC 


TGAGAGGATA 


TCCTGAATTT 


TAGAAGCGGT 


CAATCGCTTA 


5400 


ATTTCTATCA 


GCTTATCAAA 


TCCTGCCTCA 


ATCCTTTTCT 


GAGGATTAGG 


GTAGCGTGTC 


5460 


AAGAGTTGGT 


AGGTATATTC 


TGAATGCTTT 


CCAACGATTT 


TATCCAACTC 


AGGAAAGATG 


5520 


ATATCAAGAC 


AACGAGTGTA 


TTGTACTTTC 


CAATCAGACT 


CTTTTTCTTG 


AGACGATGAA 


5580 


TATGTCTAGC 


C AGT ATTTTT 


AGGTCTACTT 


GCCGATTATC 


GTGTTGAAAT 


TGTTCACGAT 


5640 


TGGGGTCAGA 


AAGAAGTTTA 


AGAGCGATGC 


CATGAGCGTC 


TTTCTTATCC 


CTTTTACTCT 


5700 


TCCGAAGTCA 


TAATGATTTG 


GCAAATTCCT 


TGATGAGCAA 


AGGATTCTAG 


GTGTAAACTT 


5760 


TATATCCTTG 


TTCATGCAGG 


AAGTTCAGTA 


GATTAAAGGC 


ATAATGTCCA 


GTATCTTCAA 


5820 


GACCGATGAG 


ACAGTCTTGG 


TTGATCTGTC 


GAATAGACAC 


ATCTAAGAGT 


TCAAAACCAG 


5880 


CTTTATTATT 


TGAAAAAGTG 


AGTGGTTTAA 


GAACAGTTTT 


TCCTGGAACA 


TTCAAGGCTG 


5940 


TAACATCGTG 


TTT ATTTTT A 






AAGCATGGGA 


CTACCTCCAG 


6000 


ATATAGTATT 


TCAAGTCTAC 


TTGGTTATCC 


ACGAATTTTT 


TGCCTTGTTA 


CCTTAGACGA 


6060 


GATCAAACGT 


CTATGCGTTA 


TCAAACTCAT 


TACCAATTGA 


AACAAAAGCT 


GTGGTTAGAG 


6120 


CCTTTCGGAA 


ATCGTCAAGC 


GATTGGAGGA 


AATGAACTAA 


TCCATAGTGG 


CTTATTCCAA 


6180 


GTATACCACT 


TGGGCTTTGG 


CAGTAGCTAA 


CTGCGCTAAA 


TATAATATAG 


GGAGTAATCT 


6240 


ATGTATCTTA 


TTGAAATTTT 


AAAATCTATC 


TTCTTCGGAA 


TTGTTGAAGG 


AATTACGGAA 


6300 


TGGTTGCCGA 


TTTCCAGTAC 


AGGTCACTTG 


ATTTTAGCAG 


AGGAATTCAT 


CCAATACCAA 


6360 


AATCAAAATG 


AAGCCTTTAT 


GTCC ATGTTT 


AA7CTCCTCA 


TTCAGCTTGG 


TGCTATTTTA 


6420 
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GCAGTTATGG TGATTTATTT TAACAAGCTC AATCCTTTTA AACCGACCAA GGACAAACAG 64 80 

GAAGTTCGTA AGACTTGGAG ACTATGGTTG AAGGTCTTGA TTGCTACTTT ACCTTTACTT 654 0 

GGTGTCTTTA AATTTCATGA TTGGTTTCAT ACCCACTTCC ATAACATGGT TTCAGTTGCT 6 600 

CTCATGTTGA TTATCTACGG GGTTGCCTTC ATCTATTTGG AAAAGCGCAA TAAAGCGCGT 6 6 60 

GCTATCGAGC CAAGTGTAAC AGAGTTGGAC AAGCTTCCTT ATACGACCGC TTTCTATATC 6720 

GGACTCTTCC AAGTTCTTGC TCTTTTACCA GGGACTAGCC GTTCAGGTGC AACGATTGTC 67 80 

GGTGGTTTCT TAAATGGAAC CAGTCGTTCA GTTGTGACAG AATTTACCTT CTATCTTGGG 68 4 0 

ATT C CTGTT A TGTTTGGAGC TAGTGCCTTA AAGATTTTCA AATTTGTGAA AGCCGGAGAA 6 900 

CTCTTGAGCT TTGGGCAATT GTTTTTGCTC TTGGTCGCGA TGGGAGTAGC TTTTGCGGTC 6 960 

AGCATGGTGG CTATTCGCTT CTTGACCAGC TATCTGAAAA AACACGACTT CACCCTTTTT 7 02 0 

GGTAAATACC GTATCGTGCT TGGTAGTGTT TTGCTACTTT ACAGTTTTGT CCGTTTATTT 7080 

GTATAAGAAA AACCTTGAAG GGGCAACTCT TCAAGCTTTT ATACTCTTCG AAAATCTCTT 714 0 

CAAACCGCGT CAGCTTTATC TGCAACCTCA AAACAGTGTT TTGAGCAGCn CTGCGGCTAG 7 2 00 

CCTCCTAGTT TGCTCTTTGA TTTTCATTGA GCTTTAAAAT CCAGTCATGG TAATCCCCAA 72 6 0 

TAGGCGGACA CCTCTTTCTT TCTTGCTTAA TTCTTCATAG AGTTGCAGGG CTATTTGGCT 73 20 

TATCTGACTA GCATCTTGTG TTTTTTGAGC AAGACTTTTT CGTTTGGTAA GAGTTGAAAA 7 3 80 

GTCCTCGTAG CGGATTTTCA AAATGACAAT TTTTCCAGCT TTTTCTTGTT GATGTAGATT 74 4 0 

GAGACCGACT TTTTCTGATA GAAGAGTCAG CTCTTTTTTG ATATCTTCCT CAGCAAGGAG 7 500 

AATCTTCCCG TAGGTTTTCT CCTTGCCGAT TGATTTACGG ATCCCATTGC ATTTGACTGG 7 5 60 

AGAGTTGTGA ATGCCACGAG CCTTTCGATA CACATCATAG CCTAGTCTAC CAAAACGGTC 762C 

TATTAGGGTT ACCTCAGGAA CTTCAAGTAA ATCAGCACCA GTAAAAACGC CCATTTGATC 7 680 

AAGACGTTCT ACTGTCTTTT TTCCTACTCC ATGAAATTTG GAAATATCCA T TTGTT T GAG 77 4 0 

AAAATCCTCA GCCTGTTCAG GTAGAATCAC TGTCAAACCA TGTGGTTTTT GATAATCACT 7 800 

CGCCATTTTA GCTAAGAATT TGTTGTAAGA AACGCCTGCG GAAGCAGTTA GATGGAGTTC 7 86 0 

TTGCCAGATA TCTTTTTGAA TGAGGCGAGC AATTTTGACC GCTGACTTGA TACCGAGTTT 7920 

ATTTTCTGTC ACATCCAAAT AGGCTTCGTC AATGCTCATG GGTTCAATCA AATCTGTATA 7 980 

GCGCTTAAAA ATAGCTCGAA TCTGGAGTCC CACAGACTTG TATTTCTCAT AATTCCCTGA 8 04 0 

GATAAAGACA GCCTGGGGAC AACGTTCATA AGCTTCCTTG GAACTCATGG CAGAATGGAC 8100 

ACCAAAAGCT CTTGCCTCAT AACT AC AG GT AGAAACCACT CCCCGTCCAC CTGTTTGCCG 8160 

AGGGTCGCTT CCAATAATGA CAGGTTTTCC TCTGACTTTA GGATTATCCC TGATTTCCAC 8 2 20 
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TGCAGCAAAA AAGGCATCCA TCTCAATATG GATGATTTTT CTTGACAAAT CATTTAACAA 
AGGAAAAATC AACATGCCTA GCACCTTTTT AT ACT CTT CG AAAATCTCTT CAAACCACGT 8340 
CAGCTCTATm TGCAACCTCA AAACAGTGTT TTGAGCAATC TGCGGCTAGC TTCCTAGTTT 
GCTTTTCGAT TTCCATTG AG TGTTACTGCT TATTyTCTTT TATTATACCC TTTTTTCTGA 
AAAAAAGAAA AAAGGACTTT ATTTTTTCAA AAATATAATA C AGTTT G AAA TAAAATATAG 
ACTG TTT TAG AAAAGAAAGT GTAAAAATAG GGAATTTTCA CTTGTTGAAA TCGGTTACTA 
TATGGTATAC TTGTCTTATG AATGTAACAG ATGACTGTTA CTAGAAAAAA GAGGACATTA 
ATATGGTTGT TAAGACAGTT CTTCAAGCAC AAGATATTTT TGACAAAGCT TGGGAAGGCT 
TCAAAGGCGT AG ATT GG AAA GAAAAAGCAA GTGTATCACG ATTTGTACAA GCTAACTACA 
CACCTTATGA TGGAGACGAA AGCTTCCTTG CAGGACCAAC AGAGCGTTCA CTTCACATCA 
AGAAAATTGT AGAAGAAACT AAAG C AC ACT ACGAAGAAAC TCGTTTCCCA ATGGACACTC 
GTCCAACATC TATCCCTGAT ATCCCTGCTG GATTTATCGA CAAAGAAAAT GAAGTTATCT 
TCGGTATCCA AAACGATGAA CTCTTCAAAT TGAACTTCAT GCCAAAAGGT GGTATCCGTA 
TGGCTGAAAC T AC TTTG AAA GAAAATGGAT ACGAACCAGA CCCAGCTCTT CACGAAATCT 
TCACTAAATA TGTAACAACA GTTAACGACG GTATTTTCCG TGCCTACACT TCAAATATTC 
GTCGCGCTCG TCACGCACAC AC TGT AACTG GTCTTCCAGA TGCATACTCA CGCGGACGTA 
TCATCCGTCT TTACGCACGT CTTGCTCTTT ACGGTGCAGA CTACTTGATG CAAGAAAAAG 
TAAATGACTG GAATGCAATC AAAGAAATCG ATGAAGAAAC AATCCGTCTT CGTGAACAAG 
TAAACCTTCA ATACCAAGCA TTGCAACAAG TTGTTCGCCT GGGTGACCTT TACGGGGTTC 
ATGTTCGCAA ACCAGCGATG AACGTGAAAG AAGCAATCCA ATCCCTTAAC ATTGCTTTCA 
TCGCTGTCTG CCGTGTGATT AACGGTGCTG CTACATCTCT AGGTCGTGTA CCAATCGTAT 
TGGACATCTT. TGCAGAACGT GACCTTGCTC GTGGTACATT TACTGAATCA GAAATCCAAG 
AATTCGTTGA TGATTTCCTT ATGAAACTTC GTACAGTTAA ATTTGCTCGT ACAAAAGCTT 
ATGACCAATT GTACTCAGGT GACCCAACCT TTATCACAAC TTCTATGGCT GGTATGGGTA 
ACGACGGTCG TCACCGTGTT ACTAAGATGG ACTACCGTTT CTTGAACACT CTTGACAACA 
TCGCTAACTC ACCAGAACCA AACTTGACAG TTCTTTGGAC TGACAAATTG C C AT AC AACT 
TCCGTCGCTA CTGTATGCAC ATGACCCACA AACACTCTTC TATCCAATAC GAAGGTGTAA 
CAACAATGGC TAAAGACGGA TATGGTGAAA TGAGCTGTAT CTCATGCTGT GTGTCTCCAC 
TTGATCCAGA AAATGAAGAA CAACGCCACA ACATCCAGTA CTTCGGTGCT CGTGTAAACG 



3280 



8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
930C 
9 3 60 
9420 
948C 
9540 
9600 
9660 
9720 
9730 
9840 
9900 
9960 
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TTCTTAAAGC CCTTCTTACT GGTTTGAATG GTGGTTACGA CGATGTTCAC AAAGACTACA LOO 20 

AAGTATTTGA TATCGAACCA ATCCGTGACG AAGTTCTTGA ATTTCAATCA GTTAAAGCGA 10080 

ACTTTGAAAA ATCTCTTGAC TGGTTCACTG ACACTTACGT ACATCCCTTG AACATCATCC 10140 

ACTACATGAC TGATAGGTAC AACTACGAAG CTGTTCAAAT GGCCTTCTTG CCAACTAAAC 10200 

AACGTGCCAA CATGGGATTC GGTATCTGTG GATTTGCTAA CACTGTTGAT AC ATTGTC AG 102 60 

CTATCAAATA CG CT AC AG TT AAACCAATCC GTGACGAAGA TGGCTACATC TACGATTACG 103 20 

AAACAATCGG TGACTACCCA CGCTGGGGTG AAGATGACCC ACGTTCAAAC GAATTGGCAG 10380 

AATGGTTGAT CGAAGCTTAC ACAACTCGTC TACGTAGCCA CAAACTATAC AAAGACGCAG 104 4 0 

AAGCTACAGT ATCACTTTTG AC AAT C A CAT CTAACGTTGC TTACTCTAAA CAAACTGGTA 10500 

ACTCACCAGT TCACAAAGGT GTATACCTCA ACGAAGATGG TTCTGTGAAC TTGTCTAAAC 105 60 

TTGAATTCTT CTCACCAGGT GCT AA CCC AT CTAACAAAGC TAAAGGTGGT TGGTTCCAAA 10620 

ACTTGAACTC ACTTTCTAGC CTTGACTTTA GTTATGCAGC TGACGGTATC TCATTGACTA 10680 

CACAACTATC ACCTCGCCCT CTTGGTAAGA CTCGTGATGA ACAAGTTGAT AACTTGGTAA 107 40 

CAATTCTTGA TGGTTACTTC GAAAACGGTG GACAACACGT TAACTTGAAC GTTATGGACT 10 800 

TGAACGATGT TTACGAAAAA ATCATGTCAG GCGAAGACGT TATCGTACCT ATCTCTGGAT 108 60 

ACTGTGTAAA CACTAAATAC CTCACTCCAG AACAAAAAAC TGAATTGACA CAACGTGTCT 10920 

TCCACGAAGT TCTTTCAATG GATGACGCC? TGGATGCATT GAGCTAATCA AGTTCTTGAA 10980 

TAATAAAAAG GAACCCTCGG TCAAACGACT GAGGGTTTTG TGCTTGGGAT AGTATGAGCA 1104 0 

ATTC CTTCGG CGCAATATGC AATGTTTTTG GCCTCTTTGT CAACTGTAGT GGGTTGAAAA II 100 

AAAGCTAACC TTGAGAAAGG ACAAATTTCG T C CTTTCTTT TTTGATGTTC AGGGCGATAA 11160 

AAATCCGTTT TTTGAAGTTT TCAAAGTTCC GAAAACCAAA GGCATTGCCC TTGATGTCTT 1122C 

TGATGAGTTT GTTAGTGGCC TCAAGTTTAG CGTTAGAATA AGGCAATTCA ATGGCGTTAG 112 80 

TCATCTAGTT TTTATAGCAA ATAAATGTGC TC AAAGTGGT TTTAAACGTC CGGTTGAGAT 11340 

GAGGTAACGT GTCTTGAATT AAGCCCCAAA ACTGGTCAGT ATTCTTCTCT TGTAGATGAA 114 00 

ATAGGAGTAG TTCATACACC TCATAGTAAT CTTTAAGTTC AGGTACTAGA GTAAAGATTT 114 60 

TCTTCAGACA CTCCCTAGGA GTTAAGGTCT CTCTGAAAGT TCTAGCATAG AAAGGCTTAA 11520 

GAGAGAGTTT CCGACTATCT TTTAGGATAA ATTTCCAGTA ATATTTAAGA GCTCTGTATT 1158C 

CCAGAGATTT ATCATCAAAT TGCTTCATGA TGTTGATTCT AGTCTGATTA AGAGCCCTGC 11640 

TCATCTGTTG GACAATGTGG AAACGATCGA GAACAATTTT AGCATTGGGA AATAATTTCT 11700 

- TAATGACAGC GATATAACTT CC AG AC AT AT CAACAGTGAC GACTTTAACT TTTTTTCTAC 117 50 
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CTTCTTTCGA 


GTACTTGAAG 


AAATGATTTC 


GGATGGTTGT 


TTGACGTCTG 


TTATCAAGAA 


11820 


TGGTCATCAT 


TTTCTTAGTG 


TTGAAATCCT 


GAGCAATGAA 


AGCCAATTTC 


CCCTTCTGGT 


11880 


AGGAGAATTC 


ATCCCAGGAG 


AGGATTTCAG 


GCAAAGTGGT 


GTAATCCTCT 


TGGAAATGAA 


11940 


ATTGCTTGAG 


CTTACGATAG 


ACGGTAGAGG 


TAGAGGTAGA 


GGTAGAGATG 


GC T AATTT AG 


12000 


AAGCGATATG 


TGTAAGAGCC 


TCTCTGTTGA 


GTAGGAGTTG 


GGCAATTTTC 


TGTCTCACCA 


12060 


TTTCCGACAT 


TTGGCAATTT 


TTCTGAACGA 


GAGTTGTTTC 


AGCTACAGTG 


ACTTTCCGAC 


12120 


AGGACTTGCA 


TTGAAATCGT 


CTCTTTTTCA 


AATGAATGAG 


GCTAGGGAAA 


CCACCAATCT 


12180 


CGATAAAAGG 


GATTTTAGAA 


GGCTTTTGGA 


AGTCGTATTT 


GATTTGTTTT 


CCTTTACAGT 


12240 


GTTTACATTT 


AGGTGGGTGA 


TAATCAAGTG 


TAGCGAAGAC 


TTCGATATGG 


GTATCGTG CT 


12300 


GAATGGCTTT 


ATTTAAGGTG 


ATGTTTTTGT 


CTTTTATTCC 


GATGAGTAAT 


GTGGTATGAT 


12360 


TGATGTGTTC 


CATAAGATAC 


TTTCTAATGA 


GTTGTTTAGG 


CG CT TTT CAT 


T AT AAGTC TT 


12420 


ATGGGACTTT 


TTTGATACTC 


AAAAAGCCCT 


ATAATCTCCA 


CAGTGGGATT 


TACCCACTAC 


1 2 480 


AG AAATT ATA 


GAGCCAGAAA 


AAACACTTTT 


GTTCACTAGC 


AGAAACTAGA 




12540 


GTTTTTCTCT 


TCAGATTTAC 


CCAAAACTGG 


GAAATATGGG 


GATAAGAATA 


GAGATGGCTT 


12 600 


AGGAAGCCCC 


TTTTTGTGTG 


TAGACAGTAC 


GATGAACTTA 


TAACAAATAG 


TGAGCCTTTT 


12 660 


TAGCAATCAT 


TGCGACCCGT 


TTGTCAAAAG 


CCTCTTTT CG 


GATATCTACA 


ATTGTCTGAT 


12720 


AGATGAGACG 


CTGTTGGCTA 


ACATGCAAAT 


CTAAGGCAAT 


CGTCAAAAAG 


TGATGTTTCC 


12"?80 


CTTTGGGATA 


CTGCTTTTTA 


ACGTAACCCA 


GGTATTCTTT 


CGTTGTAATA 


ATAATCAATG 


12 940 


GCTCTGTCAA 


ATGCTCCTCT 


GAAGGAGGAG 


GACTAATTAG 


AATATTGTAT 


CCTGTAACAG 


12900 


AGCCAACTTT 


GTCAGTAAAA 


TTCCGTAAAA 


TAATGGACTT 


TATTAAGTTT 


ACATCTGCTT 


12960 


GATTATTTAA 


AATGATAAAA 


ATCGGGATAG 


CAGGTAGTGA 


GGAAAAGATG 


GTTTCTGTCA 


13020 


AGTAGAGTGA 


GAAAAGGTAC 


AGCCGATGCT 


GGTCGATAAC 


TCCTTCAATC 


TTCTGCTCAG 


13080 


TCATCCACTC 


TTGAACAATT 


GCTTTCGAAA 


TATGATACAG 


TGGCTTGTCG 


CTTTCAATCC 


13140 


CATAATGTTC 


GTAATAATTA 


TAATAGGGAA 


CTAGATTTTG 


TAAACCAAAC 


AAAAACGTTC 


13200 


TTGTTAAGAA 


AGTCAGTGCT 


GTTAAAAAAG 


AAAGACAATT 


C G AAATGTC A 


TTTCCTAAGA 


13260 


TATTCTTGAA 


CTTGGATAGT 


ACATGCTTTC 


CTCTTGTATG 


CTGAAGAATC 


AGTTGAATAG 


13320 


TATGAGTCTT 


TTTTTCTT G A 


TTCCATTTGT 


CCTTCGAAAA 


CGAAGAATTA 


GCAGAACAAT 


13380 


AAACCAAAAA 


GATATAATCC 


AGTTCTTCCT 


GAGTAAAAGT 


CATGTTGGCA 


TGTGGCTCTA 


13440 


AGTAAGTTTG 


GCAATGTTCC 


ATCAAAATCG 


GAT A CAT AAA 


GAGGTTTTTT 


AATTTTTCAA 


13500 
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ACTCTTTGGA CTCACGGAAC TCAAGTGGAA ATTCCCGACG TTTCCAAGTG AGTGCCACTA 13 560 

GT ATGCT AAA ATGAACATAC TCGTCAGGTG TGATTTCTAA CAGTTCATGA CTGAGTTGAG 13 6 20 

AATTAGACTG C AC AATC AT A TGTGTGACCC AATCCATACT TCCATCATTC AAATCATAAA 13 680 

TCTCAATACC AAAATGAAAC TGGAGGAGTG CAATTAAAAA ACGAATGCGA TATTCAGGAC 13 740 

CAACTACTTG ATTTTTCACA AGGTCCAAAC CTACTGAACG TAGTAACAAG CC ACACTTTT 13 800 

GTCGTACGCG GTAGCCTGTT GCGATGGAAA TATACTCTTT TTGTGTAAAT TCGTTAAAGC 13 8 60 

TTTG ATT AC C TTGTAGTAGA AAGAAGCGGA CTATTTTTAA AATAGTTGAT TCGTTATAAA 13 9 20 

GCTGATGGAA GTAATAATTC GTTTG ATG AG AATGCTCTTC CATTAATTGA ACTTGTTGCG 13 980 

TATCTAAATT AAATGTCAAC TCTTCCTCGA ATGTTTCTTG TAATTCCTCC AAAATG CTT A 140 4 0 

GGAGACTTTT AG ATTGT AAT CAAGTTAAAG TAGACAGTTC ATCTAGTTCA ATAGACCGAA 14 100 

TATCCAATAA TATATTTAAA ATGGT AATTT TAT CTGT AAT TCTTTTTTCA ATGTATTTGT 14 160 

TTAGCATAGT TACCGAATCT TAGTTGCATA T AG AT AATTT T AATT ATT AT AATACAAAAG 14 2 20 

AAACTAATTG TCTTGTCAAA AAGGTTGTGG AATTTCCGAC TTTATTGATA AAACAGCATG 142 80 

TAATAAAAGG CATTTTAAAG ATAGTAATGA GTATTGGTGG ACTTTTATGC CTTATTTTTT 14 340 

TTATTAGAAA ATATTTTTTT ATCAAATATT GTCGTTCTAT AAAAAAATAT GTGATAAAAA 144 00 

TATCT ATTGT GATGCAAGTT GTTTT AATTT ATACTAGGAT AGTTAATAGT AAT ACT AT AC 14 4 60 

TAT ACT AT AT TGTATACAAG TGTGTCATTG CCAGGTTGAG AAGATAGCTA TAACGCACTT 14 5 20 

TTATACGCTT TTGCTACGTT TGTTAGTGAA CGGATTAACT CAGTGAGATA AATTTT A T C A 14 5 80 

GAACATAAGT AATCCCTTTC TTCGTGTATA CAGATTGAAA GTACCTATGA ATCATAGAAG 14 640 

GATTAACTTC T T CT ATG AAT AATGCTTAAC AGGGAGACAC ACATGAAAAA AGTAAGAAAG 14 700 

ATATTTCAGA AGGCAGTTGC AGGACTGTGC TGTATATCTC AGTTGACAGC TTTTTCTTCG 147 6 0 

ATAGTTGCTT TAGCACAAAC GCCTGAAACC AGTCCAGCGA TAGGAAAAGT AGTGATTAAC 14 820 

GAGACAGGCG AAGGAGGAGC GCTTCTAGGA CATCCCCTCT TTGAGTTGAA AAACAATACG 14 8 80 

GATGGCACAA CTGTTTCGCA AAGGACAGAG GCGCAAACAG GAGAAGCGAT ATTTTCAAAC 14 940 

ATAAAACCTG GG AC AT AC AC CTTCACACAA GCCCAACCTC CAGTTGGTTA TAAACCCTCT 15000 

ACTAAACAAT GGACTGTTGA AGTTGAGAAG AATGGTCGGA CGACTGTCCA AGGTGAACAG 15060 

GTAGAAAATC GAGAAGAGGC TCTATCTGAC CAGTATCCAC AAACAGGGAC TTATCCAGAT 15120 

GTTCAAACAC CTTATCAGAT T ATT AAGGT A GATGGTTCGG AAAAAAACGG ACAGCACAAG 15130 

GCGTTGAATC CGAATCCATA TGAACGTGTG ATTCCAGAAG GTACACTTTC AAAGAGAATT 152 4 0 

TATCAAGTGA ATAATTTGGA TGATAACCAA TATGGAATCG AATTGACGCT TAGTGGGAAA 15 300 
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ACAGTGTATG AACAAAAAGA TAAGTCTGTG CCGCTGGATG TCGTTATCTT GCTCGATAAC 15 360 

TCAAATAGTA TGAGTAACAT TCGAAACAAG AATGCTCG AC GTGCGGAAAG AGCTGGTGAC 15420 

GCGACACGTT CTCTTATTGA T AAAATT AC A TCTGATTCAG AAAATAGGGT AG CGCT TGTG 154 80 

ACTTATGCTT CCACTATCTT TGATGGGACC GAGTTTACAG TAGAAAAAGG GGTAGCAGAT 15540 

AAAAACGGAA AGCGATTGAA TGATTCTCTT TTTTGGAATT ATGATCAGAC GAGTTTTACA 15600 

ACCAATACCA AAGATTATAG TTATTTAAAC CTGACTAATG ATAAGAATGA CATTGTAGAA 15 660 

TTAAAAAATA AGGTACCTAC CGAGGCAGAA GACCATGATG G AAAT AG AT T GATGTACCAA 15720 

TTCCCTGCCA CTTTTACTCA GAAAGCTTTG ATGAAGGCAG ATGAGATTTT GACACAACAA 157 80 

GCGAGACAAA ATAGTCAAAA AGT C ATTTTC CATATTACGG ATGGTGTCCC AACTATGTCG 15840 

TATCCGATTA ATTTTAATCA TGCTACGTTT GCTCCATCAT ATCAAAATCA ACTAAATGCA 15 900 

TTTTTTAGTA AATCTCCTAA TAAAGATGGA AT ACT ATT AA GTGATTTTAT TACGCAAGCA 15 960 

AC T AGTGG AG AACATACAAT TGTACGCGGA GATGGGCAAA GTTACCAGAT GTTTACAGAT 16020 

AAG AC AGTTT ATGAAAAAGG TGCTCCTGCA GCTTTCCCAG TTAAACCTGA AAAATATTCT 16080 

GAAATGAAGG CGGCTGGTTA TGC AGTT AT A GGCGATCCAA TTAATGGTGG AT AT ATTTGG 16140 

CTTAATTGGA GAGAGAGTAT TCTGGCTTAT CCGTTTAATT CTAATACTGC T AAAATT AC C 16200 

AATCATGGTG ACCCTACAAG ATGGT ACT AT AACGGGAATA TTGCTCCTGA TGGGTATGAT 16260 

GTCTTT ACGG TAGGTATTGG T ATT AACGG A GATCCTGGTA CGGATGAAGC AACGGCTACT 16320 

ACTTTTATCC AAAGTATTTC TAGTAAACCT GAAAACTATA CCAATGTTAC TGACACGACA 16380 

AAAATATTGG AACAGTTGAA TCGTTATTTC CACACCATCG TAACTGAAAA GAAATCAATT 16440 

GAGAATGGTA CG ATT AC AG A TCCGATGGGT GAGTTAATTG ATTTGCAATT GGGCACAGAT 16500 

GGAAGATTTG ATCCAGCAGA TTACACTTTA ACTGCAAACG ATGGT AGTCG CTTGGAGAAT 16560 

GGACAAGCTG TAGGTGGTCC ACAAAATGAT GGTGGTTTGT TAAAAAATGC AAAAGTGCTC 16620 

TATGATACGA CTGAGAAAAG GATTCGTGTA ACAGGTCTGT ACCTTGGAAC GGATGAAAAA 16 680 

GTTACGTTGA CCTACAATGT TCGTTTGAAT GATGAGTTTG TAAGCAATAA ATTTTATGAT 16740 

ACCAATGGTC GAACAACCTT AC AT C CT AAG GAAGTAGAAC AGAACACAGT GCGCGACTTC 16 800 

CCCATTCCTA AGATTCGTGA TGTGCGGAAG TATCCAGAAA T C AC AATTT C AAAAGAGAAA 16 860 
AAACTTGGTG ACATTGAGTT TATTAAGGTC AATAAAAATG ATAAAAAACC ACTGAGAGGT 
GCGGTCTTTA GTCTTCAAAA ACAACATCCG GATTATCCAG ATATTTATGG AGCTATTGAT 
CAAAATGGCA CTTATCAAAA TGTG AGAAC A GGTGAAGATG GTAAGTTGAC CTTTAAAAAT 



16920 
16980 
17040 
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CTGTCAGATG 
CAAAATAAGC 
ATCCTTCCAC 
AATGAACCTA 
TTCTATCTGA 
TAAAGTGTAG 
GAAGAGATTT 
GGGGAGATGT 
T AC AT TG C T C 
CAAATTTTTA 
AACAGTTTTT 
AGATGGGGAT 
AGTGGCTGTT 
TACTAATAAT 
AACATTTAAA 
AGGAGCTAAA 
CAGTTTATCA 
AATTGAAATT 
AGAAGCAAAG 
TGTAGATAAA 
TACAAAAATT 
AGGTTTGGCA 
AG GTG AT T AT 
TTTAGCTAAA 
ATTGAATGAC 
TAATAATCCA 
GACATTGACC 
AACGTTCGAT 
AGACAAAAAT 
ACGTACTATA 



GGAAATATCG 
CTATCGTTGC 
AAGATATACC 
TTCCTCCAAA 
TAGGTTGCAT 
AAATGATAAT 
AAGTTTACTT 
TTTCGAAAAC 
ACAGTTGATT 
ACAATGCTTG 
GCGGCTCGGA 
ATGGATAAAA 
CTACCTGCAA 
GAAATTATTG 
CTCTCAGGGG 
TTTAACACGG 
ACTTATGTCG 
GAATTACCAT 
CCAAAAATTG 
GAT AC AC CTG 
CCAGCACTTG 
TTCAACAAAG 
GCTCTAACAG 
GTGAATGACC 
AAAGCAATTG 
GATCACGGGA 
AAGACATGGG 
TTGGTTAATG 
ACAGTTACTG 
AAAGGGTATT 



ATTATTTGAA 
CTTCCAAATA 
AGCGGGTTAC 
GAGAGAATAT 
GATGATGGGA 
ATCTATGTTC 
GGTGAAACCT 
TTGCACAGAA 
TTAAGAGATA 
CTGCCTTATT 
CGACAACAAC 
TTGC AAATG A 
ATGCAAAAGA 
AT G AAAATGG 
CAATGCCGGC 
CAAATTTACC 
GTGAAGATGG 
TCAACGATGT 
ATAAAGATTT 
TGAACCACCA 
CTAATTATGC 
GTACAGTGAA 
AAGTAGCAAC 
AAAACGCTGA 
TAGAAGTACC 
AT ACTC C AAA 
TTGATGCTAC 
CTCAGACTGG 
TTAACGGATT 
CAGCAGATTA 



488 
AATTCTGAAC 

GTAAATGGAG 

GAGTTTACGA 

CCTCGAACTG 

CGAGTTCTAT 

TGAACGATAC 

GTTTTATTCG 

AAAGGATTAT 

TGAATAAGGA 

ACTGACAGCG 

ATCTGTT A C C 

GTTAGAAACA 

AATTGCCGGT 

CCAAACTCTA 

AACTGCAATG 

AGCTGCTAAG 

AGCAACCTTA 

TGTGCATCCG 

CAAAGGTAAA 

AGTTGGAGAT 

AACAGCAAAC 

AGTAACTGTT 

TGGTTTTGAT 

AAAAACTGTC 

AGAATCTAAT 

GCCGAATAAG 

AGGTGCACCA 

TAAAGTTGTA 

GGATAAAAAT 

TCAAGAAATC 



CACCTCCTTA 
AAGTCAGAGA 
ATGATAAGCA 
GTGGTATCGG 
TATACACACG 
TTTTAAGAAG 
TAAGTAAACT 
TATTGTCATG 
GAAATCATGA 
AGTAGCCTGT 
GTTCATAAAC 
GGTAACTATG 
CTTATGTTCC 
GGAGTGAATA 
AAAAAATTAA 
TATAAAATTT 
ACAGGTTCTA 
CATGTCTATC 
GCAAATCCAG 
GTTGTAGAGT 
TGGAGCGATA 
GATGATGTTG 
TTGAAATTAA 
AAAATCACTT 
GATGTAACAT 
C C AAATG AAA 
ATTCCGGCTG 
CAAACTGTAA 
ACAGAATATA 
ACTACAGCTG 



TAAACCCGTT 
TGTGACTTCA 
CT AT ATT AC C 
AATGTTGCCA 
GAAACATCCG 
TAG CACTC AA 
ATCATTGAAA 
TGTAATTCAT 
AATCAATCAA 
TTTCAGCTGC 
TATTGGCAAC 
CTGGTAATAA 
TTT GG AC AAA 
TTGATCCACA 
CAGAAGCTGA 
ATG AAATTC A 
AAGCAGTTCC 
CAAAAAATAC 
ATACACCACC 
ACGAAATTGT 
CAATGACTGA 
C ACTTG AAGC 
CAGATGCTGG 
ATTCGGCAAC 
TTAACTATGG 
ACGGCGATTT 
GAGCTGAAGC 
CTTTGACAAC 
AATTCGTTGA 
GAG AAATTG C 



17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18130 
18240 
18300 
18360 
18420 
18480 
13540 
18600 
18660 
18720 
18780 
18840 
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TGTCAAGAAC TGGAAAGACG AAAATCCAAA ACCACTTGAT CCAACAGAGC CAAAAGTTGT 1B900 

TACATATGGT AAAAAGTTTG TCAAAGTTAA TGATAAAGAT AATCGTTTAC CTGGGGCAGA 18 9 60 

ATTTGTAATT GCAAATGCTG ATAATGCTGG TCAATATTTA GCACGTAAAG CAGATAAAGT 19020 

GAGTCAAGAA GAGAAGCAGT TGGTTGTTAC AACAAAGGAT GCTTTAGATA GAGCAGTTGC 19080 

TGCTTATAAC GCTCTTACTG CACAACAACA AACTCAGCAA G AAAAAG AG A AAGTTGACAA 19140 

AGCTCAAGCT GCTTATAATG CTGCTGTGAT TGCTGCCAAC AATGCATTTG AATGGGTGCC 19200 

AGATAAGGAC AATCAAAATC TTGTGAAATT AGTTTCTGAT GCACAAGCTC CCTTTGAAAT 19 2 60 

TACAGGCCTT CTTGCAGGTA CATATTACTT AGAAGAAACA AAACAGCCTG CTCGTTATCC 193 20 

ATT AC T AACT AGCCGTCAGA AATTTGAAGT CACTGCAACT TCTTATTCAG CGACTGGACA 19380 

AGG C ATT GAG TATACTGCTG GTTCAGGTAA AGATGACGCT ACAAAAGTAG TCAACAAAAA 19 440 

AATCACTATC CCACAAACGG GTGGTATTGG TACAATTATC TTTGCTGTAC CGGGCGCTGC 19 500 

CATTATGCGT ATTGCAGTGT ACGCATATGT TAAAAACAAC AAAGATGAGG ATCAACTTGC 19 560 

TTAAGTAAGA GAGAAAGGAG CCATTGATGA CAATGCAGAA AATGCAGAAA ATCATTAGTC 19 620 

GTATCTTCTT TGTTATGGCT CTGTGTTTTT CTCTTGTATG GGGTGCACAT GCAGTCCAAG 19680 

CGCAAGAAGA TCACACGTTG GTCTTGCAAT TGGAGAACTA TCAGGAGGTG GTTAGTCAAT 19740 

TGCCATCTCG TGATGGTCAT CGGTTGCAAG TATGGAAGTT GGATGATTCG TATTCCTATG 19 800 

ATGATCGGGT GCAAATTGTA AGAGACTTGC ATTCGTGG G A TGAGAATAAA CTTTCTTCTT 19 860 

TC AAAAAG AC TTCGTTTGAG ATGACCTTCC TTGAGAATCA GATTGAAGTA TCTCATATTC 199 20 

CAAATGGTCT TTACTATCTT CGCTCTATTA TCCACACCCA TCCGGTTTCT TATCCAGCTG 19980 

AATTTCTTTT TGAAATCACA GATCAAACGG TACAGCCTTT GGTCATTGTA GCGAAAAAAA 2 0040 

CAGATACAAT GACAACAAAG GTGAAGCTGA TAAAGGTGGA TCAAGACCAC AATCGCTTGG 20100 

AGGGTGTCGG CTTTAAATTG GTATCAGTAG CAAGAGATGT TTCTGAAAAA GAGGTTCCCT 20160 

TGATTGGAGA ATACCGTTAC AGTTCTTCTG GTCAAGTAGG GAG AACT CTC T AT ACTG AT A 20220 

AAAATGGAGA GATTTTTGTG ACAAATCTTC CTCTTGGGAA CTATCGTTTC AAGGAGGTGG 20 2 30 

AGCCACTGGC AGGCTATGCT GTTACGACGC TGGATACGGA TGTCCAGCTG GTAGATCATC 2 0 340 

AG CTGGT G AC GATTACGGTT GTCAATCAGA AATTACCACG TGGCAATGTT GACTTTATGA 2 04 00 

AGGTGGATGG TCGGACCAAT ACCTCTCTTC AAGGGGCAAT GTTCAAAGTC ATGAAAGAAG 204 60 

AAAGCGGACA CTATACTCCT GTTCTTCAAA ATGGTAAGGA AGTAGTTGTA AC ATC AGGG A 2 0520 

AAGATGGTCG TTTCCGAGTG GAAGGTCTAG AGTATGGGAC ATACTATTTA TGGCAGCTCC 2 0580 
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AAGCTCCAAC TGGTTATGTT CAATTAACAT CGCCTGTTTC CTTTACAATC GGGAAAGATA 20 640 

CTCGTAAGGA ACTGGTAACA GTGGTTAAAA ATAACAAGCG ACCACGGATT GATGTGCCAG 20700 

ATACAGGGGA AGAAACCCTT GTATATCTTG ATGCTTGTTG CCATTTTGTT GTTTGGTAGT 2 0760 

GGTTATTGTC TTACGAAAAA ACCAAATAAC TG AT ATT C AA TGT A CAT CAT TATGAATAGG 20820 

ATAGCAGGCT GAAGGGAAGA CCAGAGTACT CTGAGGTGAT GTTAATCAGG AATCATGGTG 2 0890 

ATGTGGCATG AATCATCAAT AACGGATATG AGGCTCGGCA GATTGTGCCA CCCTCATTGT 2 0940 

GGGTTATTGT TTGTAAAACG ATAGGACTGG TCTCGTAATC ATTTTA 2 0986 

(2> INFORMATION FOR SEQ ID NO : 5 5 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21040 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


55 : 






CCCAGCAAAA 


AGCCATCCGA 


AGATGACTTT 


TTTGCTATTT 


AATTTCTGTA 


TAAGTTACTT 


60 


CCAAGCCACG 


CTTAACAGCT 


GGACGATTGG 


CAATTTTTTC 


TGCCCATTTT 


ACTAGATTTT 


120 


G AT AACTTG A 


GGC ATC CAAG 


AATTTTGCAG 


AACCTTGGTA 


AAGATTTCCT 


TGAACTAACT 


180 


GTCCATACCA 


AGACCAGATA 


GCAATATCTG 


CAATCGTATA 


CTCATTGCCT 


GCAATATAAC 


240 


GTTTCTGAGC 


CAATTCCTTA 


TCCAATAAAT 


CCAACTGGCG 


TTTCACTTCC 


ATCGTAAAAC 


300 


GGTTAATAGG 


ATATTCCAAT 


TTTTCAGGAG 


CATAATTGAA 


GAAATGTCCA 


AATCCCCCAC 


360 


CTAGAAAAGG 


TGCTCCACCT 


GCTTGCCAGA 


ATAGCCAATT 


CAAAACTTCT 


ACCTTTTCCA 


420 


CAGGATTACT 


TGGTAAAAAG 


GCTCCAAATT 


TCTCAGCAAG 


GTAAAGAAGA 


ATATGAGCAG 


430 


ACTCAAAGAC 


TCTTACGTTT 


TCAGTACCTG 


ACTGGTCCAA 


TAAGGCTGGA 


ATCTTGGAAT 


540 


TTGGATTGAG 


CTTCACAAAG 


TCTGATCCGA 


ATTGATCCCC 


ATCCATGATA 


GCAATCTTAT 


600 


ACAAGTCGTA 


AGCCGCTTCC 


TTAAAACCAG 


CTTCTAGTAA 


TTCTTCCAAT 


AAGATAGTAA 


660 


CCTTCACACC 


ATTTGGTGTT 


CCCAGTGAAT 


AAAGCTGAAA 


AGCTTGTTCT 


CCTTTTGGCA 


720 


AGTTTTGTTC 


CAAACGGGCA 


CCTGCTGTTG 


GT CTGTTT AG 


CCCCGTAAAA 


GCTCCTTGAT 


730 


TACT AG CTTC 


ATCCTGCCAT 


ACGGTCGGTA 


ATTGATATGC 


TGACATCCGA 


AACCTCCCTT 


340 


AAATCGCATT 


CTTGTCAAAA 


CCGAGTTTGC 


GTTGAATAAA 


CTTAACGATT 


TCGACGATGA 


900 


TAATCATTGA 


GAAGCTTCCA 


GCCATAACAA 


TTCCCCATTG 


TGACAAGTCT 


AGTTTGGTTA 


960 


CGTGGAAGAT 


TCCTTCAAGC 


GGTTCTACAA 


CGATTGTTGC 


CATGAGAAGG 


ATAAAGGATA 


10 20 
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CCAAGATGGA CCAGTTAAAG GTCTTAGACT TGAATGGGCC AACTGTCAAG ATGGATTGGT 1080 

AGACAGACTT GACATTGTAG GCATGGAAGA GCTGAATCAA ACCAAGGGTT GCAAAGGCCA 114 0 

TCGTTAGGGC ATCTGCATGA ATAGCATGAT TGTCACCCAC ATGAACTGGG TAAGCAATCG 1200 

CAAGGCCATA AAC AC T CAT A ACAAGAGCTG CTTGG AGT AC ACCTTGATAA ATGATAGAAC 12 60 

TCAAAACACC AC CTG AG AAG AAGCTTGCCT TGCGTCCACG TGGTTTATGA TTCATGACAC 13 20 

CAGGTTCCGC AGGTTCAACA CCAAGAGCGA TAGCTGGGAA GGTATCCGTT ACCAAGTTGA 13 80 

TCCACAAAAG ATGAACCGGC TGT AAG AC AT CCCAACCAAA CAAGGTTGAT AGGAAGATGG 14 40 

TTAATACTTC AGCAGTATTA GCAGAAAGTA GGTACTGAAT AGTCTTTTGA ATGTTTGAGA 1500 

AGACCTTACG TCCTTCTTCC ACTGCGACGA TAATAGTCGC AAAGTTATCA TCTGCAAGAA 15 60 

TCATATCAGA AGCCCCCTTA GAAACCTCTG TACCAGTGAT TCCCATACCG AT ACCG AT AT 16 20 

CGGCTGTTTT CAGAGCTGGC GCGTCATTGA CACCGTCACC TGTCATGGCA ACGACTTTAC 16 80 

CTTGTTTTTG CCAAGCCTTG ACGATACGAA CCTTGTGTTC TGGAGACACA CGGGCATAAA 17 40 

CAGAGTATTG AC C AACG ACT TTTTCAAATT CTTCATCTGA CAGTTCATTG AGTTCAGCAC 18 00 

CAGTTAAAAC GTGACCTTCT GTATCGTTTG CGTCAATGAT TCCCAAACCT TTGGCAATGG I8 60 

CTTCCCCTGT GTCTTGGTCG TCACCTGTAA TCATAATTGG ACGGATTCCC GCTTCCTTAG 19 20 

CCACACGAAC AGCCTCAGCG GCTTCAGGAC GTTCAGGGTC AATCATCCCA ATCAAACCAG 198 0 

TAAAAATTAA ATCATTTTCA AGCTCTTCAG AAGTG AG ATT TTCTGGAATA CTATCGATAA 204 0 

TCTTATAACC ACCTGCAAGG ACACGCAAGG CTTGATCAGC CATTTCAGAA TTGTTTGTAC 2100 

GAATGAGATT TGTAACCTTC TCATCAATCG GAGCAATATC CCCAGCCTTA TCACGAAGAA 2160 

CACAACGTTT T AAG AGTTGG TCTGGCGCAC CCTTGACTGC TACAAGGAAA CGACCATCTG 2 2 20 

GCAATGGGTG AACTGTTGAC ATGAGCTTAC GGTCAGAGTC AAATGGCAAT TCAGCTACAC 2 2 80 

CAGGATATTT CTCTAAGAAA CCTTTGACAT CATAGCCCTT GTCCAAGGCA TATTGGATAA 2 3 40 

AGGCTGTTTC GGTTGGGTCA CCAATCAAGT TACCTTCCAC ATCGATTTTC GTATCATTGG 2 4 00 

CCAAGACAAC TGAACGAAGT AGTGGCATTT CAAGACCTAG TTCAATATCA TCACCTGACT 24 60 

CATGTAGAAC CGCATCGTAG AAGACTTTTT CGACTGTCAT CTTGTTCATA GTCAGCGTAC 2 520 

CAGTCTTATC AGAAGCGATG ATTTCAGTTG AACCAAGTGT TTCAACTGCT GGCAACTTAC 25 80 

GAACGATGGA ATGTCCTT7G GCCAAAACTT GAGTACC AAG AGAAAGAACG ATGGTAACGA 2 64 0 

TAGCACGAAG TCCTTCTGGA ATGGCTGCAA CGGCAACGGC AACAGAAGTC AACAACTCAC 2 7 00 

CAAGTGGATT TTTCCCTTGA ATGAAGACAC CCACTACAAA AGTAACAAGG GCAATGACCA 27 6 0 



WO 98/18931 



PCT/US97/19588 



492 

AGATAGCATA GGTCAAGACC TTAGAAAGGT TGTTCAAATT TTGTTTGAGT GGTGTATCAG 2 820 

TCTCATCCGC ATCTTGAAGC ATACCAGCAA TATGACCAAC TTCAGTGTAC ATACCTGTAT 2 8 80 

TGACAACAAC ACCCATCCCA CGACCATAGG TTACGTTTGA GTTTTGGAAG GCCATGTTGA 2 94 0 

CACGGTCACC AATACCAGCA TCTGTCGCAA GCTCGACTGA C AAGT CTTTT TCGACTGGTA 3 000 

CAGATTCACC TGTCAAGGCT GCTTCTTCAA TTTTAAGAGA GTTGGCTTCT ATCAAACGTA 3 060 

GGTCCCCTGG TACCACGTCA CCTGCTTCAA GGGCAACGAT ATCGCCTGGT ACCAATTCTT 3120 

TAGAGTCAAT CTCTGCCATG TGTCCATCAC GAAGAACGCG GGCAACTGGA CTAGACATGG 3 130 

ATTTGAGGGC TTCAATAGCT TCTTCAGCTT TTCCTTCTTG GTAAACACCA AAGGCAGCGT 3 24 0 

TGATGATAAC CACAGCTAGG ATGATAATGG CATCTGCGAT ATCTTCCCCA CCAGAAGTCA 3 3 00 

CGACTGACAA GATTGccGCC GCAACTAGGA TGATAATCAT CAAATCCTTA AATTGCTCGA 3 3 60 

TGAATTTGAC CAAGATTGAT CCTTTCTCGC CTTCTTCGAG TTCATTGTGC C C AAATT CGG 34 20 

CAAGGCCCTT TTCCGCCTCA CTTGATGACA AACCTTGCTC GGTCGCATCC ACAGCCTGCA 34 80 

AGACCTCTTC AGGGCTCTGA GTATAAAACG CTTGGCGTTT TTGTT CTTTT GACATGTGTC 3 540 

TCCTCCTTGA CATTGTGTGC AAAACAGACT CTCTTTCTCT CAT AG CTTTT CACGACAAAC 3 600 

AAAAAGAAAC CTGTTAATCA TAACAAGTCT CGCTGTTTAA GATAGGGCCG GAAAGCATAC 3 6 60 

TTTTCAGCAT AAAATT CGG A ATGACGACAC TATCACAGGT TTCTGCCAGC TACTCCCTTC 37 20 

AGTAGTACCA TT AT AC C AAA TTTTGGGGAG TTTTCAAAGA GTAAAAACTG CCTTATTTGA 37 80 

ATTTTTCCTT GAAAACCAGT ATAATGGTAG AATGCTATGT GACTAGAAAG GAAGTTGAAT 3 840 

GAAGCAATCT ATCTCAAATC TCAAGTTAGC TGAGCGTGGA GCCATTATCA GTATTTCGAC 3 900 

CTATTTGATC TTGTCTGCAG CC AAATT AG C AGCTGGTCAT CTCCtTCATT CATCCAGTTT 3 9 60 

GGTGGCCGAT CCTTTTAATA ACGTATCGGA CATCATTGGA AATGTGGCCC TCTTAATCGG 4 02 0 

CATTCGGATG GCGCGCCACC TGCAGACCGT GACCACCGTT TTGGTCATTG GAAGATTGAA 4 080 

GATTTGGCAA GCTTGATCAC TTCTATCATC ATGTTCTATG TCGGTTTCGA TGTTCTAAGA 414 0 

GATACCATTC AAAAGATTCT CAGTCGGGAA GAAACGGTCA TTGATCCTCT TGGTGCAACT 4 200 

CTAGGAATCA TTTCTGCAGC GATTATGTTT GTGGTCT AT C TCTACAATAC TCGCCTCAGT 4 2 60 

AAGAAATCCA ACTCCAATGC GCTGAAGGCA GCTGCTAACG ACAATCTTTC TGACCCTGTT 4 3 20 

ACCTCACTTG GAACCGCCAT TGCCATCCTA GCTAGTAGTT TCAATTATCC GATTGTGGAT 4 3 80 

AAACTGGTTG CTATCATCAT CACTTTCTTT ATCTTGAAGA CTGCCTATGA TATCTTCATC 44 4 G 

GAGTCTTCCT TTAGTCTTTC AGATGGCTTT GACGACCGCC TGCTCGAGGA CTACCAAAAG 4 500 

GCTATCATGG AAATTCCCAA AATCAGCAAG GTCAAATCGC AAAGAGGTCG CACCTACGGT 4 5 60 
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AGCAACATCT ACCTGGATAT TACACTAGAG ATGAATCCTG ACTTCTCTGT TTTTGAAAGC 4 62 0 

CATGAAATCG CGGATCAGGT CGAGTCTATG CTGGAGGAGC GTTTTGGCGT CTTTGATACC 4 68 0 

GATGTCCATA TCGAACCAGC ACCTATCCCT G AGG AT G AAA TTTTAGACAA TGTCTATAAA 474 0 

AAATTGCTTA TGCGTGAACA ATTGATTGAC CAAGGAAACC AACTAGAAGA ACTCTTGACT 4 800 

GATGATTTTG TCTATATTCG CCAAGATGGA GAGCAGATGG ATAAAGAGGC TTATAAGACC 4 8 60 

AAAAAAGAGT TAAATTCTGC TATCAACGAC ATTC AAATT A CTTCCATCAG TCAAAAAACC 4 920 

AAACTCATCT GCTATGAGTT AGATGGTATC ATCCATACCA GTATCTGGCC TCGCCACGAA 4 98 0 

ACCTGGCAAA ATATCTTTCA TCAAGAAACC AAAAAAGAAT AGAGAAATCC TTTCATGAGA 504 0 

CCGCATTTTT CTATTCTTTT ATACTCAATA AAAATCAAAG TGC AAATT AG GAAGCCGGTC 5100 

ACAGGCTGTA CTTGAGTCGG CAATGTGAAG CCGACATAGT TTGCACTTTG ATTTTCGAAT 5160 

AGTCTTAACT ATC AAATTC A CTGAGATACT CATAGCGTTC GTATTTTTCA AGGAGTGCTT 52 20 

CATTTTTCTC ATCCAATTCT TTTTGGAGAG TAGCCAGCTT ACCAAAGTCA GAGCCGTTAG 52 80 

CCTGCATTTC CTCTTCAATA GCAGCGATAC GTTTTTCCAA GGTTTCAATA TCACCTTCAA 53 40 

TACTTGCCCA CTCCTGCTTT TCTTGGTAGG TCATGCGTTT CTTGTCTTCT CGAACCTTGA 54 00 

CCACTTTTTC CTTTTCGGCC TTTTGCACTT GATTGGCCAT ATCTGTTTCA AAAGCTTTTT 54 60 

CATCAAGATA GTCGGTGTAA TGACCAAAGA AAGGACGAAT CTTGCCATCC TCAAAAGCGA 55 20 

GAATCTTCGT CGCTACCTTA TCCAAGAAAT AGCCGTCGTC ACTGACTGTT AAAACGGGAC 5530 

CTGCAAAACC TTGCAAGAAA TTCTCTAAGA CTCTCAAAGT TGCAATATCT AGGTCATTGG 56 4 0 

TTGGCTCGTC TAAAAGAAGA AC ATTTGG TT TTTCCAAAAG CAGTTTGAGG AGATAAAGAC 5700 

GTTTTTTCTC ACCCCCTGAC AATTTCTCAA TCAAAGTCCC ATGCCTCGAA CGTGGGAAGA 57 60 

GGAATTGCTC CAGCAACTCA GCGATGGAAG TCGTAGAACC ACCACTGGTC TTGACCTCCT 5 820 

CTGCCACTTC CTGCAGGTAA TTGATCACAC GCTTGCTTTC ATCCAAACCC TCAATTTGTT 588 0 

GAGAGAAATA GGCGATGCGA ACAGTTTCCC CAATCACAAC TTGTCCTGCT GTCGGCTCAA 594 0 

GACTTCCTGC AATCAGGTTA AGTAGGGTTG ATTTTCCAAC ACCATTGTCC CCAACAATTC 6000 

CAATACGGTC TTTAGCCTGA ACTAAGAGAT TAAAATTTTG CAAAATCCGC TTATTTTCAT 60 6 0 

AGG CAAAGG A AACATCCTGA AACTCGATGA CTTTCTTCCC AATCCGACTG GTTTCAAAGT 6120 

TCATAGTCAA GTCTGTCTCA GCACTACTGC CTGAAACTTC CTTTTTCAGA TCATGGAAAC 6180 

GATTGATACG AGCTTGTTGC TTGGTCGCAC GCGCCTGCGG TTGTCTGCGC ATCCAGGCCA 62 4 0 

ATTCTTGTTT GTAGAGTTGT TCTTTTTTGT GAAGAAGAGC CGCGTCGCGC TCATCCTGTT 6 3 00 
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CCGCCTTTAC 


GCGAACATAG 


TCCTGGTAAT 


TTCCCTGGTA 


CTCGGTCAAG 


CCTGCACGAT 


6360 


CCAACTCGAA 


AATCCGTGTT 


GACAAAGCGT 


CTAAGAAATA 


ACGATCGTGA 


GTGATAAAAA 


6420 


CGACGGTCTT 


CTTAGAATTT 


TTCAAAAAGA 


GGGTCAGCCA 


CTCAATAATC 


GCAATATCCA 


6430 


GATGGTTGGT 


CGGCTCATCC 


AAAAGCAAGA 


GGTCGTGGTT 


GCCAACTAAG 


ACTTGTGCCA 


6540 


ACTGTACCCG 


TCTTCTCAGA 


CCACCTGACA 


ATTCCCCAAC 


AGGAGTAGAT 


AAGTCTTGAA 


6600 


TGCCCAATTT 


GCTAAGAACG 


GTCTTGACCT 


GACTTTCGAT 


TTCCCAAGCT 


TGGAGAGAGT 


6660 


CCATCTCTGC 


CATG AC ACGT 


TCCAAACGCG 


CCTGCTTGTC 


CTCACTATAG 


TCGAGCATAA 


6720 


TCAATTCATA 


CTCACGAATG 


AGCTGGATTT 


CCTTGAGTTC 


ACTAGATAGA 


ACCGTATCCA 


6780 


AAACTGTCTT 


TCTATCATCA 


AAAT C A GG AT 


CCTGAGTCAA 


GTAACCAATC 


TGGTAATCAT 


6840 


TTTTAGCTGA 


AAAAGG ACTG 


ACATCCCCAT 


CAAATCCAGA 


AACACCAGAA 


AGGACGTCCA 


6900 


AAAGGGTGGT 


CTTGCCAGTC 


CC ATTG AC AC 


CGATTAAACC 


AATT CTGTCT 


AAGTCATGGA 


6960 


TAATAAAGGA 


AATATCCCTA 


AAAACGGTCT 


TGTCACCAAC 


GCATTTACTT 


AGTTTTTCAA 


7020 


CGATAAAATC 


ACTCATTTTT 


TCTCCCTCAG 


GTAAGCATGG 


ATGGCTTCAC 


GATTATTCTC 


7080 


CAATTCTCCA 


TCGACAATGG 


CAAACTCAAT 


CTCTGTTAAA 


ATCTCTCCCA 


AGTCTGGGCC 


7140 


TGGCTGATAG 


CCATATTCCT 


TGATCAAAAT 


ACCGCCATTA 


ATCTGAATCT 


CTTTCTTGTC 


7200 


ATGGATAGTC 


AAGCTTTGGT 


ATTTTTCTGT 


GATGGCTTGT 


GGGTTGACTT 


CTTTTCCTTG 


7260 


ACCTTGACGA 


AGATTTTCAG 


CCTGTAAAAG 


CAAATCTATG 


TCAAAGCGAT 


AACAATCTCG 


7320 


CTTGCTCAAT 


TCTCCATTTT 


CACGCAGAGC 


CAAAATAATC 


AGCAAATCCT 


GAACTTGCTT 


7380 


GGCAAACTGG 


CGTGAGGTCT 


TCCAAGATTT 


CAAAAATGAC 


TGCGCATTTT 


CAATCTCCAA 


7440 


AGC C CAT ACT 


AAAGCCGCCC 


AGGCTTGTTC 


AGAGGATTCA 


AAAGTAAAAT 


CAGTCTCCAA 


7500 


ATCAAACAGT 


CTGTTGACCT 


TGTCCTGCCT 


AGATGCCATA 


TCAGGGAGAT 


AGTCATAAGC 


7560 


TTGACTCTCA 


ATCATGGAAG 


CCAAGCCCCT 


TCTCCAAAAT 


GGAGCCAGCA 


AGAGTTTATC 


7620 


AAACTCGACG 


AAGC7ACCCT 


CTACAGAAAT 


TTTCTCCAAA 


AGCGGCGTCA 


AGGTCTTCAT 


7680 


AGCTTTAAAT 


GTTTCTGGCT 


C AAGTGC AAA 


ACCAAGACTA 


GCCTGAAAAC 


GGAAACCACG 


7740 


CATAATCCGT 


AAAGCATCTT 


CGTTGAAACG 


CTCACTAGCC 


ACTCCAACTG 


CTCGCAAGAC 


7800 


TTGCTTTTCC 


AAATCTTCTA 


AACCATGGAA 


CAAGTCAACG 


ATTTCTCCTG 


TCTCATCCAA 


7860 


GGCAAAGGCG 


TTGACTGTGA 


AATCACGGCG 


TTTGAGGTCT 


TCTTCTAGCG 


AT CG T AC AAA 


7920 


GGAAACCGCA 


CTGGGTCTGC 


GATAGTCCAC 


AT AG AC AT CC 


TCTGTCCGAA 


AGGTTGTTAC 


7980 


CTCATACTCC 


TCATCCCCAT 


CTAAGACCAA 


GACGGTTCCA 


TGCTCGATTC 


CGATATCGGC 


8040 


TGTTCGCGCA 


AAAATCTGCT 


TGGTCTCTTC 


TGGATAAGAA 


GACGTCGCAA 


TATCCACATC 


31C0 
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GTGGATAGGG CTATGGAGAA GGGCATCTCG AACAGAGCCC CCAACAAAAT AAGCCTCAAA 8160 

GCCTGCTTCT TTAATTTTTT CTAATACTGG TAAAGCCTTC TGAAATTCAG AAGGCATTTG 8 22 0 

CGTTAATCTC ATAATAAGTG TTCTAATCCA TAGACAAGCT CATGACGCTT GACAACTTCT 82 80 

TTAATTCCCA AATTGACTCC TGTCATGAAG GAGATGCGAT CAT AG G AGTC ATGACGGAGG 8340 

GTCAACCCTT CTCCCTGATT GCCAAAGATG ACTTCCTGAT GAGCTACCAA GCCTGGCAAA 84 00 

CGAACTGAGT GGATGCGCAT ACCATCAAAG TCAGCACCAC GAGCACCAGC AATCAGCTCT 34 60 

TCCTCATCTG CTGCACCTTG CTGAATTGAC TCTCGAACCT CTGCCATCAA CTCAGCTGTT 8 52 0 

TTAATGGCTG TTCCACTCGG AGCATCCTTT TTCTTGTCAT GATGGAGCTC AATAATCTCC 8530 

ACATTTGGGA AATATTTGGC AGCCTGCGTC GCAAATTGCA TGAGTAAGAC AGCACCCAAC 8 64 0 

GCAAAGTTAG GGGCAATCAG GCCACCCAAG TCTTGGGCAC C AG AAAATTC TTTTAGCTCT 8700 

GCAATTTCTT CACTCGTGAA ACCAGTCGTT CCAACTACTG GAGCAAAGCC ATTTTCAAGA 87 60 

GCAAAACGTG TATTTTCGTA GGCAACAGCT GGAGTAGTAA AATCTACCCA GACATCCGCT 882 0 

TCAAAACCAG CTAAATCAGC CTTATCCTTG AAAACAGGAA TACCCTGCCA TTCTGACTCA 88 8 0 

GACTCAAAAG GATCCAAAAC TGCCACCAAG TCCAAGTCTG GATCAGTCAA TACCATCTGA 89 4 0 

CAAGCAGCCT GGCCCATCTT TCCCTTAAAA CCGGCAATAA TTACTCGAAT ACTCATCTCT 9000 

ACTCCTGTCT AAG AT AC AAA GTCCGTAAGA ACACAAACTG AAAAT AGG AA TTCCAATCAA 9060 

GAAGTGTCTA CTTCTTGGAA GAACTATCTT TTTCACACAG GGTTCCAGGC GTGTTCAATT 9120 

ATCAAGATAC AAAGGACCTT AGCTGCCTCT GAAAAATAGG GAATGGCACT GACTTTCCAC 9180 

GAAAGGCAAG ACACGCATCT TTTTTCAAGA GGCAGGTAGT CCGTGTTCAA TTTCTAAGAT 9 2 40 

ACAAGGCATC TTAACTACCC TAGAAGCGCC AACTAAATCA CTGGAATATA ACCCAGAGCA 9 3 00 

ATACTTCCTG CTCCTAGGTG CGTTCCAATG ACACTACCAA ATGTAGCAAG TGAAACATCC 9 36 0 

GAACCCAAGC C AAAATC AAG CAAGTGcTGA CGCAATTCTT CAGCCTTTTC AGGAGCATTC 9 42 0 

CCATGAATGA CAATGACCCG GTATTGACCT GAAGCCGTTG TTTCCTTGAT AATTTCAATT 94 8 0 

AAGCGCTTGG TGGCCTTCTT TTCAGTACGA ACTTTTTCGT AAACTTCAAT CACACCTTGA 954 0 

TCGTT AAAAT AAAGGATTGG CTTAATGCTA AGCAAATTGC CCAAAATGGC AGCCCCATTT 9 600 

GAAAGGCGTC CACCTTTTAC CAAATGATCC AAGTCATCTA CCATGATAAA GGCTGACGTA 9660 

CGGCTGATTT GAATGGCTAG CTTATCCTGA ATGCTGGCAA AATCATCGCC CTGATCACGC 97 2 0 

CAATTAAAGA CGCTTTCAAC CATGATCCCT AGGGGAGCAC TTGT AATC AA AGTGTCTGGG 97 8 0 

AAAGCAATGG TTAAGCCCTC ATAGTCATCG ACCATATACT GGATATTTTG GTAAAAACCT 984 0 
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GAAATTCCAG AAGATAGGAA AAGCCCCAAG GCATGTGTAT AGCCTTGTTC TTTGAGCGAA 9900 
GTTAAGATCT CATCTAACTT GGCAATACTT GGTTGACTCG TCTTAGGCAA TTCAGAAGCC 9 9 60 

TG AG CCATTT TTTGGTAAAA TTCCTCAGCA GACAGATTGA TGCCTTCGAC ATATTCCTCA 10020 
CCATCAATAT TGACAGGAAT ATCCAAGACA AACAAGTCTT CTCTTTGCAA GATCTCTGCA 10080 
CTGAGATAAG CAGAGGAATC TGTGAAAACA GCTAATTTCA TATTAGAACT CCAAATTAAT 10140 
TCCTGGTAAG TCTAATCCAA TTTCAGTCAC TTCGTAAGTC AAACGATTGA GCATGTTCAA 10200 
ACATGGACGA GCCAAGC7TT CCACCTCTTC TTGGTTCAAT TCACTTGGTT C ATTG AC AAT 102 60 

ACGGCCATCG ATATGGTTTA CTTGTGAGAT TGTTCCACTA ATGACAAACT TATCAAATAC 10 3 20 

AATCATAAAG CTCAAGATGA CAATCAAGGA AGTCACTTGA TTTTCTTGGT CATGTTGGAG 10 3 30 

CAATTGGAAA TTCACATCCA CCTTGGTTTC AGGAGCTCCA TTTTCATTTT CCCATTCAAA 10440 
ATTACGCGCA TCAAAATGAT ACTGACTAAC AAATT CTTC T TCACGTTTAA GATTCATCTC 105 00 

TTTCTCCATC GGCTACAATA TTATAAGCTA TTGTACCATA ATTTTTTATT TTCATCTAGT 105 60 

TTTCTAGGA? TTAGTCAATC C C AATTTC AG CACGAACTAC ATCTGTGATG GTATCAACAT 1062 0 

AGTAGTTTAC TTCTTCTGTT GTAGGCGCTT CTGCCATAAC ACGCAAGAGG GGTTCTGTTC 10680 
CACTTGGACG AACAAGGATA CGCCCGTTCC CCGCCATTTC TTCTTCCATC TTCTCGATGA 10740 

TGGCCTTGAT AGCTGGCACT TCCATGGCCT TTTCCTT C AT GACCTTTTCC ACTCGGATAT 10800 

TAACTAATTT TTGTGGATAA ATCGTTACTT CTGCCGCCAA CTCTCATAAC CTCTTACCAG 10860 

TTTCCTTCAT GATTTTAGTC AATTGAACTG CTGATAATTG ACCATCACCT GTGGTATTGT 1092 0 

AATCCATCAA GATAACGTGA CCAGACTGTT CACCACCAAG CTTGTAGCCT GATTTTCTCA 1 0 98 0 

TTTCTTCAAC AACGTAGCGG TCACCAACTG CAGTAACTGC CTTGTTAATA CCTTCGCGAT 11040 

TCAACGCCTT GTGCAAACCA AGGTTAGACA TAACAGTTGT CACAATTGTA TTTTGAGCCA 11100 

ATTGTCCTTT TTCAGAAAGG TATTTTCCGA TGATGTACAT AATCTTCTCA CCATCAACGA 11160 

TGTCACCATT CTCATCAACA GCAATCAAGC GGTCACTGTC TCCATCAAAG GCCAAACCAA 11220 

TAGCTGACCC ACTTTCTTTG ACCACTTCTT GAAGGGCTTC TGGATGTCTT GAACCAACAT 11280 

TAAGGTTGAT GTTAAGACCG TCTCCTGTTT CCCCGATAAC CGTCAATTGG GCACCAAGGT 11340 

CTGCAAAGAT TTGACGGGCA CTGGTAGAAG CTGCTCCATT AGCTGTATCC AAGGCAACCT 114 00 

T C ATT C CATC AAGAGGAGTT CCAGTTGAAA CAAGGTATCC TTCATACTTA CGCArGCtTC 114 60 

TGGATAATCT AC C AAAATTC CTAAGCCTTC TGCACTTGG A CGAGGAACAG TGTCTTCCTC 11520 

AGCATCTAGC AAGGCTTCAA TTTCTCCTTC TTTTTCATCA TCTAGTTTGA AGCCATCACC 11580 

GCCAAAGAAC TTGATTCCGT TATCAAGGCC TGGGTTGTGG CTAGCAGAAA TCATGACACC 11640 
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GGCACTTGCT CCTTCAGTTT CAACCAAGTA AGCTACTGCT GGTGTTGCAA GG AC AC C AAG 117 00 

TTTGTATACG TGAATCCCTA CTGAAAgGAG ACCTGCCACC AAGGCCGATT CCAACATTTC 11760 

CCCTGAAATA CGTGTGTCAC CTCCTACAAA GACTTTCCGC GCTTCCGTTT CATGTTGACT 11820 

AAGAACATAG CCTCCAAAAC GTCCTAGTTT AAAGGCTAAT TCTGGTGTTA GTTCTAGGTT 118 80 

AGCTTCTCCA CCGACTCCAT CAGTCCCAAA ATATTTACCC ATTGTTATAA AAT CCTTTTC 11940 

TATTTTTAAT TCGTTTTTGA ACTAGTTGCT TTCGTTGACC AAGATGTCTC CGATGAACTG 12000 

CTTGTACTTG AATTTGATCT GCTTGAACTT GGTGCTACTG GTTTTGTAGT CACCTTCATT 1206 0 

ATTGTATCAA ACGG AGTGAT AACTGCCGGT AAGACAACAC CATTGCGCTC GATTGCCTGC 1212 0 

AAAGGTACTG AACCACTCTA ATTACCTGTT ATACGTTCGC TAGTTGGCAA AACAGCCATA 12180 

ATCTTATCAA TTCTATCCAA TGTCTCTTGG TCACTCGTAA TAGACACTTC TTTATCTGAC 122 4 0 

ACCATGACAT TTTCAATTTG TACCCGACTA TCAATTTGAC TACGGTCAAT CTCTGGTACA 12 3 00 

ATCTTTACCT TATCCTTCTG AGCCTTCTTA CCAATCTTGA CTGTAATTTT TTGCGGAGTC 12 3 60 

GCCACAGCGG TCAGCCCATT GGGTAAATCT TCAATCCTCA AAGGAACTTC AATCGTTCCA 124 20 

ACACCGGCAT CTCTTAGGTC AGCAGTAACC TTCAATTTAC CTCTACTTTC TTGCATTTCA 12480 

CrTACCTAGCG ATAGCCGATT TGCACCAGTC AAGACCACTG ATACTTCTGA AGCAAAACCG 1254 0 

CTAATAAAAT ACTTATCACT ATTATAGCGT AT GTC AAT AG GG AC ATTTGT T ACTGT ATT A 12600 

GTATACGTTT CCGTTTTTAC CTGCCTAGCA CTGGT ACTGT TTTGAAAATT CGTCGCCGTA 12 66 0 

GCATAGACAA ATAAGACACA AGCAAAAAAG AGTGAGGATA TGATATATAA ACTATTTTTT 1272 0 

TTCATGTTTC . CATCCTCCTA GCAATCGTTC TTTAAAACTA AGACCCACTT CCTCTTTTGG 12780 

AAGTAAGATT TCACGTAATT C TGTTT C AAA TTCATCAAGT GTTAGGTTGT GCTTAAACCT 12840 

TCCATTATAG GTTATCGAAA TTCCTCCCGT TTCCTCTGAT ACGACAAAAG TCAAGGCATC 12900 

TGAGACTTCT GATAAACCGA TAGCCGCCCG GTGTCTGGTC CCAAATTCCT TGGAAATCCC 1296 0 

TGTCTTTTTT GTCAAGGGCA GATAGGCAGA CGTCACAGCG ATACGTTCTT CTTTGATAAT 13020 

CACCGCACCA TCATGTAGGG GAGTGTTGGG AATAAAAATG TTAATGAGAA GTTCTGCACA 13 080 

AATCTTAGCA TCCAAGGGAA TTCCTGTCGA AATATACTCC TGCAAGGTAC GTACACGCTG 1314 0 

AATAGCAACC AAGGCCCCGA TTTTACGAGG ACTCATGTA? T C AAC AG ACT TAACAAAGGC 13200 

ACG AAT CATC TGTTCCT C AG C ACT AAT AGG GGCATTGGAA AAGAAATCTG TCGCTCTTCC 13260 

CAAACGTTCC AAACCAGTCC GAATCTCTGG AGAGAAGATA ACAACCGCCG CAATAACCCC 1332 0 

ATAAGTAATA ATTTGATTGA TTAACCAAGA AATCGTAGTC AAACC AATC A TATTTGCAAG 13 3 80 
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GATTTGAGCT 


AAAATAAACA 


CCAAAACTCC 


AC GT AC C AAA 


ATCATAATCT 


TGGTTCCTGC 


13440 


AATAGCTTTT 


GTAAAATGGT 


ATAAAATATA 


AGCAACAATC 


AAAATATCAA 


TC AG ATTG AT 


13500 


AGCTATCGTC 


CATGGACTTG 


CAAACAAACT 


GGTCCAATAT 


TGCAGATTGG 


ATAATTGTTG 


13560 


AAAATTCATC 


CCTGATATCC 


TCCCTATCAA 


AACACTTTCG 


TCCTATTATA 


CCATTTTCTG 


13620 


GCATTTTTTT 


CCCTATCCTA 


GTCCATTTTA 


CATTGAACAA 


AftftT ATC AT A 


AAATAAACTG 


13680 


ACTAAAAAAA 


ACAAAGGAGA 


AACTATGTCT 


CAACTCTATG 


ATATTACCAT 


TGTGGGTGGT 


13740 


GGTCCTGTCG 


GGCTTTTTGC 


AGCCTTTTAT 


GCCCACCTAC 


GCCAACCCAA 


GGTTCAAATC 


13800 


ATCGACTCTC 


TTCCCCAGCT 


AGGTGGACAA 


CCTGCTATTC 


TCTACCCTGA 


AAAGGAAATC 


13860 


CTAGACGTAC 


CAGGCTTCCC 


AAACCTGACT 


GGAGAAGAGT 


TGACTAACCG 


CTTGATTGAA 


13920 


CAGCTAAATG 


GATTTGATAC 


CCCTATTCAT 


CTCAATGAAA 


CGGTTCTTG A 


GATTGACAAA 


13980 


CAAGAAGAAT 


TTGCCATCAC 


AACTTCTAAA 


GGAAGTCACC 


TGACTAAAAC 


AGTTATCATC 


14040 


GCTATGGGTG 


GCGGTGCCTT 


CAAACCACGT 


CCGCTGGAAC 


TTG AAGGGGT 


TGAGGGCTAT 


14100 


GAAAATATCC 


ACTACCACGT 


TTCTAACATT 


CAGCAATACG 


CTGGTAAGAA 


AGTGACGATT 


14160 


CTTCG7GGCC 


GAGACTCGGC 


TCTGGATTGG 


GCTTTGCCTT 


TTGAAAAAAT 


CGCACCAACT 


14220 


ACCCTTGTTC 


ACCGCAGAGA 


TAATTTCCGT 


GCCTTGGAAC 


ACAGTGTTCA 


AGCCTTGCAA 


14280 


GAATC AT CTG 


TAACCATCAA 


GACACCATTC 


GCCCCTAGCC 


AACTCCTTGG 


AAATGGAAAA 


14340 


ACACTTGATA 


AACTTGAAAT 


C AC AAAAGT C 


AAATCTCATC 


AAACTGAAAC 


CATTGACCTA 


14400 


GACCACCTCT 


TTGTCAACTA 


TGGTTTCAAA 


TCTTCTGTCG 


GTAACCTTAA 


AAACTGGGGG 


14460 


CTCGACCTCA 


ACCGTCACAA 


GATTATCGTC 


AACAGCAAAC 


AGG AATCC AG 


CCAAGCAGCT 


14520 


ATCTATGCTA 


TCGGTGACTG 


CTGCTACTAT 


GACGGAAAAA 


TTGATCTGAT 


TGCGACAGGC 


1-1580 


CTCCGAGAAG 


CTCCAACTGC 


TGTCAACAAC 


GCTATCAACT 


AC ATTG AC CC 


TGAACAAAAA 


14640 


GTACAACCAA 


AACACTCTAC 


TAGTTTATAA 


AAAAGAACCA 


CGAGTCACAT 


AGGATTCGTC 


14700 


GTTTTATAAT 


TCATCCGCTA 


TCTTATTGAT 


TTTTCTGAGT 


CTGTGATTGA 


CACCACTTTT 


14760 


GGTCAGAGGG 


GTGCTGAGAC 


TATCTGCTAA 


CTGCTGGATA 


GAGTAGTCTG 


GGTGCTGAAT 


14820 


CCTCAGTTGC 


GCCACTTCCT 


GCAAATCTAC 


TGGCAAATTT 


TCTAAGCCCA 


TGATATCTTT 


14880 


GATTTTACTG 


ATATTGTTAA 


TGGTCTTCAT 


GCTGGC AG AA 


ACTGTCCGAG 


CGATATTAGC 


14940 


TGTCTCGGCA 


TTATTAGCCC 


GATTGAGGTC 


GTTACGGCTT 


TCTCGCAAAA 


TCTTAACCCG 


15000 


CTCAAAATCA 


TCACCTGCCT 


GCATGGCTCC 


TATTACTATC 


AAGAAGTCCA 


TAATGTCTTC 


15050 


TGCTCGCTGG 


AGATAGGTCA 


CAGCCCCCTT 


CTTGCGCTCA 


AGCACCTTGG 


CATCCAGTAA 


15120 


AAACTGTTGG 


AGAAGGGAGG 


CAATTCCTTG 


CGCGTGGTCC 


AGATAAACAG 


AACTGATTTC 


15130 
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CAACTCCTAC TTGCCTGACT CAGGGTCACG AATGCTCCCA TTTGCCAAGA AAGCGCCACA 15240 

GAGATAGGCA CCACCTGCTT CCTCATCCGA TAAAATCGCC TCATCAATAC CTGTTTCCAG 15 300 

GCCAAAGAAA GAGTCTGCCA AGTGCAAATC ACTTAACAAA TCCTGCACCT TTTCATCTGT 153 60 

AAAAACGGTA TAGACGCGAT TCTTGCGAAG ATTGCTCCGT TGGTGGTGAC GAATTTCAGA 154 2 0 

TTTGATTTCA TAGAGATGGA GAAAGGACTC ATAGAGGTGA CGGGCCAGTT TGGCATTTTC 15430 

TGTCACAACT GACAAAGTCA AGCCCGAAGT CGAGAGACCG ATGCTACCAG ACATTTTGAT 15540 

AATGGCAGAT AATTCATGCC AGCTCAGATG GTGTTGGCCC AGGATTTCTT CTTTTACTGC 15600 

TACTGTGAAA CTCATTTTTT CACCTGTATA ATGCGCATCA ACTCGTCCAC AATCAAATCT 15 6 50 

CCATCGTGGA AGGCACCGCC ATTTTCCAGA CGAAGGAAGT TAGATGAAAT CACGCGCGAA 15720 

ACTTGCTTAC AAAGACCTAC AAAATCGTGT TCCACTTGCA CTAAGTATTC ATCAAAACGG 157 80 

TTGGAATTCA TGTATTCCTG AGGCACTTTT TCAATATTCA CCAAGACAGT GTCGATAAAA 15 840 

GGGCGACCAA GGTGACGATG CAAGACTTCC ACGTCCTCGC TATCTGTAAA GTGTTCCGTC 15 900 

TCCCCACGTT GGGTCATGAT ATTGCAGACA TAGGCAATTT CTGCCTTGGT TTCCAAAAGA 15 9 60 

GCCCGCCCAA TTTCCTTAAT CACCATATTC GG C AAAAT AG AGGTAAAGAG GGAACCTGGC 16020 

CCTAGGACAA TCATGTCACT TTCAAGGATG GTCTGCACTA CTCGACGGCT GGCCAGAGGC 16080 

GTATCATCGT TTAGGGCATT GGTCACATAG ACATTGTCAA TTATGCCTCG ATCCTCTACA 16140 

ATATGACTCT CTCCAGCCAC TTCTCTCCCA TCCTGAAAGA CTGCATGAAG GGTCAAAGGA 162 00 

TGGTCACTGG AAGGATAAAT TTTCCCTGTT GTATGGAAAA ATTTGCTCAA TAACTGCATG 16250 

GCATTATAGG TTGAACCCTG CATTTCTCAC AAGCCAGCAA TGATGAGATT TCCCAATGGA 16320 

TGGCCAGCAA AGGCTCCGGC ATCCTCAGAG AACCGATACT GAAAGACCTT CTC AT AAAAC 163 30 

TTAGGCATAT CCGACATGGC CACAAGGACA TTACGAACAT CACCTGGCGG TGTCAACTGT 1644 0 

TGCATATTTT TTCGGAGTTC ACCTGAAGAA CCACCATCAT CTGCCACCGT CACGATAGCT 16500 
GCGATTTCCA C AT CTTTTT C CCGCAGACTT TTTAGAATGA CGGGACTTCC AGTCCCTCCA 
CCAATCACCG TTATCTTTGG TTTTCTCATG AACGGTTTAC CGTTTCCTTT CTGCGGTCTT 

TGTCGCGATG CCCTTCATTA ACAGACCAAT TCTTGGATAA GTCCTGCGCC AAGC GTTT AG 16 6 80 

CAAATGCCAC ACTACGGTGT TGTCCACCCG TACATCCCAT GGCAATGGTC AAAACGGACT 16740 

TACCTTCCTT TTGGTAACTT GGC AG AATCG GCTCAATCAA GGCCAATAAA TGTTGATAAA 16 800 

AGTCTTCTGA CTCAGGATGG TTCATGACAT AATCATAAAC AGGTTCATCC ACACCCGTTT 168 60 

GGTTTCTCAG TTCTGGTAAA TAATAGGGAT TTGGCAAGAA ACGGACATCA AAGACCAAGT 169 20 



16560 
16620 
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CCGCATCAAT CGGGATTCCA TACTTAAATC CGAAAGACAT GACTTCGATA CGG AAAG ACT 16980 

GGGCTTGTTC TTGGTCTGAA AACTGCTCTG CAACCGTTT7 GCGCAgCTCA CGTGGAGTGA 17 040 

GTTCAGTCGT ATCCACCACA TTTTGGCTCA TATTTTTCAA ACGTGCCAAG AGTTCACGTT 17100 

CCAACTTGAT TCCATCTAAA ATACGACCGT CTGCTGCTAG TCCGTGACTC CGTCTGGTTT 17160 

CCTTGTAACG AGCGACCAAT TCCTTATCAG CCGCATCCAA AAAGAGGATT TTGAAATCCA 17 2 20 

AACCATCTTG ATTTTCCAAC TCATCCAAAA CAGCTTGAAT CTCTGAAAAG AAAGAACGGC 172 90 

TACGCATATC CACTACCAAG GCCAACTTAG GATTGTCTTC CTTAATTTCA ACCAGCTGCA 17 3 40 

AAAACTTAGG CAACAGAGCT GGCGGCATAT TATCAATGGT GAAATAACCT AGATCCTCGA 174 00 

AGGACTGAAT GGCTACAGTT TTCCCTGCGC CACTCATCCC TGTCACAATC ACCAAGTGAA L7 4 60 

GTTGTTTCTT TGTCATCTTT TTCTCCTTA? ATCAAAAGAA GTTTGGCAAC ACCAAACTTC 175 2 0 

AACTACCTTA TCCAATCTCT GCG ATG ACT T CAATTTCGAC TTTTACATCA CGAGGAAGAC 17 580 

GAGCTACCTC CACAGCTGAA CGAGCTGGGA ATTCCTCTTT GAAGGCCGTT TGGTAAACCT 17 640 

CATTAAAAGG AACAAAGTCG TTCATATCGC TCAAGAAGCA AGTTGTTTTG ACAACATGGT 177 00. 

CAAAGTCTGT TCCTGCTTCT GCCAAAATAG CACCGATGTT TTTCAAGACT TGCTCTGTCT 17760 

GTTCTTGGAT ATTCTCTCCT ACAATTTCCC CAGTTTCACG CGATAGGGGA ACTTGACCGC 17 82 0 

TAGCAAACAA AAGGTTGCCA ACGATTTTTC CTTGAACATA GGGTCCCATA GCCTTTGGGG 17880 

CCTTATCTGT ATGAATTCTT TTTGCCATTT TCTTTTCCTC ACAATTTTTC TAAGATTGCA 1794 0 

TCCCAAGCCT CATCCATCCC TGCCTTACTG AC AG ATG AAA ACAGGATGAA ATCGTCACTC 18000 

GGCTCAAAGT TTAATTTCTT TTTGATTGCT GATTCATGCT TGTTCCATTT ACCACGAGGA 18060 

AT CTTGTCCG CCTTGGTCGC CACAATGATG ACTGGAATCT CATAATACTT GAGAAATTCG 18120 

TACATCTGCA CATCATCTGC TGACGGGTCA TGACGAAGGT CAACTAGACT GACAACCGCA 18180 

CGGAGATTTT CCCGAGTCGT TAAGTACTCC TCAATCATGC ACCCCCACTT TTCACGTTCC 182 4 0 

TTTTTAGAAA CACGAGCATA GCCATAACCA GGCACATCCA CAAAGCGCAT CTTGTCATCA 18300 

ATGTTAAAAA AGTTCAGGAG CTGGGTTTTA CCAGGTTTTC CTGATGTACG GGCGAGATTC 18 3 60 

TTACGGTTCA ACATAGTGTT GATAAAGCTG GATTTACCAA C ATTTG AAC G CCCTGCTAGG 134 2 0 

GCAATCTCTG GCAGTTCATC CTGCGGATAG TGGGACTTAT TAGCTGCACT GAGCAAGATT 184 8C 

TCAGCATTGT GTGTATTAAG TTCCATAGTC ACCTCTAGGC TGTTTCT AG G ATCGGTTTAT 18540 

CCGTTCCATC TACAGTTTCT TTAGTGATGC GAACCAATTT CACATTTTCC TGACTCGGCA 18 600 

CCTCAAACAT GACATCTAGC ATGGTTTCTT CGATGATGGA GCGAAGTCCA CGCGCCCCTG 18660 
TCTTCCGTTC GATTGCTTTA TTAGCAATCT CTTGAAGGGC TTCGTCGTCA AATTCCAACT * 18.7 2 0 
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CAACATCATC ATAAGAAACC AAGGTTTGGT ATTGTTTCAC 
TCAAGATGCG AACCAAGTCA TCAACGGTCA ATTGCTCAAG 
GTCCAATCAA CTCAGGCATA ATACCAAATT TTTGAATGTC 
TGTATGAGCT CTTTTCGTCA ATCGCCTTAT TATTTTGACC 
CCAGACGTTG TTTGACAATT TCTTCAATAC CATCAAAAGC 
TATTTTTTGT ATCCACTTGA ATCATCTCTT GTTGTGGATG 
GTACGCTAGC AACAGTTCCC TCAATAATCT TGAGAAGGCC 
AAACATCACG TGTGATAGAC ACATTCTCAC TCTTCTTGGC 
CATAGATAAT GCCACGCTCT GCACGTTCGA TGTTAAAGTC 
GGAGGATATT TTCCACATCC TCACCCACAT AACCAGCCTC 
CAATAGCAAA AGGTACATTC AAGCTCTTAG CCAAGGTCTG 
AACCAGTTGG GCCAATCATC AAAATGTTTG ACTTCTGCAA 
CGCCTGTATC GTGGAAATTG ATGCGTTTGT AGTGGTTATA 
GCTTGGCACG ATCTTGACCA ATTACATAGT GGTTCAAGAT 
TTGGCACCTC AG AC AAGT CT GCCAAGACTT CCTCAACCAA 
GAGCTAACTC CACGCATTCA TTACAAATAA AAGCATTGTT 
CTTCTTCTTG GTTTTTGCCA CAAAATGAGC AATAAACCAT 
TAGACATGAT TTCCTTCCAT TCTATACTGT CATTCTATCT 
GCATGAATAC TATTGACCAG ATTGGTAAAG GCATTTAACC 
TAACCCTTTT TACGAAAAGC TTGTGCTCCT GCCAGAAAGC 
CTGAATAGAC CAAATAAACT CCGTTCCATT AGACTTCCTT 
TAAAATCATA AGGATTCTTC TCATCTTTGG CGTAAAATTT 
GAGACAAGTC AAGTTCTTCA GGGAAATAGG TATCTCCTTC 
TGACAATCAC TTCATCAAGG TAAGGTTCAA AAGCCTGAAA 
TGTAGAGATT C T T TTCTTGA GCCTGATACC AGTCAAGAAC 
TAGCAACCCC ATCTATCTTT TCTTCCGGAT TACGCGTCAA 
GAAGCAAGCG ACGCCCCATC CCATCAAAGG TCACACGCCC 
GAGTTGTTTC TTTAAAGTGC TGCAATTCTG CTGGCAAATC 
TACCAATCAC ACCCTCTTCA TCCTGGGCCC AAATAGCTAC 



CAAGGCATTT CTTGGCTCTT 187 80 

AGCCGCAAAA ACACGCAAGC 18840 

TTCAGCGATG ATTTCTTGCA 18900 

AAATCCGA7G ACTTTTTCAC 189 SO 

ACCACCCACG ATGAAGAGGA 19020 

TTTGCGTCCA CCTTGAGGCG 19080 

TTGTTGCACC CCTTCACCAG 19140 

AATCTTGTCA ATTTCATCCA 19 2 00 

AGCAACCTGC AAGACTTTGA 192 60 

CGTCAGAGCT GTCGCATCCG 193 2 0 

GGCAAGGAAA GTTTTCCCTG 193 80 

ATCCACATCT TCTCACTCTT 19440 

AACCGCCACT GCCAAGGCAC 19 500 

ATGGAGGACT TCAATTGGTT 19 560 

TTCTTCTCGA ATGATTTCCT 19 620 

GCCAGCAATT ATTTTTTGTA 1.9 680 

CATATCATTT TTTCTATTTG 19740 

AAAATAAGGT C ATGT AAAAA 19 800 

AAAGGAGGAT AGAAAGCCCG 19860 

AGATGAAACA CAGAAAACCC 19920 

TCTCTTGCGG TATTGGATGG 19980 

GCTTGAAACT GTCTC AAAAA 2 004 0 

CACCCGAGCA TGAATGTGAG 20100 

AATTTGCTTC CCACCGATAA 20160 

AGACTGGACG TCCTGAAAAG 20220 

AATCAAGGTT TCCCGTTTTG 202 80 

CATCAAGATA GCATGATTCA 2 0340 

CCAAGGCAGA CGATTTTCCT 2 0 400 

GATTTTCTTA GTCATGCTTC 204 60 
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CATCCTTTTC ACTGATAGTA CTATTTTATC AAAAAACTCA AAAAAAGACT GGTTTGGAAT 2 0520 

AG CTT AC AAA ATAGAAAAAA TCTGTAAGAA ATTTCCTACA GATTTATCTA TGTTTCCTTA 20530 

TTTCTTACAA ACCAGGTCCT TGTCCAAGTT CGGCTGCAAG CATCCAAATT GTTTTATCTG 2064 0 

TTTCAGTTTT AGCGCCTGCA AACATACCGT TTGTCACATC GTCACCTTCT TCATCAGTGA 207 00 

CATCCAAACC TTTTTGGAAA AGTTCTGACA AGTAACGGTA GATAACAAGA ACACGTTCCA 20760 

AGCTTTCTTC AACATTACGG TATTCACCAG CTTCTTCTTC GATTTCACTA TTTTGAAGGA 20920 

ACTCTGTCAA TGTAGAGAAT GGGCTTCCAC CGAGTGTAAT CAAGCGTTCA CTCATTTCAT 20 880 

CCAATTGACC GTCAAGAGCT TCCATGTACT CATCCATTTT TCGATGCCAT ACAAGGAAAC 2 0 940 

CACGACCATG C AT AT AC C AG TGCACTTGGT GCAAAGCAAC GTGAGCTACA TACAAATCAG 21000 

CAACAGCTTG GTTCAAGACT TCCTTTGTTT TTGCCAATGC 21040 
(2) INFORMATION FOR SEQ ID NO: 56: 



<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 3 87 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D ) TO POLOGY : 1 i near 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATTCTTAATA CGATTAAAAG GCTTATTACT AAAAGAAAAT TTCAGTTACA TGAACTAAAC 60 

TTGCTCGTCA AATCCCGATT TAACGAGATC TTTGGGGAAA ATAAAATATT TGAAAGCATT 120 

GATAACTTAT TTGATATTAT AGATGGTGAT AGGGGC AAAA ATTATCC7AA ATCAGATCAG 180 

TTGTTTAGTG AGGAGTACTG TTTATTTTTA AATACAAAGA ATGTTACTAA AAACGGATTT 240 

TCATTCGATA CAAAGCAATT TATCACTAAA ACAAAGGATA AATTACTTCG AAAAGGCAAA 3 00 

CTTGAGCGTT ATGATATAGT CTTGACAACA AGAGGTACTG TTGGAAATGT AGCGTACTAC 3 60 

GATGAATTAA TAAAATATAA ACATTTACGT ATAAATTCAG GTATGGTAAT ATTACGTCCC 4 20 

AAGACACCAA ATCTAAATCA GAAATTTATT ATCCATGTTT TAAGGAATAA TAATTATAGT 4 80 

CGAGTGATAT CAGGAAGTGC TCAGCCTCAG TTACCAATTA C AAAA TT AAA AAAAATACTT 540 

CTCCCCCTCC CCCCACTAGC CCTCCAAAAT GAGTTCGCAG ACTTTGTAGT CCAGGTCGAC 600 

AAATC AC AAT TGGCAATCCA AAAATCTCTG GAAGAACTTG AAACTTTGAA GAAATCTCTG 660 

ATGCAGGAGT ATTTTGGCTG ATATTCTGCC ATTGTAATTA CGGTAATGAT TTGTTATAAT 720 

ACTTCAAAGG AGGAAATCAG ATGGTAGTAA AAACAAGAAA ACAAGGAAAT TCAATCACCA 7 80 

TTACGATTCC AAGTGAATTT AATATTCCAA GTGGTGTTAA ATACGAAGCG AAATTGTTAC 840 
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C AAGTGGTG A GATTATCTTT ACTCCTGAAG AATTGGGGCA 

ATGATGCCTT TGACTTAAAT TTAGATAAAA TATTTGACGA 

CTTTGGTGGA AAAATGACAA TCTATTTGAC AGAAAAGCAA 

AGCAATTCAA CGGTATTCTC CAAATGAGAA AATTCAAACA 

TATGATTGTG AACTTACCAG AACAATTTGT CTTTGGGAAG 

TGATAAAGCA ACGATACTAT TTGTCCAATT GATAAAGAAG 

TAAAAGAACT GCTTTCTTCG TTTTGGTCAA ATTTTTACAA 

TGTAACGGTA GAAGAAGCAG TAAAAATGTG TGTAACCATC 

TGAAAAAATG ACAAGCTACT CCAAATGGAT TTCTGAACAT 

AAAGTAACCT AGTATGCTGG ATTTG AATG A GCACAAGAAA 

AGAATTCTGT AATGCAGAAA CTGATATTGT CTCTTTTTAT 

AATTATGGAA TCAAAAGTTA CAATTATCAT GCAAGAAATG 

ACAATTACTA GCGTTGAGAG AGAGTTTAGA ACATCATCTA 

GAAGTATTCG AATAATAACC TGTTGCAACT ATTTATTACC 

TAGCTCAAAA AC AATTC CT T ATTATCAGAC GACGATTGAA 

AGAGTCTGTG ACACAACTCA CAACAGATGA TTTAAGGAGT 

TGAAAAGGAT TGTAGTAAGG CAAATTTAGA CAATATTAGG 

TGCTTGGCTT GAGCAAGAGG ATATATCATT AAAATTCCCA 

AAGACTGAGC AAAATGTGAA GGAAACTTAT ACTGATGAAC 

AACT GTG AAA ATTTG AG AG A TTTGGCAATA ATAGACCTAC 

GTAGGGGAGC TTGTACAGTT GAATCGTTCA GATATTGATT 

GTCTTTGGTA AAGGAAAGAA GCAGACACCA GTATATTTTG 

TTAAGAAATT ATCTTAACGA CAGAAAAGAT AGTCACCCTG 

GGAAAAGTCC AGAGGCTTGG AATTGCTGGT GT AG AG ATTC 

AAACTCCGCA T AC AAAAGGT TCACCCACAT AAGTTCAGAA 

ATTGATAAAG GTATGCCTAT CGAACAAGTC CAAAAACTGC 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10669 base pairs 
(3) TYPE: nucleic acid 
tC) STRANDEDNESS : double 



GCAGGTTTCT TATGTATCTG 900 
ATACGACGAT GTTTTCAAAG 9 50 

ATTGAAAAAA TAAATGCTTT 1020 

GTTAGTCCTT CTGCCTTAAA 109 0 

CCTCTTTATC CAACAATTTT 11*10 

CATGTTTTTG CTAATGCTAA 1200 

TTAAACGGCT ATCGTTTTTC 1260 

GCAGTAGAAG CTTTAACTGA 13 20 

TCTGTTAGAG AAAAGGTCAA 13 80 

ATAAATGAAC AGACAATATT 14 4 0 

TGATGAATAA GAAAGTGAGA 1500 

TTACCTCTTT TAAATAATGA 1560 

GTAGACGGAA AAAAGCAGCA 162 0 

GCCAAGCAGG TAGAGGGCTG 16 30 

AACTTGTTTA ATGCTATTAA 17 4 0 

TATTTAGCAA ATTACCAGTC 18 00 

CGTATATTGT CTTCTTTTTT 18 6 0 

TTCGACGGAT ACAGAAAATT 19 2 0 

ATTTGGAAAT TATGCGTGAT 19 3 0 

TACCATCGAC AGGTATGCGT 2 0-10 

TTGAAAACAG AG AG T GTG TT 2100 

ACGCTCGTAC GAAAATTCAT 2 160 

CTCTTTTTGT AACGCTAGTT 2 22 0 

GCTTAAGAAA GTTAGGAGAC 2 2 80 

GAACTTTAGC GACTAAGGCA 2 3 40 

TAGGTCA 2 3 87 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
ATATTAAAGC CACTTTCTGT GCGCTAGGGA AAAATGTTCC TGGGAATGAG GACTTGGTGA 60 
AGAGGATAAA ATCTGAAGGT CATGTTGTTG GAAACCATAG CTGGAGCCAT CCGATTCTCT 120 
CGCAACTCTC TCTTGATGAA GCTAAAAAGC AGATTACTGA TACTGAGGAT GTGCTAACTA 13C 
AAGTGCTGGG TTCTAGTTCT AAACTCATGC GTCCACCTTA TGGTGCTATT ACAGATGATA 2 40 

TTCGCAATAG CTTGGATTTG AG CTTT AT C A TGTGGGA7GT GGATAGTCTG GACTGGAAGA 3 00 

GTAAAAATGA ACCATCTATT TTGACAGAAA TTCAGTATCA AGTAGCTAAT GGCTCTATCG 3 60 

TTTTGATGCA TGATATTCAC AGTCCGACAG TCAATGCCTT GCCAAGGGTC ATTGAGTATT 4 20 

TG AAAAATC A AGGTTATACC TTTGTGACCA T ACC AG AG AT GCTCAATACT CGCCTAAAAG 4 80 

CTCATGAGCT GT AC TAT ACT CGTGATGAAT AAGCAAGAAA AAAT AGGTCT GTTAGATATT 54 0 

TGACAGACTT ATTTTTTACA GAATATAGTA CTACTTAAAA AATGTTTTAT GCTATAATTG 600 
ATGAATAAAA TAGAAGGAGA AGCATATGAA TACCTATCAA TTAAATAATG G ACT AG AAAT 660 
TCCAGTATTG GGATTTGGAA CTTTTAAGGC TAAGGATGGA GAAGAAGCCT ATCCTGCACT 720 
GTTAGAAGCC TTGAAGGCTG GTTATCGTCA TATTGATACG GCGGCGATTT ATCAGAATGA 780 
AGAAAGTGTT GGTCAAGCAA TCAAAGATAG CGGAGTTCCA CGTGAAGAAA TGTTCGTAAC 84 0 

TACCAAGCTT TGGAATAGTC AGCAAACCTA TGAGCAAACT CGTCAAGCTT TGG AAAAATC 900 
TATAGAAAAA CTGGGCTTGG ATTATTTGGA TTTGTATTTG ATTCATTGGC CGAACCCAAA 9 60 

ACCGCTCAGA GAAAATGACG CATGGAAAAC TCGCAATGCG GAAGTTTGGA GAGCGATGGA 1020 

AGACCTCTAT CAAGAAGGGA AAATCCGTGC TATCGGCGTT ACCAATTTTC TTCCCCATCA 10 90 

TTTGGATGCC TTCCTTGAAA CTGCAACTAT CGTTCCTGC G G7C AATCAAG TTCCCTTGCC 114 0 

GCCAGGTGTG TATCAAGATC AAGTCGTAGC TTACTGTCGT GAAAAGGGAA TTTTATTGGA 12 00 

AGCTTGGGGG CCTTTTGGAC AAGGAGAACT GTTTGATACC AAGCAAGTCC AAGAAATAGC 1260 

AGCAAATCAC GGAAAATCGG TTGCTCAGAT AGCCTTGGCC TGGAGCTTGG CAGAAGGATT 13 20 

TTTACCACTT CCAAAATCTG TCACAACCTC TCGTATTCAA GCTAATCTTG ATTGCTTTGG 1 3 80 

AATTGAACTG AGTCATGAGG AGAGAGAAAC CTTAAAAACG ATTGCTGTTC AATCGGGTCC 14 4 0 

TCCACGAGTT GATGATGTGG ATTTCTAGAA AATCATAAAA AGAATTGTAC ATTATTCTAA 1500 

TTTTTGATAT AATAGTCAGC AGGAAAGAAA GTCTTATGGC GTTCTTCAAG CGAGCTTGGG 15 60 

ATAGTGGGAG CCAAGTAGGG CAAAATAAAG GCCTGGCGCT TTCTGTAGTA TTTTCAAAAA 1620 
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CAATGAAGTA 


ATAAATTAGG 


GTGGAACCGC 


GTTTCTGACC 


CCCCTAGGTT 


AAATCAACCT 


1630 


AGGATTGTCA 


GATGTGGTTC 


TTTTGCTTAT 


TCAGTCTATT 


GTGTGAAAGA 


AAGGAGAGCC 


1740 


GTGGACAACC 


TTTATCTTGT 


AAAAG ACG AT 


AGTCAACTAG 


CTACATTTCG 


TGATTTTGTA 


1800 


GTAAGAAATA 


CTGAAAAGTT 


GAAAGATTAT 


CAATCTTTTT 


TAAAGAATGA 


ACTTGCACTC 


I860 


TGTGATTTAC 


CGCAAGCTGT 


TATTTGGTCA 


GATTTTAATG 


CTGCTACACA 


GATTATTAGG . 


1920 


GAAAGTGCTG 


TTCCAACCTA 


TACAAATAAT 


AGACGAGTGG 


TTATGACGCC 


TGATTTAGCT 


1980 


GTTTGGAAAG 


AATTGTATTT 


GTATCAGTTG 


ATGGACTACG 


AGTGTTCTGA 


GCAAACTCAA 


20-30 


G C AAT AG AAA 


GTCACTATCA 


TTCTTTAT CT 


GAAAATTTCC 


TCTTACAGAT 


TGTAGGACAT 


2100 


GAGTTAGCTC 


ATTGGTCGCA 


CATTTTTTAG 


ATGATTTTGA 


TGGTTATGAC 


TCTTATATCT 


2160 


GGTTCGAAGA 


GGGGATGGTT 


G AAT AT ATT A 


GTCGCAAGTA 


TTTCTTGACA 


GAAGAGGAAT 


2220 


TTCAAGCGGA 


AAAAATTTGT 


AATCAATCTC 


TCGTAGAACT 


TTTTCAGAAG 


AAGTATAGTT 


2280 


CGCATTCATT 


GAATGATTTT 


GCTTCTTCGA 


CTTATGATAA 


GAACTATGCA 


AGTATTTTTT 


2340 


ATGAATACTG 


GCCCAGCTTT 


TTGACAGTAG 


ATAAGTTGGT 


AG AAAATTT A 


GGT AGTGT AC 


2400 


AAGCGCTCTT 


AGATTCTTAT 


CATTTATGGG 


CAAATACAGA 


AAAAACTTTT 


C CCTTGTT AG 


2460 


ATTGGTTTGT 


T C AGCAGAAA 


TTAATTGAAA 


AAGAAATATA 


AAAACTAAAG 


GAGTAAACAA 


2520 


TGTCTAAGAA 


ATTAACATTT 


C ACTGC AT C A 


GTGGC AG AG A 


CCTCCTTACA 


GTCGGG CTGC 


2580 


TCCACGCTCA 


GCACTAGAGT 


GCCTGAGCTA 


GACGCAGTAC 


TAACTCGTCT 


TGCCTCGTAT 


2640 


GATCGACGAG 


GCAGACTCGT 


GTCGCAAGTA 


ATTATTTTTT 


ATTAACGACT 


ATTCAATCTC 


2700 


TAACAAATTA 


ACATTTCACT 


GCGTCAGTGG 


CAGAAACCTC 


CTTACAGTCG 


GACTGCCCTA 


2760 


CGCTCAGCAC 


TAGAGTGCCT 


GAG CT AG AC G 


CAGTACTAAC 


TCGTCTTGCC 


TCGTATAATC 


2320 


GACGAGGCAG 


ACTCGTGTCG 


CAAGAAATTA 


TTTTTTATTA 


AGGAGTATTC 


AATGTCTAAG 


28B0 




1 L (_ AALiAAA 1 


*T" A TTTTP A f^T 1 
1 A 1 I i I \jf\\. L 


1 lUV.AAt.Aftl 


III \j\jt\J\ 1 Ij A 


t-k_AALiAI I v.- I 


29 4 0 


ATGCTTATGC 


AGGCTTATGA 


TAATGAAAAA 


GGTGCGGGGA 


CAATGAGTCC 


7TACACTTTC 


3000 


CTTCGTGCTA 


TCGGACCTGA 


G C C AT GG AAT 


GCAGCTTATG 


TAGAGCCATC 


ACGTCGTCCT 


3060 


GCTGACGGTC 


GTT ATGGGG A 


AAACCCTAAC 


CGTCTCTACC 


AACACCACCA 


ATTCCAGGTG 


3120 


GTCATGAAGC 


CTTCTCCATC 


AAATATCCAA 


GAACTTTACC 


TTGAGTCTTT 


GGAAAAATTG 


3180 


GGAATCAATC 


CTTTGGAGCA 


CGATATTCGT 


TTTGTTGAGG 


ACAACTGGGA 


AAACCCATCA 


3240 


ACTGGTTCAG 


CTGGTCTTGG 


TTGGGAAGTT 


TGGCTTGACG 


GAATGGAAAT 


CACTCAGTTC 


3300 


ACTTATTTCC 


AACAAGTCGG 


TGGATTGGCA 


ACTGGCCCTC 


TGACTGCGCA 


AGTTACCTAT 


3360 
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GGTTTGGAGC GCTTGGCTTC TTACATTCAA GAAGTAGACT CTGTCTATGA TATCGAGTGG 34 20 

GCTGATGGTG TAAAATACGG AGAAATCTTT ATCCAGCCTG AGTATGAGCA CTCAAAATAT 34 30 

TCATTTGAAA TTTCGGACCA AGAAATCTTG CTTGAAAACT TTGATAAGTT TGAAAAAGAA 3540 

GCTGCTCCTC CATTAGAAGA AGGCTTGGTA CACCCTGCCT ATGACTATGT TCTCAAATGT 3 600 

TCACATACCT TTAATCTGCT TGACGCGCGT GGTGCCGTAT CTGTAACAGA GCGTGCAGGC 3 660 

TATATCGCTC GTATCCGTAA CTTGGCCCGT GTCGTAGCCA AAACCTTTCT CGCAGAACGC 37 20 

AAACGC C TAG GCTACCCACT TTTGGATGAA GAAACAAGAG CTAAACTCCT AGCAGAAGAC 3 7 80- 

GCAGAATAAA GAGAGTGACA AATT ACG AAA ATGGGCGAAC AGAGTGAGCC CTCAGCCAGT 3 84 0 

TGCCGCAGTG ATGAAGGTAT CCTTAGTGAA ACTAAGGATA CTAGGCAAAA TTGGAGACTT 3 900 

TTGGCTCCAA TTTTAGCAAT GAAACAACGA AGTTGGTTGC TTCCCTGCCA ATCACATAAG 3 9 60 

GCAAACTGGA AAATAAAAAG ATACTTTTCG GAGAAAAAAC A TG AC AAAAA ACT T ATT AG T 4 020 

AGAACTCCGT CTTGAAGAAT TACCAGCCTA TGTTGTTACG C C AAGTG AAA AACAACTAGG 4080 

CGAAAAAATG GCACCCTTCC TCAAGGGAAA ACCCCTGTCT TTTGAAGCCA TTCAAACTTT 4140 

CTCAACACCA CGTCGTTTGG CTGTTCGTGT AACTGGTCTT GCAGACAAAC AGTCTGATTT 4200 

AACAGAAGAT TTCAAGGGTC CAGCAAAGAA AATTGCCTTA GATAGTGATG GAAACTTCAC 4 2 60 

CAAAGCAGCT CAAGGATTTG TCCGTGGGAA AGGTTTGACT GTTGAAGATA TCGAATTCCG 4320 

TGAAATCAAG GGTGAAGAAT ATGTCTATGT CACTAACCAA GAAATTGGTC AACCACTTCA 4 3 90 

AGCCATTGTT CCAGGCATTG TGGATGTCTT GAAGTCACTG ACTTTCCCTG TCAGCATGCA 4 4 40 

CTGGGCGGGA AATAGCTTTG AATACATCCG CCCTGTTCAC ACTTTAACTC TTCTCTTGGA 4 50C 

TGAGCAAGAG TTTGACTTGG ATTTCC7TCA TATCAAGGGA AGTCCTGTGA GTCGTGGCCA 4 560 

TCGTTTTTTG GG AC AAG AAA CCAAGATTCA GTCAGCATTG AGCTATGAAG AAGACCTTCG 4 620 

TAAGCAGTTT GTAATCGCAG ATCCATGTCA ACGTGAGCAA ATGATTGTTG ACCAAATCAA 4 680 

GGAAATTCAG GCAAAACATG GTGTACGTAT CGAAATTCAT GCGGATTTGC TGAATGAAGT 4 740 

CTTGAATTTC GTTGAATACC CAACTGCCTT CATGGGAAGT TTTGATGCTA AATACCTTGA 4 8 00 

AGTTCCAGAA GAAGTCTTGG TGACTTCTAT GAAGGAACAC CAGCGTTACT TTCTTGTTCG 4 360 

TGATCAAGAT GGAAAACTCT TGCCAAACTT CATTTCTGTT CGTAACGGAA ACGCAGAGCG 4 920 

TTTGAAAAAT GTCATCAAAG GAAATGAAAA AGTCTTGGTA GCCCGCTTGG AAGACGGAGA 4980 

ATT CTTCTGG CGTGAAGACC AAAAATTCGT GATTTCAGAT CTTGTTCAAA AATT AAA CAA 504 0 

TGTCACCTTC CATGAGAAGA TTGGTTCTCT TCGTGAACAC ATGATTCCTA CGGGTCAAAT 5 ICO 

CACTGTACTT TTGGCAGAAA AAGCTAGTTT GTCAGTGGAT GAAACAGTTG ACCTTCCTCG 5150 
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TGCAGCAGCC ATTTACAAGT TTGACTTGTT GACAGGTATG GTTGGTGAAT TTGACGAACT 52 20 

CCAAGGAATT ATGGGTGAAA AATACACCCT TCTTGCTGGT GAAACTCCAG CGGTGGCAGC 5 2 80 

TGCTATTCG7 GAACACTACA TG C CT AC ATC AGCTGAAGGA GAACTTCCAG AGAGCAAGGT 5 340 

CGGCGCAGTT CTAGCCATTG C AG AC AAATT GGATACGATT TTGAGTTTCT TCTCAGT AGG 54 00 

ATTGATTCCA TCAGGTTCTA ATGACCCTTA TGCCCTTCGT CGTGCAACTC AAGGTGTGGT 54 60 

TCGTATCTTG GAT GCCTTTG GTTGGCACAT TGCTATGGAT GAGCTGATTG ATAGCCTTTA 55 2 0 

TGCATTGAAA TTTGACAGTT TGACTTATGA AAATAAAGCA GAGGTTATGG ACTTTATCAA 5580 

GGCTCGTGTT GATAAGATGA TGGGCTCTAC TCCAAAAGAT ATCAAGGAAG CAGTTCTTGC 564 0 

AGGTTCAAAC TTTGTTGTCG CAGATATGTT GGAAGCAGCA AGTGCTCTCG TAGAAGTAAG 57 00 

CAAGGAAGAA GATTTTAAAC CATCTGTTGA ATCACTTTCT CGTGCCTTTA ACCTGGCCGA 5760 

GAAGGCAGAA GGGGTTGCTA CGGTTGATTC AGCACTATTT GAGAATGACC AAGAAAAAGC 5820 

TTTGGCACAA G C AGT AG AAA CACTCATTTT ATC AGG AC CT GCAAGTCAGC AATTGAAACA 5880 

ACTTTTTGCG CTTAGCCCAG TCATTGATGC TTTCTTTGAA AATACTATGG TAATGGCTGA 594 0 

AGATCAGGCT GTCCGTCAAA ATC GTTTGG C AATCTTGTCA CAACTAACCA AG AAAGC AGC 6000 

TAAGTTTGCT TGTTTTAACC AAATTAACAC TAAATAAAAT TTGATAAACG GACTTTATCT 6060 

TATTACAAAC GAGAAGAAAT GGATCCGAAA AAAATTGCTC GTATCAATGA GCTTGCTAAA 6120 

AAGAAAAAAA CAGAAGGCTT AACACCAGAA GAAAAAGTGG AACAACCCAA ACTACGTGAG 6180 

GAGTACATCG AAGGTT ATCG CCGCGCTGTT CGTCACCACA TTGAAGGAAT CAAAATTGTG 6240 

GACGAAGAAG GAAACGATGT TACACCAGAA AAACTACGCC AAGTACAACC TCAAAAAGGA 6 300 

TTACATGGCC GTAGTCTTGA TGATCCAAAT TCATAATAAT ACTCTTCGAA AATCAAATTC 6 3 60 

AAACCACGTC AGCTTCACCT TGCCGTACTT AAGTACAGCC TGCGGCTAGC TTCCTAGTTT 642 0 

GCTCTTTGAT TTTCATTGAG TATATGTATT CTTTCTTTTA A C AAAG AT AG ATGAAACGAT 64 80 

AACAAAGAGA CTAGCAGTTT GTGTTTGCTA GTCTTTTTTC GCTAAAAAAG GAACCATAAT 654 0 

CGTTCCTAAA AACTATCATT AGTAACTTGC ACCGGCTGTA GCGTCTGCGT CACCACCGTG 6 600 
GCCTCCAGCA TCCCCTGAAT CAGAAGCGCC AGAAGTAGCA TCGGCGTCTC CATGACCTCC 
GGCAGCAGGA GCAAATGGTC CGCTACCACC CACCAAACGT TGACCAGTCT CTTTTAGGTA 

CCAGTCAAGC CATGGTTGGA AGTTAAAGAC G ATTTC AT TG ATACCAGCGT AT G ATC CATC 6780 
AGGATAGTAC ATTGCTTGGT AGTTGTGAGT GTTGATAACA CCTGCAGGAG AACCTGGAAC 
GATCGTACGG ACGTATTCTT GGTTTCCGTT GCGAAGTGTT CCGATAACCC ACTCTACGTT 



6660 
6720 



6840 
6900 
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CTTCATACGT 


GCTGGTGGAA 


GAGAACCATG 


AACAGTCGAC 


ATACGGCTAC 


CTGATTGAGG 


6960 


TGGTACACCT 


TTAGCGAACA 


TAGTGTCTGG 


ATCTTGGTGA 


GCGTTGTTGT 


AGTAGAGGAA 


7020 


TTGGTTGTTG 


TCGTCAGCGT 


ATGTCAATTC 


AAATGGCATA 


GCTTTCAAGA 


ACATATCAAT 


7080 


TTGGTTAACT 


GTTAGGATAC 


CGTGGTCCAA 


TTTGACATAG 


GTATCACCAG 


AAACAGCACC 


7140 


AGTGAATGCT 


GCAACTTTTT 


CTACCCATTC 


TGGATCGTCA 


GGGTCAACTT 


CTGTGATGGT 


7200 


TGTAGCGATT 


GGTTTTCCAC 


AATCCAAGTC 


TTCTGATTCG 


ATTGGTTTTC 


GTTTTTTCAA 


7260 


TTTCGAAACG 


ACTCCTACGT 


ATTTAACAAA 


GTTATCTAAG 


CAACTTTCAA 


GGAATTTAAC 


7320 


AGTGCCTTCG 


TTGGTGATAT 


TTCCGTTGT? 


ATCAAAAGCT 


TCCTTACCTT 


TACCAAGAAG 


7380 


GAATTCGTTA 


CCTGGAAGCG 


TGTAGGCATT 


AACACCTGGA 


GCATCAAGGA 


TTTTACGAAG 


7 4 40 


GTGAACTTGA 


GCACGTGATG 


TTCCTTGGTC 


ATAGTATGAT 


GCACCCACAA 


TCATAACAGG 


7500 


CTTGTTTTCA 


AATGGATGAA 


CTTCGTATGA 


AAGCCATTCA 


AGTACAGATT 


TGAGTGAAGC 


7560 


TGAGATAGTG 


TGGTTATGCT 


CAGGAGTAGC 


AATGATAACA 


CCATCTGCAC 


GAGTAATTTT 


7620 


GTTATATAAA 


TAACGTAATT 


GGAAACTTTC 


ATCCCATTTT 


TCATCTTGGT 


TAAACATTGG 


7680 


AACTTCGTCA 


ATTTCAAGAA 


CTTCTAATTC 


AAATTTGAGT 


TTGAAGTAGC 


GACGGATAAA 


7740 


TTCCAAGAGC 


TTACGGTTAT 


ATGATTGATC 


GTAGTTTGAT 


CCAACAAGTC 


C AAC AAATTT 


7800 


CATTCTTTTT 


GGTCTCCTAT 


CTTACAAATT 


TTCCCAGTCA 


AAGTCTT C AG 


CATCTTTGCG 


7860 


AAGTAATTCT 


TGTGCATTAC 


GTAATTTTTC 


TGTGATTTTT 


ACAAAGATAC 


GGAAGTCATC 


7920 


AAAGATGGCA 


TCCAATTTCT 


TGATAACATC 


AAGGTCAACC 


AAGTCGCCAC 


TTGGGTTAAA 


7980 


TGCTTCAAGA 


GAGTGTGAGA 


GCAAGAATTC 


ATCTGGAACA 


ACATTTGCCT 


TGATTTCAGG 


3040 


AGCATTCAAG 


ATTTGACGAA 


GTTGCAATTG 


GGCACGAGAT 


GAACCAAGCG 


TACCCTAAGA 


3100 


AGCACCTGTA 


ATCATGATTC 


GTTTGTTCAA 


AAGTGGGTAA 


ATACCATAAG 


ACAACCAAGC 


3160 


AAGAGCGCTC 


ATC AAAAC AG 


CTGGAATAGA 


GTGATCATAC 


TCAGGAGTAC 


CGATAATAAC 


3220 


GCCATCTGCC 


TCTTCGATTT 


TAGCAGCAAT 


TTCCAATATT 


TCAGCAGGTA 


CTTGCTTGTC 


3280 


AGCTGGTTTG 


TTGAAGACAG 


GAATGGCCTT 


GATTTCAACA 


AGTTCAATTT 


CAGCTTTGTC 


8340 


AGTAAAGTGT 


TTTTGCATGT 


ATTGAAGCAA 


TTGACGGTTT 


CTAGAACCTT 


TTGAATTTGT 


9400 


TCCAACAATA 


GCAATAAGTT 


TTAACATGAG 


ATTTCCTTTC 


TCTTTTTACA 


TAATACAATT 


3460 


TTAAAATTCC 


ATTGAAACAG 


TTGTCTCTAT 


AGAGTAGGAA 


TTCCTGAAGA 


ACAGCTTAGG 


3520 


TGGCCTTCTT 


TATCGATGAG 


GATGACTTCG 


ATGCCCTCCA 


AACTTTCGAC 


TTGCCAGAGG 


3580 


AT AGAAGC AG 


GTCTTTCTCC 


AAAGAGTCGA 


GTCGTCCAGA 


TTTCCCCATC 


GACTGATTTA 


3640 


TCAGAGATGA 


TTGTTAGACT 


CGCTAGTTCC 


GTTTCAACAG 


GATATCCTGT 


TTGACTGTCA 


3700 
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AAAATGTGAT GCTAATCTTG TCCATCGACG GTCAGGTGAC GTTCATAAAT GCCTGAAGTC 87 6 0 

ACGACAGATT TATTGACAAC AGCGATCGTC ATTAAATGAT TTCCCCTAGG ATTGGCTGGG 8 82 0 

TCTTGAATCC CGATTTGCCA TGGGTTATCC CCTCTTGCCT GATTTTTTCC AATGGTCAGG 8 880 

ATATTCCCTC CCAGATTGAT CAAGGCAGAA GTCACCCCCT CTTTCCTAAC AAATTGGGC A 894 0 

ACCTTATCCG CACTGTATCC TT TGGCT AAA CAACCTAGAT CGATCTTCAT TCCTTTCTGT 9000 

TTTAAAAACA CAGTAGAAGT AG AAG AATCT AACTCCATAC CATGAGGATT GATTAGAGGC 90 60 

AGC AC CG ATT CAATTTCTTG AGGCTGGGCG ACCTTGGCAT CTGAAAAACC GATACGCCAG 9120 

CTTTGAATTA AGGGACCAAT GCTGATATTG AGGTGGCTAG AGAGCGCTAG GCTATGCTCT 9180 

AACCCAAGTG AAATCAGCTC AAACAGGTCT GGATGAACCG TGACGGGGGC TATTCCTGCT 92 40 

TGATAATTGA TTTCCATCAA CTCAGATTCT TGACTATTGG CGTTGAAGCG GTATTCAAGT 9 3 00 

TCTTTGAGCA AGTCAAAGGA TTTTTGGAGA AAGATATCGG CTTGCTCATC CACTAATGAA 93 60 

AT AGTG AT AG TAGTCCCCAT TAGCCGTTCA GAATGTGAAC GAAGAGTCAA GCTACCAACT 94 2 0 

CCTTTCTCTT ATAGAAAATA AGTTGTAATA TCAAATAATC ATCTAAATTG AAGCCCTTAC 94 80 

ATTTCATTTT CATGTTATTA TAATACCATA AAGTT AG AAT TTTCACAAAC AAAATTTGGA 954 0 

AAAAGTCAAG AAATATGCTC ATAAAATTCA TCAGCCTTGA AAACAGGATA AATGGGGAAT 9 600 

TATTTTTGAT AAAAAATGCT GAAATAATAG TACCCCCCTT GTAAACGCTA ACGGTAAATG 9 6 60 

GTATACTAGT AAGGTAAATT TAGAATGAAG GCACGAAATT TTTATGAGTA AAATCGTTGT 97 2 0 

AGTCGGTGCT AACCACCCTC CTACAGCATG T ATCAAT AC C ATGTTGGATA ATTTTGGAAA 97 80 

TGAGAACGAA ATTGTTGTAT TTGACCAAAA CTCTAACATC TCTTTCCTAC GATGTGGAAT 9 9 40 

GGCTCTTTGG ATTGGTGAAC AAATTGACGG TGCTCAAGGC TTGTTCTATT CTGATAAAGA 9900 

AAAATTGGAA GCTAAAGGTG CT AAAGTTT A CATGAACTCA CCTGTTCTTT CAATCGACTA 99 60 

TGATAACAAA GTAGTTACAG CGGAAGTTGA AGGAAAAGAG CACAAAGAAT CATACCAAJ>A 10020 

ATTGATTTTC GCTACAGGCT CTACACCAAT CTTGCCACCA ATCGAAGGTG TTGAAATTGT 3 0080 

TAAAGGAAAC CGCGAATTTA AAGCAACTCT TGAAAACGTA CAATTCGTGA AATTGTACCA 10 14 0 

AAATGCTGAA GAAGTTATCA ATAAACTTTC TG AC AAG AGC CAACACCTCG ACCGTATCGC 10200 

CGTTGTTGGT CGTCGTTACA TCGGTGTTGA ACTTGCTGAA GCCTTTGAAC GTCTTGGAAA 102 60 

AG AAGTTGT C CTTGTTGATA TCGTTGATAC TCTCTTGAAC CGTTACTATG ACAAAGACTT 10 3 20 

CACACAAATG ATGGCGAAGA ACTTGGAACA TCACAACATC CGCTTGGCTC TAGGTCAAAC 10 3 80 

TGTTAAAGCA ATCGAAGGTG ACGGT AAAG T TGAACGCTTG ATTACTGACA AAGAAAGCTT 10440 
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TGACGTGGAT ATGGTTATCC TTGCAGTTGG TTTCCGTCCA AACACAGCCC TTGCAGGTGG 10500 

TAAGATCGAA CTCTTCCGCA ACGGTGCCTT CCTTGTAGAC AAGAAACAAG AAACATCTAT 105 60 

CCCAGACGTT TACCCTGTTG GTGACTGTGC GACTGTTTAT GACAATGCTC GTAAAGATAC 10620 

AAGCTATATC GCTCTTGCTT CAAATGCTGT GCGCACTGCT AACGTTGGT 1066 9 

(2) INFORMATION FOR SEQ ID NO: 53: 

{i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7542 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CGCGCTAATA GATACTTTAT GATAGAATAA AGAACAAGAT TGACAAGTAA CAGGAAACAT 6 0 

TATGCAAAAT C AAAC ACT C A TGCAATACTT TGAATGGTAT CTCCCCCACG ACGGTCAACA 12 0 

CTGGACGCCT CTGGCTGAAA ATGCTCCACA CCTAGCTCAT CTGGGGATCA GTCACGTCTG 180 

GATGCCACCA GCCTTCAAGG CAACCAACGA AAAAGATGTC GGCTATGGGG TCTATGACTT 24 0 

ATTTGACTTA GGAGAGTTCA ACCAAAAAGG GACTGTCCGC ACCAAGTATC GTTTCAAAGA 300 

AGACTATCTT CAAGCCATTC AAGCCCTTAA AGCACAGGGA ATTCAACCTA TGGCCGATGT 3 60 

AGTTCTCAAC CACAAGGCTG CTGCCGATCA CACGGAAGCC TTTCAGGTTA TCGAAGTTGA 42 0 

TCCTGTAGAC CGTACAGTTG AACTTGGAGA ACCCTTCACC ATCAATCGCT GGACTAGTTT 4 30 

TACCTTCGAT GGTCGCCAAG ATACCTATAA TGCCTTCCAC TGGCATTGCT ACCACTTCAC 54 0 

CGGTACAGAC TACGATGCCA AACGCAGTAA ATCTGGGATT TATCTGATCC AAGGGG AC AA 600 

CAAGGGCTGG GCCAACGAGG AATTGGTCGA TAACGAAAAC GGAAACTACG ACTACCTCAT 6 60 

GTATGCCGAC CT AG ACTTT A AACATCCTGA AGTCATCCAA AACATCTATG ACTGGGCTGA 7 20 

TTGGTTCATG GAAACGACTG GTGTAGCTGG TTTCCGTTTG GATGCCGTTA AGCATATTGA 7 80 

CTCTTTCTTT ATGCGCAACT TCATCCGCGA TATGAAGGAA AAATACGGTG ACGATTTCTA 8 40 

TGTTTTTGGT GAATTTTGGA ACCCAGACAA GGAAGCCAAT CTGGACTATC TCGAAAAAAC 900 

GGAAGAACAC TTTGACCTTG TCGATGTTCG TCTCCACCAG AATCTCTTTG AAGCCAGTCA 9 60 

AGCTGGCGCA AACTATGACC TTCGTGGCAT TTT C AC AG AT AGCCTGGTTG AACTCAAGCC 10 20 

TCACAAGGCT GTGACTTTTG TCGACAACCA CGATACCCAA CGAGGACAAG CCCTTGAGTC 10 80 

TACCGTTGAA GAATGGTTCA AGCCAGCAGC CTATGCCCTC ATTTTGTTAC GCCAAGACGG 1140 

CCTTCCATGT GTCTTTTACG GAGACTACTA TGGGATTTCA GGGCAGTATG CTCAAGAAGA 1200 



WO 98/18931 



PCT/US97/19588 



511 

TTTCAAAGAA ATCCTTGACC GCCTCCTAGC CATCCGAAAA GATTTGGCCT ATGGAGAACA 12 60 

AAATGACTAC TTTGACCATG CTAACTGTAT CGGTTGGGTA CGTTCAGGTG CTGAAAATCA 13 2 0 

ATCCCCAATC GCAGTCCTTA TCTCAAATGA CCAAGAAAAC AGCAAGTCAA TGTTTGTCGG 13 80 

TCAAGAATGG ACTAATCAAA CCTTTGTAGA TTTACTTGGT AACCACCAAG GTCAAGTTAC 14 40 

AATTGATGAG GAAGGTTATG GACAATTCCC TCTCTCAGCT AGATCCGTAA GTGTCTGGGC 1500 

AGTCAATACC ATCTAATACC TCATAATAAC CAAGCTAGGT CCAAGCGGAT TTGGCTTTTT 15 60 

TGTATTCACA AAAAGACCTA CCCAAATGGA TAGATCTTTA CTTGATTACA ATTTACCTGC 162 0 

TACTGCATCC AACAATTCTT GGATCTTAGG TTGGTTGCTT CCTCCTGCCA TGGCCATATC 16 80 

TGGTTT AC C A CCACCACGTC CATCGATGAT TGGTGCTAAT TCTTTGACAA GGTTTCCTGC 174 0 

ATGAAGGTCT TTTGTCTTGC TTGCTACAAG GACATTGACT TTGTCACCGA TAGCGGCAAC 1800 

TAGGACAAGA AG AT C AG AGT AGTCTTTTTC TTTCCAGTTA TCTGCAAAAG TACGAAGGGC 1860 

ACCGGCATCG GATACAGACA CTTGACTAGC AATGTAACGA TGACCGTTGA CTTCCTTAAC 19 2 0 

ATCTTTGAAG ATATCGCCTG CGGCTGC AG C TGCCGCTTTT TCTTTCAACT C AG C ATTTT C 19 80 

TTTTTGAAGT TGACGAAGTT GTTCTTGAAG TCCTTCTACC TTGTGAGGTA CTTCCTTGAC 2 040 

TTGAGGTGCT TTCAAGGTTG CTGCGATAGC TTTAAGAGCA TCCTCTTGTT CACGATAGGC 2100 

TTCAAAGGCT TCCTTACCAG TCACTGCCAA GATACGGCGA GTTCCTGAAC CGATTCCTTC 2160 

TTCTTTGACA ATTTTGAACA GACCAATCTC AGAAGTGTTG TCAACATGAG TACCACCACA 2220 

AAGTTCAATA GAGTAGTCAC CGATAGTCAC GACACGAACT TCCTTGCCGT ATTTCTCACC 2 2 80 

AAAGAGGGCC ATACCTCCCA TTTCTTTAGC AGTGTCAATA TCCGTTTCAA CTGTCTTCAC 2 340 

TTCAAGTGCT TCCCAAATTT TCTCGTTAAC TTGCTGTTCA ATCGCACGAA GTTCCTCAGC 2 400 

AGTTACTGCT TGGAAGTGGG TAAAGTCAAA GCGAAGGAAT TCAACTTCGT TAAGAGATCC 2 4 60 

TGCCTGTGTT GCGTGGTTTC CAAGGATATT GTGAAGGGCA GCGTGAAGCA AATGAGTCGC 2 5 20 

AGTG TGGTTT TTCATGACAC GGTGACGGCG ATTGCTATCA ATTGCCAAGG TATATTCTTG 2580 

GTTCAAGGCA AGCGGTGCAA GGACTTCAAC TGTATGAAGG GCTTGACCAT TTGGGGCTTT 2 640 

CTGAACATTG GTCACAGTAG CCACAACCTT ACCTGACTCA TCCAAGATTT GTCCGTAGTC 2 7 00 

AGCTACCTGT CCACCCATTT C AG C AT AAAA TGACGTTTCC GCAAAGATAA GAGAGGCAGT 27 60 

TCCTTCTGAA ACAGCTCCTA CTTCTGCATT GTCAGCAACG ATAGCTACCA ATTTAGAAGA 2 820 

CAATTGGCTA GCATTGTAGT TGAAGACACT TTCTACAGTG ATGTTTTGAA GAGTTTCATT 2980 

TTGCATACCC ATTGAGCCAC CCTTGACAGC TGACGCACGC GCGCGTTCTT GCTGTTCTTT 2 94 0 
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CATGGCTGCT TCAAAACCTT CACGGTCTAC AGTCATACCA GCTTCTTCAG CGATTTCTTC 3 000 

AGTCAATTCA ACTGGGAACC CATAAGTATC ATAGAGTTTG AAGACATCTG AACCAGCGAT 3060 

AACAGATTGA CCTTTTTCTT TCAAGTCTGC TACAATGCCT TGGGCAAAGT GTTGACCTGA 3120 

GTGAAGGGTA CGGGCAAATG ATTCTTCTTC GCTCTTAACG ATTTTCTCAA TAAAGTCACG 3 180 

TTTCTCAAGC ACTTCTGGGT AGTAGCTTTC CATGATTTTT CCAACAGTTG GAACCAATTT 32 40 

GTAAAGGAAA GGCTCGTTGA TACCCAATTT TTGACCATGC ATAGAAGCAC GACGGAGAAG 3 300 

ACGACGAAGA AC AT AA CC AC GACCTTCATT TCCTGGAAGG GCACCATCAC CGATAGCAAA 3 3 60 

TGAAAGAGAA CGAATCTGGT CTGCGATAAC CTTGAAGCTC ATGTTGTCG C CATCTTGGTC 34 2 0 

ATAAACCTTA CCAGACAATT TCTCGACTTC ACGGATAATC GGCATGAAGA GGTCCGTTTC 34 80 

AAAGTTGGTC TTAGCCCCTT GGATAACGGC CACCAAACGC TCCAAACCAG CGCCCGTATC 354 0 

AATGTTCTTA TGTGGCAATT CCTTGTATTC GCTACGAGGA ACAGCAGCGT CTGCGTTAAA 3600 

TTGTG ACAAA ACGATGTTCC AGATTTCAAT ATAACGGTCG TTTTCAATAT CTTCTGCAAG 3 6 60 

CAGGCGAAGA CCGATATTTT CTGGGTCAAA GGCTTCCCCA CGGTCAAAGA AGATTTCTGT 3 7 20 

ATCTGGTCCA GAAGGTCCCG CACCGATTTC CCAGAACTTG TCCTCAATTG GAATCAAGTG 3 7 80 

ACTTGGATCC ACTCCCACTT CAATCCAGCG GTTCTAAGAA TCTTTATCCT CTGGATAGTA 3 840 

GGTCATGTAA AGTTTTTCAG CAGGGAAATC AAACCATTCA GGCCTTCTCA AAAGCTCATA 3900 

AGCCCAAGTG AT AGCTT CGT CACGGAAGTA ATCCCCGATA GAGAAGTTCC CCAGCATTTC 3 9 60 

AAACATGGTA TGCTGACGCG CGGTCTTCCC TACGTTTTCG ATGTCGTTGG TACGGATACC 4 020 

CTTTTGGGCA TTCGTAATAC GTGGATTTTC AGGGATAATG GTCCCGTCAA AGTATTTCTT 4 080 

AAGGGTTGCT ACCCCAGAGT TGATCCACAA AAGAGTTGGG TCATTTACAG GAACCAAACT 4140 

TACTGATGGT TCTACTGAGT GACCTTTGGT CGCCCAGAAA TCAAGCCACA TTTGGCGTAC 4 2 00 

TTGTGC ACT A GATAGTTGTT TCATATTGTC TCCTTATTCA CTTGTTT AA7 GTGATTGGCT 4 2 60 

TTCCAGCATT TCCACATAGT CAATCGCGAC ACAGAGGGAA ATGACTAGGT CTGCATAAGC 4 3 20 

GTCTTCAAGA ACCGTTACGG TATAGGTAGA AGTCAGATGG AAGAGTTCCT TCTTAATTTC 4 3 80 

CGCAATCAAC TGATCGCCAT CATCCAGCAA TTTGAAATTC AAATCCCAGA TATTGCCCTC 4 4 40 

GATACGAAGA CCTAGATTAT CAAACTCATA CTTATCTCGC CAGAAGGTCA ACTTCTTACG 4500 

AATGACAAAA CTCGAGCCAT CCCGAAGCTG AATTTCAAAA CGAGGAAGCA AGGTCAAGAT 4 560 

TTCTTTACTA ATCTCACTGA CTTGTTCACC AGCCGCATCA TAGATGGTAA AGGTTTTAGG 4 620 

AATCTTAAAA AATGATCCCT CCACCTGATA GGCAATTTCT CCCCTGTCAT CCTTGATAGC 4 680 

GAAGCGTTCG CCTCCAACAC GAAACTTTTG TTTGACAAGA AATCTTTTCA TCAACACCTC 4 7 40 
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CAAAAATCAA AAGACAAGCT CATATCACGA AGGGCGAAAA ACCGCGGTAC CACCTTCATT 4 800 

CAATGAACTT CTCATTCTCT TGTTCTTATG CAATTGTATG ATTGAGTAGC ATGACTTCCT 4 860 

AGCTTAGATG GCTCGCAGCA CCGCCATTTC TCTGGACTAA GACAAGTGAA AATCAATTCT 4 92 0 

CAACTTTCTT ATTATAACGT TTTTTTAAGC TTGCGTCAAC TGGAAATGAT CTCCGTTGAA 4 9S0 

TTAGACCAAT TCCCTACATC TCTGATTACT TTTTCAGCAT ATATTTTTTC TTACTGCCAT 504 0 

TTTTCTTTTT ATCCCAAATT TTCATATTAC TAAACACAGC TACTAGAATA TTTCCAAATA 5100 

TAAAGGTGCC TATCACCCAA T AT AT GG ACT CACTTGTTAG GTATTGTCGA TCCAAGCCAT 5160 

CCTTTAAATG GAATAGTATA GCAGTTTCGT TAACAATCAT AAAGGTTGGC CACAAACTTT 5 220 

TTTTGAAAAA AGTAGACATT TTCATTATTT CTTGCCGCTT TCTGTAACGT TAATACTCAA 5 280 

TAAAAATCAA AAAGCAAACT AGGAAGCTAG CCTCAAGCTG TACTTGAGTA CGGCAAGGCA 5 340 

ACGCTGACGT GGTTTG AAG A GTATAGGCTT AGTATACTAC TAGGCAAGCA AATAAACAAA 5 400 

TAAACAACTA GAATAGAAAA AGATAGGGCT CTAAAAACTG ACTTCTATTC CTTAAAAACG 5460 

AACCAGCTTG ACTGATTCGT CTTCTTACGT TTATCTCCTA CTTCCGATAC ATTTTAAACT 5 520 

GTAGGAAGAG GTCGCTATAT TTCCCTGTCC ATTTATGGTC AAATTTCTCA TAAACTTCTA 5 580 

GGTGTTTCAT GGTTTCAACA TCGGGATAGA AGGCCTTATC TTCCTTTGTT TCCTCTGGGA 5 64 0 

GCAATTCCTT CGCTGGTAGG TTTGGTGTTG AATAGCCGAC ATACTCCGCA TTTTCGAGAG 5700 

CATTTTCACG TTTCAACATA AAGTTGATAA AGGCATAGGC TGAGTTTTGG TTTTTAACTG 57 60 

TTTTCCCAAT GACCA T ATTG TCAAACCAAA GATTGCTGGC CTCTGTCGGT ACCACATAAC 582 0 

GTAGATTTTC ATTTTTTTCT AACATTTGGC TGGCTTCACC AGAGAACGTC ACGCCGATTG 5880 

CAACATTATT CTOAATCATA TACCCCTTCA TCTCGTCCGC AACGATAGCC TTGATATTTG 5 94 0 

GAGTCAGTTT GTAGAGCTTA TCCACTGTCT CTTC C AACTG CTGCAGATCC TTGGAGTTCA 6000 

GGCTGTAGCC GAGGGAATTG AGTCCTAGTC CCAGCACCTC ACGCGCCCCA TCAAAGAGCA 6060 

TGATAGAATT CTTATACTCC GGCTTCCAAA GGTCATCCCA ATGCTCAGGC GCTTCATCTA 612 0 

CCATGGTTTC GTTGTAGACA ATTCCTAAGG TTCCCCAGAA GTAAGGGATG GAGAATTTAT 6 180 

TACCTGGGTC AAAGGACTGG TTG AG AAACT CTGGTCCGAT ATTTTCGATT GCTTCAATTT 62 4 0 

TTGAATAATC AAGCGGAACC AAGAGGTCTT CGTCCTTCAT CTTGTTAATC ATGTATTCAC 6 300 

TTGGAATGGC AATATCGTAG GTCGTTCCAC CCTGCTTTAT CTTAGTGTAC ATGGCTTCGT 6 350 

TGGAGTCAAA AGTCTCGTAC TGAACTTGAA TTCCTGTTTC TTCTGTAAAC TGAGTCAAGA 64 2 0 

GTTCAGGATC GATATAGTCT CCCCAGTTAT AGATAACCAA TTTTTGACTA TCTCGACTAT 64 80 
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TGATTTTACT ATCTAAATGA GTCGCAATTC CCCACAAGAC AAGGATAATC GCTGCAATTC 6 540 

CTGCTAAAAA TGAATAGATT TTTTTCATGC TTGCTCCTCC TTCTCACGAG AGATAAAGTA 6600 

ATAACCTACA ACTAGGATAA TACT AAAG AG AAAGACTAGA GCAGACAGGG CATTGATTTC 66 6 0 

TAAGGAAATC CCCTTGCGAG CACGAGAGTA AATCTC G ACT GATAGGCTTG AAAAGCCATT 67 20 

TCCTGTTACA AAGAAGGTCA CGGCAAAGTC ATCTAACGAA TAGGTGAAGG CCATGAAATA 67 80 

ACCAGTAATG AT AG ACGG AG TCAGGTAAGG AAGCATGATT TCCTTGAACA TCTGAAATTG 684 0 

ACTAGCTCCC AAGTCATAGC CCGCATGAAT CATGTCGCCA TTCATTTCCT TGACTCGAGG 6900 

CAAGACCATC AAGACCACGA TAGGAATGGA GAAGGCCACG TGACTAGATA GAACGGTCAA 6 9 60 

AAAGCCAAGT CAAAACTTGA GTTGGGTAAA GAGAATCAAG AAGCTAGCAC CAATCATAAC 7 020 

GTCAGGCGCA AC CAT GAG G A TATTATTGAG TG AT AG AAAG GCTTCTTGGT ATTTCTTACC 7 090 

AGACTGCTAG ATGTAAATGG CACCAAAAGT CCCGATAATG GTCGCTATCA AGGCTGATAG 7 140 

GAAGGCCAAG AAAAATGTCT GAGCCAAAAT CAGCATGAGT CTCCCATCTC CAAACATGGT 7 200 

TTCAAAGTGA GTCCAGCTAA AAC CTGT AAA GCTATTCATA TCATCACCAG C ATT AAAG GC 7 2 60 

ATAGCCAATC AAGT AAAAG A TAGGCAGGTA GAGGACCAGA AAGACCAGTC CC AG AT AAAG 7 320 

GTTGGCAAAT TTTTTCATCG TTCTCTCCTT TCCTTAGTCA CCCACATGCT GATGAACATG 7 380 

GTCAGGATGA GAATCACACC GATGGTTGAA CCCATACCAT ACTTCTCATT GGTTAGAAAA 744 0 

TTCTGCTCAA TAGCCGTCCC CAAGGTGATA ACGCGTTCCC ACCAATCAAA CGGGTCAGCA 7500 

TGAAGAGACT CAAACTTGGG ATAAAGACCG ACTGAACCCC GG 7 $4 2 

(2) INFORMATION FOR SEQ ID NO: 59: 

{ i ) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 9223 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAACCAAAT TCCGGTATTT TAACCTATGC TGTAAATACC ATGAAGTCTG TCATGACAGA 60 

TCAGGTCTAT AACATTAAGG TTGACACAGA AAATGGAAAT TATGTTGGTG AAGCTAGCCA 120 

TGTTTTGGTC CTTTTGACAA ATTACTTCGC TG AT AAG AAA AT CTTTG AAG AAAACAAGGA 180 

CGGCTATGCC AACATTTTGA TTCTGAAAGA TGCCTCTATA TTCTCCAAAT TATCCGTCAT 24 0 

TCCTGATTTA TTAAAAGGGG ATGTTGTCGC AAATGATAAT ATCGAGTATA TCAAAGCGCG 300 

T AAT ATT AAA ATCTCTTCAG ATAGTGAATT GGAGTCAGAT . CTTGACGCAC ATAAATCAGA 3 60 
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TAACCTACCT GTAGAAATCA AAGTCCTAGC TCAGCGAGTA GAAGTATTTT CAAAACCGAA 4 20 

AGAGGATTAG TAT AT AG AG A AAGCCTTTTT TAAGGCTTTT 7GTATACTTT AAAAGATAGT 4 80 

TCCTTTAACA ACGGACATTC CTTGCAAATA GTTTTACAAA AAT AG TAT AC TGGATTCATT 54 0 

GAGTTTGAAA ACGTTTGCGT AAAATTTGAA TGAATACTTT AG GAG AC AAA TTGATGGAAT 600 

TGAGTGCTAT TTACCATAGG CCTGAGTCGG AGTATGACTA TCTTT AT AAG GATAAGAAAC 66 0 

TCCATATTCG AATTCGAACT AAGAAAGGGG ACATTGAAAC CATCAACTTG CACTATGGGG 7 20 

ACCCTTTTAT CTTTATCGAC GAGTTTTATC AGGATACAAA AGAAATGGTC AAG AT AACTT 7 80 

CTGGTACCTT ATTTGACCAT TGGCAGGTTG AAGTCTCAGT TGACTTTGCA CGTATCCAGT 840 

ATCTCTTTGA GCTCAGAGAT ACAGAAGGTC AAAATATTTT GTATGGCGAT AAAGGGTGTG 900 

TGGAAAATTC TCTAGAAAAT CTTCATGCAA TTGGGAATGG ATTTAAGTTC CCTTAGCTTC 9 60 

ATGAGATTGA TGCCTGCAAG gTTCCTGACT GGGTTTC AAA TACGGTATGG TATCAGATAT 102 0 

TTCCTGAAAG ATTTGCCAAT GGCAATGCTC TATTAAACCC AGAAGGGACT TTAGACTGGG 10 80 

ATTCATCTGT CACACCTAAG AGCGATGATT TCTTTGGTGG TGATTTACAG GGGATTATTG 114 0 

ATCATATGAA TTACTTGCAA GACTTGGGTA TTACTGGACT ATATCTTTGT CCCATCTTTG 1200 

AATCTACAAG CAATCACAAG TACAATACGA CAGATTACTT TGAAATTGAC CGTCATTTTG 12 60 

GAGACAAGGA GACCTTTCGG GAACTGGTGG ATCAAGCGCA TCATCGTGGC ATGAAAGTCA 13 2 0 

TGCTGGATGC GGTATTTAAT CATATTGGTT CGCAATCTCT TCAATGGAAA AATGTCGTCA 13 80 

AAAATGGTGA ACAGTCTCCT TATAAGGATT GGTTCCATAT TCAACAATTC CCAGTGACAA 144G 

CTGAAAAGCT AGTTAATAAG AG AG ACT T AC CCTATCATGT TTTTGGTTTC GAGGACTATA 1500 

TCCC7AAGCT AAA7ACAGCC AATCCAGACC TCAAGAATTA 7CTTTTAAAG GTTGCGACT7 15 60 

ATTGGATTGA AGAGTTTAAT ATCGATGCTT GGCGTTTGGA TGTGGCTAAT GAGATTGACC 162 0 

ATCAGTTCTG GAAGGATTTT CGTAAGGCAG TTTTAGCTAA AAATCCTGAT CTTTATATCC 1680 

TAGGAGAAGT CTGGCATACA TCTCAGCCTT GGCTAAATGG AGATGAGTTC CATGCCGTCA 174 0 

TGAATTATCC TTTATCTGAT AGTATCAAGG ACTATTTCTT ACGAGGAATT AAGAAGACAG 1800 

A C C AGTT CAT CGATGAAATC AATGGAGAGT CTATGTATTA CAAGCAGCAG ATTTCAGAGG L86 0 

TCATGTTTAA TCTCTTGGAT TCACATGATA CAGAGCGAAT CCTGTGGACG GCCAATGAAG 1920 

ATGTTCAACT GGTTAAATCA GCCTTAGCCT TTCTCTTTTT ACAAAAAGGA ACACCGTGCA 1980 

TTTATTACGG AACCGAGCTA GCCTTGACTG GAGGACCAGA TCCAGATTGT CCTCGTTGTA 204 0 

TGCCTTGGGA ACGTGTATCA AGTGACAATG ATATGCTGAA CTTTATGAAG AGGCTGATTA 2100 
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AAATTCGGAA ATACGCCTCA GTAATCATTT CGCATGGCAA CTATAGCCTT CAAGAAATCA 2160 

ACTCTGATCT AGTAGCTCTC GAATGGAAAT ACGAAGGACG GATCCTCAAA GCAATATTCA Z2 20 

ACCAATCAAC AG AAGATT AT CTTTTAGAGA AAGAAGCAGT AGCACTAGCA AGCAATTGCC 2 230 

AAGAATTGGA TAATCACCTT CTCATCTCTC CAGATGGATT TATGATTTTC TAAAAACTAG 2 340 

TTGATGAAGA TT ATGGT AC A TTTCATACCT TATATAGTAT AATAAGGCTA GTT ACT AAAC 2 4 00 

TTGT AAAGG A GAACTTAAAT GAATTGTAGA GGACATGAAA CAAGACAAAG AATTGTTAGA 2 4 60 

CATTTTGAAG TTCAGCCTAA AGCACATATT AAGCTGTT AG CAAATCAACA AAAACATAGT 2 5 20 

GATGCAGGAG CAACTATTGA AGATGAATAT TATGTATTTA TCGCTGAGAG TAAAATTGAT 25 AO 

GGCAAGAAGG AAGTTATTCA GTGTTGCATG GGTGCGGCAA GCGATTTTTT AGAACTAATT 2 64 0 

AATCACAAAG GGCTACCTCT TTTTAATCCG CTTGT AGGT G ATTCTCATGT AAATAATAGA 2 700 

CAAGAATATG ACAATACAGG CAGTCGAAAT TTATAACCTG AAAAGTGGAA TGAAACTGCA 27 60 

AAGCAGCTTT ATAATGCTAT AATGTGGTTG ATTATTTTAT GGAATGCTAA GCCGGATACA 28 20 

CCTTTATTTA ATTTTAAAGA CGAAGTAATT AAGTATAAAA CATATGAGCC TTTTGAAAGC 2 880 

AGTATAAAAA GAGTAAATAC TACTATAAAG AATGGTAGTA AAGGGAAAAC TCTGACTGAG 294 0 

ATGATTAATG GCTACAGACC GGATAACGAT ATTAGAGATG AAATTTGTAA CTTTAATATT 3000 

CTGAAAAATA AAATTCCTGA T AT G AAAAAC CAACAAGGAA ATACAATGGA ATCTTACTTT 3 060 

TAGTTATTGT TGAATTTTGG GTATTCTATA AAATATCCTA ATTGAGATTT AAAT AGT AG A 312 0 

CTATACAATA TAGTTAAAAT ATCACTAAAA ACAACACTTT ATTGAGGTAT TGGATACGCT 3 130 

TTGCTAATAG CCTAATAATC ACATGTGGAG TGTTGCTACA ACGAAAAAGG TGATAATCCT 3 24 0 

TGATTTCAAG CT ATT TT AT A ACCATTTTGT CTTTGTAGAT AAAGGCAATT TTGACAATAA 3 3 00 

AAATCCTAAA AGGTGAATCG TTATAGATGT ATTTGTAGAT ATCGTTTGCG CATCGAAAAA 3 3 60 

ATTAATACAA GAATAAATAT T T AT AG CTCT TTAGGTGACT TTTATAGAAG TAAAGTTTAG 3 4 20 

GATAGAAAAA CAAGAAATAA CGCACCATTT TTGGTGCGTT ATGCTTTTTT ATGCTATAAT 3 4 80 

GCATTTATAA AAATAAAGGA GTTTGCTATG ATTGGAAAGA ACATAAAATC CTTGCGTAAA 3 540 

ACACATGACT TAACACAACT CGAATTTGCA CGG ATTGT AG GTATTTCACG AAATAGTCTG 3 600 

AGTCGTTATG AAAATGGAAC GAGTTCAGTC TCTACCGAAT TAATAGACAT CAT TT GT C AG 3 660 

AAGTTTAATG TATCTTATGT CGATATTGTA GGAGAAGATA AAATGCTCAA TCCTGTTGAA 37 20 

GATTATGAAT TGACTTTAAA AATTGAAATT GTGAAAGAAA GAGGTGCTAA TCTATTATCT 3 7 30 

CGACTCTATC CTTATCAAGA TAGTCAGGGA ATTAGCATTG ATGATGAGTC TAATCCTTGG 3840 

ATTTT AATG A GTGATGATCT ATCTCATTTG ATTC AT AC G A ATATCTATCT AGTAGAAACT 3 900 
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TTTG ATG AAA TACAGAGATA TAGTGGCTAT TTGGATGGAA TTGAACGTAT GTT AG AG A T A 3 9 60 

TCTGAAAAAC GGATGGTGGC CTAATGGAAA TCCAAGATTA TACTGATAGT GAATTCAAAC 4 020 

ATCCTTTAGC AAGGAATCTT CGTTCACTGA CAAGAGGAAA AAAGTCCAGT AAGCAACCTA 4 080 

TAGCGATTTT GCTTGGAGGG CAAAGTGGTG CCGGTAAGAC TACAATTCAT CGTATTAAAC 4 140 

AGAAAGAATT TCAAGGAAAT ATTGTTATCA TAGATGGTGA TAGTTTTCGT TCTCAGCATC 4 2 00 

CACACTATTT AGAACTGCAG CAAGAATATG GCAAAGACAG TGTAGAATAT ACCAAAGATT 4 2 60 

TTGCAGGAAA AATGGTAGAG TCTTTAGTAA C AAAATTG AG TAGTTTGAGA TACAATCTTT 4 3 20 

T G AT AG AGGG AACTTTACGA ACAGTTGATG TTCCAAAGAA AACAGCACAA CTCTTGAAAA 4 3 80 

ATAAGGGATA TGAAGTACAA TTGGCCTTAA TTGCGACAAA CCCTGAATTG TCGTATCTAA 4 44 0 

GTACTCTTAT CCGTTATGAA GAACTGTACA TTATCAATCC AAATCAAGCA CGCGCAACTC 4 500 

CAAAAGAACA TCATGATTTC ATTGTAAATC ATCTAGTTCA TAACACACGA AAATTGGAAG 4 56 0 

AACTAGCTAT CTTTGAAAGA ATTCAAATTT ACCAACGAGA TAGAAGTTGT GT AT ATG ATT 4 62 0 

CAAAAGAAAA TACAACTTCA G C AG CAG ATG TTCTTCAAGA GTTACTCTTT GGGGACTGGA 4 68 0 

CTCAGCTAGA CAAGGAGATG TTGCAGGTCG GGG AAAAG AG ACT T AATG AA TTACTTGAAA 47 4 0 

AATAAACAAT TGATATTTTT AGGAGAATAG AAATGAGAGG GTTTAATAAC AAGATAAAGT 4 800 

CTGTTTATCA AG AACT AAC A AATTCCAAAG AGAAATTCGG TAGCTTTCAC AAGACTTTAA 4 8 60 

TTCATTTCCA TACACCTGTT TCTTATGATT ACAAGCTATT TTCTAATTGG ACTGCAACGA 4 920 

AA TAT AG AAA AATTACTGAA GATGAACTAT ATGATATATT TTTTGAAAAT AAGAAAATAA 4 9 80 

AAGTTGATAA G A C AATTTTT TTTAGTAATT TTGATAAGGT TGTTTTTTCT ACTTCAAAAG 504 0 

AATATATTAC TTTTCTTATG TTAGCACAGG CAATCATAAA AAATGGAATA GAAATAGTTG 510 0 

TAGTAACTGA TCATAATACT ACCAAAGGTA TTAAAAAGTT ACAAATGGCA GTCTCAATCA 5160 

TAATGAAAAA TTATCCGATT T ATG AT AT AC ATCCTCATAT TTTACATGGA G T AG AAATT A 52 2 0 

GTGCAGCAGA TAAATTGCAT ATTGTATGTA TAT AT G ATT A TGAACAAGAA TCATGCGTTA 52 80 

ATCAATGGTT AAGTGAAAAT ATTATAAGTG AGAAAGATGG AAGTTATCAA CATTCACTGA 53 4 0 

CTATAATGAA GGATTTCAAT AATCAAAAAA TAGTTAACTA TATTGCTCAT TTCAATAGTT 54 00 

ATGACATTTT GAAAAAAGGT TCTCACTTAT CAGGTGCATA TAAACGAAAA ATTTTTTCTA 54 60 

AAGAAAATAC ACGATTTTGG AGTTTAATAT TAACTCGAAA GAATCTTCGC AACAACTTGA 55 20 

TATTCTCTAT AAAGAAGTTC GTGTATTAAG TTTGGGACAA AAAGTTGTAG CCATCCTTGA 55 80 

TTTTTTATTA GCATATAGTG ATTATTCTAA AG ACT T CAG A CCATTGATTA TTGATCACCC 56 4 0 
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TGAAGACAAT 


CTAGACAATC 


GTTATATTTA 


CAGGCATTTA 


GTTC AG C AG T 


TTAGAGATGT 


5700 


GAAAGCTCAA 


CGTCAAATTA 


TTTTAGCAAC 


ACATAATGCT 


ACAATTGTAA 


CAAATTCTAT 


5760 


GACAGATCAA 


GTTGTTATTA 


TGGAGTCAGA 


TGGAGTTAAC 


GCATGCATTG 


AATCACAGGG 


5820 


ATATGTTAGT 


G AAAAAT AT A 


TAAAAAATCA 


TATCATCAAT 


CAATTAGAGG 


GAGGAAAAGA 


5880 


TTCCTTCAAG 


CATAAAATGT 


CTATATATGA 


GACGGCTTTA 


TCAGAGTAGA 


GTCAG AAAAA 


5940 


GTAGGTTAGA 


AATTTAGCCT 


ACTTTTTTCT 


TTGTCCGACA 


GGCATAGTGT 


ACATCTGAGG 


6000 


TCCAAGTCCT 


CTGTGGATAT 


TTGCTGCAGA 


TGAAACCAAT 


AGCGACTCCT 


AAGCCTGAAT 


6060 


ATCGTGAGGT 


AGGGGGGATA 


GGAAGGAATT 


AGCGAAATCA 


AGGTTCTACA 


AACAGAATCG 


6120 


TGACTTGAAG 


C CAT AT AT AG 


CGGATGAGGA 


ACTCT AAAAT 


CCAAATAGGT 


GTCGTAACCT 


6180 


ATATACGTAA 


ATTACGAGAG 


TAAACTAGGA 


AAGATGTACG 


GCTTATTCCG 


TGAGCGTTTA 


6240 


GGACGTAGTA 


CAACGAATCA 


TGGGAGTCAG 


CTGAACACAT 


AGTATTGAAG 


AAATTTCTGT 


6300 


AATGGAAATG 


GAGCGAAGAA 


GTGAACAATT 


AAATGAATAC 


CTCTCTAATT 


AAATTTG TC A 


6360 


ATTCTAATTC 


CTGGTATGAA 


AAGACAGTGA 


CCTGAAAATG 


TAAACGATGG 


GAGCTGATCA 


6420 


TAAATATAGG 


ACGGTACATG 


CAGTGGTGTT 


AGAGATTAGT 


CCTTACTTGA 


TTTGTGATAA 


6480 


CTTCCCCAAA 


TTTCTTCTGC 


TATACTTTTC 


TCAACTTTTA 


AAAATCCAAC 


TAAGAATTTT 


6540 


ACCTGGGGGT 


TTGGGGGCGG 


AGCACTAAGT 


TATCTTATCG 


TTAGCTGTCA 


AAACTGGTAG 


6600 


GTTTTGATAG 


GCTGGCGATA 


TGATTTTTGG 


GATATTGTGG 


ACACAATATC 


TGAGCTCGCA 


6660 


AAGCCTTACA 


AGAATGAAAA 


TCAGTTGTTG 


GAAAAGTGTA 


CTGACATTGT 


ATGGTAGCTC 


6720 


ACATTGTCAG 


TACAAGTATT 


TTCCAAACGA 


ACT AG C ACT A 


TGAAACGAGA 


TGTGCGTGAT 


6730 


ATTCGGAAAC 


AATTTCCTTT 


AACAGAAGCA 


GAAGAAAAGC 


AAATTCTAGC 


TTTGATGAGA 


6840 


GAGCCGGGAG 


AGACTAATTT 


CTCTGATTTT 


CTTCGTAAAA 


GTTTACTTTC 


CTCTGATTTA 


6900 


C AAAAAC AG A 


TGGAGACATG 


GTTTGCCCTC 


TGGCAATCCC 


AAAAACTAGA 


ACAAATCAGT 


6960 


CGTGACGTTC 


ATGAAGTTTT 


AATCTTGGCA 


CAGTCACAAC 


GTCAAGTCAC 


C C AAG AG CAT 


7020 


GTATCTATTC 


TCTTAACGTG 


CGTGCAGCAA 


TTGATTCAAG 


AGGTTGCAAA 


CACCATACCC 


7080 


CTCAGTAAAG 


AATTTCGTGA 


GAAGTACATG 


AGGTAAGCAC 


ATGGAACATC 


GTTACCGAAC 


7140 


CAATCTCAAG 


AAAGTGTTTT 


TGTCTGATAG 


TGAGTTGAAC 


CAACTAAATA 


TAAATATCGA 


7200 


TCAAAGTGGT 


TGTAAATCCT 


TTTCTGAATA 


TGCGAGACGA 


ACTCT ACTCG 


ATCCTGGTAT 


7260 


GAATTTTATC 


ACGATTGACA 


CAAACGGTTA 


CCAAGATTTA 


GTG TTTG AG T 


TAAAGAGGAT 


7320 


TGGCAATAAT 


ATCAACCAGA 


TTGCTCGAAG 


TGTTAATCAA 


TCTCAGTTAA 


TTTCTGGTGA 


7380 


AGAATTGCAG 


GAGTTGAAAA 


AAGG AATTGG 


TGAATTGATA 


AAAGAAGTTG 


ATAAGGAATT 


7440 
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TAATCTGCAA GCGCAGAAGC TAAAGGAGTT CCATGGTCAT CACTAAACAC TTTGCCATTC 7 500 

ACGGAAAGAG TTACCGCAGA AAGCTTATCA AGTACATTCT CAATCCTGAG AAAACCAATA 7 560 

ATCTTGCCTT GGTGTCGGAC TATGGCATGA AGAATTTTCT GGACTTTCCT AGCTATGAGG 7520 

AAATGGTGCA G ATGT AT CAT GAAAATTTCA TCAGCAACGA TACGCTTTAC GATTTTCGCC 7 680 

ACGACAGGAT GG AAG AAAAT CAACGAAAAA TACACGCTCA CCACATCATT CAGTCTTTCT 774 0 

CGCCAGAGGA TCATATCACT CCTGAACAAA TCAATCGGAT AGGTTATGAG ACTGTGAAGG 7 300 

AATTAACTGG TGGCAAATTT CGTTTTATCG TTGCGACCCA TG T TG AT AAA GACCACCTGC 7 8 60 

AC AAT C AC AT CATTATCAAT TCAGTAGATA GCAATTCTGA CAAAAAGCTC AAGTGGGACT 7 920 

ACAAGGTGGA GCGAAATCTT CGCATGATTT CTGACCGTTT TTCTAAAATC GCAGGTGCTA 7 980 

AAATCATTCA GAACCGCTAT TCTCACCAGC GGTATGAAGT CTATCGTAAG ACTAATCACA 8040 

AGTATGAACT CAAGCAGCGA CTCTATTTTT TGATGGAACA TTCTAGGGAC TT TG AGG ATT 8100 

TCAAAAAGAA TGCTCCGCTA CTACATGTGG AGATGGATTT CCGTCACAAG CATGCCACCT 8160 

TTTTTATTAC GG ACTCAACT ATGAAACAGG TGGTGCGTGG CAAGCAACTC AATCGCAAGC 8 2 20 

AG CCTT A C AC AGAAGAATTT TTTAAGAACT ACTTTGCCAA AAGAGAAATA G AAAGTCTC A 8 280 

TGGAATTTTT ATTGCTGAAA GTTGAGAATA TGGATGATTT ACTTCAGAAA GCAAAACTTT 834 0 

TTGGACTAAC TATCAATCCT AAACAAAAGC ATGTTTCTTT TCAATTTGCA GGAGTGGACC 84 00 

TAAAGGAGAC AGAGCTAGAC CAGAAAAATC TTTATGATGT AGAGTTTTTC C AAG ATT ATT 9 4 60 

TTAAAAATAG AAAAGATTGG CAAGCTCCAG AAACTGAGGA TTTCGTTCAA CTTTATCAAC 3 520 

AAGAAAAGTT ATCCAAAGAA AAAGAACTTC CAAGCGATGA GAAGTTCTGG GAGTCCTATC 9 580 

AACACTTCAA GAGTAACAGA GATGCCCTTC ATCAATTTGA GGTGGAGTTG TCACTCAATC 9 640 

AAATTGAAAA AGTAGTGGAT GAT GG AATTT ACGTCAAGGT CAAGTTTGGT ATTCGT C AG G 8 7 00 

AGGG AC TT AT CTTTGTCCCG AACATGCAGC TTGATATGGA AGAGGATAAG GTGAAGGTTT 37 6 0 

TCATCAGGGA AACCAGCTCC TACTATCTCT ACCACAAAGA CGCTGCCGAG AAAAATTGTT 8820 

ATATGAAAGG TCGAACCTTA ATTAGACAGT TCAGCTATGA AAATCAAACC ATTCCATTAC 88 80 

GCAGAAAAGC GACAGTCGAT ATGATTAAAG AGAAGATTGC GG AAGTGG AT GCTTTGATTG 9 940 

AACTGCAAGT AGAAAATCAA TCTTATGTCA CGATTAAAGA TGAGTTAGTG CATGAACTAG 9000 

CAGCGTCTGA ATTGAGAATC AATGAGTTGC AAGAACGAAT GTCAACCTTG AATCAAGTAG 9 060 

CAGAATATCT ACTGGCTTCA GTTGAAAGTA AGCAAGAAAT GAAATTAAAT CTTTC AAAAC 9120 

TGAATATAAC TGAGAATATC AGTGCTAATA TTGTTGAGAA AAAATTGAAG AGCCTGGGGA 9190 
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ATCAACTGGA ATTGCAAAGG GGCAGGTATG AAAAGATGGT AGT 922 3 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

TCTGCTGGCT ACC AT C AT C T GACTTGGGCA ACACCAAAGT CTTAGTTACA ACTGTATTCT 60 

TCTCAGCATT TTCAATAACT GGCAATGCCG ACTGAAGCGT ATCTTTTTCT GTTTTTGTAG 120 

CTGGTCCAGT TTCTTTTTTC TGTCCGCAAC CAACCAGGAC AAAAAGGAAA GCTAGACTAA 18 0 

CAAGAACTAT TTTTTTCATT TCTTTCTTCT TTCTTTTTGA AATTAAAATA GAATAAGACT 2 40 

GGGAAGTGCT CCCAGCCTTG ATGTTTATAG AGCTGCACGC AAACGTGCTT CTGCATTTTC 300 

TACATTACGG ACAGAGCGTG GTACGAAGGC ACGAATATCG TCTTCCTTGT AGCCAACTTG 3 60 

CAGCCCTTTT TCATCTACAA GGATTGGGCT CTTTAAAATT CTCGGTGTTT CCATAATCAG 4 20 

ATTGAGAACT TCATTGACAC TCAAATCTTC AATA7CCACT CCAAGGGCTT TGGCATAGCG 4 80 

ATTTTTAGAC GAAACGATGC TGGCTATTCC GTT AT CTGTT TTGGTTACAA TATCCAGTAA 540 

TTCTTCTCTC GTAATT CCTT CTTTACCAAG GTTTTGTTCT TTATAACTTA ACTGGTGGGC 600 

ATTGAGCCAG GTTTTTGCTT TTTTACAGCT AGT AC AACTT CAGACTCTAT AAATTTTAAT 660 

CATGTACCTA CCCCTTTCGC TACATGTTAC TATCAGTTTA GTCT ATT AT A CCATAAAAAA "7 2 0 

CATCCGACTT GCGACCTATT TTTAATTTTT TTTGACTTTT TTCGTCATTT TCGTACTTTT 7 80 

TTCTTGACAA ACAACTAAAT GACTATCAAC TCTTTTCGAG CTAGGGTCAA TAATTCACAA 84 0 

CCTGTCTCTG TAATCAGGAT ATCATCCTCG ATACGAACGC CAT ATTTG CC TTCGATATAG 900 

ATACCTGGTT CATCGGTCAA GGCCATACCT GTCTTAATAG TTTCTGTACA AGTCTGACTA 960 

AAGTAGGGTT CCTCATGGAT ATCCAGACCA ATACCGTGGC CAATGCCGTG AGTAAAGTAG 1020 

TCACCATAAC CTGCCTCAAT GATAATATCA CGAGGGATTT TGTCAAAGTC ACGGAAACCT 1080 

AAGCCTGCCT TAGCTTGGTC AATCAAGGCT TGGTTAGCTT TTAGAACCGT ATTGT AAATC 1140 

TCTCCCTGCT CATCGCTAAC ATGCCCTAGA TAGATAGTCC GGGTCATATC ACTGACATAG 12 00 

TGGTCATAGA GACAGCCGAA GTCCATGGTG ATGGCTTCTC CCAACTCCAC TGGTTTGTGC 12 60 

ATTGGATGGG CATGGGGTTT AGAAGAATTG ATACCGCTAG CTAGGATCGT ATCAAAAGAT 1320. 

AAGCCAGATG CTCCCAACTC ACGCATGCGG AAATCAAGGA AGTTGGCAAT CTC AATTT C A 13 80 
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GTTTTTCCTG GTTTGATAAA GTCAAGCGCA TCGCGGAAAG CTTGGTCTGA GATAGAACAA 144 0 

GCCTTGCGAA TCGCTGCAAT CTCTGCCTCA TCCTTAATCA TACGAACACC TTCCACAAAC 1500 

TGAGTTTGTG GAAGCAAGTT CAAACCTGCA AAAGCTGCCT GC AT A CGGTG GTAATAAGAC 1560 

ACTGAAATCT CATCTTCAAA ACCGATACGA GTCAAGCCCA TGTCCTTAAC AATTCCTGCA 152 0 

ATGACAGCCA ATTCATCACG ATCAGCCACA AT CTC AAAAC CACTGGTTTC TTGCTTAGCT 1630 

GCGATGATAT AGCGAGAGTC TGTCACTAAG ACCTGACGGT CACGACTGAT AAAGACTGTT 174 0 

CCGTTTGAGC CCCAAAAACC AGTCAAATAA TAGACGTTTT TAAGATTGTT GATGATGATA 1800 

CCATCTAGTT CTTTTTCTTG CATTTTAGCT AGAAATGCTT GTACGCGTTT ATTCATGATG 18 60 

TAACTTTCCT TTCAAATAGT GT CCTGT AT A GCTGGCTTCG TTGGCAGCTA CTTCTT CTGG 1920 

AGTTCCTGTT ACGATGATGG TTCCACCACC GACACCGCCC TCACGTCCCA AGTCAATGAT 1980 

ATGGTCTGCC GTCTTGATAA CATCCAGATT GTGCTCCATG ACGAGGACTG TATTGCCATC 2040 

GTCTACAAAG CGAGCTAAAA CCTTGAGCAG GCGAGCAATG TCCTCTGTAT CAAGCCCTGT 2100 

CGTCGGCTCA TCCAGAATCT AG AAAG ATTT TCCTGTCGAT CGTTTGTGGA GTTCGCTAGC 2160 

TAACTTCATA CCTTGGGCTT CTCCCCCAGA AAGGGTGGTA GCTGGCTGTC CCAAGGTCAC 2220 

ATAGCCTAGC CCTACATCCT TGATGGTCTG GAGTTTGCGT TGAATTTTCG GAATGTGTTG 22 80 

GAAAAATTCT ACCGCATCGT TGACCGTCAT ATCCAACACC TGCGAAATAT TCTTTTCCTT 2340 

GTAGTGAACT TCTAGGGTTT CACTGTTATA GCGGGTTCCG TGGCAAACTT CACAAGCCAC 2 4 00 

ATAAACATCT GGCAAGAAGT GCATCTCAAT CTTGATAATC CCGTCACCTG ACCAACCTTC 2 4 60 

ACAGCGACCT CCCTTGACGT TGAAACTGAA GCGCCCCTTC TTGTAGCCTC GAATCTTGGC 2 5 20 

TTCATTTGTC TGAGCAAAAA GGTCACGTAT ATCGTCAAAA ACTCCTGTAT AGGTAGCTCG 2 580 

GTTAGACCTC GGCGTCCGTC CGATAGGGCT CTGGTCAATA TCAATCAAAC GGTCGACATG 264 0 

CTCAATCCCT GTAATAGTCT TAAACTTACC AGGTTTGTCT GAATTACGGT TCAGCTTCTG 2 7 00 

GGCAATGGCT TTTTTGAGAA TGCTGTTGAT TAGAGTCGAT TTCCCTGAAC CCGACACACC 2 760 

TGTCACTGCG ATAAATTTTC CTAGTGGAAA GCGAGCCGTG ACATTTTGCA AGTTGTTCTC 2 82 0 

ACGCGCTCCT ATCACTTCAA TAAAACGACC ATTTCCCACA CGGCGCTCTT CTGGTACTGG 2 880 

GATGACACGT TTGCCTGACA AGTACTGACC TGTGATAGAC TTGCTGTTGC GAGCCACTTG 2 94 0 

CTTACGTGTA CCTGCTGCAA CAATCTCACC ACCAAAAACA CCGGCACCAG GACCAACGTC 3000 

AATCAGATAA TCAGCCTCAC GCATGGTATC TTCGTCCTCT TCCACCACGA TAAGAGTATT 3060 

GCCCAAGTCA CGCATCTTTT TCAGACTGGC AATCAGGCCA TCATTGTCCC TCTGGTGAAG 312 0 
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ACCCATTGAC GCCTCGTCTA GGATATAGAG G AC A CCTG AT AGGTTGGAAC CAATCTGGGT 3180 

TGCCAAACGA ATGCGCTGAC TTTCCCCACC TGAAAGGGTT C CTGCTG AAC GTGACAGGGT 3 240 

T AG AT AGTT A AGACCCACAT TATTAAGGAA GGTCAAACG A TCCTTGATTT CCTTG AG AAT 3 300 

GGGACGAGCA ATCATGGCTT CATTTTCAGA CAAAGTTAAC TGGCTC ACC A AGTCCAAGTG 3 3 60 

GTCAGCGATA GACAGGTCTG AGATTTCTCC AATATGTGGC CCTTGCTGGC CGCCCACACG 3 4 20 

GACAGACAAG GCCTGGTCAT TGAGACGATA G CCTTG ACAG CTTCCGCAGC TCAGCTCATT 3 4 80 

CATGTAGAGA CGCATCTGAG TGCGAGTGTA ATCGCTATTG GTTTCATGGT AACGACGTTT 3 54 0 

GATATTATTG ATAACTCCCT CAAACGGAAT GTCGATATCG CGCACGCCAC CAAATTCATT 3 600 

CTCATAGTGG AAATGGAATT CCTTACCATC TGACCCATAG AGAATCAAGT TCTTATCTTC 3 660 

TTCTGACAGG TCCTCAAAAG GCTTATCCAT AGCCACTCCA AAGACTTTCA TGGCCTGCTC 37 20 

TAACATGTTT GGATAGTAGT TGG ATG AG AT ACCATTCCAA GGTGCTAGCG CTCCCTCACG 37 80 

TAAGGTTTTG CTACCATCTG GCACTACCAA ATCAGTATCC ACCTCCAGCT TGATGCCCAA 3 840 

GCCGTCACAC TCACTACAAG AGCCAAAAGG AG C ATTG AAA GAAAAGAGAC GAGGCTCTAA 3 900 

CT CTGGG AC A GTAAAACCAC AAACTGGACA CGCATAATGC TCAGAGAACA ACAACTCCGA 3 9 60 

GTCGTCCATG GTGTCGATAA TGACATAACC TTCTGCAATA CGAAGGGCAC CCTCAATGGA 4 020 

ATCAAAGAGA CGACTACGAA TGCCCTCCTT GATAACAATA CGGTCAACCA CGACATCGAT 4080 

ATTGTGTTGC TTGCTCTTAG ACAACTCTGG CACTTCGGTC ACATCATAGA CTTCCCCATC 414 0 

CACACGGACA CG AAC AT ACC CGTCTTTCTG AACCTTCTCG ATAACACTCT TATGTTGGCC 4 200 

TTTTTTCTTG CGCATGACAG GAGCCAAGAT CTGCAAGCGC TGG CGTTC AG GTAACTCCAA 4 2 60 

AACCTTATCA ACGATTTGCT CCACAGAAGA AG C ATTG AT A GCTCCATGTC CGTTGATACA 4 3 20 

GTAAGGCCTC CCCACACGTG CGT AG AGG AG ACGCAGATAG TCATTGATTT CAGTCGTCGT 4 3 80 

TCCCACCGTC GAGCGAGGAT TTTT AC T AG T CGTTTTCTGG T CG AT GG AAA TAGCTGGGCT 4 440 

GAG AC CAT C A ATGGCATCTA CATCTGGTTT TTCCATATTT CCCAAGAACT GACGAGCGTA 4 500 

GGCGGACAAA CTCTCTACAT AGCGACGTTG TCCCTCCGCA T AG AG AGT AT CAAAAGCCAG 4 5 60 

ACTGGACTTC CCTGAACCTG ACAAGCCAGT CACGACAACC AACTTGTCTC GCGGAATCTC 4 620 

CACATCAATA TTTTTTAAAT TATGGGCACG CGCCCCATGA ATG ACAATT T T AT CTTG CAT 4 68 0 

CTTTGTTCTT TCTAGTCCAT TATTGCTTAC C ATT AT ACC A AAAAAAGTGA GATTCTATTA 474 0 

CCCAAAAGGC CGATTTTGTA GT AT AAT AGT ACAGTGTGAA AAAATCTGAA AAAT G AG AAA 4 800 

GGATAAGGGA TATGAAACAA GTTTTTCTCT CTACAACAAC TGAATTTAAA G AGATC GAT A 4 8 60 

CGCTTGAACC GGGTACTTGG ATCAATCTCG TCAATCCGAC TCAAAATGAA TCACTCGAAA 4 920 
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TCGCCAACAC CTTCGATATT GATATTCCTG ACCTTCGAGC ACCGCTCGAT GCGGAAGAAA 4 980 

TGTCTCGTAT 7ACCATTGAA GACGAGTATA CCCTGATTAT CGTAGACGTG CCGGTCACGG 5040 

AGGAAAGAAA TAACCGCACC TACTACGTAA CCATCCCGCT TGGTATTATC ATCACTGACC 5100 

AAAC C ATT AT CACTACGTGT TTGGAACCAC TACCTGTCCT TGATGTCTTT ATCAACCGTC 5160 

GATTGCGTAA TTTCTATACC TTCATGCGTT CACGTTTTAT CTTTCAAATT CTTTATCGCA 52 20 

ATCCAGACCT TTACCTAACA GCCCTTCGTT CAATCGACCG CAAGAGTGAA CAAATCGAAA 52 80 

GTCAACTGCA TCAATCAACT CGTAATGAAG AATTGATTGA GCTCATGGAA TTGGAAAAAA 53 40 

CTATCGTCTA TTTCAAGGCC TCCCTCAAAA CAAATGAGCG CGTGATTAAG AAATTGACCA 54 00 

GTTCAACCAG CAATATCAAG AAATACCTTG AGGACGAAGA CCTGCTTGAA GACACCCTGA 54 60 

TTGAAACCCA ACAGGCCATC GAG AT GGC AG ATATTTATGG AAACGTCTTG CATTCTATGA 5520 

CAGAGACCTT TGCCTCTATC ATTTCTAACA ACCAGAACAA C ATC ATG AAA ACCTTGGCCC 5580 

TTGTGACCAT CGTCATGTCC ATCCCAACCA TGGTCTTTTC TGCCTACGGG ATGAACTTTA 5640 

AGGATAATGA AATCCCCCTA AACGGAGAGC CAAATGCCTT CTGGTTAATC GTCTTTATCG 5700 

CCTTTGCTAT GAGTGTCTCG CTCACTCTCT ATCTCATCCA TAAAAAATGG TTCTAAGAGG 57 60 

AGTTCCTATG TCTCAAATTG ATCTACAAAA AT t AACTAAG AAAAACCAAG AGTTTGTCCA 5820 

CATTGCTACC CAACAATTCA TCAAAGATGG GAAAACAGAC GCTGAAATCC AGACTATTTT 5880 

TGAGGAAGTC ATTCCCCAAA TCCTTGAGGA GC AATC T AAA GGTACAACTG CCCGTTCCCT 5940 

ATACGGCGCA CCAACTCATT GGGCTCATAC CTTCACTGTC AAAGAGCAGT ACGAAAAAGA 6000 

GCATCCAAAA GAAAATGATG ACCCAAAACT GATGATTATG GACTCAGCTC TTTTCATCAC 6060 

TAGCCTCTTT GCCCTTGTCA GCGCCCTCAC AACCTTCTTT GCGGCAGACC AAGCTTTCGG 6120 

CTATGGATTG ATTACTCTTC TATTAGTTGG ACTGGTTGGT GGATTTGCCT TCT ACTTG AT 6180 

GTACTACTTT GTTTACCAAT ACTATGGACC AGATATGGAT CGCAGTCAAC GTCCACCTTT 624 0 

CTGGAAATCT GTACTAGTTA TCCTAGCTTC TATGTTCCTT TGGTTGCTTG TCTTCTTTGC 6300 

AACAAGCTTC CTACCAGCTA GCCTTAACCC AGTACTGGAT CCATTGCCAC TAGCTATTAT 6 3 60 

TGGAGCAGCC CTCCTAGCCC TTCGCTTCTA TCTCAAGAAA CGCTTGAATA TCCGTAGTGC 6420 

AAGTGCAGGA CCAACACGCT ATCAAGAATA AGAAAACGAT AAAAGCAACT GCAGGTGCGG 6480 

TTGCTTTTTC ACTTACTTTT TTGAGTTATA TTCAATGAAA ATCAAAGAGC AAACTAGGAA 6540 

CCTAGCTGCA GGTTGCTCAA AGCACACCTT TGAGGTTGCA GATAAAACTG ACGTGGTTTG 6 600 

AAGAGATTTT CGAAGAGTAT TAAAAGTATT CTTCTGAAAT CCCACATAGC TTTCTCTTAT 6 6 6C 
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ATTTTGTG AT AAAATAGGCT CAATCTATTT CTAGGAGGAT GAGATATGCT TT CT ACT ATT 

GGTATTGTTA GTTTATCTAG TGGCATTATC GGAGAGGATT TTGTCAAACA CGAAGTGGAC 
TTGGGTATCC AACGTCTCAA GGATCTGGGA CTCAATCCCA TCTTTTT 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CTGGCTAGTT 


GCATAGAGCA 


AAGTTGCTTC 


TTCATCAACA 


AAACCGTTCA 


TTTCAAAATA 


60 


GGAAAGCAGC 


TCATCAGGAC 


TCTCCAAACG 


AATCCCTTTG 


TAATCCAGCT 


CAACTGCCAC 


12 0 


CTCTTTCAAC 


GCTGCAAGAA 


GAAGTGTTCC 


CAGGCCCTGT 


CTCTGATGGT 


CAAACTCGAT 


180 


GACTAAAGAA 


TGTACTTTTA 


GACATTGCGG 


ATTGTCTGAC 


TGGGGACTTG 


ATAAAATATA 


240 


GCCTAAAAGT 


TGATTTTCAT 


CCCTAGCTAG 


AAG AAAGGT A 


TCCGCACACT 


TACGGATACT 


300 


TTCTTCTAAA 


ATATGGGAAA 


GTTGCTGCTT 


TTCAGCTGGA 


AAAGACGAGG 


TCTGAAGTGC 


360 


CCCTATCTCA 


GGCAAATCAG 


ACTTGCTTCC 


CTGAATGATC 


TTAATTGGAA 


TTTCCATGGG 


420 


AACATCCTAT 


TGAACATTGC 


TTGTC AAGTT 


AGACAAGAGA 


CGCTCAAATG 


AGTATTCATA 


480 


GGTTTGGATG 


TCTCCTGCTC 


CCATAAAGAC 


GTAAACAGCA 


TTGTC ATGGT 


CTAGGAGTGG 


540 


AGAAACATTT 


TCAACAGTAA 


TCACTTGGTG 


^TTTTTGTTG 


ATTTTGTTGG 


CTAGGTCTTC 


600 


TACCTTAACG 


TCACCATGAT 


CTACTTCACG 


AGCCGAGCCA 


TAAATTTGCG 


CTAGATAAAC 


660 


AGCATCTGCT 


TCGTTTAAAG 


CATCGGCAAA 


CTCGTCCAAC 


AAG G C AAT GG 


TTCTTGTAAA 


720 


GGTATGCGGT 


TGAAAGACTG 


CTACAATTTC 


CTTGCTTGGG 


TATTTCTGAC 


GAGCCGCATC 


780 


CAAGGTCGCA 


ATAATTTCTC 


TTGGATGGTG 


GGCAAAGTCA 


TCGATAATCA 


CTGTATCATT 


840 


GACAATTTTC 


TCAGTGAAAC 


GACGTTTAAC 


ACCCGCAAAT 


GTTTTCAACT 


GCTCACGCAC 


900 


C AAGTT C AAA 


TCAAATCCTG 


CTGTGTAAAG 


AAGACCAATA 


ACGGCTGTCG 


CATTCATGAT 


960 


ATTGTGACGA 


CCAAAGGTTG 


GAATGTGGAA 


TTGCCCCAAG 


TTTTGTCCAC 


GGAAATGAAC 


1020 


GGTGAAGGTT 


GAACCAGTTA 


TTGAACGAAG 


AAGATCACTA 


GCTACAAAGT 


CATTGCCTTC 


1080 


AGCTTCAAAA 


CCATAATAAT 


AAATTGGTGC 


ATCAGACCTA 


ATCTTACGCA 


ATTCAGCATC 


1140 


TTCACCATAG 


ACAAAAAGAC 


CCTTGGTGAT 


, T T Q7"TTGGCA 


TAGTCGTTAA 


. AGGCATTAAA 


1200 


AACATCCTCG 


AGACTTGTGA 


AATAATCTGG ATGGTCAAAG 


TCAATGTTGG 


, T GAT AAT AG A 


126C 
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GTATTCTGGG 


TCGTAAGGCA 


TGAAGTGACG 


CTCATATTCG 


TCAGATTCAA 


AGACAAAATA 


1320 


TTTGGCATTG 


GCCGAACCAC 


GACCTGTCCC 


ATCTCCAATC 


AAGAAGCTGG 


TATCTGTAAT 


1380 


GTGAGACAAG 


ACATGAGACA 


ACATACCTGT 


CGTTGAAGTT 


TTTCCATGTG 


CTCCTGCTAC 


1440 


TCCCATGCTA 


ACAAAGTCAC 


GCATAAAGCT 


ACCTAGAAAC 


TCATGGTAAC 


GTTTGTAGCT 


1500 


GATACCATTT 


TGGTCCGCAT 


AGGCAATTTC 


GACGTTGTTA 


TCTGGACGAA 


AGGCATTTCC 


1560 


AGCGATAATT 


TCCATATCAC 


CGTCTAGATT 


TTTTTCATCA 


AAAGGAAGAA 


TGGTAATTCC 


1620 


TGCCTGCTCA 


AGACCGCGTT 


GGGTAAAGTA 


GTACTTTTCA 


ACATCTGATC 


CCTGAACCTT 


1680 


GTGCCCCATC 


TGG7GCAACA 


TCAAGGCCAA 


GGCACTCATC 


CCTGATCCCT 


TAATTCCGAT 


1740 


AAAATGATAT 


GTCTTTGACA 


TGTTTTCTCC 


C CT ATTCTGT 


CATTCTGGTC 


AGATTCAACT 


1800 


CTTGGGCAAC 


CCGACGTTCT 


TGTTCTGTTT 


GTTTACTTTT 


TTTATTGTAG 


ATTTGGCTCT 


1860 


TCTTTAGAAA 


ATCATAATTG 


TTTTTCTTTG 


GAGCAGGTGC 


TGACACTTCT 


TCATTCTTGG 


1920 


TAGGGATAGA 


ATGAACTTCT 


TCCGCCAAGA 


TATAATGAGA 


CTGGGTCAAT 


TTTTGGCTAT 


1980 


ATTTGACAAA 


TTCACCAGCA 


TTTTCCTTTT 


GGAAAGGAGC 


TGTCGGTTGA 


TTGCCCTGTC 


2040 


TAACTAGACT 


GGGCTGAGAA 


TGACGTCTCG 


CAAGGCTGAA 


ATCCTCACT? 


AGGTAGTTAG 


2100 


CAGAGCGTTT 


CTTTTTCAAG 


TCCGCACGCG 


CTTCTTCACG 


CGCCACCTCC 


GCATAGCTCT 


2160 


TTCCTTCTTT 


TTTAACCCCT 


AAAGGAGCCT 


TTTTAGGTTT 


TTCGACTTGC 


TTTTCAATCG 


2220 


GTTTT A CTGG 


TTTTTCTTCA 


GCAATAGGAG 


CCCATTCTAA 


ATAATTTTTA 


TCTCGATACT 


2280 


C AC C CTTG AT 


ATTACTGATC 


AG AT C AG ACT 


CATCATAGAG 


ATTCATGACT 


GG C ATTTC AG 


2340 


TCAACATGAC 


CTCGTCATCT 


GACACCAATG 


GAAATCGTTC 


TTGTTTCATT 


TTCTATTTCC 


2400 


TTTCAACACT 


TCATTATAGC 


GTATTGTCTT 


GATTTTTCAA 


GTGCTGCCTT 


CAGAAATTCC 


2460 


CAAAATTTCT 


CTAATTTCTG 


CTAGGGTCAG 


ACTACCACGT 


GACTCTGTGC 


CGTCCAATAC 


2520 




AGATGi lTCT 


TTTGTTCTTG 


GAGTTCCTGA 


ATTTTTTCTT 


CAATGGTTCC 


2580 


CTTGGTCACC 


AAG CG AT AG A 


CCTCAACCGT 


TTCTTCCTGA 


CCCATCCGAT 


GGGCACGGCC 


2640 


AATGGCTTGC 


GCTTCCACCG 


CAGGATTCCA 


CCAAAG GTC A 


ACCAAGATCA 


CTGTATCTGC 


2700 


ACCTGTCAGG 


TTCAGACCGA 


CCCCACCAGC 


CTTGAGGGAA 


ATCAGAAAGG 


CATCTCTTTC 


2760 


TCCTTGGTTA 


AAGGCCTTGG 


TCATGTCTTG 


TCTTTCCTTG 


GCTGGGGTTG 


AACCCGTAAT 


2820 


TTTAAAGGAA 


GTCAGGCCCA 


AGT CTGGC AG 


TTCTTGTTCA 


ATTTTTTCCA 


ACATTCCCTT 


2880 


GAACTGAGAG 


AAAATCAAGA 


CACGGTGTCC 


GCCGTCTGCC 


ACCTGT AC C A 


GT AG GTCTCG 


2940 


GAGACTATCT 


AGTTTGCCGC 


TGGCTCCCTG 


ATAATCTTCC 


ATAAACAGGG 


CAGGAGTGTC 


3000 
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ACATATTTGA CGCAAGCGCA TCAAACCAGA TAAAATTTCC ACACGACTTC GCTGAAATTC 3060 

CTGTTCTGAC ACTTGAGCCA GATGGTCTCG CATCTGTTGT AACTGGGCAA GGTAAATAGC 3120 

CTTTTGCTGC TCTTCCAGTT CATTTTTATA AACCACCTCA ATCAAGTCTG GCAATTCAGT 3180 

CAGAACTTCT TCTTTCTTGC GTCGCATCAC GAAAGGCTTG ATAAACTGAG CCACTCGCTC 3 240 

TCCTGGCAAT TTCATAAATT CTTTCTTGCT TGGC AAAAGT CCAGGCATGA CGATTTGGAA 3 300 

AATAGACCAC AACTCACCCA GATCGTTTTC AATCG G AG TT CCTGACAAGG CAAAGACCGA 3 3 60 

CGGCACCACA AATTGTCTCA AGGTCTGGGC AATCTTCGTC TGGGCATTTT TCATGACCTG 3 4 20 

AGCCTCATCT AAGAAAAGGA AGTCAAAGGC CATCCCTTGA T AAAACTC AC TGTCCTGACG 3 4 80 

GAAGGTGGCA TAGCTAGTCA CATAGATTTG ATGGCTCTCG GCAAGAATCT CCTCACGACT 3 54 0 

TGCTTTCAAA CCATGAACAA CAGTCACATC CAACTCTGGA GCAAATTTCT GAAACTCATC 3 600 

TCCCCAGTTC TAAATCAAAC CCGACGGAGC GAGAATCAAA ACCCGACTTT CTTTTGTCAC 3 6 60 

TTGACTAGTC AAAAAAGCAA TGGTCTGAAG GGTTTTCCCA AGTCCCATAT CATCAGCCAA 3 7 20 

AATCCCACCA AAACCATAAT GATGGAGCAT CTGCAACCAG CCAATTCCCT TTTCCTGATA 3 7 80 

ATCTCGCAAG TCAGCCTTGA CCTGAGTTGC TTGCAAAGGA AAGTCCTCTG GATGCGTCAA 3840 

ATCCTGGGCC AGATTCTGGA ATTCTTGTGA AAAAGAAACA CGGTCTCGCC CTTCAAAGAG 3 900 

ATGAGCTAAA CTGTAGGCCA AGGATTTCCG AGCCTGCAAG GTCCCATCTT TT AATT C AAA 3 9 60 

TTGCCCCAGT TCCTGTACAT TTTGGCGAAT TTTCTTGGTT TCTTCATCGA AAAAGT AAAC 40 2 0 

TTGATTAGAC GAATCAATAT AAAAATCCTG ATTGGCAACC AAGGCCTCCA TCCCTTGCTC 4 0 80 

GATTTCCTCC TGGACAATAT TTTGAAAATC AAACTGGATT TCCAAGAGAC CTCCCTTGGA 414 0 

GGCAATCTGC ACCTGAGGAC TCCCTAGGCT AT AAAC CTCT TCTAGTTTAT CTGATAGGTC 4 200 

AACATGCCCG AGTTTTTCAA AGACTGGAAT GATATCATGA AAAAAATGAT AGACAGACTC 4 2 60 

CCCTTTTAAG GCCTGACGCC AAGATTGAAA ATCGGCCTCA AAGCCCGCAG CCAAACAGAC 4 32 0 

TTGGAAAATT CTTTCTTCTA AGTCTGCGTC ACTTGAAAAG GGTAATTCTT CTAGCTCTTG 4 3 30 

TCGGCTAGAT ACCTGTCTAT TTCCATAATC AAA CTG AATT TCTAAACGAA TCCGATTATC 44 4 0 

TTCTTCCCTG TCAAAGTAAA AAGAGGGCGC AAAAGTTTTG ATTTGTAGAC GTTCTGGAGC 4 500 

TGAAACGGTG CCCATCTGGA TAAAAAGAGT CAGACAGGAG GCCAATTTGT CTCGATCACT 4 5 60 

GCTATCAAAT TGCAGGTATT TCTTTCCTTG TTGACCCACA GGTAACGCTT TAATTTCCTT 4 620 

GAGAAGACGC AT CTGCTGGT CTGTTAAAAA ATAAACCTGA CCTTTATGGA AAAGTACTCC 4 680 

TCCCTGATAA AAGACATTGA CCCTAGGACT CTCACTGATT TCCATTTCAA AATAATCCGA 4 740 

GTATTCTGTT ACTGTAAAGG CAAATAGATT GGCATCAGCA TGCATATCCT GAAAAAGCAG 4 800 
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GGTTTGGTAG CTATCCACTT GATGGTCAAA TTGAAAATGG GGCAAGGCCA TCAGTAAATT 48 60 

CACACCCTGC TCAAAAAAGG TCAGAGGGAA AAAGAGGTGC CGACCTTGGT TTTGGAAAAA 4 920 

GAGGTCTGGA ACCAGCCCTT CCTCCGTTAG TCCGTGCAAG AAAGTCAAAA GTTCTTGGCT 4 980 

GGCATCATCA AAGGCTTCCC AAGAAAGAGA CTCCTCATAA ATCTTGCCAA TCATATACGA 504 0 

CTTTCTCTGC TCGACAATCC TTAAAAAAAG TGGAATATCC CGAATGACAT AGT ATTTTTG 5100 

GCTATTCATT TGGCCGATTC TCAGAGTCCA CAAGATATGA TTGCTTCCTG CTTCCACCTG 5160 

ACCCACAGCT GATAACTCAT AGGCGCATTC TGATTTTGGA GATAAAATTC GATCCAAAAA 5220 

CTTGCCACCC AAGGTCACCT TGGTTTC AAC AGCCTCTTTT TCTTCATGAC CTTCTTCCAG 52 80 

ACTCCACAAG ATTTCCTGAC CACGCTCATC ATTTTTCAGA AAATGCTCTA GCGCTGCCAA 5 34 0 

ATGCACACAG TAGCCCCTCT TTTGAAAAAA ATCACAGGCA CAAAAAACCA AATCATCCTC 54 00 

TAAACTATAC CCCAGTTCTT CTTCTGCAAC GCGAGCGTAG AGCCGATTGT TCTTTTCCTT 5 4 60 

GATGATATCA ACCTTACCAG TTTCATAAAG GGCAACACCT TCGATACCAA TTTTCCCCGG 552 0 

AATCAATTTA GCCATATTTT CACCTTTACC TTATCTTTTT ATTATACCAT ATTTTCGCCT 5 580 

ATGAAAATAG CCTTCTAGGA AG A CTTTT CT CCTAGAAGGC TGGATTTTTA ACGTTTGGCA 564 0 

AAAGTAGCCA CAATCCGCTG ACAGACTTCT TG C AAC AG AG ATTTGGGCAT AGCTATATTG 5700 

ATGCGGGCAT GGAGACTTCC TTCCTCTCCA AAATCCAAAC CACGGTTG AG GATAACCTTC 5760 

GCTTCATTTC TCAACAACTC TTGCAATGTT TCATCAGTCA GGTCATAAGC TGAAAAGTCA 582 0 

AGCCAAATCA AGT AGGT AC C TTGCGGTTTC ATGACCTTGA TTTTACTCTC 7TTTCCAAAT 5 880 

AGATCCATCA CATAATTGAT CTGGTCTTCA AAGACTTGCT TGAGTTCCTC TAGCCAATCT 5 94 0 

TTACCGTATC GATAGGCAGC TTCTGTCGCC AAATAACCCA AGCCTGAAAT TTCATGCTCA 6000 

TTATTGGCCA ACAGGCGTTT CTGGAAAGCC AGTCTCAACT TAGGATTTTC AATGACTGCA 6060 

TAGCAATTTT TTGTTCCAGC AATATTAAAT GTTTTAGTGG CACTGCTCAA GACGATAGCA 6120 

AAATTTTTGA AGGCAGGATT GATGGTATTG AAAGACTGGT GTTTGTGACC AAAGACGGTC 6180 

AAATCTTGGT GAATCTCATC CGAAACTAAC AAAACACCGT GTTTTTGGCA GAGTTGGCCA 62 4 0 

ATCTTCTCCA ACACTTCTTT TTCCCAAACA CGTCCACCAG GATTGTGACC GTTGCAAAGA 6 3 00 

ACATAGAGTT TAACCTCCTC TTCCACCAAA TCCTTTTCAA GTTG GTC AAA GTCAATCTCA 6 3 60 

AACAGACTAT CCTTTTCCAC TAAGGAATTA GTAATCAATC TACGATTATT CAACTTGACA 64 20 

CTGCGAGCAA AGGGTGGGTA GACAGGCGTG TTAATTAAAA CCGCCTCCCC TTCTTTTGTA 6480 

AAGGTTTGAA TAGCTGTTGA GATGGCTGGT ACCACACCCT CGATAAAGAC AAGACCCTCT 6540 
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TTGTCAAAGT TGTAACCGTA TTGTGTAGCT TCCCACTTTT GAACTTCCTT AATTAAGTCT 6600 

TCACTGGCAT AGGTATAACC ATAAACCAGT TGGTCTGCCT AAGTTTGCAC GGCTTGGCGC 6660 

ATTTCAGGCA AGACCACAAA GTCCATATCC CCTATCCAAG CTGGTAGAAC TTCACTATCC 67 2 0 

GTTTCTGTTT CTTTCCATTT ATAGGTATGG TGCCCTAAAC GGTTGGGCAG GCTTGTAAAA 6780 

TCATATTTTC CCATCTTTGT C TT ATCCTTC TATGGCTTGG CGCAAATCTG CAATCAAATC 6840 

TCTAGCATCC TCAATCCCAA TAGACAAACG CAAGAGGTCA TCTGTCAAAC CATAAGAATC 6 9 00 

GCGTACCTCT GCTGGAATAT CAGCATGAGT TTGAGTCGTT GGATAAGTAA TAACACTTTC 6960 

CACTCCACCC AAACTTTCCG CAAAAGAGAA CACCTTGAGA CTGTTCAAAA TATGAGGAAT 7020 

GCGTGTTTCA TCGGCTACTT TAAAGGAAAT CATGCCTCCA CGACCACTGT AGAGAACTTC "7 080 

CTTAACTGCT GGAGAATCCT TCAAAAAGGC AACCACTTCT TGGGCGTTAG CTCTTGACCG 714 0 

CTCCATACGA AGAGACAAGG TCTTGAGACC ACGAAGCAAC TGGTAGCTGT CAAATGGAGA 7 200 

CAAGACTGCC CCTGTTGTAT TAAGATTGTA AAAAAGCTTC TCGTATAGTT CTAAACTATT 7 2 60 

GGTCACAACC ACTCCAGCCA AGACATCATT GTGGCCTGCT AGATACTTGG TTGCTGAATG 7 320 

GAGAACGATA TCTGCTCCAT CTTCAATCCG ACGTTGGTAG ATAGGGCTAT AGAAGGTATT 7 380 

GTCCACCACC ACTTTGCCAC CCTTAGCATG AG C C AATTTT CCTAGTTTTT CGATATCAAA 74 4 0 

TTCCAACATC AAGGCATTGG TTCGGGTTTC GAT AT AG AG A ACATCCACAT CCTTTTCTAA 7 500 

CTCGGCAATC AACTCTTCTT CTGT ATTGGC AT AGGT AAAA TCGAAATGAC CTTCCTGCTC 7 5 60 

CACTTGGTTA AACCAGCGAA AAGAACCACC CTAAAGATCA CGCACTGCCA AGACCTTACT 76 2 0 

TCCTACTGGA AAGACCCTAA AGGCCAGTAC AATAGCTGAC ATCCCTGAGC TAGTCCCTAG 7 6 30 

GGCATAGTCT GCTCACTCAA TAGCCGCCAA GACTTCCTCA CCCTTACTAC GAGTTGGATT 7T40 

TTTAGTGCCC CTA7ACTCAA ACCCAGTAGA TCCACCAAAC TCTCGATOCT CATACCTCCT 7 300 

TGAAAAATGA AGTGGTGTCA CCAAAGCACC TGTTGCCTCA TCAGACTTGA TCCCTGCTTG 7 8 60 

TGCTAAAATT GTGTTAATGT GTAATTCCTT GCTCATACAA TTCCTCCAAA TCTATAGTAA 10 20 

CTATTGTACC ACTTATTTTG TATCCTTCGT TTTCTTGTTT TCAAGAGCTA GTTATAGTTT 7 9 80 

CAAACTATAT AAAAAGGGAG TTTTTCCTGC TCCCTTTAAT AGACTATAAA ATGGTGAATC 804 0 

TCAAAAGACA CCTTCACTCT ATCATTTGCT CCTGCACAAA ACGAGCATAA CGCTCATGAT 8100 

TTTCCAGTAG TTCCTTATGA GTTCCTGAGC CAGTGATTTT CCCCTCCTCT AAGAAGAAAA 8160 

TACAATCCAC ATCTTTTACC GTTGACAAAC GATGCGCTAT AATCACAACC GTCTTCTCCT 3 2 20 

TTAGTACAGA ATAGAGGCTA CTGATAATCG CATACTCAGA ATCCGCATCA ACATTAGCAG 32 80 

TGGCTTCATC AAATATAAGA ATTTCAGCAT CTTTTAAGTA GGCTCTAGCT ATTTGAAGTC 3340 
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TTTCCTTCCC 


CCCCCTGACA 


AGAGTCGTCC 


GCGTTCACCA 


ACTTCAGTAT 


CTAGTCCCTC 


3400 


TTTCATGGAG 


CGAATCTCAT 


CACCTAGTGA 


TACTAAGTCT 


AG C ACTTTC A 


TCAATTCATC 


3460 


ATCAGTTACT 


AAGCGATTCA 


AACCGAGACA 


AAGATTGTCA 


CGAATACTGC 


C AG AT AAG A C 


8520 


TGCATTATTT 


TGTGAAACCC 


AAGCGATTTT 


ACTTCTCCAT 


TCTTTTAAGT 


TAAAATCATA 


8580 


TATACTTGAT 


TCCTCCATTA 


GAATATCTCC 


TGAAAGCGGT 


TTATAAAACC 


GCTCTAACAA 


8640 


ACGCACAATC 


GTTGATTTTC 


CTGATCCAGA 


TGGTCCAACA 


AAAG C AATTT 


TTTGCCCCTT 


8700 


GAAAATTGAA 


CAAGTAATAT 


CCTTTAAGAC 


AGGTCGATTT 


TCATCATAAC 


CAAAATAGAC 


8760 


ATGGTTAAAA 


TTCAACCCTC 


GTCCTGATAC 


CGATTTTCCT 


CCCTCAAATT 


TTTCTTTAGG 


8820 


AACTGCAAGC 


AAGTTCTCCA 


GTGCAACTGA 


AGATCCCTTG 


CTCCTAGAAT 


AAACAGTTAC 


8380 


AAAATTACCT 


AT ATT ACT AA 


TAGGATTAAG 


TAATTGAAAG 


AGGTAAATCA 


AAAACGAAAC 


8940 


CAAGGTTCCC 


AC AG AT AT AT 


ATCCTGCGCT 


G AC CC GAT AA 


CCCCCATAGG 


TTAGCATCAC 


9000 


AGCTATAGTC 


GCAAAGATAA 


ATAAGAGAGC 


AAACGGGGTC 


TCAAAAGAAC 


TAACCCTATC 


9060 


TGATTTCAGT 


GAATTGTTTT 


GTACCCTTTC 


AATACAATTA 


T C C AAAAC AT 


CCTGTACACT 


9120 


TTTCTCTGCT 


TGGTTAGTCT 


TAATTAATTC 


ATGTTCTTGA 


ATCTTTTCAG 


TCAATTGCCC 


9180 


TGTTAAATTT 


CCTCCTGTAA 


ACGACGACTA 


TACTTTTCAC 


TG AT ATTGG A 


AAGGGGCAAG 


9240 


ATAATAAACA 


TCATACAAGG 


AAGAGTGATG 


AATAAAAGTA 


GAG AAAG ATT 


CCAATCAAGA 


9300 


CTAAATAAGA 


CTACAATGGA 


ACCAAGTACC 


ATAACTAAAC 


TCAGAATAAT 


ATTTGGGAAA 


9360 


GTCGTAATTA 


AAAACTCACG 


AATGACACTC 


GTGTCATTGA 


CAATGGCAGA 


AGTC AACTC C 


9420 


CCACTTTGGC 


TCTTATCAAA 


GAAGGATTTC 


TCTACATAAA 


TCAACCCCTC 


TATCACTTTT 


9480 


TTCCTGATTT 


TTGCTATCTT 


TTTTTCACCC 


GATTGACTAA 


ACAGATAGTA 


ACCAATAGAA 


9540 


GAAAACAAGG 


CTTGACCAAT 


AAAAATCAAA 


AACGATTGAA 


ATACTTTGGA 


GCCTATATTT 


9600 


TCAATAGAAt. 


I LLLn ILiftl 




AAGATAAGGG 


GAAGCAACAA 


AG C AAGT AG A 


9660 


CTAGACAGAA 


CAAGTAAGAA 


ACTCCCCATA 


ATCACCTTAG 


TATCTACTCT 


TAATAATTTT 


9720 


AATTTCATAA 


ATACTCCTTA 


TAATATTTCA 


ACGGATAAAG 


TCGGGAATAA 


CTCAATTTGA 


9780 


GGATAAAATC 


TAATAAATCT 


TCCTATAACA 


AAACGCATAA 


CATCTAGGAT 


TTTATATACC 


984C 


TGATATTATG 


CGTTTTTAAG 


CACAAAGACT 


TCTTACACAA 


ACTTATCTAC 


AATTAGATTT 


9900 


TATTTGACAT 


GTTTTGCCAA 


TTCTTCTTGG 


GCTTTTTTAT 


TGGATTCTTC 


TTTTTCTTTC 


9960 


AACCATTTTT 


CTCTGGCTTT 


TGCATATTCG 


TCTGTTGTGA 


CAATCTTATC 


TTGTACTTTG 


10C20 


AGGTATTTAT 


ATGATTCAAC 


CCCTTTTGTA 


, CCGGTTAAAC 


CATAGGCAGC 


AGCAAATGGT 


10080 
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ACGGTTCTTC TCAATGATGC TGTTCCCCCA CGCGAAACAC TTGGAAGAAC TAAAGAACTA 10140 

TCAATCAACC AAGCTTGAAT ATCACCATAT TTCTCATAAC CTTTGCCCGG ATCTTGCTCT 10 200 

TTATTAGCTT CTTCCAACAT TTCAGTATAG ACATCCAGTC CAACTGCCTT AGCCTTGTCA 102 60 

TTGGCCTCAC CAGGCTCTAG TCCAAGATTT TGCAGAAATC CTCCACTATT AGTATTAAAA 10320 

ATATCGAGAT AGGTTGACGG GTCTTGATAA TCAGGTCCCC AACCGCCATG ATATAAATCA 10 3 80 

TAATCTTTCT GAGCAGCTGT TTGAGCAAAG TAGCCTGAAC TGTCAAACTC ATCTGATGTT 10440 

AATTCCTGAA TGTCAATCAC TACATTATCA GAACCTAAAA CAGATTCAAT TGATTGTTTG 10500 

ATAGAACTAA CTCCTTGTAT GCCTACTTTA TCTGTTACTT CCACAGTCTT ATCCAAGTGG 105 60 

ATTGGGAATT GAACACCCTT TGCTTCGAGT TCTTTCTTAG CTTCCCCAAA CTTAGCCTTG 10620 

GCTTTCTCAG GATTGTAGTA AGGGTCTTGA CCATCCGCAA ACTTGATACC TTGCCATTCC 10680 

TTACCATAGT TGACCATCTT AGAGGCTACA ACTTCACCAA AGTCTTTTCC CTTGATACTG 107 40 

ACAAACTTTC GAGGAACCAC T AGGTT AC GC AAAATCTTTG TTGCACCTTC TTTCCCTTCA 10800 

GACTGAGCCC CATAAGATGT TCTGTCAAAA CCAAAATTCA TAGCCTGACG GAAGTTTTTA 10860 

TTGACAACTG CTTCCTGAGT CGATTTCTTT TCAATGTCAC TTGTTTTACA ACT AT AATTG 109 2 0 

TAACACTTCC TATCT AGGTT AAAATTAAAG AAATATGAAG TTGAATTTTG CATACTATAG 109 80 

ATGATATTGT TTTTGTATTT TTCTTTAATC CCTTCATAGC TCGACCTGTT AGGAAAAAGA 11040 

CGAGCCGTAG TATAAGCACC AGCTGTAAAA TTACGTTCCA GTGATTCTTG GTCGCTACCA 11100 

TCATAGTAGG TCAATTTCAC ATCGTCTACA AAGACATTCT TAGCATCCCA CTAATTAGGC 11160 

TTTTT CTT AT ATTCAATAGC AGATTTTGAG ACAAGTGCTT TCATCAAGAA AGGTCCATTG 11220 

TACAAAATAC TAGATGGATC CGCCTTCCCA AAATCATCCC CTTTTGATTT CAGGAAATCT 11280 

GCATTAACAG GAAAAAGTAT CGTTGCAAGT G TTTTT GAAP TCCACTAAAC TTCTGCTTTA 11340 

AC C AAAGT AT ATTGAACCGT TTGGTCATCA AGTGCCTTGA CACCGACAGT TGAAAAGTCG 11400 

CTTGTTTTAC CAGTGATATA GTCATCCAAA CCAGCAACAC AGTCCTGCAC TAGATACAAG 114 60 

GCTTCTGATT TTTTATCAGC TGCATATTGC AAACCTGTCA CAAAATCCTG CCCACTTACA 11520 

GGCGCATATT CTTCTCCCTC AGAAGTAAAC CACTTGGCAT CCTTACGAAG TTTGTAGGTA 11580 

TAGGTCAAAC CGTCCTGAGA AACAGTCCAA TCCTCTGCTA ATGATGCAAT AATATTCCCA 11640 

TATTGGTCAT TTTCT AAT AA CCCGTCTACC AAATTTGCAA CAATATCCGA TGTTGCTGCG 11700 

CGGTTTTCTG CT AG AT AGTT CAAGCTAGAT GGATCACTTG AATAAACATA GTTGT AGGTT 11760 

TTTGACGCCG TGCTAGAATT TCCACACGCG CTCAATAAAA CTCCTGTACC CAGGACAAGA 11820 

CCTGCCAAGG TTAGATATTT GCTCTTAGAC TTTTTCATTT CCGG 11364 
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12) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TAACTGCACT AAACATAATA TAAGGAGAGA AAATGTCTGC AATAGAACGT ATTACAAAAG 6 0 

CTGCTCACTT AATTGATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 120 

TTGCTGAGGA AGTCACTTTC CCCCTATCTG ACCAGGAAAT CATCCTAGGC G AAAAG ATG A 180 

TGCAATTCCT T AAA C ATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 24 0 

GTGTTGGACT GGCTGCTCCC CAGTTAGATA TCTCAAAACG CATTATCGCT GTTTTGGTAC 300 

CTAATATTGT TGAAGAAGGC GAAACTCCAC AGGAAGCCTA CGATTTGGAA GCCATTATGT 3 60 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AAGATGCTGC TCTTGGCGAA GGAGAAGGTT 4 20 

GCCTGTCTGT TGACCGTAAC GTGCCTGGCT ATGT TGTTCG CCATGCCCGC GTTACTGTTG 4 B0 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGGCTAC AACTCCATTG 54 0 

TTGTTCAGCA TG AAATTG AC CACATTAACG GTATCATGTT TTACGATCGC ATC AATG AAA 600 

AAGACCCATT TG C AGTT AAA GATGCTTTAC TGATTCTTGA ATAAAGAAAA TCCCGTTGCA 660 

AGACGGGGTT TTCTGTTATA ATAGAGGCAT GAAAACAAAT GATATTGTCT ATGGTGTCCA 72 0 

CGCCCTTACC GAACCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 780 

CCGAGGTAAG AATGTTGAGA AAGTCAAGGA ACTAGCTACA G AAAAG AACG TCTCCATTTC 94 0 

TTGGACATCA AAAAAATCTC TCTCTGACAT TACTGAAGGT GCTGTTCATC AAGGTTTTGT 900 

TCTACGAGTG TCTGAATTTG C C T AT AG CG A GCTAGATTAC ATCCTTGCAA AAACACGCCA 960 

AGAAGAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGCGTTC 102 0 

TATCTTGCGA ACAGCCGATG CGACCAATGT TTCAGGTGTC ATCATTCCCA AGCACCGTAC 1080 

TGTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACAGGT GCTATTGAAC ACGTtCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGATAAACT TAAGGATGAA GGTTTCTGGA 12 00 

CCTTTGGAAC GGATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 12 60 

CCCTCATCAT TGGAAATGAA GGAAAAGGTA TCTCTAGCAA CATCAAAAAA CAGGTCGATG 13 20 

AAATGATTAC CATTCCGATG AATGGACATG TTCAAAGCCT TAATGCCAGT GTTGCTGCGG 13 80 
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CCATTCTCAT GTACGAAGTT TTCCGAAATA GACTATAAAA AAGTTTCCAG TCATCTGATT 144 0 

GGAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT ATAGCCTTCT TGACCAGCGA 1500 

TAGCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC TTTCAAGCGA ACATCTCCAA 15 60 

CTGCAAAGAT ACCGTCGACT GCAGTTTTCA TGTGGTT ATC TCTCACAATC CATCCTGCCT 1620 

GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG GTCCAAACCA ACATAGATAA 1680 

AGACACCACC GAAGGCTTGT TCTGTCACTT G ACCTGTTTT CACATTTTCA AATACGACTG 174 0 

ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA ATC CC AG AT A AAGCTGATTT 1800 

TTTCATTCGC AAAGGCGCGA TCTTGTAAAA CCTTTTGGGC ACGAAGTTGG TCACGACGGT I8 60 

GAACAATGGT AACACTCTTA G C AAAACG AG TCAAGAAGAG GGCTTCTTCA ACAGCTGAAT 19 20 

CTCCACCACC AACTACCAAT AAATC TTGGT CACGGAAGAA AGCACCATCA CACACAGCAC 1980 

AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG CACTCCCAAA GGACCGTGTT 204 0 

TAGAACCAGT TCCTACGATA ACTGTACGTG TTTCATATGT TTGGTCATCA GTCATCACTT 2100 

TCTTAAAATC ACCATGGCTT CGACATTTTC AACATAACCA TAAATGTGCT CAACACCAAG 2160 

ATTTTCAAGT CGTTCAAACA TCTTTTCAGC CAATTCAGGT CCACTAATAT TAGCGTATCC 2 22 0 

TGGGTAATTT TCGATATCAG ATGTATTATT CATCTGACCA CCTGGCAGAC CACCTTCAAT 22 80 

CAAAGCTACT TTTAGATTGC TTCGAGCAGC ATACAAGGCC GCAGTCATCC cTCCACGTCC 23 40 

AGCACCGATA AT AAT AGT AT CGTACATATA GATTCCTTCT TTCTTGGTGT AACTATCTTT 2 4 00 

2412 

ATTCTAACTC TG 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 77 50 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
JD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGAGCA GAGTTTACCT 60 

TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAAT ATTGT A GATCTAAACA 120 

AGGATGAATG GATTTTAGGA AC AGTTGCTG GTGCCAATAC CTTATTGGTT ATTTGTCGAG 180 

ATCAGCACGT TGCCAAACTC ATGGAAGATC GTTTCCTAGA TTTGATGAAA GATAAGTAAG 24 0 

GTCT TGGG AG TTGCTCTCAA GACTTATTTT TGAAAAGGAG AG AC AG AAAA TGGCGATAGA 300 

AAAGTTATCA CCCGGCATGC AACAGTATGT GG AT ATT AAA AAGCAATATC CAGATGCTTT 3 60 
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TTTGCTCTTT CGGATGGGTG ATTTTTATGA ATTATTTTAT GAGGATGCGG TCAATGCTGC 42 0 

GCAGATTCTG CAAATTTCCT TAACGAGTCG CAACAAGAAT GCCGACAATC CCATCCCTAT 4 80 

GGCGGGTGTT CCCTATCATT CTGCCCAACA GTATATCGAT GTCTTGATTG AGCAGGGTTA 540 

TAAGGTGGCT ATCGCAGAGC AGATGGAAGA TCCTAAACAA GCAGTTGGGG TTGTTAAACG 600 

AGAGGTTGTT CAGGTCATTA CGCCAGGGAC AGTGGTCGAT AGCAGTAAGC CGGACAGTCA 660 

GAATAATTTT TTGGTTTCCA TAGACCGCGA AGGCAATCAA TTTGGCCTAG CTTATATGGA 7 20 

TTTGGTGACG GGTGACTTTT ATGTGACAGG T CTTTTGG AT TTCACGCTGG TTTGTGGGGA 7 30 

AATCCGTAAC CTCAAGGCTC GAGAAGTGGT CTTGGCTTAT GACTTGTCTG AGGAAGAAGA 840 

ACAAATCCTC AGCCGCCAGA TGAATCTGGT ACTCTCTTAT GAAAAAGAAA GCTTTGAAGA 900 

CCTTCATTTA TTGGATTTGC GATTCGCAAC GGTGGAGCAA ACGGCATCTA GTAAGCTGCT 960 

CCAGTATCTT CATCGGACTC AG ATG AG GG A ATTGAACCAC CTCAAACCTG TTATCCGCTA L020 

CGAAATTAAG GATTTCTTGC AGATGGATTA TGCGACCAAG GCTAGTCTGG ATTTCCTTGA 108 0 

GAATGCTCGC TCAGGTAAGA AACAAGGCAG TCTTTTCTGG CTTTTGGATG AAACCAAAAC 1140 

GGCTATGGGG A TGCGTCTCT TGCGTTCTTG GATTCATCGC CCCTTGATTG ATAAGGAACG 1200 

AATCGTCCAA CGTCAAGAAG TAGTGCAGGT CTTTCTCGAC CATTTCTTTG AGCGTAGTGA 1260 

CTTGACAGAC AGTCTCAAGG GTGTTTATGA CATTGAGCGC TTGGCTAGTC GTGTTTCTTT 13 20 

TGGCAAAACC AATCCAAAGG ATCTCTTGCA GTTGGCGACT ACCTTGTCTA GTGTGCCACG 13 80 

GATTCGTGCG ATTTTAGAAG GGATGGAGCA ACCTACTCTA GCCTATCTCA TCGCACAACT 1440 

CGATGCAATC CCTGAGTTGG AGAGTTTGAT TACCGCAGCG ATTGCTCCTG AAGCTCCTCA 1500 

TGTGATTACA GATGGGGGAA TTATCCGGAC TGCATTTGAT GAGACTTTAG ACAAGTATCC 15 60 

TTGCGTTCTC AGAGAAGGGA CTAGCTGGAT TGCTGAGATT GAGGCTAAGG AGCGAGAAAA 16 20 

CTCTGGTATC AGCACGCTCA ACATTGACTA CAATAAAAAG GATGGCTACT ATTTTCATGT 163 0 

GACCAATTCG CAACTAGGAA ATGTGCCAGC TCACTTTTTC CGCAAGGCGA CGCTGAAAAA 174 0 

CTCAGAACGC TTTGGAACCG AAGAATTAGC CCGTATCGAG GGAGATATGC TTGAGGCGCG 1800 

TGACAAGTCA GCCAACCTCG AATACGAAAT ATTTATGCGC ATTCGTGAAG AGGTCGGCAA 1860 

GTACATCCAG CGTTTACAAG CTCTAGCCCA AGGAATTGCG ACGGTTGATG TCTT AC AG AG 192 0 

TCTCGCCGTT GTGGCTGAAA CCCAGCATTT . GATTCGACCT GAGTTTGGTG ACGATTCACA 19 80 

AATTGATATC CGGAAACGGC GCCATGCTGT CGTTGAAAAG GTTATGGGGG CTCAGACCTA 2040 

TATTCCAAAT ACGATTCAGA TGGCAGAAGA TACCAGTATT CAACTGGTTA CAGGGCCAAA 2100 
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CATGAGTGGC 


AAGTCTACCT 


ATATGCGTCA 


GTTAGCCATG 


ACGGCGGTTA 


TGGCCCAGCT 


2150 


GGGTTCCTAT 


GTTCCTGCTG 


AAAGCCCCCA 


TTTACCGATT 


TTTGATGCGA 


TTTTTACCCG 


2220 


TATCGGAGCA 


GCAGATGACT 


TCGTTTCGGC 


TCAGTCAACC 


TTTATGGTGG 


AGATGATGGA 


2280 


GGCCAATAAT 


GCCATTTCGC 


ATGCGACCAA 


GAACTCTCTC 


ATTCTCTTTG 


ATGAATTGGG 


2340 


ACGTGGAACT 


GCAACTTATG 


ACGGGATGGC 


TCTTGCTCAG 


TCCATCATCG 


AATATATCCA 


2-300 


TGAGCACATC 


GGAGCTAAGA 


CCCTCTTTGC 


GACCCACTAC 


CATGAGTTGA 


CTAGTCTGGA 


2460 


CTCTACTTTA 


CAACACTTGG 


TCAATGTCCA 


CGTGGC AACT 


TTGGAGCAGG 


ATGGGCAGGT 


2520 


CACCTTCCTT 


CACAAGATTG 


AACCGGGACC 


AG CTG AT AAA 


TC t ACGGT AT 


CCATGTTGCC 


2580 


AAGATTGCTG 


GCTTGCCAGC 


AGACCTTTTA 


GCAAGGGCGG 


ATAAGATTTT 


GACTCAGCTA 


2640 


GAGAATCAAG 


GAACAGAGAG 


TCCTCCTCCC 


ATGAGACAAA 


CTAGTGCTGT 


CACTGAACAG 


2700 


ATTTCACTCT 


TTGATAGGGC 


AGAAGAGCAT 


CCTATCCTAG 


CAGAATTAGC 


TAAACTGGAT 


2760 


GTGTATAATA 


TGACACCTAT 


GCAGGTTATG 


AATGT CTT AG 


TAGAGTTAAA 


ACAGAAACTA 


2820 


TAAAACCAAG 


ACTCACTAGT 


TAATCTAGCT 


GTATCAAGGA 


GACTTCTTTG 


ACAATTCTCC 


2880 


ACTTTTTTGC 


TAGAATAACA 


TCACACAAAC 


AGAATGAAAA 


GGAGCTGACG 


CATTGTCCCT 


2940 


CCCTTTTCTC 


TATTTTTTAA 


GGAGAAAGTA 


TCCTCATTCA 


GAAAATAAAA 


ACCTACAAGT 


3000 


GGCAGGCCCT 


GGCTTCCCTC 


CTGATGACAG 


GCTTGATGGT 


TGCTAGTTCA 


CTTCTGCAAC 


3060 


CGCGTTATCT 


GCAGGAAGTC 


TTAGGCGCCC 


TCCTTACTCG 


GAAATATGAA 


GCT ATT TATA 


3120 


GTATCGGGGC 


TTGCTTCATT 


GGTGTGGCCG 


TAGTCGGTCT 


AGTTGCTGGT 


GGACTCAATG 


3180 


TTGTCCTCGC 


ACCCTATATT 


GCCCAAGGAG 


TTTCATCCGA 


CCTTCGGGAG 


CATGCCTTCC 


3240 


GTAAAATTCA 


AACCTTTTCT 


TATGCTGATA 


TTGAACAATT 


TAATGCGGGA 


AATCTAGTCG 


3 300 


TTCGAATGAC 


AAATGATATC 


AAC C AG ATT C 


ACAACGTTC'7 


CATCATGACC 


TTCCAAATTC 


3 3 60 


TTTTCAGACT 


TCCCCTCTTG 


TTCATCGGTT 


CGTTTATCCT 


AGCCGTTCAA 


ACCTTACCTT 


3420 


CTCTGTGGTG 


GGTGATTGTT 


CTCATGGTAG 


TCTTGATTTT 


TGGTTTGACT 


GCTGTCATGA 


3480 


TGGGAATGAT 


GGGCCCTCGT 


TTTGCCAAGT 


TTCAAACCCT 


TCTTGAGCGC 


ATCAATGCCA 


3S40 


TTGCCAAGGA 


AAATTTACGT 


CGCGTTCGTG 


TGGTCAAGTC 


CTTTGTCCAA 


GAAAAAGAGC 


3600 


AATTTGCTAA 


GTTTACAGAG 


GTCTCAGACG 


AGCTTCTTGG 


TCAAAACCTT 


TACATTGGTT 


3660 


ATCCCTTTTC 


AGTAGTGGAA 


CCCTTTATGA 


TGTTGGTTGG 


TTACGGGGCG 


GTCTTCCTCT 


3720 


CTATTTGGCT 


GGTCGCGGGA 


ATGGTTCAGT 


CGGATCCGTC 


TGTTGTTGGT 


TCCATCGCTT 


378C 


CTTTTGTTAA 


TTACCTAAGC 


CAGATTATCT 


TTACCATTGT 


TATGGTTGGA 


TTTTTGGGAA 


3840 


ATTCTGTCAG 


CCGTGCCATG 


ATTTCCATGC 


GTCGTATTCG 


AGAAATTCTT 


GACGCAGAGC 


3900. 
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CAGCTATGAC CTTCAAGGAT ATCCCAGATG AAGAGTTGGT TGCAAGTCTT ACCTTTCAAA 3 9 SO 

ATGTGACCTT TACCTATCCA ATGGACAAGG AACCGATGCT GAAAGATGTG AG CTTT ACT A 4 020 

TTGAACCTGG TCAAATGGTT GGTGTAGTTG GAGCGACTGG TCCAGGAAAG TCAACCTTGG 4 080 

CTCAATTGAT TCCACGTCTC TTTGATCCAC AGGACGGGCC CATTAAAATC GGTGGCAAGG 4140 

ATATTCGAGA AGTGAGTGAA GGAACCCTGC GTAAAACAGT TTCCATCGTT CTCCAACGTG 4 2 00 

CCATTCTTTT TAGTGGAACG ATTGCAGATA ACTTGAGACA GGGGAAGGGG AATGCTACTC 4 2 60 

TATTTGAAAT GGAGCGCGCA GCCAATATTG CCCAGGCTAG TGAATTCATT CATCGTATGG 4 320 

AG AAAAC CTT TGAAAGTCCA GTTGAAGAAC GGGGAACCAA TTTCTCTGGT GGACAAAAAC 4 38 0 

AAAGGATGTC GATTGCGCGT GGGATTGTCA GCAATCCACG TATTCTGATT TTTGATGATT 44 4 0 

CGACCTCAGC CTTGGATGCC AAATCAGACC GCTTCGTGCA AGAAGCTTTG AATAAGGACT 4 500 

TGAAGGGGAC GACAACCATT ATTATTGCTC AAAAAATTAG CTCGGTTGTC CATGCAGACA 4 560 

AGATCTTGGT TCTAAATCAA GGACGATTGA TTGGTCAAGG TACGCATGCA GACTTGGTTG 4 620 

CCAACAATGC CGTTTACCGT GAAATCTATG AAACACAGAA ATGAAAGACA AACTATAAGA 4 68 0 

AAAGTCAATA CTTTTATCTA AACTATTTCT TATTTCAATT TGATGATTTG GCGATGATTT 4 740 

TAGAGCACGG CAAAAAGCCC TTGAAAAAGT CCATTTTTTC AAAGGTAATC CTGTGTTAAT 4 800 

TTCAGAAATT ACATCACTTT TTGTTCGTCA AATGGCAGCT CTTTTTTTAG GAT AT AAAAC 4 860 

AGGGTTCGGA TAAGTTTTTT TCCAAGGTGG ATGATGGCTA CATTGTAATG TTTTCCTTGT 4 92 0 

TCTAATTTAG TCTT AAG AT A GCCCTTAAAA GC AGGC G AAA AGCGAGGGCA TGCTTTGGCA 4 980 

CCTTGTATGA G7ACCTACCG CAGATGAGGG GAACTC CGTT TGACCATTCT TCCTGCTAAA 5 04 0 

TCAATCTGAT CTGACTGATA AATAGAAGAA TCCAGTCCAG CGAAAGCTTG TAATTGAGCA 5 100 

GGATTATCAA AGGCATGAAT ATTTCGAATC TCAGCTAAAA TGACCGCCCC TAAACGATCC 5160 

CCAATCCCAG TAACCGTCGT GATGACCGAG TTCAACTCAC CCATCAAGTC ATTGACACAT 5 2 20 

GTTTCCCCCT TGTCAATGAG CCTCTTGTAA TGTTTGATGT TTTCATTACA CGAGATAAAA 5 2 80 

CGTCTATGCG TTATCAAACT CATTACCAAT TAAAACAAAA AGCTGTGGTT AGATCCTTTC 5 340 

GGAAATTGTC AAGCGATTGG AGGAAATGAA CTAATCCACA GCGGCTTATT CCAAGTATAC 54 00 

CACTTGGGCT TTGGC AGT AG CTAACTGCGC TAAATATAAT ATAAGGAGGA G TAAAATG AA 5 4 60 

CACAGTTCAA TTTTTTTGGC ATTATTTTAA CGTCTACAAG TTCTCATTTG TACTTCTCAT 5 5 20 

CCTGATGATT GTTCTGGCGA CTTTTGCCCA AGCCCTCTTT CCAGTCTTTT CTGGACAAGC 5 5 30 

CGTCACGCAG CTAGCCAATT TAGTTCAAGC TTATCAAAAT GGCAATCCAG AACTTGTATG 5 64 0 
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GCAAAGCCTA 


TCAGGAATCA 


TGGTCAATCT 


TGGCCTGCTG 


GTTTTGGTTC 


TATTTATCTC 


S700 


TAGTGTAATA 


TACATGTGTC 


TCATGACGCG 


CGTGATTGCA 


GAATCGACCA 


ACGAGATGCG 


5760 


CAAAGGCCTC 


TTTGGTAAGC 


TTGCTCAGTT 


GACGGTTTCT 


TTCTTTGACC 


GTCGACAAGA 


5820 


TGGCGATATC 


CTGTCTCATT 


TTACCAGTGA 


TTTGG AT AAT 


ATCCTCCAAG 


CCTTTAACGA 


5880 


AAGCTTGATT 


CAGGTCATGA 


GCAATATTGT 


TTTATACATT 


GGTCTGATTC 


TTGTCATGTT 


5940 


TTCGAGAAA? 


GTGACGCTGG 


CTCTCATCAC 


CATTGCCAGC 


ACCCCATTGG 


CTTTCCTTAT 


6000 


GCTGATTTTC 


ATCGTGAAAA 


TGGCACGCAA 


ATACACCAAC 


CTCCAGCAGA 


AAGAGGTAGG 


6060 


GAAGCTCAAC 


GCCTATATGG 


ATG AG AG CAT 


CTCAGGCCAA 


AAAGCCGTGA 


TTGTGCAAGG 


6120 


AATTCAAGAG 


GATATGATGG 


CAGCATTTCT 


TGAACAAAAT 


GAGCGCGTGC 


GCAAGGCAAC 


6190 


CTTTAAAGGA 


AGAATGTTCT 


CAGGAATTCT 


TTTCCCTGTC 


ATGAATGGGA 


TG AG CCTGAT 


6240 


TAATACAGCC 


ATCGTCATCT 


TTGCTGGTTC 


GGCTGTACTT 


TTGAATGATA 


AGTCTATTGA 


6300 


AACAAGTACA 


GCCCTAGGTT 


TGATTGTTAT 


CTTTGCACAA 


TTTTCACAGC 


AGTACTACCA 


6360 


GCCTATTATC 


CAAGTTGCAG 


CGAGTTGGCG 


AAGCCTTCAG 


TTGGCCTTTA 


CTGGAGCTGA 


6420 


AC G AATTC AG 


GAAATGTTTG 


ATGCAGAGGA 


GGAAATCCGA 


CCTGAAAAGG 


CTCCAACCTT 


6480 


C ACT AAGTTG 


CAAGAAAGTG 


TTGAAATCAG 


TCATATCGTT 


TTTTCATACT 


TGCCTGATAA 


6540 


ACCTATTTTG 


AAAGATGTCA 


GCATTTCTGC 


CCCTAAAGGC 


C AG ATG AC AG 


CAGTTGTTGG 


6600 


GCCGACAGGT 


TCAGGAAAAA 


CGACTATTAT 


GAACCTCATC 


AATCGCTTTT 


ATGATGTTGA 


6660 


TGCTGGTGGT 


ATTTATTTTG 


ATGGTAAAGA 


CATTCGTGGC 


TATGACTTAG 


ATAGTCTTAG 


6720 


AAGCAAGGTG 


GGAATTGTAT 


TGCAAGATTC 


GGTCTTGTTT 


AGCGGAACGA 


TTAGAGACAA 


6780 


TATCCGATTT 


GGTGTGCCAG 


ATGCTAGTCA 


GGAAATGGTT 


GAGGTAGCAG 


CAAAAGCAAC 


6840 


CCACATTCAC 


GACTATATCG 


AAAGTTTGCC 


TGATAAGTAC 


GATACTCTTA 


TTGATGATGA 


6900 


CC AG AG CATC 


TTTTCAACAG 


CGCAGAAGCA 


ATTGATTTCA 


ATCGCTCGAA 


CCCTGATGAC 


6960 


ACATCCAGAA 


GTTCTCATTC 


TCGATGAAGC 


AACTTCAAAC 


GT AG AT ACGG 


TGACAGAAAG 


7020 


CAAGATTCAG 


CATGCCATGG 


AGGTGGT TGT 


AGCAGGTAGA 


ACTAGTTTCG 


TCATTGCCCA 


7080 


CCGCTTGAAA 


ACCATTCTCA 


ATGCAGATCA 


GATTATTGTC 


CTTAAAGATG 


GAGAAGTCAT 


7140 


TGAACGTGGT 


AACCACCATG 


AACTTTTGAA 


GCTAGGTGGC 


TTTTATTCAG 


AACTCTATCA 


7200 


CAATCAATTT 


GTTTTCGAAT 


AAGAAAGAAG 


TTCTCCTATG 


TGGGCAGCTT 


TTTCTTGTCC 


7260 


ATAAAAAATG 


TTTATCACAG 


CCTTAAAAAA 


AACATATTAG 


ACGAAAGTCA 


TTTTGAGTGA 


7320 


TATGATAGGA 


CTATCGTTAG 


CATTCGAAAG 


GAGAGGCATC 


ATGGCTAGAA 


CGGTTGTAGG 


7380 


AGTTGCTGCA 


AATCTATGTC 


CCGTAGACGC 


AGAAGGCAAA 


ATCATTCATT 


CATCTGTATC 


7440 
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TTGTAGATTC GCAGAGATCA TTCGTCAAGT CGGTGGTCTC CCTTTAGTCA TTCCTGTTGG 7 500 

TGATGAGTCA GTTGTACGTG ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 7 560 

CCAAAATGTT CATCCTCAGT TTTATCGAGA GAAAAAGACC GTCGAGAGCG ATGATTACAA 7 620 

TCTGGTCCGT GACGAATTTG AATTGGCACT CTTGAAGGAA GCGCTTCGTC AGAATAAACC 7 6 80 

AATTATGGCA ATCTGTCGCG GTGTCCAACT TGTCAATCTT GCCTTTGGTG GAACCCTCAA 77 4 0 

TCAAGAAATC GAAGCTCAGG 77 60 



(2) INFORMATION FOR SZQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A» LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GAGCTTTTAA TTCACTTACC TCTsCCGTAT CTTTATTTAA AATGAATTCT TTTACGGTTG 60 

TATTTCTTCC AAAATCTTTT ACAACAATCT TAATGTTTAG TGTCTTGTCT ATTATTTGTT 12 0 

TAATATCATT AAATGATGTA TATTCTTTTC CATTTATATA AATATGTTGT TCTTGAATCT 180 

CACCATCGAA TCCATTATTT CTTTTATCAT TGATGTTAAA GACTACACAT TTTCCATCAG 24 0 

CATATTCGAT ACTAGTATTT CCCTTAGGAT CAATGTTTAC TTCGGGTTT A ACATTATCAT 3 00 

ATAAAAACTG ATACTCCACT C C AACTG CTT TAGCATTCAA ATCGCTATAG CCAGTTTGAA 3 60 

GATAAACATT TCCATCCATA TCTGTTACCT TATCTGGAAA TCCGTTTGCT TTATAGTCTT 42 0 

TCATTCCCCA GTCCATGATG TCACCGTCTT TAACATTCAC CTTAATATTA AAATCTCTAG 430 

TGTTATCAAT CTGTAAATCT CCGTAGATTA AATAATTATC TACAACCGA7 TCATTAACTC 54 0 

TCAATTCCCA GTTAAAACCA CCCTTATCAG AAATCTTACC TCTTAAATAA AATTCTG GAT 600 

TTCGTACATA AATTTTATTA GATTTAGATG GATTAAAGTA GTTCTTATCC ATTGAAAGGT 660 

TTACTGGTTT GGTATCAATA AATAACATGG AGCCATCTTC TTTTATAGCT TCTACATTGA 7 20 

ACTTATCCTC TCCAGTGTAT TCTTTATCAT CCTTACCAAA TAATACAAGT TTAGAAGAAT 780 

CTGTCACAAG ATTTCCCTCT TT ATCG AT AG CTTCCCCTTT ATCCTTCATT TTAAATGTAA 840 

ACACTTGATA CCTTATAATG TTAAAGCCGT CCAAAGCCGA CATTAATACA GATTGGGTAC 9 00 

TTCTTCCATC TTCAACAT7T C T ACT ATC AG CATAAATTGT TGTTTCTGAA AGGGCTCTTA 960 

GATTAGGATT GGCCTTTTGT ATTTTTGCTA TATCTTCCTT G CT AT AG ACT CCATTTCCTT 1020 
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CTAACATATC 


CGTTTTTCCA 


GGATTATAGG 


TAGTCACTTT 


TAGTGCATAG 


CCTTTTCTTA 


1080 


GAATGATATT 


ATCCTTTAAC 


AGATATTGTT 


GTTTTTCTGA 


ATCAGAATAG 


ATTTT ACCAG 


1140 


ATTCCATTTT 


AGTT AAATT G 


TCTGGTTTGT 


TTTTTGAAAG 


ATCTCCTTCC 


CCTAATTCTA 


1200 


TGACATTCCC 


AT AACT TGAT 


ACATAGGGAT 


ATTCTGATTT 


AGTTTCCTTA 


ATTTTTTCAG 


1260 


GCATTCTAAT 


TTTAATTTCA 


GCTTTTTTCT 


GATCATTATC 


TTTAACAAAT 


AATCTCATAT 


1320 


CTCCTGCAAA 


AGCTAATCCA 


TCCACAATAT 


CATTAATATT 


AGCGT AT AG A 


TCAAATGTCA 


1380 


TCGTTTTTGA 


GTGGAAATCA 


TACTTGGTCG 


CTTTGATTTC 


TATAGATTTA 


TAGTTATTCC 


1440 


CATAATATAC 


CTTGGCATTT 


TTAGAAACAT 


TACTTATCTT 


TCCAAGAATT 


TCAAAGTGTC 


1500 


CATCTTTAGA 


CGGACTTAGA 


ACACCATAAA 


TTTTTGATTT 


GATTTCGTCA 


AGTTTCTCAG 


1560 


TTTCATATTC 


TAGATCAGTC 


CCATCATCGT 


AGGCTATTAT 


ATTTCCTTTA 


TCATCGTATT 


1620 


TATAATCGTA 


TTCCTCCATT 


CTCTTACCAG 


TTTC ACT TGT 


AAAATCATCA 


ACTTCTCTAA 


1680 


ATTT C 'f'TTT*^ 1 


AATGAGTTTC 


TTTAAGTCTT 


TATTTTCAAA 


GTCTCTAATT 


GTTG AAAT AT 


1740 


TTCTATCAAT 


AGTAAAACTA 


GATTTTTCTT 


TAATAGACTC 


TTCATTTTCT 


TGATGATGAT 


1800 


GTTCTACCCC 


ACTTGTATCT 


TTTTTTAGAC 


TACCCTCTTT 


TCCATTTCCT 


AAATTTTTAA 


1360 


ATTT* AG ATT C 


TGCAATCTCG 


CCAACCTTTT 


GAT ATTT AG A 


TGAATCTTGA 


TCAGGATCTA 


1920 


CTAGATAATA 


GGAAATCATC 


CCCTTTTCAT 


CAGCCTGATT 


AG CAAATTT A 


ATTCTATGAA 


1980 


TCTTTGTGAA 


ATTGCTAGAA 


C CAT CT AATG 


CAATGACTTC 


AATG ATT TT T 


CCCCTTAAAT 


2040 


CTCCCGCACC 


TTTAATTTCA 


TAAATGGTAT 


TTCCGTCTTT 


ATCAAGTTTT 


CTATTTCTTC 


2100 


CTTGACCCTC 


ACCTGCGTAA 


GTTACTTCAA 


GATTTTTTTC 


AACCTCTCCA 


TCTTCATTAA 


2160 


CAAGAGCGGC 


GCCAGCATAC 


CAAACTTCGT 


TCGCAATCTC 


GTCAAATTTT 


TCAGGATGTT 


2220 


CTTTTTGATC 


TCTCGCAAAT 


AGCGTTTCAT 


TCTTATACTC 


ATCTTTTACC 


TTATGATAAG 


2280 


TATCCTTTGT 


AATCAACTTA 


ATTTTTTCAG 


GATTTGAAAA 


ATCAACCGAA 


ACAATCTTAG 


2340 


GGGCGGTGTT 


ATCAATTTTT 


ACAGGAATAT 


AGGAAACCTG 


CCATGGGTAA 


TCTTTAGTTA 


2400 


ATCTATATTT 


AAATTTATAG 


AAATATTGAC 


CTTCCGCAAT 


CGGTTCAAAT 


TGACCTCTTA 


2460 


TCTTAGTAGC 


AGGATCTTGA 


TTATCCTTAC 


TTTCTGGTGC 


ATTTT CT T CT 


CTACCTCTAG 


2520 


GATTATAGAT 


GAGTCCATCC 


CACTTCAAGT 


CACCCCAAAC 


TTTTAGTTTA 


GATGATTTGA 


2580 


TTCCCTTTGC 


ATCATTGCTT 


TT AG AATTT A 


AAATTCCTCT 


AATAAAGTCT 


TCTCTCGAAA 


2640 


TGACTTTTAA 


GTCTCTTTGA 


TTTTCTCCCT 


CTTTATTTGT 


ATTTACTATT 


GAAATCAATC 


2700 


CTTCTTCTGC 


ACTTCTTAAT 


ACA 








2723 



(2} INFORMATION FOR SEQ ID NO: 65: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11331 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
( D > TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

AAAAAAGTGG GAATGACTCA AATCTTCACT GAAGCTGGCG AATTGATCCC TGTAACAGTT 6 0 

ATTGAAGCAA CTCCAAACGT TGTTCTTCAA GTTAAAACTG TTG AAAC AG A CGGATACAAC 12 0 

GCTATCCAAG TTGGTTTCGA TGACAAACGC GAAGTATTGA GCAACAAACC TGCTAAAGGA 180 

CATGTAGCGA AAGCTAACAC GGCTCCTAAG CGCTTCATTC GTGAATTCAA AAACGTTGAA 240 

GGCTTGGAAG TTGGTGCTGA AATTACAGTT GAAACATTCG CAGCTGGAGA CGTTGTTGAC 3 00 

GTAACGGGTA CTTCTAAAGG TAAAGG TTTC CAAGGTGTTA TCAAACGCCA CGGACAATCA 3 60 

CCTGGACCAA TGGCTCACGG TTCTCGTTAC CACCGTCGTC CAGGTTCTAT GGCGCCTGTT 42 0 

GCACCTAACC GCGTATTCAA AGGTAAAAAC CTTG CAG G AC GTATGCGTGG CGACCGCGTA 4 80 

. ACAATTCAAA ACCTTGAAGT TGTACAAGTT GTTCCAGAAA AGAACGTTAT CCTTATCAAA 54 0 

GGTAACGTAC CACCTGCTAA GAAATCTCTT ATCACTATCA AATCAGCAGT TAAAGCTGGT 600 

AAATAATAAA GAAAGGGGAA AT CAGTC AC A ATGGCAAACG TAACATTATT TGACCAAACT 660 

GGTAAAGAAG CTGGCCAAGT TGTTCTTAGC GATGCAGTAT TTGGTATCGA ACCAAATGAA 720 

TCAGTTGTGT TTGATGTAAT CATCAGCCAA CGCGCAAGCC TTCGTCAAGG AACACACGCT 730 

GTTAAAAACC GCTCTGCAGT ATCAGGTGGT GGACGCAAAC CATGGCGTCA AAAAGGAACT 340 

GGACGTGCTC GTCAACGT7C TATCCGCTCA CCACAATGCC CTGGTGGTGG TGTTGTCTTC 900 

GGACCAACTC CACGTTCATA CGGCTACAAA CTT C C AC AAA AAGTTCGTCG CCTAGCTCTT 960 

AAATCAGTTT ACTCTGAAAA AGTTGCTGAA AACAAATTCG TAGCTGTAGA CGCTCTTTCA 102 0 

TTTACAGCTC CAAAAACTGC TGAATTTGCA AAAGTTCTTG CAGCATTCAG CATCGATTCT 1030 

AAAGTTCTTG TTATCCTTGA AGAAGGAAAT GAATTCGCAG CTCTTTCAGC TCGTAACCTT 114 0 

CCAAACGTGA AAGTTGCAAC TGCTACAACT CCAAGTGTTC TTGACATCGC AAATAGCGAC 1200 

AAACTTCTTG TCACACAAGC AGCTATCTCT AAAATCGAGG AGGTTCTTGC ATAATGAATT 12 60 

TGTATGATGT TATCAAAAAA CCTGTCATCA CTGAAAGCTC AATGGCTCAA CTTGAAGCAG 1320 

GAAAATATGT ATTTGAAGTT GACACTCGTG CACACAAACT TTTGATCAAG CAAGCTGTTG 13 80 

AAGCTCCTTT CGAAGGTGTT AAAGTTGCCA ATGTTAACAC AATCAACGTA AAACCAAAAG 144 0 
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CTAAACGTGT TGG ACGTT AC ACTCGTTTTA CTAACAAAAC TAAAAAAGCT ATCATCACAC 1500 

TT ACAGCTG A TTCTAAAGCA ATCGAGTTGT TTGCTGCTGA AGCTGAATAA TCTAAGGAGG 1560 

AAATATCGTG GGAATTCGTG TTTATAAACC AACAACAAAC GGTCCCCGTA ATATGACTTC 1620 

TTTGGATTTC GCTGAAATCA CAACAAGCAC TCCTGAAAAA TCATTGCTTC TTGCATTGAA 1680 

GAGCAAGGCT GGTCGTAACA ACAACGGTCG TATCACAGTT CGTCACCAAG GTGGTGGACA 174 0 

CAAACGTTTC TACCGTTTGG TTGACTTCAA ACGTAATAAA GACAACGTTG AAGCAGTTGT 1800 

TAAAACAATC GAGTACGATC CAAACCGTTC TGCAAACATC G CTCTTGT AC ACTACACTGA 1860 

CGGTGTGAAA GCATACATCA TCGCTCCAAA AGGTCTTGAA GTAGGTCAAC GTATCGTTTC 1920 

AGGTCCAGAA GCAGATATCA AAGTCGGAAA CGCTCTTCCA CTTGCTAACA TCCCAGTTGC 1980 

T ACTTTG ATT CACAACATCG AGTTGAAACC AGGTCGTGGT GGTGAATTGG TACGTGCTGC 204 0 

TGGTGCATCT CCTCAAGTAT TGGGTTCTGA AGGTAAATAT GTTCTTGTTC GTCTTCAATC 2100 

AGGTGAAGTT CGTATGATTC TTGGAACTTG CCGTGCTACA GTTGGTGTTG TCGGAAACGA 2160 

ACAACATCGA CTTGTAAACC TTGGTAAAGC AGGACGTAGC CGTTGGAAAG GTATCCGCCC 222 0 

AACAGTTCGT GGTTCTGTAA TGAACCCTAA CGATCACCCA CACGGTGGTG GTGAAGGTAA 2 2 80 

AGCACCAGTT GGTCGTAAAG CACCATCTAC TCCATGGGGC AAACCTGCTC TTGGTCTTAA 2 340 

AACTCGTAAC AAGAAAGCGA AATCTGACAA ACTTATCGTT CGTCGTCGCA ACGAGAAATA 2400 

ATATTAAACT AGTCGCTTAA GCAACTAGTA AATCCGCCAG CTCGGTAGCG CTCCATAGGA 2 4 60 

GTGCAAGCCG CTGTCGTACA ACATTTAAAG GAGAAAATAT AAAAATGGGA CGCAGTCTTA 2520 

AAAAAGGACC TTTCGTCGAT GAGCATTTGA TGAAAAAAGT TCAACCTCAA GCTAACGACG 2580 

AAAAGAAAAA AGTTATTAAA ACTTGGTCAC CTCGTTCAAC GATCTTCCCA AGTTTCATTG 2 640 

GTTACACTAT TGCAGTTTAT GACGGACGTA AACACCTACC TGTTTACATC CAAGAAGACA 2 7 00 

TGGTAGGCCA CAAACTTGGT GAATTTGCAC CAACTCGTAC TTACAAAGGT CACGCTGCAC 27 60 

ACGACAAGAA AACACGTAGA AAATAACGAG AACATAAATG CCAGAAATTA CTTCAGCTAA 2 8 20 

AGCAATGGCT CGTACAGTAC GTGTTTCACC TCC7AAATCA CGTCTTGTTC TTGATAACAT 28 80 

CCGTGGTAAA AGCGTAGCCG ATGCAATCGC AATCTTGACA TTCACTCCAA ACAAAGCTGC 294 0 

TGAAATCATC TTGAAAGTTT TGAACTCAGC TGTAGCTAAC GCTGAAAACA ACTTTGGTTT 3 000 

GGATAAAGCT AACTTGGTAG TATCTGAAGC ATTCGCAAAC GAAGGACCAA CTATGAAACG 3 0 60 

TTTCCGTCCA CGTGCGAAAG GTTCAGCTTC ACCAATCAAC AAACGT AC AG CTCACATCAC 3120 

TGTAGCTGTT G C AG AAAAAT AAGGAGGTAA AATCGTGGGT CAAAAAGTAC ATCCAATTGG 3180 
TATGCGTGTC GGCATCATCC GTCATTGCGA TGCCAAATGG TATGCTGAAA AAGAATACGC " 3 2 40 
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GGATTACCTT CATCAAGATC TTGCAATCCG TAAATTCGTT CAAAAAGAAC TTGCTGACGC 3 3 00 

AGCAGTTTCA ACTATTGAAA TCGAACGCGC AGTAAACAAA GTTAACGTTT CACTTCACAC 3 3 SO 

TGCTAAACCA GGTATGGTTA TCGGTAAAGG TGGTGCTAAC GTTGATGCaC TCCCTGCAAA 3 4 20 

ACTTAACAAA TTGACTGGAA AACAAGTACA CAT C AAC ATC ATCGAAATCA AACAACCTGA 3 4 80 

TTTGGATGCT CACCTTGTAG GTGAAGGAAT TGCTCGTCAA TTGGAGCAAC GTCTTGCT7T 3 540 

CCGTCGTGCA CAAAAACAAG CAATCCAACG TGCAATGCGT GCTGGAGCTA AAGGAATCAA 3 600 

AACTCAAGTA TCAGGTCGTT TGAACGGTGC AGATATCGCC CGTGCTGAAG GAT ACTCTG A 3 6 60 

AGGAACTGTT CCGCTTCACA CACTTCGTGC AGATATCGAT TACGCTTGGG AAGAAGCAGA 3 720 

TACTACATAC GGTAAACTTG GTGTTAAAGT ATGGATCTAC CGTGGTGAAG TTCTTCCAGC 3 7 30 

TCGTAAAAAC ACTAAAGGAG GTAAATAACC AATGTTAGTA CCTAAACGTG TTAAACACCG 3 8 40 

TCGTGAGTTC CGTCGAAAAA TGCGCGGTGA AGC AAAAGGT GGAAAAGAAG TAGCATTCGG 3 900 

TGAATACGGT CTTCAAGCTA CAACTAGCCA CTGG ATC ACT AACCGCCAAA TCGAAGCTCC 3 9 60 

TCGTATCGCC ATGACTCGTT ACATGAAACG TGGTGGTAAA GTTTGGATTA AAATCTTC CC 4 020 

AC A C AAATC A TACACTGCTA AAGCTATCGG TGTGCGTATG GGATCTGGTA AAGGGGCACC 4080 

TGAAGGTTGG GTAGCACCAG TTAAACGTGG TAAAGTGATG TTCGAAATCC CTCCTGTATC 414 0 

TGAAGAGATT GCACGTGAAG CGCTTCGACT TGCTAGCCAC AAATTGCCAG TTAAATGTAA 4 200 

ATTCGTAAAA CGTGAAGCAG AATAAGGAGA AGGCATGAAA CTTAATGAAG TAAAAGAATT 42 60 

TGTTAAAGAA CTTCGTGGTC TTTCTCAAGA AGAACTCCCC AAGCGCGAAA ACGAATTGAA 4 3 20 

AAAAGAATTG TTTGAACTTC GTTTCCAAGC TGCTACTGCT CAATTGGAAC AAACAGCTCG 4 3 80 

CTTCAAAGAA GTTAAAAAAC AAATCGCTCG CAT C AAAAC A GTTCAATCTG AAGCCAAATA 44 4 0 

ATAGACTAGG GAAGGAGAAA TTTCAATGGA ACGCAATAAT CGTAAAGTTC TTGTTGGACG 4 500 

TCTTGTATCT GACAAAATGG ACAAGACAAT CACAGTTCTA GTTGAAACAA AACGTAACCA 4 560 

CCCACTCTAT GGTAAACGTA TTAACTACTC TAAAAAATAC AAAGCTCATG ATGAAAACAA 4 6 20 

TGTTGCCAAA GAAGGCGATA TCGTACGTAT CATGGAAACT CGCCCCCTTT CAGCTACAAA 4 680 

ACCTTTCCGT CTTGTAGAAG TTGTTGAAGA AGCGGTCATC ATCTAATCAA ACCTGAAAGG 4 74 0 

AGAAAACTGA AATGATTCAA ACAGAAACTC GTTTGAAAGT CGCAGACAAC AGCGGTGCTC 4 800 

GCGAAATCTT GACTATCAAA GTTCTTGGTG GTTCAGGACG TAAATTTGCA AACATCGGTG 4 8 60 

ATGTTATCGT GGCATCTGTA AAACAAGCTA CTCCTGGTGG TCCGGTTAAA AAAGGTGACG 4 9 20 

TTGTTAAAGC AGTTATCCTT CGTACTAAAT CAGGTGCTCG TCGTGCTGAT GGTTCATACA 4 9 80 
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TCAAATTTGA CGAAAACGCA GCAGTTATCA TCCGTGAAGA CAAAACTCCT CGCGGAACAC 504 0 

GTATCTTTGG CCCAGTTGCA CGTGAATTGC GTGAAGGTGG CTTCATGAAG ATCGTGTCAC 5100 

TTGCTCCAGA AGTACTTTAA TTTTTAGGAA CAAACTAGTC CCCTAGCTTC AAGCTAGGGT 5160 

GCCCTTATGG GCGTAAGAAA AATCAAGGAG AAACCTAATG TTTGTAAAAA AAGGCGACAA 52 2 0 

AGTTCGCGTA ATCCCTCGTA AAG AT AAGGG AACAGAAGCT GTTGTCCTTA CTGCCCTTCC 5 2 80 

AAAAGTAAAC AAAGTTATCG TTGAAGGTGT TAACATTGTT AAGAAACACC AACGTCCAAC 5340 

TAACG AG CTT CCTCAAGGTG GTATCATCGA GAAAGAAGCA GCTATCCACG T AT C AAACGT 5 400 

TCAAGTTTTG GACAAAAATG GTGTAGCTGG TCGTGTTGGA TACAAATTTG TAGACGGTAA 54 60 

AAAAGTTCCC TACAACAAAA AATCAGGCGA AGTGCTT G AT TAATCACGAA GGAAAGGAGA 5 5 20 

AGTATAATGG C AAATC GTTT AAAAGAAAAA TATCTTAATG AAGTAGTTCC TGCTTTGACA 5 5 80 

GAACAATTCA ACTACTCATC AGTGATGGCT GTGCCTAAAG T AG AT AAG AT TGTTTTGAAC 5 64 0 

ATGGGTGTTG GTGAAGCTGT A TCAAAC GCT AAAAGCCTT G AAAAAGCTGC TGAAGAATTG 57 00 

GCACTTATCT CAGGTCAAAA ACCACTTATC ACTAAAGCTA AAAAATCAAT CGCCGGCTTC 5 7 60 

CGTCTTCGTG AAGCTGTTGC GATCGGTGCA AAAGTTACCC TTCCTGGTGA ACGTATGTAC 58 20 

GAATTCTTGG AT AAATTGG T ATCAGTTTCA CTTCCACGTG TACGTGACTT CCACGGTGTC 5880 

CCAACAAAAT CATTTGATGG AC GCGGG AAC TACACACTTG GTGTGAAAGA ACAATTAATC 5940 

TTCCCAGAAA TCAACTTCCA TGACGTTGAC AAAACTCGTG GTCTTGACAT CGTTATCGTA 6000 

ACAACTGCTA ACACTGACGA AGAGTCACGT GCATTGCTTA C AGGC CTTGG AATGCCTTTT 60 60 

GCAAAATAAT ATAGGAGGTA AATCTAATGG CTAAAAAATC AATGGTACCT AG AG AGG CT A 6120 

AACGCCAAAA AATTGTTGAC CGTTATGCTG AAAAACCTGC TCCATTAAAG GCGGCAGGGG 6180 

ACTACGAAGG TTTATCTAAA TTACCTCGCA ACGCCTCACC GACTCGTTTA CATAATCGTT 624 0 

GTAGGGTTAC GGGGCGCCCA CATTCAGTTT ACCGCAAATT TGGTCTGAGT CCTATCGCTT 6 300 

TTCGCGAACT TGCGCATAAA GGTCAAATTC CTGGTGTAAC AAAAGCATCT TGGTAATTTA 6 3 60 

AGATATCAAG AGCGTCAAAA CTCCAAGTAA AAATAGGAAA CTTGACGAAG AAACTAAAGT 642 0 

TTCTAGGAAA GTTTATCTTT TTCACACAGA GTTTACCCCC GGTTCAATTG GGCTTGCCAA 64 80 

TTTGAACACG AGCTACAGCT TTGGCAAAAA AGACCAATTT GCTTTGGAGC ATTCCTTCTG 6540 

CATT AAATTG TCTATTTTTG CTCCTGCTGT TACGCTCTTT GTATCATGTA TTAACTAGCA 6600 

AGTGCAACTT GCAAACTACT AGTAAGAGGA GAAAAACAAA ATGGTTATGA CTGACCCAAT 666 0 

CGCAGACTTC CTAACTCGTA TTCGTAATGC TAACCAACCT AAACACGAAG TACTTGAAGT 6 72 0 

ACCTGCATCA AACATCAAAA AAGGGATTGC . TGAAATCCTT AAACGCGAAG GTTTTGTAAA 6730 
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AAACCTTGAA ATCATTGAAG ATGACAAACA AGGCGTCATC CGTGTATTTC TTAAATACGG 6 8 40 

ACCAAATGGT G AG AAAG TT A TCACTAACTT GAAACGTGTT TCTAAACCAG GACTTCGTGT 6 900 

CTACAAAAAA CGTGAAGACC TTCCAAAAGT- TCTTAACGGA CTTGGAATTG CCATCCTTTC 69 60 

AACTTCTGAA GGTTTGCTTA C T GAT AAAG A AGCACGCCAA AAGAATGTTG GTGGTCAGGT 7020 

TATCGCTTAC GTTTGGTAAA ATCAAGATAC AAAGCTCCTA AAGAACAAAG CAAAATTAGG 7080 

AAGTTGGAGA AG TTTGTTT A CAAACAAGCC AACTTATCTA TTTTGCACAG TTCTTAGAGC 7140 

CTGTTCAGTT CAGCTCTTCA ACTAAATAAG TATCTGAACC CCCTGAAAAC TCGCCGTTCT 72 00 

GCCCTGACAA TTTAACAGGA GAAAATAAAC ATGTCACGTA TTGGTAATAA AGTTATCGTG 7 2 60 

TTCCCTGCTG GTGTTGAACT CGCTAACAAT GACAACGTTG TAACTGTAAA AGG ATCT AAA 7 320 

GGAGAACTTA CTCGTGAGTT CTCAAAAGAT ATTGAAATCC GTGTGGAAGG TACTGAAATA 73 80 

ACTCTTCACC GTCCAAACGA TTCAAAAGAA ATGAAAAC? A TCCACGGAAC TACTCGTGCC 7 44 0 

CTTTTGAACA AC ATGG TTGT TGGTGTATCA CAAGCATTCA AGAAAGAACT TGAAATGCGT 7 500 

GGGGTTGGTT ACCGTGCACA GCTTCAAGGA TCTAAACTTG TTTTGGCTCT TGGTAAATCT 7 560 

CATCCAGACG AAGTTCAAGC TCCAGAAGGA ATTACTTTTG AACTTCCAAA CCCAACAACA 762 0 

ATCGTTGTTA GCGGAATTTC AAAAGAAGTA GTTGGTCAAA CACCTCCTTA CGTACGTAGC 7 680 

CTTCGTTCAC CAGAACCATA TAAAGGTAAA GCTATCCGTT ACGTTGGTGA ATTCCTTCGC 7 74 0 

CGTAAAGAAG GTAAAACAGG TAAATAATGT TGACTCCTTC ATCATCAACC ACCAACCTAT 7 800 

TTTCCAACTT TGTGCATAGC ACACGATTTA AAACT AAAG A GGTGAAAACT GTGATTTCAA 7 860 

AACCAGATAA AAACAAACTC CGCCAAAAAC GCCACCGTCG CCTTCCCCCA AAACTCTCTG 7 92 0 

GAACTCCTGA TCGCCCACGT TT G AACGT AT TCCCTTCTAA TACAGGCATC TACGCTCAAG 7980 

TCATTGATGA CGTAGCCGGT GTAACGCTCG CAAGTGCTTC AACTCTTGAT AAAGAAGTTT 804 0 

CAAAAGGAAC TAAAACTGAA CAAGCCGTTG CTGTCGGTAA ACTCGTTGCA GAACCTGC^A 8100 

ACCCTAAAGG TATTTCAGAA GTGGTGTTCG ACCGCGGTGG ATATCTATAT CACGGACCTG 8160 

TGAAAGCTTT GGCTGATGCA GCTCGTGAAA ACGGATTGAA ATTCTAATAG GAGGACACTA 82 2 0 

GAAAATGGCA TTTAAAGACA AT GC AG TTG A ATTAGAAGAA CGCGTAGTTG CTGTCAACCG 8280 

TGTTACAAAA GTTGTTAAAG GTGGACGTCG TCTTCGTTTC GCACCTCTTG TTGTTGTTGG 834 0 

TGACCACAAT GGTCGCGTAG GATTTGGTAC TGGTAAACCT CAAGAAGTTC CAGAAGCAAT 84 00 

CCGTAAAGCA GTAGATGATG CTAAGAAAAA CTTGATCGAA GTTCCTATGG TTGGAACAAC 84 60 

AATCCCACAC GAAGTTCTTT CAGAATTCGG TGCAGCTAAA GTATTGTTGA AACCTGCTGT 35 2 0 
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AGAAGGTTCT 


GGAGTTGCCG 


CTGGTGGTGC 


AGTTCGTGCC 


GTTGTGGAAT 


TGGCAGGTGT 


8580 


GGCAGATATT 


ACATCTAAAT 


CACTTGGTTC 


TAACACTCCA 


ATCAACATTG 


TTCGTGCAAC 


8640 


TGTTGAAGGT 


TTGAAACAAT 


TGAAACGCGC 


TGAAGAAATT 


GCTCCCCTTC 


GTGGTATTTC 


8700 


AGTTTCTGAT 


TTGGCATAAG 


AAAGGCGATA 


AAATGGCTCA 


AATTAAAATT 


ACTTTGACTA 


8760 


AGTCTCCAAT 


CGGACGCATT 


CCATCACAAC 


GTAAAACTGT 


TGTAGCACTT 


GCACTTGGCA 


8820 


AATTGAACAG 


CTCTGTTATT 


AAAGAAGATA 


ACGCTGCTAT 


CCGTGGTATG 


ATCACAGCAG 


8830 


TATCTCACTT 


AGTAACAGTT 


GAAGAAGTAA 


ACTAATGAaG 


TTTTAGGGGA 


TGTGCACTGT 


8940 


ACCATCCCCT 


AAAACTAGAT 


ATAGTCATCT 


ATGATGACAT 


CGTATAGGCG 


AGTTGATGGG 


9000 


GGAGACAACC 


TTTTCTCCCT 


TATCGGCCCT 


AGCATTTTAC 


AAAAG AGG AG 


AAAATAAAAA 


9060 


TGAAACTTCA 


TGAATTGAAA 


CCTGCAGAAG 


G TT CTCGT AA 


AGTACGTAAC 


CGCGTTGGTC 


9120 


GTGGTACTTC 


ATCAGGTAAC 


GGTAAAACAT 


CTGCTCGTCG 


TCAAAAAGGT 


CAAAAAGCTC 


9180 


GTAGCGGTGG 


CGGAGTTCGC 


CTTGGTTTTG 


AAGGTGGACA 


AACTCCATTG 


TTCCGTCGTC 


9240 


TTCCAAAACG 


TGGATTCACT 


AACATCAACG 


CTAAAGAATA 


CCCAATTCTG 


AACCTTGACC 


9300 


AATTGAACGT 


CTTTGAAGAT 


GGTGCTGAAG 


TAACTCCAGT 


TG TTCTT AT C 


GAAGCAGGAA 


9360 


TTGTTAAAGC 


TGAAAAGTCA 


GGTATTAAAA 


TTCTTGGTAA 


CGGTGAGTTC 


ACTAAGAAAT 


9420 


TGACTGTGAA 


AGCAGCTAAA 


TTCTCTAAAT 


CAGCTGAAGA 


AGCT ATC ACT 


GCTAAAGGTG 


9480 


GTTC ACT AG A 


AGTCATCTAA 


GAGAGGTGAC 


CTATGTTTTT 


TAAATTATTA 


AGAGAAGCTC 


9540 


TTAAAGTCAA 


CCAGCTTCGA 


TCAAAAATTT 


T ATTT AC AAT 


TTTTATCGTT 


TTGGTCTTTC 


9600 


GTATCGGAAC 


TAGCATTACA 


GTTCCTGGTG 


TGAATGCCAA 


TAGCTTGAAT 


G CTTT AAGT C 


9660 


GATTATCCTT 


CTTAAACATG 


TTGAGCTTGG 


TGTCGGGGAA 


TCCCCTAAAA 


AACTTTTCGA 


9720 


TTTTTGCCCT 


AGGACTTAGT 


CCCTATATCA 


CCCCTTCTAT 


TGTTGTCCAA 


CTCTTGCAAA 


9730 


TGGATATTTT 


ACCCAAGTTT 


GTAGAGTGGG 


GTAAACAAGG 


GGAAGTAGGT 


CGAAGAAAAT 


9840 


TGAATCAAGC 


TACTCGTTAT 


ATTGCTCTAG 


TTCTCGCTTT 


TGTGCAATCT 


ATCGGGATTA 


9900 


CACCTGCTT? 


TAATACCTTG 


GCTGCACCTC 


AATTGATTAA 


AACTGCTTTA 


ACTCCACAAG 


9960 


TTTTTCTGAC 


GATTGGTATC 


AT CTT AAC AG 


CTGGTAGTAT 


GATTGTCACT 


TGGTTGGGTG 


10020 


AGCAAATTAC 


AGATAAGGGA 


TACGGAAACG 


CTCTTTCCAT 


GATTATCTTT 


GCCGGGATTG 


10080 


TTTCCTCAAT 


TCCAGAGATG 


ATTCAGGGCA 


TCTATGTGGA 


CTACTTTGTG 


AACGTCCCAA 


10140 


GTAGCCGTAT 


C ACT T C AT C T 


ATCATTTTCG 


TAATCATTTT 


G ATT ATT ACT 


GTATTGTTGA 


10200 


TTATTTACTT 


TACAACTTAT 


GTTC AAC AAG 


; CAGAATACAA 


, AATTCCAATC 


C AAT AT ACT A 


10260 


AGGTTGCACA 


, AGGTCCTCCA 


, TCTAGCTCTT 


1 ACCTTCCGTT 


1 . AAAAGTAAAC 


CCTGCTGGAG 


10320 
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TTATCCCTGT 


TATCTTTGCC 


AGTT C G ATTA 


CTGCAGCcTG 


PPPPTATTPT 


1 Lnu 1 I I I I Kj 


ixj J oU 


AGTGCCACAG 


GTCATGATTG 


GGCTTGGGTA 


AGGGTAGCAC 


A APAP ATPTT 
AAVjnun Lull 




10440 


TCTCCAACTG 


GTATTGCCAT 


GTATGCTTTG 


TTGATTATTr 


l<»i 1 lnLnl X 


l_ -l iLirtl AClj 


105 00 


TTTGTACAGA 


TTAATCCTGA 


AAAAGCAGCA 


GAGA 5c PPT AP 


A A A A P L\r"T , r*C 


io C C TATA Tc 


10560 


CATGGAGTTC 


GTCCTGGTAA 


AGGTAfAGAA 


GAATATATGT 1 


P*T*> AAA i^TTPT 


TCGTCGTCTT 


1062 0 


GCAACTGTTG 


GTTCCCTCTT 


CCTTGGTGTG 


1TTTPPATTT 




ACC 1 AAAbrt 1 


10 680 


GTATTTGGTC 


TTTCTGATGT 




PP.TP. P. A A P A A 


rTrTr^r A TP 




10 7 4 0 


ACAGGTATCG 


AAGGAATCAA 


GCAATTGGAA 


GGTTArCTA" 


TP A iarr^ia 




10800 


TTCATGGACA 


GAACAGAATA 


AAAGTATTT A 


PTf!l ATP A r*T 




Q»G AGTG G AGG 


108 60 


TTTAAACTCT 


GACATTTGTA 


AGAGTTGGAT 


CTCCCCTCTT 


PT A TTTTP'^T 




10920 


GGTGAAAAGA 


CTTTTTCCTT 




ATA A A AT A A r* 
/*» l n/w\ 1 /vrtvj 




C_ ATG AATCTT 


10980 


TTGATTATGG 


GCTTACCTGG 


TGCAGGTAAG 




rimi a a at* 


C (j TACjAA C AA 


11040 


TTCCATGTTG 


CACATATCTC 


AAPAGGTGAT 




L. 1 >o<-. AA I COC 


AAATCAAACT 


11100 


GAAATGGGTG 


TTCTTGCTAA 


GTCATATATT 




AA I I Cjt* I l CC 


TO AC G AAGTT 


11160 


ACAAATGGAA 


TCGTAAAAGA 


ACGCGTTTCA 


PAAP ATP AT A 


TT A A If* A A Af* 


AOG A ilC 1 1 A 


11220 


TTGGATGGTT 


ACCCACGTAC 


AATTGAACAA 


PPTPATPPPT 


tpp 


AT iGOCTGAA 


11280 


CTTCGCATTG 


AACTAGAAGG 


TGTTATCAAT 


ATTPAAPTPA 




LL IT I 1 CjCjAA 


11340 


CGTTTGAGTG 


GGCCTAT'CAT 




irTTA' 1 AAA 


C . I » CCACAA 


GGTCTTTAAC 


1 140C 


CCACCAGTTG 


ACTATAAAfiA 


rtVjrvrMj>\ 1 1 


t a fr" a srrTr 
1 ALv.nnt.u 1 vj 


A A /** A T*r* A »T» A A 

AAO A I G AT AA 


G CCTG AG AC A 


11460 


CTAAAACGTC 


GTTTCGATCT 


TAATATTGCT 


CAACCAGAAC 


CAATCATTGC 


7CACTACCCT 


11520 


GCCAAAGGTT 


TGGTTCATGA 


CATCGAAGGT 


AATCAAGATA 


TCAATCATGT 


CTTCTCAGAT 


11580 


ATTGAAAAAG 


TATTGACAAA 


TTTGAAATAA 


AGCGTTTTTC 


ACACTTGCAA 


AAATCCGCTA 


11640 


CAAATGTTAT 


ACTGAGATAG 


TCTGACTTAT 


AATTGTTGTC 


TCTGTGTCTA 


GAGGCATCGA 


11700 


ATCGAAATTT 


ATGGAGGTGC 


TTTTGCGTGG 


CAAAAGACGA 


TGTGATTCAA 


GTTGAACCCA 


11760 


AAGTAGTTGA 


TACAATGCCG 


AATCCAATGT 


TTACGGTTGA 


ACTTGAAAAT 


GGACATCAGA 


■ 11820 


TTTTAGCAGG 


G 










11331 


(2) INFORMATION FOR SEQ ID NO: 66 











( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10726 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
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<D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CCCGGCATTT GAAAGCTATT CGTGAAGGAT TTATGATGGC AATGCCTTTG ATTTTAGTCG 60 

GCTCTTTATT TCTTATTCTA ATCAGTTGGC CTCAAGAGGC TTTTACAAAT TGGCTGAATA 12 0 

GTGTTGGATT GCTAAGTATC TTGACAACTA TGAATCAGTC AACAGTAGCG ATTATCTCCT 180 

TGGTCGCTTG TTTCGGTATT GCCTACAGGT TGTCGGAAGG ATATGGTACA GATGGTCCGT 2-10 

CGGCAGGGAT CATACCCTTA TCCAGTTTTG TATTGATGGC ACCTCCTTTT TCGAGTATGG 300 

TTTATGATAA AAATGGGGAG CAGGTCAAGC AGTTATTTCC CGGCGCAATA CCATTTTCTA 3 60 

GCCTGAATGC ATCTTCTTTG TTTATGGCGA TTACTATTCG ATTGGTTACA GC AGAGATTT 4 20 

ATCGTATGTT TATCCAGCGC GGAATTACGA TAAAAATGCC AAGTGGTGTC CCAGATGTAC 4 80 

TAAGTAAATC ATTTTCAGCT CTTTTATCTG GTTTTACTAC TTTTGTTTTG TGGGCTTTGG 540 

TCTTAAAAGG TCTTGAACCG GCAGGAGTTG CAGGAGGTCT CAACCCACTC CTAGGTGCAA 600 

TTGTTGGAAC ACCGCTTAAG TTAATTGCAG GAACGCTTCC AGGTATGATT CTATGTGTTA 660 

TTGTAAACTC ATTCTTTTGG TTCTGTGGAG TTAATGGGGG ACAAGTTTTA AATGCTTTTG 720 

TAGACCCAGT TTGGTTACAA TTT AC T AC AG AAAAC C AAG A AGCTGTGGCT GCAGGACAAA 7 80 

CACTCCAACA CATTATTACA TTACCGTTTA AACATTTATT TCTATT7ATT GGTGCCGGTG 34 0 

G AG CG ACT AT TGGTCTTGCG ATTTGTCTCT TCCTATTTAG TAAGACTCGT GCGAATAAAA 900 

CATTACGTAA GCTAGCTATT ATACCGTCTA TTTTTAATAT CAATACAGCT ATTCTATTTA 960 

CGTTTCCAAC AGTTTTAAAT CCGATTATGC TGATTCCCTT TATTGCTACT CCTACAATCA 10.: 0 

ATGCCTTGAT TACCTATGTA TCAATCGCTG TAGGATTAGT ACCCTATACA ACAGGTGTAA 1080 

TCCTTCCGTG GACAATGCCA CCGATTATAG GAGGCTTCCT TGCAACAGGG GCTAGTTGGC 114 0 

GAGGAGCTCT ATT AC AAGT T GTTTTGATTT TGGTTTCTGT AG C AATTT AT TATCCATTCT 12 00 

TCAAAATTGC AG AT AAACG C AATCTTCAAA AAGAAAAAGC TACTGTTGGA GGGAAATAAG 12 60 

ATGGTTATCA GAGTATTTGA TCAACAGAAA AAT ACTT AT T CTAGCTTTGC CTTAGAGGAA 13 20 

TTAAGTTACT ATATGAATCG GGTCTTTAAG ACTAACATAG AGCTTGTCGA GGAGAAGGAA 1380 

GCCCATATTT TTGTAGGATT AGTCAATAAA GAGGACAGAA AAGACCATGT TCTTATCTCA 1440 

TT AG AC AAGG GTAAGGGGAG AATTGAGTCT AATACAATTG TAGGTTTACT TATTGGAATT 1500 

TACCGAATGT TTCATGAATT TGGGGTTGTG TATACTAGAC CAGGGCGCAG ACATGACTTT 15 60 

GTTCCAGAGT TACGATTTGA AG ATTT TTT A GATAAACAGC TAT CT AT AG A TGAAACAGCC 1620 
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AGTTACTATC 


AT AGGGG ACT 


ATGTATAGAG 


GGAGCGGATT 


CATTTCAAAA 


TATACTAGAT 


1690 


TTCATTGATT 


GGCTACCTAA 


GATTGGGATG 


AACACTTTTT 


TCATCCAGTT 


TGAAAATCCT 


1740 


TACTCTTTTT 


TGAAACGTTG 


GTATGAACAT 


GAATTTAATC 


CATATCTAAA 


TAAAGAACAA 


1800 


TTTTC AAAT G 


AATTAGTACA 


AGAATTGAGT 


GATAGGTTGG 


ATAAAGAATT 


GCAAAAAAGA 


1860 


GGTCTTATTC 


ATCATCGTGT 


TCGTCATGGA 


TGGACAGGTG 


AAGTTTTAGG 


TTACTCTTCA 


1920 


AAATTTGGCT 


GGGAATCAGG 


TCTTAGTATT 


TCAGAGGAGA 


AG AAACCCT A 


TGTCGCTGAA 


1980 


ATAAACGGGA 


AACGAGAATT 


GTTTAATACG 


GCTCCGATTT 


TAACCAGCCT 


GGATTTTTCA 


204C 


AATCCAGATG 


TAGCTGATAA 


GATGGTAGAA 


ATTATCAAGG 


ATTATGCCAA 


GAAAAGACCT 


210C 


GATGTTAACT 


ACTTACATGT 


ATGGTTGTCG 


GATGCTCGTA 


ATAATATTTG 


TGAATGCGAA 


2160 


AACTGTAGAC 


AAGAATTGGT 


TTCGGATCAG 


TATATTCGTA 


TTCTCAATCA 


ATT GG AT AGG 


2220 


GCTTTAACGA 


GTGAGGGATT 


AGATACAAAG 


ATTTGTTTTC 


TGCTTTATCA 


TGAGTTGTTA 


2280 


TGGGCACCTC 


AGAAAGAAAA 


ATTAGATAAT 


CCTGAACCCT 


TTACCATGAT 


GTTTGCACCG 


2340 


ATTACAAGAA 


C ATT TG AAAT 


GAGTTATGCA 


GATGTAGATT 


TTCACAATTC 


CATACCTACG 


2400 


CCTAAACCTT 


ATATCCCTAA 


TAAAATTATA 


CTTCCGAATT 


CTCTTGAGGA 


AAATTTATCT 


2460 


TATCTTTTTG 


AGTGGCAAAA 


AGCATTTAAA 


GGAGATAGTT 


TCGTATATGA 


CTATCCTTTA 


2520 


GGGCCTGCTC 


ATTATGGCGA 


TTTAGCCTAT 


ATG AAAATT A 


GTCAAACTAT 


TTACAGAGAT 


2580 


GTATCTTATC 


TTTCCAACCT 


ACATTTGAAC 


GGGTACATTT 


CGTGTCAAGA 


ATTACGTGCC 


2640 


GGATTCCCTC 


ATAATTTTCC 


TAATTATCTC 


ATGGGGGAAA 


TGCTCTGGAA 


GAAGACAAGA 


2700 


AGTTATGAAG 


AATTGATTGA 


AGAATACTTT 


TCTGCTTTGT 


ATGGGGAAAA 


TTGGCAGTCT 


2 7 60 


CTTCTTCAAT 


ATTTAOAAAA 


ATTATCC.YTT 




GTGATTATTT 


TAATGCAATT 


2820 


GGCAGCCGTC 


AAACTGATGT 


TTTAGCGAAT 


CATTATTATA 


TAGCTTACAA 


TCTAGCTGAT 


2880 


AATTTTTTAC 


CAATTATTGA 


GGAAAATATT 


TCT AAGTT AT 


TAAATAGTCA 


AAAGGATGAA 


2940 


TGGAAACAGC 


TCAGTTATCA 


TCGTGAATAT 


GTTGTTAAGA 


TGGCGAAGGC 


TTTATATCTT 


3000 


CAAGCAACTG 


GAAAAACAAG 


GCAAGCTCAA 


GATGAATGGA 


GAAATGTGTT 


GAATTATATC 


3060 


CGTGGGCACG 


AATTGCTATT 


TCAATCTAAT 


TTGGATGTTT 


ATCGTGTAAT 


TGAAGTAGCA 


3120 


AAAAATTACG 


CTGGTTTC C A 


CTTATAAATC 


ATAAGTATAG 


AAAATGAACT 


AAGGTATTCA 


3130 


GAGAAGATTG 


ATCCTAAATA 


TTATGAAATT 


TAAGGATTTT 


TAAGATATTT 


AGGGTCAACT 


3240 


TTCTATTTAT 


ATCGTAGCGA 


ACTCATTTTA 


ATAATGATGT 


GTAAAAGATG 


GATCAAGATT 


3300 


GAGGAGGAAG 


AAAGATGAAA 


TCAAAAGAAG 


AAAT AAAT AT 


GCTTGGTTTT 


ACAATTGTCG 


3 360 
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CTTACGCAGG 


AGATGCAAGG 


TCAGATTTGA 


TGGATGCTTT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


ATTTTG AAC A 


GGCAAGAGAA 


TTGGTTG ACT 


CTGCAAACGA 


CT C AAT AG TG 


TCTGCCCATC 


3480 


GAGAACAGAC 


TAATTTATTA 


GCGGAGGAGG 


CATATGGAGA 


TAATTTTGAA 


GTGAGCTTTA 


3540 


TTATGATTCA 


TGGTCAACAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGT AAAGT 


3600 


TTTTTATTGA 


TGAATATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACKTATTGGT 


TTGCAATGAG 


366C 


GATTAGTCAT 


GG AAAATTT A 


CAGGTTAAAG 


CCTTACCGAA 


GGAGTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GAGGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3790 


GGGATGTTTA 


TTTGCAAGAA 


AATAGTCCGT 


TCTTACCAGA 


TCCAGCTAGT 


G ATTTTT ATT 


3840 


ATCGTTACGA 


AGAGGATATA 


GCTTTGGCGG 


CAGAACATGG 


TTTGCAGGCT 


TTGCGTTTAT 


3900 


CTATTTCTTG 


GGTTCCTATA 


TTTCCTGATA 


TAGATGGGCA 


TGCTAATGTA 


TTAGCTGTTC 


3960 


ATTATTACCA 


TAGAGTTTTT 


CAGTCTTGCT 


TAAAACATAA 


TGTGATTCCG 


TTTGTTTCTT 


4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAATGT 


TAGAAACAGG 


GCATTGGTTC 


AACAGAGAGA 


4080 


ATATTGATCG 


TTTCATACGA 


TATGCTCGCT 


TTTGTTTCCA 


AGAATTTACA 


GAAGTCAAGC 


4140 


ATTGGTTTAC 


AATCAATGAA 


CTCATGTCTC 


TTGCTGC AG G 


TCAATATATA 


GGACGTCAGT 


4200 


TTCCTCCAAA 


TCATCATTTT 


CAATTATCTG 


AAGCAATTCA 


AGCGAATCAT 


AAT ATGTTGT 


4260 


TGGCGCATGC 


TCTTGCAGTC 


CTCGAATTTC 


ATCAATTACG 


GATTGAGGGA 


AAGGTAGGTT 


4320 


GTATTCATGC 


TTTAAAGCCA 


GGCTATCCTA 


TTGATGGGCA 


AAAACAAAAT 


ATTTTGGCAG 


4380 


CTAAACGGTA 


TGATGTTTAT 


AATAATAAAT 


TTCTATTAGA 


TGCAACTTTT 


TTGGGCT AC T 


4440 


ACAGTGAGGA 


CACGCTTTTT 


CACTTGAATC 


AAATATTGGA 


AGCTAATAAT 


TCTAGCTTTA 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA 


AGAGAGCTGC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


GGATGAATTA 


TTATCC77CA 


CAATTTATTC 


G TG AAT AC AA 


AGGTG AAAAT 


AGACAAGAAT 


45^0 


TTAATTCAAC 


AGGAATAAAA 


GGACAGTCTT 


CTTTTAAATT 


AAATGCTCTA 


GGTGAATTTC 


4680 


TAAAAAAACC 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TATTTATCCT 


CAAGGGTTAT 


4740 


TTGATATGTT 


GCTTCGTATC 


AAAGAAGAAT 


ATCCTCAACA 


TCCGGTCATT 


TATTTAACTG 


4800 


AAAATGGTAC 


■ AGCCCTTAAA 


GAAGTTAAGC 


CAGAGGGCGA 


G AAT GAT ATT 


ATTGATGACA 


4860 


GTAAGAGAAT 


CCGTTATATT 


GAGCAACATT 


T AC AC AAAGT 


TTTAGAGGCT 


C GAG AT AG AG 


4920 


GAGTCAATAT 


TCAAGGCTAT 


TTTATATGGT 


CTTTGCAAGA 


TCAATTTTCT 


TGGGCGAATG 


4980 


GCTACAATAA 


GCGATATGGT 


CTTTTCTTTG 


TTGATTATGA 


AACACAGAAG 


AG AT AT ATT A 


5040 


AGAAAAGTGC 


TCTTTGGGTA 


AAAGGGCTAA 


AACGGAATTA 


AGGTTAGCGA 


TTTGACTGAT 


5100 


GTTTAATATG 


TTTTAAATAT 


GAGGTTGAAT 


TTTTTATAGG 


AGGAGTTTTA 


TGGATAAGCT 


5160 
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AGTCCCTGCC ATTGAAAAGC AACAAGGGAA ATTTCAAAAA ATTTCTACTA ATAACTATAT 52 2 0 

GATGGCTATT AAAGATGGAT T C ATTGCT AC TATGCCTTTA ATTATGTTT? CAAGCTTTTT 5230 

GATGATTATT ATTATGATTC CTAAAAATTT CGGAGTAGAG TTACCGAGTC CAGCTATTGT 534 0 

CTGGATGAGA AAAGTGTATA TGTTAACCAT GGGAGTTTTG GGTATTATTG TTTCAGGGAC 5400 

TGTTGGAAAG TCATTAGTTG GAAATGTTAA CAGAAAAATG CCTCACGGAA AGGTAATAAA 54 60 

TGATATTTCT GCAATGTTGG CAGCCATATG TAGTTATCTG GTATTAACTG TAACGCTTGT 5520 

AGTTGATGAG AAGACGGGAT CTACAAGTTT GTCGACAAAC TATTTAGGAT CTCAAGGATT 5580 

GATAACTTCG TTTGTCAGTG CCTTTATTAC TCTAAATGTT TACCGATTCT GTATTAAGCG 564 0 

AGACATTACT ATTCATTTAC CTAAGGAAGT TCCTGGGGCT ATATCACAAG CTTTTAGAGA 5700 

TATTTTCCCT TTTTCTTTTG TTTTACTTAT TAGTGGTTTG TTAGATATTG TATCTCGGTT 57 60 

TAGTTTAGAT GTTCCTTTTG CCCAAGTATT TCAACAACTA TTGACTCCTA TTTTTAAGGG 58 20 

GGCAGAATCA TATCCTGCTA TGATGTTGAT TTGGTTTATG TGTGCTTTGC TTTGGTTTGT 5880 

TGGAATTCAT GGACCATCTA TTGTCTTACC TGCTGTTACA GCTTTGCAAC TGAGCAATAT 594 0 

GGAAGAGAAT GCTCAACTTC TTGCAAATGG GCAGTTCCCT TATCATTCTT TAACACCTAA 6000 

TTTCCCGAAT TATATCGCTG CTATTGGACG AACGGGGGCT ACCTTTGTTG TACCATTTAT 6060 

TTTGATTTTC TTTATGCGGT CTAAACAATT AAAATCGGTA CGTAAACCTA CAATTACTCC 6120 

TGTTTTATTT GCGGTAAATG AACCTCTTCT ATTTGGTATG CCTGTTATTT TGAATCCCTA 6180 

TCTTTTTGTC CCTTTTTTCA TGACTCCACC AGTGAATGTA TTTCTACCAA AGGTCTTTAT 6 24 0 

TGATTTCTTT GGAATGAATG G ATTTT AT A T CCAGTTACCT TGGACCTTTC CTGGTCCCTT 6 300 

GGGATTGTTA ATTCCAACCA ATTTT CAACT TATCTCCTTT GT ATTTT TAT CTTTGATTTT 6 3 60 

AGTTGTCGAC ATATTGATTT ATTTGCCATT CTGTAGAGCG TATGATAGAC AGTTACTCCT 64 20 

GAAAGAAGAT ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGATACAA GTGAAATAAT 64 80 

TCCTGGTGAG ATAGATGAAA TAAAAAGTAA GG AG TTG AAA CTACTGGTTC TTTGTGCAGG 6 540 

GTCTGGAACA AGTGCGCAAT TAGCCAATGC AATTAACGAG GGGGCTAACT TAACAGAGGT 6 600 

TAGAGTGATT GCG AATTC AG GAGCGTACGG AGCTCATTAT GATATTATGG GTGTTTATGA 6660 

TTTAATTATT CTGGCCCCAC AAGTTCGGAG TT ATT AT AG A GAGATGAACG TGGATGCAGA 67 20 

AAGATTAGGT ATTCAGATAG TTGCTACCAG AGGAATGGAA TATATTCATT TAACAAAGAG 67 80 

TCCAAGTAAA GCCTTACAAT TTGTATTGGA GCATTACCAA GCTGTGTAGT AAGTTTTTCC 6 840 

ATCTTTTATT TGAGTAAAGA TTTTGTTTAC AGATAGGCTT GGATTTAAAA ACGTTCCCCC 6900 
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TTTTTTAATA TAAGAATCCC TCTTTCACAA TTGTAAAAAG AGGGATTTTG TATTTTATCT 6960 

CTTAGACCAA GTTCTCTTCA TAAAGAGAAG GAGGATTGGG TAAATCTCCA AGCGCCCTGC 702 0 

AATCATTGCA AAGGATAGGA GAATTTTTGA GATGGGACTA AAGATTGAGA AACTAGAAGT 7080 

GGTTCCTAGA ATAGGCCCGA TATTATTGAA ACAGCTAAAG ACAGCGCTGG TCACGACCAG 7140 

AAAATCATTG CTATCTAGGC TG AC AAT AAA GATAAGCGCT AGCAAAATCA TAGCATAGAT 7 200 

GACAAAGTAC TTGAGAATCT TATGCTGGGT ATCTTTGTCA ATCACCGTTT TATTAACATG 7 2 60 

GAGGGTCAAA ACACGGTGGG GCGATAGGAT TGACAAAATT TGCTTTTTGG CAATTTTTGA 73 20 

AAGGATGAGG CCTCGAATAA TCTTGAGTCC ACCTGCAGTT GATCCAGCAG AGCCACCGAT 7 3 90 

TGCCATGAGG AAAAGGAGGA TAAACTGGGA GAAGAGGGGC CAGTTGGTAA TATCTCCATA 7 44 0 

TCCAAAACCA GTTGTTGTAA TGATGTTGGA AACCTGGAAG AAGGTC ATTT CAAAGCTCTT 7500 

TGAAAACCCT GGGTAGAGGT AGAGGGTGTT GAGGCTAATC AAGCCTGTAG AAACCAGTAC 7560 

AATGACCAAG TAAGCCCTAA GCTCTTCATC TCCAAAGAAG GCCTTGATGC GACGGAGCAT 7 620 

GAGGTAGTAG T AG AGGTTG A AATTTACTCC AAAAACCAGA ACTCCGATAC TGACCAGATA 7680 

GGTAATCAGT GAGCTGCCAT AGTGGGCAAT TCCCTCGTTA TAGACGGTAA AGCCTCCACT 77 4 0 

TCCCGCTGTC CCCATAGCAA TAACAAAACT ATCGTAGAGA GGCATACCGG CT A GAT AAT A 7 800 

GATGATGACA AAG AGGGAG A AGAGAGCTAC ATAAAGGACA TAGAGAATCT GGGCAGTGTT 7 860 

TTTTAGTTTG GATACAACCT TGCCAAAAAC AGGACCTGGA ACCTCAGCCT TCATCACCTC 7 9 20 

TAGGTGGCTA TTTTTGGCAT TGTCCATAAT AGCAAGTGCA AAAAC AAGC A CTCCCATCCC 79 90 

TCCAATCAAG TGGGT AAAAC TTCGCCAGAA GAGGAGGGAA CGGCTGAGAA CCGAAACGTC 3040 

GTTCAAAATA CTTGCTCCAG TAGTTGTAAA TCCAGAACTA ATTTCAAAAA AGGCATCAAT 3100 

AAGGCTGGGG A7TTCCCCAG AAAAGACAAA GGGGAGACCA CCAAACAAAG AC C AAAGG AT 9150 

CCAACAGAGG GCAACGATCA AGACTCCCTC CTTGGCATAA ATCCGTTGAT TTTTTCCCTT 5220 

CTGTAAACTC CCTGAACCGC CTAACAATAC GAGAATCCCT ATGGTCGAAA AGAGGGCTGT 92 80 

AAACACTTGC CTCGATTCAC GGTAATAGAC AGCAATCGCA ACAGGAACCA AAAGAAGAAC 8340 

AGCTTCAATC AAAAGTAATT TTGAAAGGAG GTAACGAATC ATACTTTTAT TCATTTCTTA 84 CO 

CCTCGCGATC AAGTCATAAA TCTTGGTGAT GTTTGGCAAC AAGGTTGTTA CTAGGAGCTT 84 60 

GTCTCCAACT TCCAACATAT CCTCCCCAGT TGGGAAAATA GTCTTGCCCT TTCGAATAAT 8520 

CGCTGCAATA AGAACCCCTT TTTTCAATTT CAGTTGAGAA AGAGGTTTGG CAGTCATTTT 8 580 

ATTGGCTTCC TTGATATGGA ATTGCAGGGT TTCGATTTGG CCATTGGCTA GATGGTGCAT 864 0 

AGCTTGAAGG TCTGAATACT GGGCATTAAC TCGACCACGA ATAAAGTGCA TAATCGTATC 37,00 
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TACAGCGATG 


CTTTTAGGTG 


TGATGATACT 


TGAAAAATCA 


GGCGCATTGA 


TAATCTCGAG 


8760 


GAGACTGGTA 


CGATTGACCT 


TAGTAATATT 


TTTCTGTACA 


CCTACCCTGT 


CAAGGAACAT 


9820 


AGATGTAATC 


AGATTTTCCT 


CATCGACTCC 


TGTTAGAGTC 


GCAACGGCAT 


CATAGTGTTG 


8880 


AGCACTTTCT 


TCCAGCAGGA 


TATCTTTTGC 


GGTTCCATCT 


CCTTCAACGA 


TGTAGAGATT 


8940 


TGGGAATTTC 


TCGCTAAAGA 


AGCTGGCGAT 


TTCAGGATTG 


ATTTCAATGA 


CTTTTGTATC 


9000 


GATACGACTA 


TCTTTGAGAA 


T AC CAAGT AG 


ATAATAGGCA 


ATTCT AC CTG 


CCCCAACGAT 


9060 


GAGAAGGCTC 


TTCACGGCGC 


GTGATTTAAA 


ATAATTATGG 


AAGAGTATCA 


TATCGACACG 


9120 


GTTACCAGTG 


ACAAAGATTC 


TATCTTTATC 


CTGT AC AGTC 


ATGTCACCGC 


TTGGAATGAT 


9180 


AATTTGATGA 


TCCCTCTCTA 


TCGCACAGAC 


AATGACATTA 


CCAAATTTTT 


TACGAAAATC 


9240 


AGAAATGGGC 


ATTTGGCAAA 


GACCGCTGGT 


GGACTTGACG 


ACAAATTCCA 


TCAGGCTAAC 


9300 


GCGTCCACCA 


GCAAAGCGTT 


C G AC AG AC AG 


GGCGTTGGGG 


AAGTC AATG A 


TATTCGCGAT 


9360 


AGCGCGGGCA 


GCCAAGAGCT 


CAGGATTAAC 


GATAAGAGAA 


AAACCGAGAA 


TATTCTTTTC 


9420 


CTTGAAATAA 


GAGTTAGAAT 


ATTCAGGGTT 


CCGCACCCGA 


ACCATAGTTT 


CTTTAGCTCC 


9480 


CATTTTCTTG 


GCTAGAACTG 


CTGCAATCAT 


CTTGACTTCA 


TCGTGCTCAG 


TCAGGGCGAT 


9540 


AAAGATATCA 


CAATCTTGGA 


CGCTGGCTTG 


CTCAAGAATG 


GCAAAATCGG 


CCCCGTTACC 


9600 


AAGGATACCA 


ATGATATCAA 


AGCGACTGAC 


AATATGATTG 


AGAACAGCTT 


CGTCTTGCTC 


9660 


AATCAGCAAA 


ACATCATGCT 


TTTCTGCAAC 


CAAGGAGCGA 


CAGAGGGCAA 


AACCAACTTT 


9720 


TCCCCCTCCG 


ACAAGGATAA 


TTTTCATAAT 


AAAACCTACT 


TTTTCATCAT 


GTAACTATCA 


9780 


TACCCTTTTT 


CAAGAAAAAA 


TGCACCTACT 


AGCTAATAAC 


AAGAGTTTTT 


AGTGAAAATT 


9840 


CCCTATAAGG 


TAAAACTATA 


CCCTAACCAA 


TTGAAATAGC 


TATTAGCGAC 


TTTCTCTGAA 


9900 


ATATGGTATG 


ATAAAGGATA 


TACAAGGAGA 


TAAAATGAAT 


AATAATTTAC 


TGGTATTACA 


9960 


r\ I LnUft\. X * 1 




ATGGTGCGGT 


ATCGGCTATG 


ATTGGAGTGG 


CTTT AG AAG A 


10020 


GTCTCCAACC 


TTAAAAATAC 


ATCACTTGAC 


GCACGATATC 


ACCCCTTATA 


ATATTTTTGA 


10080 


GGGGAGCTAT 


CGTCTCTTTC 


AGACGGTGGA 


TTACTCGCCT 


GAGGGAACGA 


CGTTTGTATC 


10140 


GGTTGTCGAT 


CCAGGTGTCG 


GTTCGAAACG 


TAAGAGTGTA 


GTTGCCAAGA 


CTGCAAAAAA 


10200 


TCAATACATT 


GTC ACGCC AG 


ATAATGGGAC 


GCTTTCCTTT 


ATCAAGAAAC 


ACGTTGGCAT 


10260 


TGTAGCCATT 


CGTGAGATTT 


CTGAGGTGGC 


CAATAGGCGT 


C AAAAC AC AG 


AGCATTCTTA 


10320 


TACCTTCCAC 


GGTCGTGATG 


TCTATGCCTA 


TACTGGTCCT 


AAACTGGCCA 


GTGGTCACAT 


10380 


TACTTTTGAG 


GAAGTAGGGC 


C AG AG CT C AG 


TGTGGAACAG 


ATTGTAGAGC 


TTCCAGTCGT 


10440 
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AGCGACCATC ATAGAAGATC ATCTGCTGAA GGGAGCCATT GATATTCTGG ATGTGCGTTT 10500 

CGGTTCGCTT TGGACCTCTA TCACACGGGA AG AATTTT AC AAGCTGGAAC C AG AATTTG G 10560 

TGATCGTTTT GAAGTGACCA TCTATCATGC TGATATGCTG GTCT ATC AAA ATCAGGTTGT 10620 

CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGGCAACCs ATcTTTACrc TCAGCaTCTC 10680 

CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 107 2 6 
<2> INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGG GATTGGGGCT TATCGTGAAG 60 

TTCGCATCCA ACCCCATGGA TATCTCTGTG ACGTGACAGA CCACGACCGT ATCCAAGCCA 120 

TGGTCAAGCA AATCGAACAA GAGGTTGGTC TCATTGACAT CCTCGTTAAT AACGCTGGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCGCTGA TTTCCGTAAG GT CATC GAT A 24 0 

TTG AC TT AAA CGCACCATTT ATCGTTTCAA AGGCAGTTAT TCCTTCTATG ATAAAGAAAG 300 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGACCGA ACTGGGACGT GAAACACTTA 3 60 

GCGCTTATGC TGCTGCTAAA GGGGGCTTGA AAATGTTGAC CCGCAACATT GCGTCTGAAT 4 20 

ACGGTGGAGC CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 4 80 

CAGCACCTCT TCGTGAATTG CAAGAAGATG GTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

TTGCAAAAAC ACCTGCTGCA CGTTGGGGAA ATACTGAAGA TTTCATGCCC CCTCCT^TCT 6 00 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGGCCACAT CCTATATGTA GATGGCGGTA 6 60 

TCTTAGCCTA CATCGGAAAA CAACCTGAGT AAAAATAGAA AGAAGATCTT ATGAAAATCG 7 20 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACGAT AGTCTAAAAG 7 80 

AAGC G AC AG A TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGGA GAAGAAGGAG 84 0 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGGCTGC CATCCTTTTA AATACAAAGG 9 00 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CCGGTGTAGG GCCTATGCTT CCTTTAAACA 9 60 

CCTTCCCTGG TGTTGTCTGT GGTCTAGCAG TGGACCCAAC TGACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGGTAACCCC TTGTCTATCC CTTATGCCAA AGGATTTGGC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTGCTCA AGAAATGGGC GGTGGCTACC 1140 
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CAAGAGAACG TGTAATCCCT GAACAACGCA ACCCTCGTAT CTTAAACGAG GTGAAACAAA 1200 

TCACCCACAA TGATTTGATG ACCATCCTTA AAATAATCGA CCAACACTTC CTCAAAGACA 12 60 

CCATCTCTGG CAAATACTTC CAAGAATACT TCTTTGAAAA CTGCCAAGAT GATGAACTTG 1320 

CTGCTTATTT GAAAGAAGTA TTAGCCAAGT AAAGCTATTC TAAACCAGAA AGGAACTAAT 13 80 

GGATGACGAA AATATTACTG TTTGGCGAAC CATTAATTCG AATTTCACCA TTAGATGCCA 14 4 0 

CCAGTATCGG CGATCATGTT GCCAGTTCGA CTTATTTTGG CGGATCAGAA ATTAACATCG 1500 

CTTGTAATTT GCAAGCCCTG GGTATCTCAA CGAAAGTTTT TACCGCACTC CCTGCCAACG 1560 

AGATTCGACA TCGTTTTCTC ACATTCTTGA AACAGCACCA AATCGATACC AGTTCAATCT 1620 

GTCGGCTTGG CGATCGAATC GGCCTCTACT ATTTGGAGAA CGGCTTTGGT TGTCGTCAAA 1680 

GTGAAGTTTT CTACGATCGT AAGC AT ACG A GTATCAGCCA GATTCGGCCA AACATGCTAG 174 0 

ATATGGATTC TCTCTTTCAG GGGATTAGCC ATTTTCATTT TAGTGGAATC ACCCTAGCTA 1800 

TCGGTCAAGA GGTCCCTGCG ATCCTTCTCC TACTCTTGGA AGAAGCCAAG CGCCGAGGAA 1860 

TTCTCCTTTC AATGGATCTC AATCTGAGAA CAAAGATGAT TTCAGTCCTA GAAGCCAAGT 1920 

ATGAATTTTC TAAGTTTGCA CGTTTTACTG ACTATTGCTT CGGTATTGAT CCTCTCATGA 19 80 

TTGATGACCA AAATCTAGAG ATGTTTCCAA GAGACAGTGC TAGCCTAGAA GAGGTGGAAA 2040 

ATCCCATGCG ACT1TTAAAA GAAGCCTATG GTTTCAAGGC CATTTTCCAT ACCCTCCGCT 2100 

CTAGTGATGA GCAAGACAAA AATGTCTATC AAGCCTATGC TCTAGAAGAA CTATTTGAAG 2160 

AGTCTGTCCA ACTAAAAACT GCAGTCTATC AACGAATTGC TAGCCCCGAT CCCTTTATAT 2 2 20 

CTGGTGCCCT TTACCAACTA CTCCATCATT CCTCCCTAAA AACTACCATT GACTTTGCAG 2 2 80 

TTGCGAGCGC AACTCTCAAA TGCACTCTTC CAGGAGACCA TCTCTCCACT TCCTCAACTA 2 3 40 

GTATTGAAAA TTTACTGGCA AATCCACAAG ATATCATTCG TTAGGAGAAT TACATGACCA 2 4 00 

AATCAGATAC GATTATTCAA CTAAAAAAAC AAAAAATTGT CGCTGTTATT CGAGGAAATA 2 4 60 

CAAAGGAAGA AGGACTACAA GCCTCGATTG CTTGTATCAA GGGCGGTATC AAAGCTATTG 2 520 

AAATCGCCTA TACCAATCAG TATGCAGGAC AAATCATCAA GGAACTTGTA GACTTGTATC 2 580 

AGGACGATCA GAGTGTTTGT ATCGGTGCAG GTACTGTGCT TGATGCCGTA ACTGCTAGAG 2 64 0 

ATGCCATTCT AGCTGGAGCA AATTACGTTG TTTCTCCATC TTTCCATGCT GAAACTGCGA 2700 

AAATGTGCAA TCTCTACAGC ACACCGTACA TTCCAGGCTG T AT T AC C CT C AC AG AG AT C A 27 60 

CGACTGCACT TGAAGCCGGT AGTGAAATCA TCAAACTCTT CCCAGGTAGT ACTCTCAGTC 2820 

CAGCATATAT CTCTGCAGTC AAGGCACCGA TCCCACAAGT TTCCGTAATG CTAACCGGAG 2 880 
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GAGTCGGCCT AAACAACATC CCTCAATGGT TCGCTGCTGG TGCAGATGCC GTTGGAATTG 2 94 0 

GTGGCGAACT CAATAAACTC GCTTCCCAAG GCAACTTTGA CCGCATCAGC GAGATTGCCC 3 000 

AACAGTATAT TACACTCAGA TAAAATCATA ACTACCCGTC TAACGGGTGG TTTATCTCAG 3060 

AGCTATAAGC CCAAATCATC AGCCAGCGCC TAAAGACCCT GGCTTTCACG TTGTTCAACC 3120 

CTTATTGCTC TTGACTCGTC ACTTG C CTCT TTAAGAGACT TTGGTATTAC TTACCACTAT 3180 

CCCTAAAGGG ATCCTCATAT TCTTTTACAC TCAATTTATC TAGTGCTATA GTAGATTGAA 3 240 

ACTGGAATAG TACACCTCTG CTTCTAAAAC ATTGTTAAAA ATCGATTTGA CTGTCCTGAT 3 3 00 

CGATTTTGTC CTGTTCTTAT TTCATTTTAC TAT AT AT C AT ACTTTACTCG TTCT C AAATT 3 3 60 

TTCATACTCA TGAAGAAATC ATCCACTCGA TAATTTCTTT AATCTTGACT ATATTTCTTA 3 4 20 

ATTGTGGCTT CATTAAGCCC T ACTGG ACT T ACATAATAAC CTTCCTCCCA GAAATGCCGA 34 80 

TTCCCAAACT TGT ACTTG AG ATTGGCGTGT TTGTCAAACA TCATGAGTGC ACTTTTGCCT 354 0 

TTTAAATACC CCATAAAACT TGAAACACTT AGCCTCGACG GAATACTGAC TAACATGTGT 3 600 

ACATGGTCTG GCATTAAGTG ACCCTCGATC ATTTCAACAC CTTTATAACT ACACAAGCGA 3 660 

TGAAATATTT CGTCTAAACT ACTTCTATAT T GAT TAT AG A TGACTTTTCG TCTATACTTA 3 72 0 

GGGGTGAACA CAATATGATA GAACACCTCC ACTTTGTGTA TGATAAACTA TGAGTCTTTT 3 7 80 

GTGCCATATT TTTTCTCCTT TCG C TT T ACA ATTGGATTGA AC AC CTTT AT TGTATCGCGT 3 84 0 

TTGGAGTTTT TTTGGTATAA CCTTCGACGC GCACCCGTAT AGCGGGTGGT TGTTTTGTCT 3 900 

CGCACCTCAC GGAGCGAGAC GG AC T AAT AT AGTGGAGTGA AATAGGATAC GAACAAATTG 3 960 

ATTAGGAAAA TCAAATGAAT TTATAGAAAT CTTTTAGCAG TTATAACGTT CTATTCTACT 4020 

TTCAAAACGC TATAGTCACA TAATAATGAA GTAAAAAA GG ATAAGTATCA ACTTATCCTT 4 0 80 

TTTTAAAAGA AAAATCCGAA GAT AT T T G G C CTTCTTCGGA TTTTTTCTA? TTTCCACAGT 4 -40 

TTCATGTAAT TCATCTAGAT GATGAACAAA TTAGTTGTTC TTTCCTCTAC GG AAT AG AT A 4 2 CO 

AAATGCCCCA AGTAGCAAGA ACCCTAGACT TGCCAAGATT GACTGACCTT CTCCTGTCTG 4 2 60 

AGGGAGATTC TTTTGATCCG AATGGTTCTT TTCCTCTTCA GATTTTTCCT TTTCTTTTGA 4 320 

ATTCTG T ACT TGTGGCTGAG CTG CTTG CTC TAGCTTTTTA AAGACTTCCT GATCTGGAGC 4 3 80 

TGATTCCTGG GTTTCAGGAT TATACTAGGC AATCTTATAT TCATCCCCTT CTTTTCGAAT 4440 

GGTATAGACT CCACGTTTCA AAACTTGGAA TTGGTTGGAA ATAGTAGAGA C AG AAT CAT C 4 500 

AT AT TTC AC A ATGCCCCAAA CTCCTTGTTT AGCATCATAA ACAGACTGAA GGGTTTCGTT 4 5 60 

ATTTTCGATG AGGCTACTTT CTAACTCTTT TATCATTTGA TTCAAGGTGC CACGATCCAC 4 620 

GTTAGGAATG AG C AT AT AG C CATAAGAATC TCTATTTTGC TTATGAGCCT GACTAATCGT 4 68 0 
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AAGAAATTCA TTTTCAACTT CCTTGTCTGA CTGTCCTTCA TTGATATCCT TCCAGGCTCC 47 4 0 

C TTTTG C AAA GCCTTACTCA TACTGATTGA ACTCTTCTTA AAGAAAAAGT AACCAATATT 4 8 00 

CTTTTTCGAA TCGAACGATT CTAAAAACAC ACTTTGGGTT TCAGGATAAT CCTTTTCTTG 4 860 

TTCTGTAAGC GAGGCTTCTT TATCATTGAC ATAGACTTTA TATCGATTAC CTGATTCCAG 4 920 

TTTTCTCTGG TCAATTGTAG TTGCAGCACT ATCTGTTGAA GTGTTTTGGA TATTGCTTCC 4 980 

TAAAAAGGCG ATCTTATCCT TTAGCATAAA CCAGCTCTTA TGAGCAGTCA ATGTTTGATT 504 0 

CCAGTTGGTG AAATCCATGG TTGCTGTCGC ATTGGCATCA TCTAGTTTGC TCGTTCCAAC 5100 

GAAAGCAGAC GGTAAAACTT TACCTGTATC GCTATCCGCT CTCTTAGCAT CCGTCTCTGT 5160 

TGTACCAGGC ATCTTATATG GATTAACTGT TGGCCAGTAG CCATCCCTAT AGTGACTCAA 5 2 20 

ATCGCCATTG TAAAGATAGA ACATCCCATC ACTCGTATAC CAACCACGTT TATTTTCCTT 52 80 

GTTCATGTCT TCGTAATTCA AGGTACGACT GGAAAAGAGT GACAACCCAA ATCCAAACCC S3 40 

TTTCTCTGCA TTGTACATGG CTGTTTTATC CATCTTGTTA AAGGCAGATA GGTAACTTGG 54 00 

TCTTGGAACA CTTGCCACTC CTGCATCACT TAACAAGGAT TGCATCAAAC TCATATCCTT 54 60 

ATAAGTCTTC AAATTCTTAA AGACATCATA ATAACTATCC GATTGAACAA TGGTCTTCAC 552 0 

AAGACTCTGC AAACATTCTT TGGTTTCTCC TTCAGACATA TCCGCTATTC GGTGAATCCC 5 5 80 

TCTTAGTACT TCTACTGCGG CCACGTGCCC CTCGCTATTT GCACGACTGA TCGAGCGTCC 564 0 

ACG ACT CAT A TCCATCAACT CTCCATTCAC CAGCAAAGGA GCAAACGATT TATCAATCCA 5700 

GTGGTACATG GTTTCCATTT TATCTTTATC GATTGCATTC TTGGTCTTTT GAATGACTGG 57 60 

CAACAGTTGA GACAGGCCAT CAATCAAAAC ATTCCCATAA GCACCCGTAT AGGCAACATT 5820 

GGTGTCGTCG ATATAGGATC CATCTTCATA AAAACCTTCA CCTTGGTCTA CCAACTTGAA 5 380 

CACTTGCTCA ATCGAGCGAA TGGTAGAAGA AATTTCTTGA TCATCCTTAC GCAGTAAACC 5940 

AGCTATTACT TTTACCCTTC CCATATCAAC TAAGTTTCCA CCTAGAGCCT TGAATGGGTT 6000 

ATCAGTCGTC TTTCGGAAAT GTTCGGGATC TGG T AC AAA T TTTTCAATCA CATCTGTATA 6060 

TTTTTTAATT TCCTCATCAG AG AAGT ATTC TTTCATCAGA GACAAGGTAT TGTTGATGGC 5120 

ACGAGGTGTA CCGATTTCAT AATCCCACCA GTTCCCAACA ATGCTCTTTT CACTATTCTA 618 0 

GACATGTTTA TGCATCCATT CCATGGAATC CCTGACTGTT CGAACGACAG TTTCATCTTG 6240 

ATAATAACCA GAAGAAGCAT TGGTCACTTG CTTGGCCATC TCCTCCAATT TCCGATAAGT 6300 

GGCAGTCAGA TTTGCAGACG TTTTATAATT TGAAAATTTT TCCCACAAAT AGGTGCGGTC 63 60 

CGCCTGACTT GAAATACTGG ATAGGCTATC AGCTACCTTT CCTTCCAATT CCTGGTTTAA 64 2 0 
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TTTGGCCATC TGTTCATTTT TAGAATCATA GTATTGATTC CCAGCGATGA TGCCATTCCA 6 4 80 

GTCATCCAAA CGGTCTGTGT ATGCATCCTT AACAGAGGCC AG AATCTTC A AAGGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT GTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT ACCTT AT AAG TATAGTCCGC 6660 

AAGAGAAAAA AC AT GTTTTT TTCCAATTGG TAAATCAATC TTTTCCTCAA GCTGTTTATC 67 2 0 

TGTTTGAGAA TCCTCAGAAA GCTGGTCTGC TACCTCTACC AGCTCAATAT CCTT AAAGG A 67 80 

AACAGTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG ATTTTATCAA CATCTAAAGT 684 0 

CGGGCTATAG TCTGCTTCAA TGGTCTGCCA GTC CTTTGTT CCTGACGTCG TTGCAGAATT 6 900 

CCACAATCGC TTGTCCTTAC CACTTTCCTC AATGATACGA ACTTTGGCAA TCCCGATTTT 6960 

ATT AT CTGTT TT AATCTTG A AACGCAGTTT ATACTTTTTC TTAGCTTCAA TAGGAACCAT 7020 

ACGGTGAAGC GCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7080 

CTCAATGACT CGAGTTGAGG CATCTGCACT AT TCTTC TGG TCTACCCAAG CTGACCACCC 714 0 

CCTGAGCTTT GCTTCCTGTC CGG 7163 
(2) INFORMATION FOR SEQ ID NO: 68: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9244 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS : double 
( D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGTTATAACA TACATGTAAG CGGTACCCAA AATGGTGCCA ACTCAAAATT TTTAAGGACG 60 

AAAATACATC TCTTC AC AT C CAATTCACGT CTTCTCAGAA ATTGGGAAAC TGAAAAAAGT 120 

TATGTTGCAC CGTCCAGGCA AG GAG TT AG A AAACTTGTTG CCGGACTATC TTGAAAGGCT 180 

TCTTTTTGAT GATATTCCTT TCTTGGAAGA TGCTCAAAAA CAACATGATG CATTTGCCCA 2 40 

AGCTCTTCGC GATGAAGGAA TTGACGTTCT CTACCTAGAA CAACTCCCTG CTGAATCATT 3 00 

GACCTCTCCA GAAATCCGCG ATCAATTTAT CGACGAATAC TTAGACGAAG CCAACATCCG 3 60 

TGATCGTCAA ACCAAGGTTG CTATTCCTGA ATTGCTTCAC GGCATCAAGG ACAACCAAGA 4 20 

ATTGGTTGAA AAAACAATGC CTGGGATTCA AAAAGTTGAA TTGCCAGAAA TTCCTGACGA 4 80 

AGCTAAAGAT CTAACTGACT TAGTTGAATC AGAGTATCCA TTTGCAATTG ACCCGATGCC 540 

AAACCTCTAT TTCACTCGCC AC CC ATTTGC AACAATTGGA AACGCCGTAT CGCTTAACCA 600 

CATGTTTGCA GACACTCGTA ACCGTGAAAC ACTCTACGGT AAGTATATCT TCAAATACCA 6 60 
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CCCAATCTAT GGCGGAAAAG TGGATTTCGT CTACAACCGT GAAGAAGATA CGCGTATCGA 720 

AGGTGGAGAC GAGTTAGTTC TTTCTAAAGA CGTCCTTCCA GTAGGTATCT CTCAACGTAC 780 

AGACGCAGCT TCTATCGAAA AACTTTTGCT CAACATCTTC AAGAAAAATG TTGGCTTCAA 840 

GAAAGTTTTG GCCTTTGAAT TTGCTAACAA CCGTAAATTC ATGCACTTGG ATACTGTCTT 900 

CACTATGGTA GACTATGACA AGTTCACTAT TCACCCAGAA ATCGAAGGCG ACCTTCACGT 960 

TTACTCAGTT ACTTACGAAA ACGAAAAACT TAAAATCGTT GAAGAGAAAG GTGACTTAGC 1020 

TGAACTTCTT GCTCAAAACC TTGGTGTAGA AAAAGTTCAT TTGATTCGTT GCGGTGGTGG 1080 

CAATATCGTA GCAGCTGCGC GTGAACAATG GAACGACGGT TCTAACACTT TGACCATCGC 1140 

ACCTGGTGTG GT AG TTGTTT ATGACCGCAA TACCGTGACC AATAAGATTT TGGAAGAATA 12 00 

CGGGCTTCGC TTGATTAAGA TTCCCGGAAG TGAATTCCTT CGGGGCCGTG GTGGACCTCG 12 60 

TTGTATGTCT ATGCCATTTG AACGTGAAGA AGTGTAATCG CTGTTCGATA TTCGTCAATA 13 20 

GAAAATGTAA AAAATAGAAA GAGGAAATAA TAAAATGACA AATTCAGTAT TCCAAGGACG 13 80 

CACCTTCTTA GCAGAAAAAG ACTTTACCCG TGCAGAGTTA GAATACCTTA TTGGTCTTTC 144 0 

AGCTCACTTG AAACATTTGA AAAAACGCAA TATTCAACAC CACTACCTTG CTGGCAAGAA 1500 

TATCGCTCTC CTATTTGAAA AAACATCTAC TCGTACTCGT GCAGCCTTTA CAACTGCGGC 15 60 

TATCGACCTT GGTGCTCACC CAGAATACCT CGGAGCAAAT GATATTCAGT TGGGTAAAAA 162 0 

AGAATCTACT GAAGATACTG CTAAAGTATT GGGACGTATG TTTCACGGGA TTGAATTCCC 1680 

CCGATTCACC CAACGTATGC TTGAACAATT GGCAGAATTC TCAGGCGTTC CAGTATGCAA 174 0 

CGGTCTAACT GACGAATGGC ACCCAACTCA AATG CTCGCT G ACT ACTTG A CTGTTCAAGA 1800 

AAACTTCGGT CGCTTCGAAG CCTTCACATT GGTATACTGT GGTGATGGAC GTAACAACGT 18 60 

TGCCAACAGC TTGCTCGTAA CAGGTGCTAT CCTTGGTGTC AATGTTC AC A TCTTCTCACC 1920 

AAAAGAACTC TTCCCAGAAA AAGAAATCGT TGAATTGGCA CAACCATTTG CTAAAGAAAG 19 80 

TGGCGCACAT GTTCTCATCA CTGAAGATGC TGATGAAGCA GTTAAAGATG CAGACGTTCT 2 040 

TTACACAGAC GTTTGGGTAT CAATGGGTGA AGAAGACAAA TTCGCAGAAC GTGTAGCTCT 2100 

TCTTAAACCT TACCAAGTCA ATATGGACTT AGTTAAAAAA GCAGGCAATG AAAACTTGAT 2160 

CTTCCTACAC TGCTTGCCAG CATTCCACGA TACTCACACT CTTTATGGTA AAGACGTTGC 22 2 0 

TGAAAAATTT GGTGTAGAAG AAATGGAAGT AACAGACGAA GTCTTCCGCA GCAAGTACGC 22 8 0 

TCGCCACTTC GATCAAGCAG AAAACCGTAT GCACACTATC AAAGCTGTTA TGGCTGCTAC 2 34 0 

ACTTGGTAAC CTTTATATTC CTAAAGTATA ATTTTAGATA ATAAACCGTC TACCAACAGC 2 4 00 
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TATGAGGCCT GCGACTAATA GCTTTAGTCC GGTCCTCTTT TATGTAATGG TAATCTATTA 2 4 60 

TTTCTTATAA AATATGTGAA AAATCATTAA ATTGAAATCT AAACGCATTC TATTGAGTGT 252 0 

GATAAAGGAG AATTTATGGC AAATCGTAAA ATTGTACTAG CTTTGGGAGG AAATGCGATT 2530 

CTTTCTTCTG ACCCATCAGC AAAGGCTCAA CAAGAAGCTT TAGTTGAAAC AGCTAAGCAT 2 64 0 

CTTGTAAAAT TGATTAAAAA TGGAGATGAT CTGATTATCA CTCACGGTAA TGGACCTCAA 2700 

GTTGGGAATC TCTTGCTCCA ACATTTGGCA TCAGACTCTG AAAAGAACCC TGCCTTCCCA 27 60 

CTCGACTCAC TTGTCGCTAT GACAGAAGGT AGCATCCGTT TCTCCTTGAA AAATGCTTTG 2 820 

CAAAATGCTC TCTTGGATGA AGGCATCGAA AAAAATGTTG CCTCTGTTGT AACGCAAGTT 2 8 80 

CTCGTAGATA AAAATGATCC ACCTTTTGTT AACTTGAGTA AACCAATCGG TCCTTTCTAT 2 9 40 

TCAGAAGAAG AAGCAAAAGC AGAAGCCGAA AAAAG CGG AG CGACTTTCAA GGAAGATGCT 3 000 

GGCCGTGGCT GGCGTAAGGT CGTTGCCTCA CCAAAACCTG TTGACATCAA AGAAATTGAA 3060 

ACCATCCGTA CTCTTTTAAA TAATGGTCAA GTCGTCGTAG CTGCAGGTGC TGGCGG T ATT 312 0 

CCCGTCGTCA AAGAAAACAA TGGACATTTG ACTGGTGTCG AAGCGGTTAT TCATAAAGAC 3L80 

TTCGCTTCCC AACGTTTGGC AGAATTGGTT GATGCAGACC TCTTCATCGT TTTGACAGGT 32 40 

GTACATTATG TATTTGTTAA CTACAACAAG CCAAACCAGG AAAAATTGGA ACATGTGAAT 3300 

GTTGCCCAGC TGGAAGAATA TATCAAACAA GATCAGTTTG CACCAGGTAG CATGCTTCCA 3 360 

AAAGTAGAAG CAGCTATCGC TTTTGTCAAT GGTCCTC C AG AAGGAAAAGC AGTT ATT ACT 34 20 

TCCCTTGAAA ATCTAGGCGC CTTGATTGAA TCTCAAAGCG GAACAATTAT TGAAAAAGGA 34 80 

TAAGTTGTTT TACTAATAAG ATGTATTCTA TTTCTAGTAT CTTTATATCA AATT AG AAA T 354 0 

TATTCTTGAA AACATGTACA ATATTTCAAA AG AT ACT ACT TTTAGACTTT AATATGGTAA 3 600 

AAC AAA TATA AATAGAAAGC GTTTTCTTGA ATGTTTATTT AAGAAAGTAG TTCGTTTTTT 3 660 

ACACTTTGTT AGACATCAGG AGGAAAAACA AATGAGTGAA AAAGCTAAAA AAGGGTTTAA 372 0 

GATGCCTTCA TCTTACACCG TAT TAT TG AT AATCATTGCT ATTATGGCAG TGCTAACTTC 3780 

GTTTATCCCT GCGGGGGCCT TTATAGAAGG TATTTACGAG ACTCAGCCTC AAAATCCACA 3 84 0 

AGGGATTTGG GATGTCCTCA TGGCACCGAT TCGGGCTATG CTAGGTACTC ATCCAGAGGA 39QO 

AGGTTCGCTC ATTAAAGAAA CGAGCGCAGC GATTGATGTA CCCTTCTTCA TCCTTATGGT 3960 

TGGTGGTTTC CTTGGCATTG TCAACAAAAC TGGTGCTCTT GACGTAGGGA TTGCCTCTAT 4020 

CGTGAAGAAG TATAAGGGCC GCGAAAAAAT GTTAATTTTG GTACTGATGC CTTTGTTTCC 4080 

CCTCGGTGGT ACAACTTATG GTATGGGTGA AGAAACAATG GCCTTCTATC CACTCCTTGT 414 0 

GCCAGTTATG ATGGCCG7TG GTTTTGATAG CCTGACTGGT GTTGCAATTA TTTTGCT CGG 4200 
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TTCT C AAATC 


GGCTGTTTGG 


CATCTACTCT 


GAATCCATTT 


GCGACAGGTA 


TTGCTTCAGC 


4260 




GTTGGTACAG 


GGGACGGTAT 


CGTACTTCGT 


CTGATCTTCT 


GGGTTACCTT 


4320 




AGTACTTGGT 


TTGTTTACCG 


TTATGCGGAT 


AAG ATT C AAA 


AAGATCCGAC 


4380 




GTTTATAGTA 


CTCGCAAAGA 


AGATTTCAAA 


CACTTTAACG 


TAGAAGAATC 


4440 




GAATCTACAC 


TTAGCAGCAA 


ACAAAAATCA 


G TTCT CTT CT 


TATTTGTGTT 


4500 


UnLAl 


TTr.ATGC'AT 


TGAGCTTCAT 


TCCATGGACA 


GACCTTCGCG 


TTACCATTTT 


4560 


I LjA 1 \j A*— 111 


A AT*Af*TTCGT 


TGACTGGTCT 


TCCAGTTATT 


GGTAATATTG 


TCGGTTCATC 


4620 


1 At. 1 I V_ * 


CTAGGTACTT 


GGTACTTCCC 


AGAAGGCGCA 


ATGCTCTTTG 


CCTTTATGGG 


4680 


I A J t-t- i \jA t 1 


C. CYT r;Tr a T^TT 

*o u 1 U i in» * * 


ATGGTCTTAA 


AGAAGATAAG 


ATT AT C T CTT 


CCTTCATGAA 


4740 




HACTTGCTCA 


GTGTTGCCTT 


GATCGTAGCG 


ATTGCTCGTG 


GTATTCAAGT 


4300 


LAI v_A 1 K*i\t\K- 


ci a r*r:fTr a tc; a 


TTACCGATAC 


AATCCTCAAC 


TGGGGTAAAG 


AAGGCTTGAG 


4860 


CGGTCTA H_ I 


1 LALnnu IV. i 


TTATCGTTGT 


AACTT AT AT C 


TTCTATCTAC 


CTATGTCATT 


4920 




TrTTfATCTG 

lull IV. IU 


GTCTTGCCAG 


CGCAACTATG 


GGTATCATGG 


CTCCACTTGG 


4980 


nunn 1 1 1 Vj i n. 


AA TGTCCC^C 


CTAGCTTGAT 


f ATCACTGCT 


TACCAATCTG 


CTTC AGGTGT 


5040 






CATCTGGTAT 


TGTGATGGGA 


GCTCTTGCAC 


TTGGACGTAT 


SlOO 


LAALA 1 i v->^«J L 


ACTTGGTGGA 


AATTCATGGG 


CAAACTCGTA 


GTCGCTATTA 


TTGTAGTGAC 


5160 


t_A * ^.w^V- 1 — I 1 


fTTCTCCTTG 


GAACCTTCCT 


TCCATTCCTA 


TAAAATAGTG 


AGTGAGGTGA 


5220 


* IwLAlunnn 


ATACATATAA 


CAAATCAAGT 


TAAAGATCAA 


TTTCTTATAT 


CATTAAAAAC 


5280 


L I 1 IjA ILL v— 


t* AT C CTT C AG 


TACTCAATGA 


AGGACAAAAT 


GGAACACCTT 


TTGGACAAGC 


5340 


AA 1 k_t»/v\Vj/\ 


GTCCTAGAAA 


AAACTTTACA 


GATTTGTCCA 


GACATAGGTT 


TCACTACCTA 


5400 




AAAGGTTATT 


ACGGATATGC 


AGAAATCGGT 


CAGGGAGCAG 


AGCTTCTGGC 


5460 


CATTCTCTGT 


CATTTGGATG 


TTGTTt-L A I <- 




GCAGATTGGC 


AGACACCGCC 


5520 


ATTTGAAGCA 


ACTATCAAAG 


ACGGCTGGGT 


ATTCGGACGT 


GGTGTCCAAG 


ATGATAAAGG 


5580 


CCCTTCGCTC 


GCAGCTCTCT 


ATG C AGT AAA 


AAGCTTGCTG 


GACCAAGGTA 


TTCAGTTCAA 


5640 


AAAGCGCGTA 


CCCTTTATCT 


TTGGTACCGA 


TGAGGAAACC 


CTCTGGCGCT 


GCATGGCACG 


5700 


CTACAATACC 


ATCGAAGAAC 


AGGCCAGTAT 


GGGCTTTGCA 


CCTGACTCAT 


CTTTTCCTCT 


5760 


GACCTATGCT 


GAAAAAGGGC 


TTCTACAGGT 


CAAACTTCAT 


GGCCCTGGAT 


CGGATCAACT 


5820 


AGAGCTTGAA 


. GTAGGAGGCG 


CCTTTAACGT 


TGTACCACAC 


! AAGGCCAACT 


ACCAAGGTCT 


5830 


CCTCTATGAA CAGGTTTGTA 


, ACGGTCTCAA 


. AGAAGCTGGT 


■ TATGATTACC 


AAACCACTGA 


S940 



WO 98/18931 



PCT/US97/19588 



560 

ACAAACCGTA ACGGTTCTCG GAGTGCCAAA GCATGCTAAG GATGCTACTC AAGGTATCAA 6000 

TGCTCTCATC CGACTAGCTA CCATTCTTGC TCCTCTCCAA GAACACCCTG CTCTCAGTTT 6060 

TCTTGCAACA CAAGCAGGTC AAGACGGCAC AGGAAGACAA AT CTTTGGT G ATATAGCAGA 6120 

TGAACCTTCT GGTCACCTAT CCTTTAATGT CGCAGGTCTC ATGATCAATC ATGAACGTTC 6180 

TGAAATCCGT ATTGACATTC GGACTCCTGT CTTAGCTGAC ASGGAAGAAC TAGTAGAGTT 62 4 0 

GCTTACAAGA TGTGCACAAA ACTACCAACT CCGCTACGAA GAGTTTGACT ATCTAGCGCC 6 300 

TCTATACGTC GCAGAAGACA GTAAACTCGT TAGCACACTG ATGCAAATCT ACCAAGAAAA 6 3 60 

GACTGGCGAT AACAGTCCTG CTATTTCATC CGGTGGTGCC ACTTTTGCTC GCACCATGCC 64 2 0 

AAATTGTGTA GCCTTCGGCG CCTTATTCCC AGGAGCGAAG CAGACAGAAC ATCAGGCAAA 64 SO 

TGAATGTGCC GTTCTAGAAG ATTTGTACCG TGCTATGGAT ATTTATGCCG AAGCCGTCTA 6 54 0 

TCG ACTTGC A ACTTAATCAG GCAACTGTTT CTACCAAAAA AAATCGACCG ATTAATGAAC 6 600 

TGCACCCCAA AAGTTAGACA G AAT AAATCT AACTTTTGGG GTGTTTTATT ATGAAATTGA 66 60 

GTTATGAAGA TAAAGTTCAG ATCTATGAAG TAAGAAAGCA AGGACAAAGC TTCAAACAGC 67 2 0 

TTTCAAAAAG ATTTGGTGTG GATGTTTCTG GTCTAAAGTC ATCTGAATCT TTGAGATGAG 67 80 

CTTTATAAAT CGCTTTTTTC AGTTTTTGCA CTGGTGTTTC GATAAACTCA AACTTTTTAG 68 40 

CCGTGGTATT GCCTGATTTT ATAGTATATT G AAACT AG AA TAGTACACCT CTCCTTCTAA 6 9 00 

AACATTTTTA GAAATCGATT TGACTGTCCT GATCGATTTG TCCTGTTCTT ATTTCATTTT 6 96 0 

ACTATATTTG AGCCACTTCG TCTTTAACGG CTTTATTCAT AAGCTCTTGT AATTTTTCTT 7 020 

T AC TAT C AAT T A CTTCTG AT TTTCCGTTCT AATTTATTGT AATAGGTTTT AACTTACCTA 7 080 

ATTTCTCG A C ACGCTCATTA ATTTGATCTT TTTTGAAGGC TGCTTATGTT TTTCCTAAGA 7 140 

TTTTTTCAAA AATATATTTA TCAGATAGCG GTTTGTCTTC TTCTTCAGCT TGGTTTTTCT 7 2 00 

ATTAATTTGA AACATAAGGA AC AAATC CTT CATAGTAACC TAATGCTCCC ATAAGTTCAA 7 2 60 

AAGCTTGTTT TCTAATTCAA ACCATTGCAA CTCAGATTTC AGCTTTTCAG ATAAATCCTG 7 3 20 

CT C AT CC AAA TAATGACTTG AAATTAGTGC TGAACTCGTT TCTGTATCCT GTACAGGCTG 7 3 80 

AGCACCCATA CCAGCAAAAA ATAAACTCGT TCCTAGCAAG ACCGAACAAG CTCCTATTGC 7 4 40 

ATATGGCCTC AAAGAAAAAC GCTGCTTTCT CTCAAATTGA AATTCTTTCA TCCCATCTCC 7 500 

CATCATTCAT TATTACTGTA TATTTTGTAT ATCAGAAATA GTTTGTATTC ACAAATCTTT 7 560 

CTAGTTATTC CCTTATCATT CCTAATTAAG GGAGATAACA TACAATAATT TTTAGTTAAA 7 6 20 

TGTATATCGA TGTT TTTTGT TTTTCTTAAT AAACGCAATA C AAAAAG AG C CTGTTACCAA 7 680 
CCTCTTTCTA CTCAATGAAA ATCAAAGAGC AAATTAGGAA ACTAGCCACA GGTTGCTCAA ' 77 4 0 
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AACACCGTTT 


TGAGGTTGCA 


GATAGAACTG 


ACGAAgTCAG CTCAAAACAC 


TGTTTT G AGG 


7 800 


TTGCAGATAG 


AACTGACGAA 


GTCAGTAACA 


TCTATACGGC 


AAGGCGACGC 


TGACGTGGTT 


7860 


TGAAGAGATT 


TTCGAAGAGT 


ATTAGTCTAT 


TATTTCTTCT 


CAGCGCGAAG 


GGCTGACAAG 


7920 


ATTTGTGTTC 


GGATATCATC 


CACACCATTT 


GGAGTATTTG 


G T AAAAAG AT 


ACTTTGATTT 


7980 


CCTTTAGAGG 


CAAAGGTATT 


CAAGGTATCC 


AAATACTGGT 


TGGTCAAGAG 


GATAGACATG 


80 4 0 


ATTTGTTCTT 


CTGTCATGCC 


AACATTGGCT 


TCCTTGAGTT 


CGGTGATAGA 


CTCTGCCAAT 


8 100 


CCATCCACAA 


TCGCCTTACG 


TTGTTGGGCA 


ATCCCCACAC 


CATGAAGGCG 


* 1 i i 1 \. I 


9 1 OU 


GCTTCTGCTT 


CAGCTGCAGT 


GACAATTTTA 


ATCTTGTCAG 


CTTCCGCCAA 


TTCTTGTC CT 


8 2 20 


GCGACCCGCT 


TACGTTGCGC 


CGCATTGATT 


TCATTCATGG 


ATTGCTTAAC 




8280 


GGTTCGACCT 


T GGT AAT C AA 


GGT TTTCACG 


ATAATGTAGC 


CGTAAGTGGT 




h i d n 


GCTACTTGGT 


GTTGAACTTC 


AAGGGCAATC 


TCATCTTTTT 


TCTCAAACAA 




otuu 


GTTAATTTTG 


GAACAGAAGA 


GCGAAGAGCA 


TCTTCGATAT 


AAG ATTT AAT 


CTC, AHA TTPT 


84 60 


GGACGTATGA 


GTTTATAGTA 


AGCATCTGTC 


ACGCTCTGCT 


CGTTGACACG 




ft ^ "> n 


GCTACATTCA 


TCATAACGAA 


CACATTGTCC 


TTCCTCTTAG 


TCTCAACCAC 


AAT A TPAPTT 


8 580 


TGCAACAAGC 


GCAACTGAAT 


CCGTGCTGCA 


ATCGAGTCAA 


TCCCAAAAGG 


CAACCGAATA 


8 64 0 


TGAATACCGC 


TATTAGCAAC 


CTTTTGGTAT 


TTCCCAAAGC 


GTTCAATAAT 




87 00 


TGCTGACGAA 


CCACATAAAC 


TGTACTCAGT 


GTGACTATCA 


CCAATAGGAG 


CACACAAAfA 


87 60 


ATCAGAAAAA 


TCATGAAAAA 


TATTCCCATA 


ATGGAACCTC 


C A C AAC" A' T * m 


* i * V_ 4 i rt 1 


A ft 5 ft 


TATAGCACAT 


TTAAAGAAGG 


CTGTGCCGTT 


TTTACTGCGA 


TTTTTCCTGA 


AATGTCAATA 


8880 


ATTAGAGGTG 


AATTGTCCTA 


TTGTCGTCCA 


ATCTCTTCCT 


AAAATAACTC 


TTTATAAAAC 


8940 


GCAATCGTTT 


CTTCTAAGGT 


TGGCATAAAT 


GGATTTCCTG 


GTGCGCAGGC 


AT C AAT C AAG 


9000 


GC ATTCT TAG 


AAAGG T ATTC 


AAAGTCGAAA 




CAATACCAAG 


TTCAGTCAGT 


9060 


TTCTTAGGAA 


TACCTACTGT 


CTCAGAAAGC 


TTCTCAATCT 


CAGCAATCGC 


ATAATCGGCA 


9120 


CATTCTTGAT 


CTG ATTT AC C 


TTCT AC ATG A 


AGTCCCAAGG 


CTTTGGCAAC 


ATTGCGGAAA 


9180 


GCTTCTGGTA 


CACGTTTAGC 


ATTTTCACGT 


TCTATAACTG 


GTAGCAACAT 


GGCACAGCAC 


9210 


ACGG 












9244 



(2 J INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS : 

<A> LENGTH: 8898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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( 0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GATCTGAACT TTAT CATC AT AACTT AATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 60 

TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTGGACT GACGTTTTTT TGTATGCTTA 120 

TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTGGTGCG ACTGCTTTTC CAAAGAAGGC 180 

TGCTAAGACA AG AATTGT C A AAACATAAGG TGCAATTTGA AG AT AAAC CG CTGGCACTCC 240 

TTGTAGGAAC GGCAATTGAG AACCGATAAC AGCCAAACTT TGTGAAAGTC CAAAGAAGAG 300 

A CT AGAAAGC ATAGCACCGA TTGGATTCCA TTTCCCAAAG ATCATCGCAG CAAGGGCGAT 3 60 

AAATCCAGGT CCAACAATAG TTGTCACTGA GAAGTTAACT GAGATTGATT GCGCATAAAT 420 

CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTGTA 4 80 

G AC GTTG ATT CCCAAGGTAT CCGCTGCTTG AGGATGTTCA CCCACAGAGC GGAGACGAAG 54 0 

ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAAGGCAA TCGCCAGATA 600 

ACCAAGTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC A CTGGG AT AT TTGCCAAGAC 660 

TGGGAAATCA AAGCGTCCAA AAGTTTGACT TACCTTCTCC GTTTGTCCTT TGTTATAAAG 72 0 

AACTTTAACT AAGAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCG TACCGCTGAC 780 

AACATGGTCT GCACGGAAAT GAACCGTCCC TGCTGCGTGG ATGATAGAGA AAAC ACT ACC 84 0 

AACCAATCCT GCTACAAGCA AGGATAGCCA TGGAGTTGCT GCTCCAAATT GTTCTGCAAA 900 

TTCAAGGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC AT AATTCCTT CAAGGCCAAC 960 

GTTTACCACA CCACCACGTT CAGAGAAAAC ACCACCGATA CTTGTAAAGA TGAGAGGTGC 102 0 

TGAGTAAATC AGCATAGAAG ACACCAACAG GGGGAGCAAG GTTATAATAG ACATCTTTAC 1080 

TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCGTT CGATAAGGTA ATGAACACTG 1140 

ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CAGATGGTAC CTGCGCCGCA 1200 

TTCATACCAG GAGCCCCAAC TTGGAGAACG CCAAATAGGA AGGCTGCAAA G ACT AT ACC A 1260 

ATTGGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCGT TAAATCCGAT AGCTAATGAC 13 20 

GAACCTTGAA C AT AG ACGTT CTGGAAGGTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 13 80 

GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC CCTTGGCAGA AATACCAGCA 14 40 

TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 1500 

TTGAGCATGA ACCAAATAAC TGCAACGGCA ATGATGGCAA AGAAAATACC AATATTCATC 1560 

CGTCAGTTAC CAGTCAACTC AGCCAACCAA GGTGTCTGAT AGGTTGCATT AGCCCCAACA 152 0 
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CGAATGGTCG AATC TGT ACT 7TGCATGAAG TCTTTAGGGA AAGCATGGAT AAAGGCATTC 1680 

CCTACATACA AG AC AATGT A GTTCATCATG ATGGTTACAA TAACCTCTGA CGTCCCTAGA 17 4 0 

TAGGCCCTAA GAATACCTGG AATCGCTCCG ACAATCCCAC CAGCAATCAA GGCAATCACG 1300 

ATGGTTGCTA GAATCATCAA GGGACGGGGC ATATCTGGAT GCGACAGGGC AAACCAACCA 1860 

CTGAGAATCC AACCTGCCAA AGCCTGACCA GGAAGTCCGA CGTTAAAGAA ACCAGCTCGA 1920 

CTGGCAACCG CAAAACCAAG ACCAATCAAG ACCAGACGAC CCATAGCACG GAAGATTTCT 1980 

CCAATCCCAC GCAGACTGCC AAAGGCTGTA TAGAACAATT CTTCCTACCC CCAAATAGCA 2 04 0 

TCATAACCGA AGATCCACAT GACAATGGCT CCGAGTAAAA TTCCTAGGAA TACACAAATC 2100 

AAGGGAACCG AAATTTGTTG TAATTTTTTA GACATCACTC TTCTCCTTTC CCAAGTTTCC 2160 

ACCAGCCATC AAGACACCAA GTTCTTGTTT ATTG GTTGTT TCTGGTGATA CAATACCTTG 2 2 20 

AATCTTACCA TCGTGGATAA CGGCAATACG GTCTGAGACG TTTAAAATCT CATCCAATTC 2 2 80 

AAAGCTGACA ACAAGGACAG CCTTGCCATT ATCACGCTCT TCAATCAAGC GTTTGTGGAT 2 3 40 

ATACTCAATC GCACCGACAT CCAACCCACG AGTTGGCTGG CTAACGATAA GGAGATCAGG 2 4 00 

ATCTCGATCA ATTTCACGAG CAATAATTGC TTTTTGTTCA TTTCCTCCTG AGAGTGCAGC 2 4 60 

TGCACGAACT AATTCACTGG CAGCGCGAAC ATCAAACTCT TCCATCAGCT TTTTAGCATA 2 520 

AGAAGTAATA TTTGAATAAT TCAAAATTCC ATTTTTACTA TGTGGTTCTT TATAGTAGGT 2 580 

TTGAAGGGCA ATATTTTCAG ATATCATCAT TTCCAAAATC AAG C CATC AC GGTGACGGTC 2 64 0 

TTCTCGAACG TGCCCAACAC TTAGTTCTGT AATCTGACGT GGGTGCAAGC C7ACAATTGA 2 7 00 

ATCTCCTTTT AGCTCAATGC T A CC AG ATT C AACCTT AC G A AGACCTCTAA TGGCTTGAAT 2 7 60 

CAGTTCAGAC TGACCATTTC CATCAATCCC CGCAATACCA ACAATCTCTC CAGCACGAAC 2 8 20 

ATCCAAGGAC AGATTTTTAA CAGCTGGAAC ACCACGGTTT TCATTGACCA CCAAATCTTT 2 8 80 

GATAGACAAA ACCACTTCTT TTGGTTTAGA GGCTTCCTTC TCTGTTTTAA AGCAAACAGA 29 4 0 

ACGTCCTACC ATCATTTCCG CCAAATCAGC ATTGGTAGCC CCTGCAATTT CAACGGTTTC 3 000 

AATTGATTTC CCACGACGGA TAACTGTAAC ACGGTCAGAA ACTGCTCGAA TTTCATCCAA 3 060 

TTTGT GGGT A ATCAAGATAA TTGATTTTCC TTCTTTGACA ACATTTTTCA TAATAGCCAT 3120 

CAACTCATCA ATTTCTGATG CACTCAAAAC AGCCGTTGGT TCGTCAAAGA TAAGGATATC 3130 

AGCCCCCCGA TAAAGTGTTT TTAAAATTTC TACACGTTGT TGGGCTCCAA CTGAGATATC 3 2 40 

TGCTACCTTG GCAGAAGGGT CAACAGCTAA GCCATAACGT TCAGAAAGAG CCTTGATTTC 3 3 00 

TTTGCTAGCT CCACCGATAT CTAGCACACC ATTTTTAGTC AATTCACTAC CTAAAATGAT 3 3 60 
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GTTTTCAGCC ACTGTGAAGC CTTCAACCAA CATAAAGTGC TGGTGAACCA TCCCGATTCC 3 420 

CAAGCTAGCT GCTTTACATG GGGAGTCGAG ATTGACAACT TGACCGTTGA CCGCGATTTC 34 80 

ACCACTAGTT GGTTCAAGAA GCCCTGCTAA CATGTTCATT AGCGTGGACT TACCAGCCCC 3 540 

ATTTTCTCCT AAAAGTGCAT G AATTTC AC C TTTTCGTAGG TGCAAGTTGA TTTTGTCGTT 3 600 

GGCAACAAAT CCACCAAACA CCTTCGTAAT ATCACGCATC TCAATGACAT TTTCGTGTGC 3 660 

CATCTCCTCT TCCTTTCAGA GTCTTATTTT ATTTCAATAA AACTTGCTAG TTTCTCTAGT 37 20 

AGCAAGCTTT ACTTAGACAA AATGACTTTG TCTCAACTCT TAAAAAAGCG GCCCTTGGCC 37 80 

GCTTCCTAAG AAATGACTTC CATCCATTAT TTTTCAGGAA CTTTTACGCT TCCATCAAGG 3 84 0 

ATTTTAGCTT TTGCATCTTC GACAGCTTTT TTACCTTCTT CTCAAACGTT TGTTACTCCC 3 900 

AAGTCAACCC CTTTATCCTT CAATGAGTAA ACGATCACTT GACCGCCAGG GAATTCTCCT 3960 

CTTTCTCCCT TGTTAGAAAT ATCTTTTACA GTTGTACCAA CTTGTTTCAA ACT AG AT AC A 4020 

AGAACAAAGT TTGATTCTTT GCCATCTTTA GAAGTGTATT TACCTTCTCC TTCTTGCTCA 4 080 

CGATCAACAC CGATAACCCA AACTTTTTCA TTTTCAGGAC GGCTTTCGTT GAGAGATTTT 4 140 

GCCTCTGCAA AGACACCTGC ACCTGTACCA CCAGCTACTT GGTAAACAAT ATCTCCACCG 42 00 

GCTGCGTATT GTGCGGCTGC AATTGTTTTA CCTTTAGCCG CATCACCAAA TGAACCAGCG 4 2 60 

TAGTCAACTT GGACTTTGAT AGATGGGTCT ACTGACGCAA CACCAGCCTT GAATCCTGCT 4 3 20 

TCAAAACGAG ACATAACTTC AGATTCGATA CCACCTACAA AACCAACTTG TTTTGTCTTA 4 3 80 

GTTGTTTTTG CTGCAGCCAC ACCTGCAAGg TAACCTGACT C ATT AT C AG C GAAAGTTACG 4440 

CTCGCAACAT TCTTTTGGTC TTTAATCACA TCATCAATCA AGACATAGTT CAACTCAGTG 4 500 

TGTTCTTTTG CTGCATCTTT AACTGCATTA TTAAGGGCAA AACCAACACC GAAGATTAGG 4 560 

7TG7AAC7TC CAGCCCC7TC TTG C AAGTTG TTACC3TAGT CACCTTCACT TCTTGATTGG 4 620 

AAGTAAGTGA AACCGTTATC TTTTG AAAG A TTGTGTTCTT TACCCCAAGC CTGCAAACCT 4 680 

TCCCAACCTG ATTGGTTGAA TGATTTGTCA TCAACACCAC CAGTATCAGT GACGATTGCT 4 740 

GCTTTTGTCT TC AC AT C AG A AGATGAAGCT GCGTTACGAG AAGAGCGGTT ACCACATCCA 4 300 

GCAAGTCCAA CTGCTGCCAC TGCAACTAGG CCAAGACCTA GCCATTCTTT CTTGTTCATT 4 8 60 

ACTGAACCTC CTAAATAAGA TGTGCAACGA TGTTGCAAGT ATGGATTGGT TCGCCACAAG 4 920 

GACCGTGCCA CTCAGAGAGC GACTCACACT AGTTTAAGTC TGTAAAAGAG TATGGAAGTA 4 9 80 

ATTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC GACTAAGGTC ACTTTTAGAT 5040 

CTTGTTCAAA AAATTCAGCC ATCACTTGGC GACAAGCACC ACATGGCGAG ATCGGTTTTT 5100 

CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG GCCTTCAGAT ATAGCCTTAA 5160 
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AAATAGCTGT 


TCTCTCACCG 


CAATTGGTCA 


AAGGATAGCT 


AGCATTTTCA 


ATATTCACTC 


5220 


CCGTGTAAAC 


ACTTCCGTCT 


TTAGCTACTA 


AAACTGCTCC 


GATAGGAAAG 


TGAGAATAGG 


5280 


GGACATAGGC 


ATGTTTGCTG 


GTTTCAATTG 


CCAGTTCAAT 


CAACTCAGTA 


GTCGCCATCT 


5340 


GCCAATTCTC 


CTTTTAAAAT 


AGCTACCCCA 


GCTGACGTTC 


CGATACGGGT 


CGCACCTGCT 


5400 


TCCACAAAGG 


CAAGAGCATC 


TGCATAAGAA 


CGAGCTCCAC 


CGGCGGCCTT 


GACACCCATA 


5460 


TCAGATCCAA 


CTGTTTCACG 


C ATT AATGT A 


ACATCTGCTA 


TCGTAGCACC 


ACCAGTTGAA 


5520 


AAGCCAGTAG 


ATGTTTTGAC 


AAAGTCAGCC 


CCAGCTTTTT 


GGCCCAATTC 


GCAAAC AAC A 


5530 


ACTTTTTCTT 


GGTCTGTCAG 


AAGGCAAGCT 


TCAATAATGA 


CTTTCACTAA 


CTTATCACCA 


5640 


CTTGCTTCCA 


CTACTGCGCG 


AATATCTGAC 


TCAACCAAGG 


CTAAATTACC 


TG A TTTG AG A 


5700 


GCTCCAACAT 


TGATCACCAT 


ATCAATCTCA 


TCTGCACCAT 


TTTGGATAGC 


TTCTTTTGTC 


5760 


TCAAATCCTT 


TCACGGCTGA 


AGTTGTTGCT 


CCCAAAGGGA 


AACC T ACT AC 


TGTGCAAACC 


5820 


TTAACATCTG 


TGCCTTCAAG 


TCCTTTTTTA 


GCATGTTCAA 


CCCAGGTCGG 


ATT AACGCAA . 


5880 


ACACTGGCAA 


AGTCATACTC 


TCTAGCCTCA 


GACAACAAAC 


TATCAATTTC 




5940 


GCATCTTGTT 


TTAAAAGCGT 


ATGATCTATA 


TATTTATTTA 


ATTTCATTTC 


GGTTTTCCCT 


6000 


CCATTTAGGA 


GATGATTTCT 


ACAATTTCAC 


GGATTTTTTT 


CACTTCATCA 


CTTATTTTAA 


6C60 


CACATTTTTG 


GAAATCTGTA 


ACTAGTTGAG 


GTGGAATTTT 


TTCATTTGTG 


TATACTTTTG 


6120 


CAACAATTTC 


ACCCTTTTGA 


ACGGAGTCTC 


CAATCTTCTT 


TTCAAAAACA 


ATTCCTGTTT 


6180 


CATAGTCCAA 


GCCATCAGAC 


TTAACTGCAC 


GACCAGCACC 


CAGCCTCATG 


GCATAAAGAC 


6240 


CAAAGTCCAT 


AGCTGGAAGA 


GCTGAAATGA 


CACCCCTTTC 


CTGAGCAGGG 


ATTTCCACCA 


6300 


CATGAGCTAC 


ATTTACACCA 


CCATAGAGCT 


CTTCCAAGTC 


TCCACCTTGC 


GCTTGCACCA 


6360 


TTTCCT C AAA 


CTTAGCCAGT 


GCTTGACCAT 


TCTCAAGATG 


TTGGTGAACT 


TCTTCAACAG 


6420 


* TTTG I r AAC 


ATTTGCCAAA 


CCAAGCATAA 


TTTGAGCCAA 


TTCACAAATA 


AAGTGGGTAA 


6480 


TATCCTGACG 


TCCTTGACCT 


TGCAAAATCT 


CCAATGCTTC 


AAGGATTTCC 


AGACGATTTC 


6540 


CAATCGCTCG 


TCCCAAAGGC 


TGGCTCATAT 


CCGTAATCAC 


TCCTACTGTC 


TTCCGTCCAA 


6600 


CAACCTTACC 


AAGATCTACC 


ATAGTTTGAG 


CCAACTCACG 


CGCCTCATCA 


ACCGTCTTCA 


6660 


TGAAGGCACC 


CTCACCGACA 


GTCACGTCTA 


GCAAAATAGC 


ATCCGCCCCT 


GCCGCAATTT 


6720 


TCTTGCTCAT 


CACCGAACTC 


GCAATCAAAG 


GAATCGTGTC 


GACAGTTGCG 


GTCACATCAC 


6780 


GAAGGGCATA 


GAGAACCTTA 


TCTGCTTTGA 


CCAGCTGGTC 


TGATTGCCCA 


ATGACAGATA 


6840 


CTCCAATATC 


CTGAACCTGA 


CGAAT AAAAT 


CCTCTTGACT 


ACGTTCTACT 


TGATAGCCCT 


5900 
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TAATGGACTC CAATTTATCA ATTGTTCCGC CTGTATGGCC AAGACCACGA CCACTCATTT 69 60 

TTGCTACAGG CACACCGAAG CTAGCAACAA GAGGAGCTAA AATCAAGGTT ACCTTATCGC 7 020 

CGACACCACC AGTAGAATGC TTGTCAACTT TCACACCATC AATGGCTGAC AGGTCAAACT 7080 

CTTGCCCAGT CTTAACCATA TTCATCGTTA AATCAGAGAT TTCTCGAGTC GTCATTCCTT 7140 

TAAAATAAAC AG C CAT AGC A AAGG C AG AC A TCTGATAATC AGGAACAGTT CCTGATACAT 7 200 

AGCCTTCTAT CAGCCATTCA ATTTCACTTG AAGTCAGTTC TTGACCGTCT CGTTTTTTTT 72 60 

GGATTAAATC AAC TGCTCTC ATTCTTTCAC AC TT CT AAGG ATATAGTATC CCTTGTCTTT 73 20 

TTT AAGG ATT TCACAATTGC CAAACACATC TTCCATCTTA GACTTGGCAC TTGGAGCTCC 7 3 80 

TTGTTTTTTC TGGATGACGA TGGTCAAATC TCCACCAATT TCCAAGAAAT CTTTACTTTT 74 40 

CTCGATGATT TCATGAACGA CTTGCTTGCC CGCACGGATA GGAGGATTGG AAATGACATG 7 500 

GTCAAATCGC CCTTGAACTC TTGCATAAAT ATTAGATTGA AATATCGTCG CTTTTGCATT 7560 

ATTTTTT T CA GCATTTCTCT GAGCTAAATC CAGGCCACGA GTCTTAATAT CAACCATGGT 7 620 

CGCCTGAACT CCGTAAACCT TGACCAAGGA CAAACCTAAT GGACCATAAC CACAGCCTAC 7 680 

ATCTAGGACT GTCTCTCCTT GGTTGACATC CAGACACTTG AGCAAGAGTT CACTTCCAAA 7 740 

GTCAACCATT TTCTTCCTAA AAACACCCGC AT CTGT C AAA AAAGTCATTT TTTCTCCCAA 7 300 

CAAGTCCACT CT C AACT CAT GAATGTCGTG AGCAGCGTCA GGATTTTCTG CATAGTACAT 7 860 

TTTACTCATG ACACTATTTT A CC AT AATTT GACTCAAATT GTAAATCCTT TACAAATTGA 7 920 

T AAT AAAACG AAAAAGACCG AAGAAAGCAA GTCACGAAGC CATTTTCTTC AATCTCTTTC 7 980 

AACACTTATA AATAATAAAC CATTTAGAAC TATAAATATC ACAGTCCAGA TAAAAACAAA 804 0 

AAGTTTATCA TCTATAATCA CCCAGATTAT TATTTCTATT GCTTAACCTT AAAATACTTT 8100 

ATTATCAACA AAATTCCTAA CAAAATGTTT AGATAAAAGC CCAACTGATA CGTTTATGTC 8150 

AGGATTTCCA AACTTGTCCA AAGTCGTATC AAATCTTCTA GTCACATGTG GAAGAAATAA 8 220 

CCCTCTGTCG CAATCCGTAG GACTAAAAAG C AAT AACT AC CCGCAGCAAT CCATTTCGTC 8 2 80 

CATCGTTTTT TAGTAAGAAA GCAATTAAGA ACGAACAAAT AAAGACAGCT GTTACAATAG 8 3 40 

C ATGTTC CAT CAAAAAAGTA AAACCGTAAT AGGTTTCCAC AAAGCATCTA CCATTATCTG 8400 

CATTGGTTCC TTTTATAAAA GGTAAAGCAA AACTTAAAAT AAAAC AG AG T TCCAATATGT 8 4 60 

AACGTTTTAA GATTTTCATA GTACACCTCC TATAAGTTGT GAACTAAAAA GCCCCCTTTA 8 5 20 

TAAGCTTATA AATCAGTAGA ATCTATCTCC TATTTCATCA ATAAATTGAT C ACTT AT ACT 8580 

ATATACCATT GACTTACCAC ATTCAAGAAA CCGCTTTATT TTTTTAGCTT TTTATGGTAT 8 64 0 

GATAGACAAA ATATCTAGGG G AAAAC AAAT GACCAACGAA TTTTTACATT TTGAAAAAAT 8 700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAAGACAACA CCTCCTTTGA CAGAAGAAGA 87 60 

ATTGG AATCT ATCAAGAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 8820 

TCTCCCCTTG GCTCATTTGA TTC AGATTT A CAACCGAACT AAGGAAGATT TAGCCTTTTC 8 8 80 

AAAAGGAATT TTCCTCCA 889 8 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 13188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Di TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
TATCTTAACG aGGATTGCGT TTATCGTCAG TCTTATTGCC CTAATTGTGG GAACAATCCC 60 

TTAAATCATT TTGAAAATAA TCGGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 120 

GAGTTTGAAC TAAACAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTCCTTAT 180 

GCAACGATGA TGAAGCGTGT GCAGGCAGAT AATAATCCTA ATTTCTTTTT TTTAACTTAC 240 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AG C AATTT GT T AC AC CG AAA 300 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGGTTG GATTGGTTGT 360 

AACATTGATT TATCACAAGT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA AGATGGACAA 4 20 

GTTAGAGATC CAGAAAAAGT TACAAAAGAA TTTAAGCAAG CTTTATTTTT AAGGAAGAGC 4 80 

TCTCTGTCAT CAAGAGCTTG GACAATAGAA ATTCTAAATT GTATAGATAA GATAGAGGGT 54 0 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 600 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 66 0 

ATAATAGAAT TTAAAGGTAG AGGAAAGTAT CGGAAATTAT GAAAACGAAA C AACTTGT*^ 720 

CATCAGAAGA GGTGTATGAT TTCTTAAAAG TCATCTGGCC TGATTATGAA ACTGAAAGCC 7 80 

GTTACGATAA CCTAAGTTTA ATCGTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGGT 84 0 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 9 00 

GGTGGGAAGT AGGAGATAAA TTGCCAGAGT GGCTACATAG CTCCTATCAT AGATTATTGT 960 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA GTATACAGTA GAAATTACAG 102 0 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTGAAAATCC AGATGAAGCC GAACGACTTG 10 80 

TAAGAGAAAA GTATAAGAGT TGTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT 


ATATGAATAG 


GTAGATGTTT 


TTATTTTGTC 


AACAAAAAAG 


AGGCTCGCAC 


1200 


CTCTTTTTCT 


TATTTCTTTT 


TATGATTTAA 


TACGGCATTG 


AGGACAATAG 


CGAGTAGGCT 


1260 


GGCTACGACG 


ATTCCCTTTG 


AGAAGAACAT 


TTGGAAGGCT 


GTCGGCATGC 


TGACAAAGAG 


1320 


ATTACTGTTG 


TTGAGACCGA 


CACCTGCAGC 


GATTGAAACA 


GCTGCGATAA 


GGAAGTTGTG 


1330 


TTCATTGTTA 


GCAAAGTCAA 


CACGGGCGAG 


GATTTGCATC 


CCTTGAATTG 


ATACAAAACC 


1440 


AAACATTACC 


AGCATGGCAC 


CACCGAGGAC 


GGAGCTTGGA 


ATGATTTGGG 


CAAGGGCGCC 


1500 


AAACTTAGGA 


AGCAGTCCAA 


GGAGAACCAG 


GAAACCAGCT 


GCGTAGTAGA 


TTGGCAGGCG 


1560 


TTTTTTGATG 


CCTGACAATT 


TAACCAAACC 


AACGTTTTGT 


GAAAATCCGG 


TGTAAGGGAA 


1620 


GGTGTTAAAG 


ATTCCTCCGA 


GAAGTACGGC 


CAAACCTTCT 


GCGCGGTATC 


CGTTGCGAAG 


1680 


CCGCGTGCTG 


TCGATTGGAT 


CCTTTGTGAT 


ATCAGACAAG 


GCCAGATAAA 


CACCAGTTGA 


1740 


CTC AACC AT A 


GACACCGTTG 


CGATGATACA 


CATCATGACA 


ATAGATGAGA 


TTTCAAACGT 


1300 


TGGCATCCCA 


AAGTAGAGTG 


GAGTTGGGAC 


ATGGACAAGT 


GCAGCTACCG 


CAACAGGACA 


i860 


GAAGTCCACC 


AAGCCCATAG 


TAGCAGCAAT 


GGCAGTTCCA 


ACAACCAGAC 


CAATCAAAAT 


1920 


AGAGATAGAC 


TTCATAAATC 


CTTTGGTAAA 


GATGTTGATC 


AAGAGGATAA 


TCAGAACAGT 


1980 


AATAGCTGCA 


AGCAAGAGAC 


TTTGACCAGT 


TGGCTCTGGA 


ACCTTATTTC 


CCATATTTCC 


2040 


AATAGCGACA 


GGGATCAAGG 


TTAAACCAAT 


CGTGGT AAT A 


ACAGATCCTG 


TTACGATACA 


2100 


TGGGAAGAGA 


TTGGCTACTT 


TTGAGAAGAT 


CCCTGAAACA 


AGAACCACCT 


AAATCCCAGA 


2160 


TGCGATAAGG 


GCACCAAACA 


TAGCGCCACT 


ACCATGGCTT 


TGCCCAATCA 


TAATCAAGGG 


2220 


AGCGACCGAC 


TGGAATGCAA 


CTCCAAGAAC 


GACTGGGAGT 


CCAATCCCAA 


AGTATTTGTT 


2280 


GAGTTGGAGT 


TGGAGGAAGG 


TTGCCACCCC 


ACACATGAAG 


ATATCTCTAG 


AAATCAGGTA 


2340 


GGTCAACTGC 


TCAGCTGAAT 


AGCCAAGGGC 


TCTCGCAATC 


ATGATGGGAA 


CCAGCATAGA 


2400 


TCCTGAGTAC 


ATGGCTAGTA 


AGTGCTGCAA 


GCCAAGAACG 


GCTCCTTGCG 


AGTGTTTTTC 


2460 


TTGAGTTTGC 


ATTAGAGATC 


TGCCTCCTTA 


AATACGACTT 


GACCATTTTC 


AAAACAATCC 


2520 


AAACGAGCAA 


GTGATAGGAC 


AGGGTAGCCT 


GCTTTTTCAA 


GCAAATCACG 


ACCATCTTGG 


2580 


AAGGATTTCT 


CAATCACGAT 


ACCGATAGCT 


TGGACTGTGG 


CACCGGCCTG 


TTCCATGATT 


2640 


TGAATCAACC 


CTTTAGCACC 


TTGGCCATTA 


GCAAGGAAAT 


CGTCGATAAT 


CAAAACCTTG 


2700 


TCCTCTGGTG 


AGAGGAATTT 


TTCAGCGATA 


GAAACGGTGC 


TGGTCACCTG 


CTTGGTAAAG 


2760 


GAGTAGACTT 


GACCAGTTAA 


GATGCCTTCG 


TTCATGGTGA 


TGTTCTTAGC 


TTTTTTGGCG 


2920 


AAAATCATGG 


GAACGTTTAA 


GGCTTCAGCT 


GTAAAAACGG 


CTCGGGCAAT 


ACCCGACGCT 


2880 


TCAATGGTTA 


CGACCTTGG7 


AATGCCAGTA 


GTAGCAAATT 


TTTCCGCAAA 


AACCTTACCA 


2940 
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ATCTCTCGCA TCAAGCTAAA GTCAACTTGG TGGGTTAAAA AGGAATCTAC CTTGAGGATG 3000 

TTATCACCCA AGATATGCCC ATCCTTGAGG ATGCGCTCTT CTAATAATTT CATAAGACCT 3 060 

CCTAAAGTCT AAAAGTT AAT TTACTTGTTG TTTAAATATT TCTATAGTGA TCCCTTTTGC 3120 

TAATACTATA TATTTGATAA AACTATTACG AGCGAAGCGA GTCTTATCAA ATATTTCCCG 3180 

TTGTAGTGGT AT CAT AG AC A ATAATCTTGT TATTGTCTAT GACGGGATTT TTGAGAGTAA 3 2 40 

AATAGTTCGG GGAACTATTT TAGCCTAAGC CTAGAAATGA AAGAGCTAGG GGCTCAAAAA 3 3 00 

TTAGGGATGA AATTC CCTGG ATTCCTGAAA TTATTCACAG GATAATTTCA CCTCCCGTCC 3 3 60 

GCACTAATTA AGGGAAATAT TAAAAAAAGA CCTACTTAAT CTCTAAGTAA GTCCCCTAAA 34 20 

TAGACATGGC AAAAACGGCC ATATCTCACT GCTGACTTAC TTATTGTTAG GTGTTCCGGC 34 80 

ACCTTGTAGA AACGTCGTGC CAATTCACGA CATAAACAAG TAAAACGATA TTCAATTTTA 3 54 0 

AATAGGCTTG AGCCAATGTT TTTATTTTAC ACT AAAT AAC TTTAGAAATC AACTATTTTG 3 600 

TTAGTGTTTT GGTTTAAAAA ACGAACAAAA AGAAGAGAGG GTGAACAAAA ACTCCATTGT 3 6 60 

AAGCTAACAG T TAT ACT AAA TGAAAATCAA AGAGCAAACT AGGAAGCTAT CCACAACCTC 37 20 

AAAACACTGT TTTGAGGTTG TGGATAGAAT TGACAGAGCC AGT AT C AT AT ACCTACGGTA 37 80 

AGGCGACGTT GACGTGGCTT GAAGAGATTT TCGAAGAGTA TTAGAAGATT TTTCCATCAT 3840 

AAAAGGCATA CTATCAAGCT TTTAGACACC TGACAATATG CCTTTTTCTA ACTTTAAAGA 3 9 00 

CTTTTCCCAA TTTTTATTAT TCTACTCGCT AAATCTTAAA AAATAGCCAT CTGGATCCAA 3 9 60 

AACTGCAAAT TTATGAGGAT AGATATAGGG ATC ACT G AC A CGAAACTTTC TTTTGGTCAA 40 20 

GGGACGATAA ATAGGATAGT TTGCCTTCAT CACTCTTTAA TAGAGTTTTG AAACATCCTT 4080 

TATGCCAAAG GAGAGATTGA CTCCACGACC AAAGGGATAG GTCAGTTCAG CTAGTTGATC 414 0 

CTTTGTTCCC TCCTCTAACA TTAGTTGACA CTCTTCAAGA GAAAGAGAAA GTTTTCTTCT 4 200 

GGACGTTGGT ATTCAATCCT AAAACCCAGT AAACCACAGT ACAACGACCC GGACTGTTCG 4260 

ATATTCGATA CAAGCAACTC GGGAATGACC GCATTGTAGT CCATATAGAA AATCCTTACA 4 320 

AGTCAATTTC CAAGACAATC GGTGTATGGT CTTGGCGACC ACCTGAGTCA ATCATATCAG 4 380 

ATTTAGTGAC CTTGTCAGCG ATACGGTTAC TTGTGAGCCA GTAGTCGATT CTCCAGCCTG 4 440 

T ATTGTTG AT TTTAGAAGTT TTGCTCCCTT GTGCCCACCA AGTGTAGCGT TCAGGAACAT 4 500 

CGCCATGAAC ATGGCGGAAG GTGTCTGTAA ATCCAGTTGC CAAAAGGTTG GTAAATCCAG 4 5 60 

CACGTTCCTC GTC AGT AAAT CCAGGTGAAC GGCGGTTGCT AGCAGGATTT GCAAGGTCGA 4 620 

TTTCATTGTG GGCTACGTTG TAGTCACCGG TCGCAAGGAC TGGTTTTTCT TTGTCTAGTT 4 6 80 
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CAGCCAAATA 


CTCAGCATAT 


TTGGCATCCC 


AGACTTGGCG 


TTCTTCCAAG 


CGTTTGAG AC 


4740 


CGTCACCAGC 


GTTTGCACTG 


TAAACTTGGG 


TTACGAAAAA 


TGCATCAAAT 


TCTAGAGTGA 


4800 


TGATACGACC 


TTCCAAGTCC 


ATGGTAGAAG 


GGGCACCGAT 


TTCTGGGAAC 


CTG AT AG T AG 


4860 


GTGTAAGTTC 


TTTCTTATAA 


AGGAACATGG 


TTCCAGCATA 


GCCTTTACGG 


GCAGGCTCTT 


4920 


GGGAAGAGCG 


CCACGTGTTT 


TCGTAGCCTG 


GGAAGAGTTC 


TTCTAAAATT 


TCCACGTGTT 


4980 


TCTTTGTAGG 


TCCTTTCGCA 


GAAAGCTTGG 


TTTCTTGGAT 


AGCAATGATA 


TCAGCATTTT 


5040 


CAGCGACCAA 


GGTTTGTAGG 


ACTTCTTGGG 


ACAATTTGGC 


ACGAGCTGAG 


TCACTAGTTA 


5100 


GGGCAGCGTT 


TAGGGAATCA 


ATATTCCATG 


AGATAAGTTT 


CATAAAGTTA 


CCTTTTTCAT 


5150 


TC AG ATT AT A 


GATTTTATTA 


TACCAAAAAA 


ACATCTATTT 


CCCCAACGTA 


TGGTTTGAAA 


5220 


AATTACTCTC 


TTTCGTTTAT 


AATTAAGAAT 


GATTTTATGA 


AAGGGAGTGA 


AAATACATGA 


5280 


AATTCTACTC 


TT ATG ACT AT 


GTACTCAGCC 


AAATCGGTCA 


GCAAAATGGT 


ATCATGGTTG 


5340 


GCTTTGGGAT 


TGTTCTATTA 


GCTGTGACAG 


TTTTTTTTGC 


TTTCAAGGCA 


TACCATAATA 


5400 


AAAAGGGAAG 


CGAATTTCGT 


G AGTTGGT C A 


TGATTTCACA 


TCTGGCCTTA 


TTTACCTCTG 


5460 


CTTTTGGTCA 


GCATCACGAC 


TTATCAAAAC 


AATCAAGTTT 


CTAACAATAA 


ATTTCAAACT 


5520 


TCACTTCATT 


TCATCGAGGT 


TGTTTCCAAA 


GATTTGTGAG 


TAGACAAGTC 


AGAAGTCTAT 


5590 


GTTAATACTT 


CCACAAACAC 


AGATGGCGCA 


CTT AT C AAGG 


TGGGAGATCG 


CTATTATCGT 


5640 


GCCCTAAATG 


GAAGTGAGCC 


AGACAAGTAC 


CTGTTAGAGA 


AAGTCCAATT 


GTATAAGACA 


5700 


GACGCAATTG 


AACTGGTGCA 


TGTGAACAAA 


TGACACTTAA 


TTATATCGAA 


ATTTT AATC A 


5760 


AACTGGTCTT 


G ACT CT C AAA 


TACCTCAACA 


ACAATGTTCA 


CTTTGTG AAA 


CGTTTCATTG 


5320 


ATGGTAAGCC 


AACTCTCCTT 


ATCAAAAATG 


GGAATATTGA 


CCCACAACCC 


TGTCGTTCAG 


5880 


TTGGTTTGTC 


TGCATCGGAT 


GTATCCCTCA 


AACTTCGTAG 


C C AAGG G ATT 


TTCCAGATCA 


5940 


AGCAAGTCAA 


ACGAGCTGTG 


CAAGAGCAAA 


ATGGGCAACT 


CATCGTTCTG 


CAAATGGGAG 


6000 


ATGAAAATCC 


TAAGTATCCA 


GTTGTGACTG 


ACGGTGTGAT 


TCAAGTAGAT 


GTCTTGGAAT 


6060 


CGATTGGTCG 


TAGCGAAGAG 


TGCTTGCTTG 


AT AAC CT C AG 


TAAACAAGGG 


CATGACAATG 


6120 


TAGCCAATAT 


C TTT ATTGCT 


GAATATGACA 


AGGGTGCTGT 


TACAGTCGTA 


ACTTATGAAT 


6130 


AAGAAAAACC 


TGGGGTCTTG 


TACT CTTCG A 


AAATCTCTTC 


AAACCGCGTC 


AACGTCGCCT 


6240 


TGCCGTATGT 


AGGTTACTGA 


CTTCGTCACT 


TCTATCTACA 


ACCTCAAAGC 


AGTGCTTT G A 


6300 


GCAGCCTGCG 


GCTACTTTCC 


TAGTTTGCTC 


TTTGATTTTC 


ATTCAGTATT 


GGCCTCAGGT 


6360 


TTCCATTTGC 


AATC AG AAAG 


GGATTTTATG 


TCCATTATTC 


AAAAACTTTG 


GTGGTTTTTC 


6420 


AAGTTAGAAA 


AACGCCGTTA 


TCTAGTCGGA 


ATTGTGGCCC 


TGATCTTGGT 


TTCCGTCCTC 


6480 
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AATCTCATTC 


CTCCTATGGT 


TATGGGGCGG 


GTCATTGATG 


CCATCACATC 


GGGGCAATTA 


6540 


ACCCAGCAGG 


ACCTCCTTCT 


TAGCCTATTT 


TACTTGCTAC 


TTGCAGCCTT 


TGGTATGTAC 


6600 


TATTTGCGCT 


ATGTGTGGCG 


TATGTATATC 


CTTGGCACCT 


CTTATTGCTT 


GGGACAGATC 


6660 


ATGCGGTCTC 


CCTTCTTTAA 


G C ATTTC AC A 


AAAATGTCGT 


C AG C CTTTT A 


TCAAACCTAT 


6720 


CGGACGGGTG 


ATCTGATGGC 


ACACGCAACC 


AATGATATCA 


ATGCCTTGAC 


TCGTTTAGCA 


6780 


GGTGGCGGTG 


TCATGTCTGC 


GGTGGATGCC 


TCTATCACGG 


CTCTGGTGAC 


TTTGTTGACC 


6840 


ATGCTCTTTA 


GCATCTCATC 


GCAGATGACT 


CTTGTTGCCA 


TTCTCCCCCT 


ACCTTTCATG 


6900 


GCCTATACGA 


CTAGTCGCCT 


AGGGAGAAAG 


ACTCATAAGG 


CCTTTCGCGA 


ATCCCAAGCT 


6960 


GCTTTTTCTG 


AACTCAATAA 


CAAGG T AC AG 


G AGTCCGT AT 


CAGGTATCAA 


AGTGACCAAG 


7020 


TCTTTCGGTT 


ATCAGGCAGA 


CGAGTTGAAG 


TCTTTTCAGG 


CAGTCAATGA 


ATTAACCTTC 


7080 


CAAAAGAACC 


TGCAAACCAT 


GAAATATGAT 


AGTCTCTTTG 


ACCCTATGGT 


TCTCTTGTTT 


7140 


GTTGGTTCGT 


C CT ATCTTTT 


AACGCTTTTG 


GTTGGCTCCT 


TGATCCTTCA 


GGAAGGGCAG 


7200 


ATTACAGTTG 


GGAATCTAGT 


CACCTTTATC 


AG CT ATTTGG 


ATATGCTGGT 


CTGGCCTCTT 


7260 


CTGGCCATCG 


GTTTCCTCTT 


TAATACTACT 


CAGCCAGGGA 


AG G TTTCTT A 


CCAGCGGATT 


7320 


GAAAATCTTT 


TGTCTCAGGA 


ATCTCCTGTA 


CAAGACCCTG 


ACTTTCCTCT 


GGATGGTATT 


7380 


GAAAATGGGC 


GTTTGGAGTA 


TGCCATTGAC 


ACCTTTCCTT 


TTGAAAATGA 


GGAAACACTG 


7440 


ACGGATATTC 


ACTTTAGTTT 


GGCAAAAGGG 


CAAACACTGC 


GCTTGGTTGG 


GCAGACAGGC 


7500 


TCTGGGAAAA 


CGTCCTTAAT 


CAACCTCCTC 


TTGCGTGAAT 


ACGATGTGGA 


TAAGGGTGCC 


7560 


ATTTATCTAA 


ACGGTCACGA 


TATT ZGGGAC 


TATCC7CTGA 


CAGACCTTCG 


CAGTCTCATG 


7620 


GGCTATGTTC 


CTCACCACCA 


G TT T CTTTTT 


GCCACTTCAA 


TCCTAGACAA 


TATCCGCTTT 


7680 


GGCAATCCTA 


ACTTGCCCCT 


TTCAGCGGTC 


GAGGAAGCTA 


CTAAGCTAGC 


CCGGGTTTAC 


7740 


CAAGATATTG 


TAGACATGCC 


TCAAGGATTT 






nuunu 1 v_ /^v^ i 


f o uu 


CTTTCTGGTG 


GTCAAAAGCA 


ACGGTTGGCT 


ATGAGTCGGG 


CTATGATTTT 


AGACCCTGAT 


7860 


ATCTTGATTT 


TGGATCATTC 


CTTATCCGCC 


GTAGATGCCA 


AGACAGAGTA 


TGCGATTATC 


7920 


GACAACCTCA 


AGGAGATGCG 


AAAGGACAAG 


ACAACCATTA 


TCACTGCCCA 


TCGCCTCAGT 


7980 


GCTGTTGTCC 


ATGCAGATTT 


TATTTTAGTT 


CTACAAAATG 


GTCAAATTAT 


CGAACGAGGC 


8040 


ACGCACGAAG 


ACTTGCTAGC 


TTTGGATGGC 


TGGTATGCCC 


AAACCTACCA 


GTCTCACCAC 


8100 


TTCGAAATGA 


AAGGAGAAGA 


AGATGCAGAA 


TAAACAAGAA 


CAATGGACTG 


TATTGAAGCG 


8160 


CTTGATCTCT 


TATCTCAAGC 


CTTATGGACT 


CCTGACCTTT 


TTGGCACTCA 


GTTTTCTCCT 


8220 
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AGCGACGACG GTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 8 2 80 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTATG GTCTCTACAT 8 3 40 

CCTACAAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG AT GCCTTTGC CAATATGGAG AAACTGG GC A TGTCTTACTT 84 60 

TGACAAGACG CCAGCAGGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACG AT T AG 8 520 

TG AT ATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8 580 

CCTTTATACC ATGTTGGTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTC A ATCTCTATCG AAAAAAGTCA GTGAAAATCA TCGAGAAAAC 8700 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA GCTGGCAGAG AATATCGAGG GAATCAGGAT 37 60 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGGCAGAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTT7TTCA GACCTGCCAT 8880 

GAGTTTGCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT AC CGTGGTTT 894 0 

TTCTATCGGG ATAACGGTCG GGACCATGTA TGCCTTTATC C ACT AC AT C A ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGCCTA TGGTTTCTGC 9060 

AGGTCGTGTC TTTGCCCTGA TAGACGAGAG CACCTATCAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATGTG TGTTTCTCAT ATGACGGTAA 9180 

ACATCCGATT CTGGATG AC A TTTCTTTCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9 24 0 

AGGTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATGCCCT TTTATGAATT 9 3 00 

CCAGTCAGGG AGAGTTCTCT TGGATGATGT CGATATCAGG GATTTCAGTC AAGAAGAGCT 9 3 60 

CAGAAAAAAC ATCGGTTTCG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9 420 

CAATATCGCC ATGTACCAAG AAACCAGTGA TGAGCAGGTT CAGGCTGCCJG CAGCCTTTCT 9 4 30 

GGATGCAGAT TCCTTTATTC AAG AACTTCC TCACGGGTAC GACTCCCCTG TTTCCGAGCG 9 540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC TTTCCTACAA CAGTCGCCAG 9 600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC G AC AG C C AA T ATTGACTCTG AAACAGAAAG 9 660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTG CTATCGCTCA 9720 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATGTC TTGCATAAGG GACGCATTAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTGCGAGGA ACCTATCACA AGATGTATAG 9 840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9 900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9 9 60 

TTACAGATTT CTTTCACCGC CTTTTCCATT TTGTCGTATA ATCAAAAATC TTGACAAATA 10020 
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GT AT AAT AAA AACAAACGAG AACAGCATCC TGAAATGGGA AGACTTGCCT GTGGAAATGA 10030 

AATCAAGCGA GGTTGAGTCT TACTACCAGC TTGTCTCTAA AAGGAAGGGT TCGCTGATTT 10140 

TCAAGCGTTG CTTGGACTGG GTTTTGGCCT TGGTCTTACT GGTTCTGACC TCTCCCATCT . 102 00 

TTCTCATCTT GAGCATTTGG ATCAAGTTGG ATAGCAAAGG GCCAGTGATT TACAAGCAAG 102 60 

AGCGTGTGAC CCAGTACAAC CGTCGGTTCA AG ATT TGG AA GTTTCGTACC ATGGTGACGG 10320 

ATGCGGATAA AAAAGGAAGT CTGGTGACTT CTGCTAACGA TAGCCGCATT ACCAAGGTTG 103 80 

GAAATTTCAT CCGACGTGTC CCTTTGGACG AACTGCCTCA GTTGGTCAAT GTCCTTAAAG 10440 

GTGAGATGTC CTTTCTCCGT ACACGACCTG AAGTGCCACG TTATACAGAG CAGTATAGCC 10500 

CTGAAATGAT GGCAACCTTG CTCTTGCAAG C AGGG ATT AC CTCTCCAGCC AGCATCAACT 10560 

ACAAGGATGA GGACACAATT ATCAGTCAAA TGACGGAGAA AGGTCTGTCA GTTGATCAGG 10620 

CCTATGTCGA GCATGTTCTT CCTGAAAAGA TGCGCTATAA CCTCGCCTAT CTCCGAGAGT 10680 

TTAGTTTCTT TGGGGACATC AAAATCATGT TTCAAACCGT GTTTGAGGTA CTAAAATAAA 10740 

GTAGTCATAA GAAAATGAGT ACAGATAAAA GGAGCAAATC AATGCCAAAT TACAATATTC 10800 

CATTTTCACC CCCTCATATC ACAGAAGCAG AAATTACTGA AGTAGTGGAT ACCCTCCGTT 108 60 

CTGGTTGGAT CACAACAGGT CCTAAAACAA AAGAACTGGA GCGCCCCTTG TCTCTTTACA 10920 

CACAGACACC TAAGACTGTT TGTCTCAACT CTGCGACAGC CGCTCTGGAG TTGATTTTAC 10980 

CCGTTTTCCA AGTGGGACCT GGTGATGAAG TCATCGTTCC AGCCATGACC TATACGGCTT 1104 0 

CATGTAGTGT CATTACGCAC GTGGGAGCAA CCCCTGTCAT GGTGGATATC CAAGCAGATA 11100 

CGTTTGAGAT GGACTATGAC CTGCTTGACC AAGCTATCAC TG AG AAAACT AAGGTGATTA 11160 

TTCCAGTAGA GCTCGCAGGG ATTGTTTGCG ATTATGACCG TTTCTTCCAA GTCGTGGAGA 11220 

AAAAACGTGA CTTCTTT AC C GCTTCAAGCA AGTGGCAAAA GGCCTTTAAC CGTATTGTCA 112 80 

TTGTCTCTGA TAGTGCCCAC GCTTTGGGAT CTATTTATAA AGGACAACCT TCTGGTTCTA 11340 

TCGCTGACTT TACTTCCTTC TCATTCCATG CAGTTAACAA CTTTACAACG GCAGAAGGTG 114 00 

GAAGTGCGAC TTGGAAAGCC AATCCAGTGA TTGATGACGA AGAGATGTAC AAGGAATTCC 114 60 

AAATCCTTTC CCTTCACGGG CAAACTAAGG ATGCTCTTGC CAAGATGCAA CTGGGGTCAT 115 20 

GGGAATACGA TATCGTTACA CCAGCCTATA AGTGCAACAT GACCGATATC ATGGCTTCAC 115 80 

TTGGTTTGGT ACAATTGGAC CGCTATCCAA GTTTGTTG C A ACGCCGTAAG GACATTGTGG 11640 

ACCGCTATGA TAGTGGTTTT GCAGGTTCTC GCATCCATCC TTTGGCACAC AAGACTGAAA 11700 

CTGTCGAATC TTCACGCCAC CTCTACATCA CCCGTGTAGA AGGAGCAAGC CTAGAAGAAC 117 60 
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GCAACCTCAT 


CATCCAAGAA 


TTGGCTAAAG 


CAGGAATTGC 


AAGTAATGTT 


CACTACAAAC 


11320 


CGCTTCCTCT 


CTTGACAGCC 


TATAAGAATC 


TTGGATTTGA 


TATGACGAAC 


TATCCTAAGG 


11390 


CCTATGCCTT 


CTTTGAGAAT 


GAAATTACCC 


TCCCTCTTCA 


TACTAAATTA 


AGCGATGAAG 


11940 


AAGTAGACTA 


TATCATTGAG 


ACTTTCAAAA 


CAGTTTCTGA 


AAAAGTGCTA 


ACTTTATCAA 


12000 


AAAAATGACA 


AACT ACAGT C 


AAGCGAAAGT 


GATCCTGCCC 


CTAAAAAGTC 


TAATTGAGTG 


12C60 


T AAAAACT GT 


TGTTTTCAAT 


TGATAATAGT 


TTACACCTGT 


AGTTGAGGCC 


CCTTTCTCCT 


12120 


CAGAGAGAGA 


ATTTTT AT AG 


GATT TTCCTT 


TCTTGTGGGA 


GTCCCGTGGT 


TTGAAATAAG 


12130 


ATGTGAGCAA 


TTTAGTGTAG 


CATTTAGAAT 


CCTTACTAGA 


CATCATTTAG 


AAAATCTAGT 


12240 


GTCTTGTTCT 


AGTTTTCAAT 


TCACCCTATT 


TTTTGAAAGA 


CGTGAGTTTC 


CATGAGTCAG 


12300 


ATTGTGGAAA 


CTCGCGTCTT 


TTTTTGTTTT 


CAGAATATTG 


TTCAAAATTT 


TGTGCCTGTC 


12360 


TTTCATCTTC 


TAGTCATTCT 


TTTGCATGAT 


AG AATTT AT A 


GCATGTTGAT 


ATTATAATAA 


12-120 


TACAAATATT 


CTATATGTTT 


AGTGATGCTT 


GCTATACATT 


ATT AG ATCT C 


CTGCGAGACA 


12480 


ATCT AT AAAA 


CACTTGTCTA 


CGATTACCTA 


TATGCCCTAT 


TCCAGTATTT 


TAGAAGCACT 


12540 


GCATCTATTT 


TTATCGAGGT 


TAAATCTAGC 


TTTTATAGAA 


GGTCTATTTA 


AGAAATATAT 


12600 


TGTAGTGTTT 


TAGTTTCAAT 


CCGCCATATG 


AGCGATATTC 


AGCTAAATAT 


CCCTGGCGAA 


12660 


TGCTTGTATG 


ACAAGGTATT 


TGTTCTTTCA 


TTT AT AATTT 


ACAACATATC 


AACAAATTTA 


12720 


AATATAGTAA 


ATGGGATATT 


TTATATTCAA 


GCTAAGAAAG 


ATAGCATCAC 


TTTTGAATGG 


12780 


AAGGCTAAAG 


AG C AAACT AG 


GAAGTTGGCC 


AT AG AT AGCT 


CAAAACCCTG 


CTTTGAGGTT 


12840 


GT AG AT AT AG 


TAAAATGAAA 


TGAGAATAGG 


ACAAATTGAT 


CGGGACAGTC 


AAATCGATTT 


12900 


CTAACAATGT 


TTTAGAAGTA 


G AGGTGT AC T 


ATTTTACTTT 


CAGTCTACTA 


TAGAACTGAC 


12960 


CAAGTCAGTA 


ACCTAGACTT 


AGGGCAAGGC 


GGCACTGACC 


TAGTTTGAAG 


AGATTTCCGA 


13020 


AGAGTATAAA 


TTTTAATATT 


TTCTTGTCTT 


ATTCCTTGAC 


AATT C AATTT 


GGAAAATATA 


1 3080 


TGATAAAGAT 


AATGACAGCG 


GTGTCATTCT 


ATCTATTTTA 


AGAAAAGTAA 


T AATC AATTG 


13140 


TTAAAAATAG 


TAAAAAAATT 


GGACGTTCTG 


ATGAAATATT 


TTGTTCCG 




13183 


(2) INFORMATION FOR S 


EQ ID NO: 71: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 71: 
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AACGAGTGCA TCAGTCTCAG CAAGCACCAG TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 

TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 

AACAAGTGCA TCGGCTTCAC CAAGCACAAG TGCTTCAGCC TCAGCAAGTA TCTCAGCGTC 180 

TGAATCGGCA TCAACCAGTG CCTCCCCTTC AGCAAGTACT AGCGCCTCAG CAT C AG CGT C 2 40 

AACAAGTGCT TCGGCTTC AG CGTCAACGAG TGCCTCTCAG TCAGCATCAA CGAGTACGTC 300 

AGCCTC AG C A AGCACATCAG CTTCTGAATC TGCATCAACC AGTGCGTCAG CCTCAGCATC 3 60 

GACAAGCGCC TCACCTTCAC CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 4 20 

AGCCTC AGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 4 80 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 

AGCAAGTACT AGTGCATCAG CATCAGCATC AACGAGTGCA TCGGCTTCAC CAAGTACCAG 600 

CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTC AGCCTC 6 60 

AGCAAGCACC ACTCCCTCAG CTTCAGCAAG T AC C AG TGCG TCAGCCTCAG CGTCGACAAG 72 0 

TGCGTCGGCT TCAGCAAGTA CCTCACCGtC TGAATCAGCA TCAACGAGTG CATCAGCTTC 7 SO 

AGCATCAACA ACTGCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACCAG 84 0 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCCGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCACCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAACCG CCTCAGCTTC 102 0 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 10 8 0 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC ACTCCGTCAg CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 12 60 

CACCAGTGCG TCAGnCTCAC CAAGTACCAG CGCCTCACCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 13 80 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 144 0 

AGCTTCAGCA AGTACCAGTG CGTCCGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 15 60 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 162 0 

AACGAGTGCT TCGGCTTCAC CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 16 80 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 17 4 0 
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GACAAGTGCT TCCCCTTCAC CAT C AACG AG TGCGTCGGCC TCAGCAAGCG CAAGTACCTC 1800 

AGCGTCAGc C TCCGCCTCAA CCAGTGCGTC GGCTTCAGCA AGCACAAGTG CGTCAGCCTC 1860 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCTGAGTCAG CATCAACGAG 1920 

TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 19 80 

AGCATCGACA AGCGCCT CAG CTTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 204 0 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 2100 

TACTAGTGCA TCAGCTTCAG CATCAACGAG TGCATCGGCT TCAGCATCAA CCAGTGCCTC 2160 

GGCTTC AG CG TCAACCAGTG CGTCAGCTTC AGCAAGTACC AGTGCTTCAG TCTCAGCATC 2 220 

AACAAGTGCT TCAGCCTCAG CATCGACAAG TGCCTCCGCT TCAGCAAGCA CATCAGCATC 2 280 

TGAATCAGCG TCAACCAGTG CTTCGGCTTC AGCAAGTACC AGTGCTTCAG CTTCAGCATC 2 34 0 

AACCAGCGCC TCGGCCTCAG CAAGCACCTC AGCTTCTGAA TCCGCCTCAA CCAGCGCCTC 2 400 

GCCCTCACCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCAG CCTCAGCATC 24 60 

AACGAGTGCT TCGGCTTCAG CAAGCACAAG CGCCTCGGGT TCAGCATCAA CGAGTACGTC 2 520 

AGCTTCACCC TCAACCAGTG CTTCAGCCTC AGCATCAACA AGTGCGTCAG CCTCAGCAAG 2 5 80 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 2 64 0 

AGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGTGCGTCAG CCTCAGCATC 2 7 00 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCTTCACCC TCAGCGTCGA CAAGTGCGTC 27 60 

GGCCTCAACC AGTGCATCTG AATCGGCATC AACCAGTCCC TCAGCCTCAG CAAGTACTAG 2 8 20 

TGCATCGGCT TCAGCATCAA CCAGTGCCTC GGCTTCAGCG TCAACCAGTG CGTCAGCTTC 2 8 30 

AGCAAGTACC AGTGCTTCAG TCTCAGCATC AACAAGTGCT TCAGCCTCAG CATCGACAAG 2 94 0 

TGCCTCGGCT TCAGCAAGCA CATCAGCATC TGAATCAGCG TCGACAAGCG CCTCAGCTTC 3 000 

AGCAAGTACC AGTGCGTCAG CCTCAGCGTC GACAAGTGCG TCAGCCTCAG CAAGTACTAG 3 060 

TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCGGCG TCAACCAGTG CATCAGAGTC 3 120 

AGCAAGTACC ACTCCGTCAg CTTCCGCATC AACAAGTGCC TCGGCTTCAG CAAGCACCAG 313C 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCCTCAGCC TCAACCAGTG CGTCAGCCTC 3 2 40 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCCGCTTCAG CAAGTACTAG 3 300 

CGCCTCAGCC TCAGCGTCAA CAAGTGCATC GGCTTCAGCG TCAACGAGTG CGTCTCAATC 3 3 60 

GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCAACAAG 3 4 20 

TGCATCGGCT TCAGCATCAA CCAGTGCGTC CGCTTCAGCA AGTACTAGCG CCTCAGCCTC 3480 

AGCGTCAACA AGTGCATCGG CTTCAGCCTC AACGAGTGCG TCTGAGTCAG CATCAACGAG 3 54 0 
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TGCGTCAGCC TCAGCAAGCA CATCACCTTC TGAATCTGCA TCAACCACTG CGTCAGCCTC 3 600 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 3 660 

TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC ACCCTCAGCA AGTACCAGTG CGTCAGCCTC 37 2 0 

AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC 3780 

ACCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 3 840 

AACCAGTGCA TCAGAGTCAG CAAGTACCAG TGCGTCACCT TCCGCATCAA CAACTGCCTC 3 900 

GGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTG CTTC AG CTTCCGCGTC 3 9 60 

AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CAAGTGCCTC 4 02 0 

GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 4 080 

CACCAGCGCC TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 4 140 
TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTGCTTCAG CCTCAGCAAG 
TATCTCAGCG TCTGAATCGG CATCAACCAG TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC 

AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC 4320 

AACGAGTACG TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC 4 3 80 

ACCCTCAGCA TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAG CCTCAGCAAG 4440 

TACCAGTGCT TCAGCCTCAG CGTCGACAAG TGCGTCGGCC TCAACCAGTG CATCTGAATC 4500 

GGCATCAACC AGTGCGTCAG CCTCAGCAAG TACTAGCGCC TCAGCCTCAG CATCAACCAG 4560 

TGCGTCCCCT TCAGCAAGTA CT AGTG CATC AGCTTCAGCA AGTACTAGCG CCTCAGCCTC 4 5 20 
AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 
TGCGTCGGCT TCAGCAAGTA CCTCAGCCTC TGAATCAGCA TCAACAAGTG CCTCGGCTTC 
AGCATCAACG AGTGC ATC AG CTTCAGCATC AACAACTGCT TCAGCTTCAG CAAGTACCAG 

TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG CCTCTGAATC 4 8 60 

CGCATCAACA AGTG C CTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG CGTCAACGAG 4 92 0 

TGCGTCTGAG TCAGCATCAA CCAGTGCGTC ACCCTCAGCA AGCACATCAG CTTCTGAATC 4 9 80 

TGCATCAACC AGTGCGTCAG CTTCCGCATC AACAAGCGCC TCGGCCTCAG CAAGTACAAG 504 0 

TGCTTCAGCC TCAGCATCAA CC AGTGC ATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 5100 

AGCGTCAACC AGTG C CTCGG CTTCAGCAAG TACCAGTGCG TCAGCTTCAG CAACCACAAG 5160 

TGCGTCAGCT TCAGCATCAA CCAGTGCTTC GCCTTCGGCA TCAACAAGTG CCTCAGCATC 5220 

AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACT AGTGC A TCAGCATCAG CATCAACCAG 52 80 



4680 
4740 
4800 
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TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA TCAACGAGTG CATCAGCATC 5 3 40 

AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA TCAGTCTCAG CAAGCACCAG 54 00 

TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5 4 60 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 55 20 

TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5 5 30 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCmTCAC CATCAACGAG 5 64 0 

TGCATCGGCT TCAGCAAGTA CCAGCGCCTC ACCTTCACCA AGCACCAGTG CGTCAGCCTC 57 00 

AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC TCAGCTTCAG CAAGTACCAG 57 60 

TGCGTCAGcT CAGCATCAAC AAGTGCTTCA GCTTCGGCCT CAACAAGTGC GTCAGCTTCA 5 82 0 

GCATCAACGA GTGCGTCGGC TTCAGCAAGC ACCAGTGCCT CCGCCTCAGC AAGCACCAGT 58 80 

GCTTCAGCTT CAGCATCAAC AAGTGCGTCA GCTTCAGCAA GTACATCAGT TTCAAATTCA 5 94 0 

GCAAACCATT CGAACTCACA AGTTGGAAAT ACTTCTGGAT CGACAGGTAA ATCCCAAAAA 6000 

GAATTGCCTA ATACAGGTAC TGAGTCGTCA ATTGGATCTG TGTTACTTGG AGTTCTAGCA 6 0 60 

GCTGTTACAG GTATTGGATT GGTTGCGAAA CGCCGTAAAC GTGATGAAGA AGAGTAAGAC 6120 

AACCTGTAAA GTTACGCTAA ACTAACTCGC GCAC AT AAA T CAAGGAGAAA ATTGCTAGTG 6180 

GATGATAAAA TAACAGTCAT TGTACCAGTA TACAATCTGG AAAACTATCT GAGGAAGTGC 6240 

CTAGATAGTA TTATTACTCA AACATATAAA AATATTGAGA TTGTTGTCGT TAATGATGGT 6 3 00 

TCTACGGATG CTTCAGGTGA AATTTGTAAA G AATTTTC AG AAATGGATCA CCGAATTCTC 6 3 60 

TATATAGAAC AAGAAAATGC TCCTCTTTCT GCCGCACGAA ACACCGGTCT GAATAATATG 64 20 

TCCGGAAATT ATGTGACCTT TGTGGACTCG GATGATTGGA TTGAGCAACA TTATGTAGAA 6 4 80 

ACTCTATA7A AAAAAATAGT ACAGTA7CAG GCTCA7ATTG CACTTGG7AA TTATTATTCT 6540 

TTCAACGAAA GTGAAGGAAT GTTCTACTTT CATATATTGG GAGACTCCTA TT A TG AG AAA 6 600 

GT AT ATG AT A ATGTTTCTAT CTTTGAGAAC TTGT ATG AAA CTCAAGAAAT GAAGAGTTTT 6660 

GCTTTGATAT CTGCTTGGGG TAAACTCTAT AAGGCAAGAT TGTTTGAGCA GTTGCGCTTT 57 2 0 

GACATAGGTA AATTAGGAGA AGATGGTTAC CTCAATCAAA A GG TAT ATT T ATTATCAGAA 6 730 

AAGGTAATTT ATTTAAATAA AAGTCTTTAT GCTTATCGGA T T AG AAAAGG TAGTTTATCA 6840 

AGAGTTTGGA CAGAAAAGTG GATGCACGCT TTAGTTGATC CTATGTCTGA ACGTATTACG 6900 

CTACTAGCTA ATATGGGTTA TC CT CT AG AG AAACACTTGG CAGTTTATCG T C AG AT GTTG 6960 

GAAGTCAGTC TCGCCAACGG TCAAGCTAGT GGTTT ATCT G ACACAGCAAC GTATAAAGAG 7 020 

TTTGAAATGA AACAAAGGCT TTTAAATCAG CTATCGAG AC AAGAGGAAAG TGAAAAGAAA 7080 
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GCCATTGTCC 
ATTTGTTATC 
TGGATTAAGC 
GTAACTTCTG 
TATTTCATAG 
GTAACGAAAA 
GTTACAGATT 
TTGGTAAACA 
AATGAATGGC 
C AT AAATGG T 
GATTATCAAT 
AAACCGTGGA 
CTTGAATGGA 
TATCCAATAA 
ATTGAGACAT 
GTTAGTGATC 
CACTATTTGG 
AATCATGGCG 
TTATCCTTTG 
GTTCAAGCAA 
GAGATGCTCT 
ATTTCTC CGT 
ATAGACCTAA 
TTTTTGGsGT 
TATATTATAG 
TCTATAGTTT 
ATTTGTTGTT 
TGGTTTTTGA 
ATTGT AG AT C 



TCGCAGCAAA 
ATAATCGTTC 
AATTAAATAA 
AGCAAATTTC 
CTGATTTCGT 
ATCTGGATGA 
TTGGGGGCAG 
ATGCTTTTTG 
ATGATAAGGT 
TGGAATTGGA 
TCCCTGACGG 
AAGATTTGGC 
CAGAATTGGG 
AGGAACCTTT 
TGGTTCAATC 
GATTGCCTCA 
TAGATGTCGA 
AAAAG AC AG A 
AAAATACTAA 
TOATTGAAAA 
GATTTTGACG 
TC AT CAT AT A 
AGATATTTAA 
CAGTACAATA 
CTGTTTGATA 
TTAATGCAGA 
ATAATAATTG 
TGATGAATAA 
ATCTTCTTAA 



CTATGGCTAT 
GATTCGTTTT 
GCGCTTAGAG 
ATGTT AT AAA 
G C AAGAAG AC 
CTTGTTTGCT 
AGCTTATTTT 
GAAAAAAGAG 
GGATCAGGCA 
CTTTGATTAT 
TCAGGATTAT 
GGCCCAAACC 
ACAAAACCAT 
CACTTGTCTA 
CTTGCCTCAT 
GATGACAATT 
TAATGAATTG 
AG AAATT CTC 
AACCTATGAA 
ATTGAGAGAA 
GTTAGTGATC 
TGAAAGTTGT 
TATGAACTGC 
TTAGGGTGTG 
CGAAATTTAT 
TAATGATTAT 
TTTAAAATTT 
GCCATTCAAC 
AAAGTTTCAT 



579 
GTAGACCAAG 
TATCTGATTC 
AAGTTTGACT 
TCGGATATTA 
AAGGCCCTCT 
ACAGACTTAC 
GGTCAAGAAA 
AATATGACCC 
GATCAGAGCA 
AATCATATTG 
CCTGCTATTA 
TATCGTGAAG 
CATTTACATC 
ATCTATACTG 
ATTCAGTTTA 
TATCCAAACG 
GTAGAAACCA 
GATCAATTTG 
GTAGGTCAGC 
ATAACCAAA7 
AGATTGAAGA 
TCAAACATCA 
ACCCCAAAAG 
ATTAATTATC 
TATAAGGAAA 
GTAGATAAAT 
TATGTATTTA 
ACTATACAAT 
TTACCGTTAA 



TTTTAACGAC 
ATAGCGATTT 
CAGAAATTAT 
GTTACACAGT 
ACTTGGACTG 
AAGATTATCC 
TCTTTAATGC 
AAAAATTAAT 
TCTTGAATAT 
TCATTCATAA 
TTCACTATCT 
TTTGGTGGTA 
CATTACAAAG 
CCTCAGACCA 
AGATAGCAGC 
TGACTATATT 
GTCAAGTACT 
CTAATCTTGG 
AGGCATATCC 
G AAG AAAAAT 
GTTGCATTAT 
GAGTGCTTTA 
TTAGACAGAA 
TTTTTAGGTG 
TTATGTTAAT 
TAGAAACTGC 
ATGATGATAT 
CTGAAATCGT 
A'GAATTTAAG 



AATCAAGTCT 
TCCAAATGAA 
TAATTGTCGG 
CTTTTTACGC 
TGATCTAGTT 
TTTGGCTGCT 
CGGTGTTCTC 
TGATGTAACC 
GCTTTTTGAA 
ACACTTTGCT 
TTCTCATCGG 
CTATCATCGG 
ATCTCACATC 
TATTGAACAA 
TAGAGTAATA 
TAACGGAATT 
TTT AG AT ATT 
CAAGCCTATC 
TGTTGACCAA 
CATTTAGTAG 
TTTTTATAAA 
TAAAATATAA 
AAAATCTAAC 
AAAATGATTC 
GAATACAAAA 
AATTAAATCT 
TGCGTCAGAG 
TAATGTAAAG 
TTATGCCACT 



7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7900 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
9230 
8340 
8400. 
8460 
8520 
8580 
8640 
8700 
3760 
9820 
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TTCTTTCCTT ATTTTATACC TAATTTTCTC AAAGAAAGTC GTGCTTTATA CCTACATTCT 8880 

GACATCATTG TTACAGGAAG TTTAGACTAT TTATTTGATA TAGAACTAGA TGGTTATGCC 894 0 

TTCCCACCAC TAGAAGATTC TTTTGGTGAT GTTCCTTCTA CCAATTTTAA CTCCGGAATG 9000 

TTATTAGTTA ATCTACATAC TTGGAGAGAT GAAGATGCTT GTTCGAAACT GTTAGAACTG 9060 

ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGGAA TTTTAAATAT CTTATTCCAT 9120 

GATAGATGGA AAAGATTAGA CCGAAATTTT AATTTT ATGG TGGGGATGGA TAGCGTCGCA 9180 

CACATAGAAG GAAATCATAA ATGGTATGAG ATTTCTGAGT TGAAAAATGG AGATTTACCT 9 2 40 

AGTGTT AT AC ATTATACTGG GGTAAAACCT TGGGAAATAA TTTCCAATAA TCGCTTTAGA 9300 

GAACTTTGCT GGTTTTATAA TCTGTTAGAA TGGTCTGATA TTTTATTGAG AAAAGACATT 9 3 60 

ATTAGTCGTA GTTTCGAAGA ACTTGTATAC AGTCCTAAAG CTCATACAGC AATTTTTACA 942 0 

GCTAGTTGTG AGATGGAGCA TGTAGAATAT TTGATAGAAA ATTTACCAGA GGTACATTTT 94 80 

TCTATACTAG C AC AT AC AT A TTTTGCGTCT AGTGTCGTTG CTTTATTAAG ATATAGCAAT 9 54 0 

GTTACGATTT AT CCTTG TTT TTCTCCATTT GATTATCGAA AAATTTTGGA TAATTTAGAT 9600 

TTTTATTTAG ATATTAATCA TTATAAAGAA GTGGATAATA TTCTATCCCT TGTTCAACAA 9 660 

CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTAGTCATG ATATAGGCAA TCAAACTAAT 9 7 20 

ATATTTTCTT CAACCGAACC AAACAAAATG GTAGAGGCTA TTAGACAATT TATAGGAGAA 97 80 

TAAGTTTATG GCAGACGAAC TAATTACTAT TGTAGTTCCA ATCTACAACG TTGAGAATTA 9 84 0 

TTTGCGAATG TGTTTGGATA GCATTCAGAA T C AG ACGT AT CAAAATTTTG AGTGTTT ATT 9 900 

AATCAATGAT GGCTCTCCAG ATCATTCATC CAAAATATGT GAAGAATTTG TAGAGAAAGA 99 60 

TTCTCGTTTC AAATATTTTG AGAAAGCAAA CGCCGGTCTT TCATCAGCTC GTAACCTAGG 10 02 0 

7A7TGAATST TCGGGGCCCG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGCTTGGAAC 10030 

ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG 1014 0 

GGCGTTATAA TTCTTATGAT GAAACACGCT ATCTGTATAT GACTTATGTT ACGGATCCAG 10200 

ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG 102 60 

TCAGAAATGG GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TT ACT AC AAG 10320 

ATTTACCATT TCCTATAGGA AAAATTG C AG AGGATACTTA CTGGACATGG AAGGTACTTC 103 80 

TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT 10440 

CTCATACTTT ATCGAATACA TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG 1C500 

AAGAAAAGAT AGCT AT TTT A GCAAGTTCAG ACTATGACTT GACCAATCAT ATTTTGATTT 10560 
AT AAAAAT AG ATTACAAAGA GTGATAGCAA AATTAGAAGA AC AAAAT AT G CAGTTCACAG ■ 10 62 0 
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AG ATT T AC AG AAGAATGATG GAAAAATTGT CTTTACTT CC GTAGATAGTA ATAAAAAATG 10680 

AGATAGCGTA ATATGAAACT ACATTTAACA AATTTATACG GCATGGCTGG TGATAGTACG 10740 

GTTATCTTAG CTCAAAATGC TGTTCAAAAG ATAGCTAGTC AACTGGGATT TAGAGAGCTT 10800 

GGTATTTATT TTTACAACAT TGCTTCAGAT AGTCCTTCTG AAATGAATAA GCGTCTGGAT 10860 

GCTATTATGG CCAGTATCTC TATTGGGGAT ATTTTAGTCT TTCAG7CTCC AACCTGGAAT 10920 

GGTTTTGAAT TTGATCGTCT CTTGTTTGAT AAGCTAAAGG ATATGCAGGT GAAAATTATT 10980 

TGCTTTATCC ATGATGTTGT TCCCCTCATG TTTGATAGTA ACTATTATCT CATGAAAGAT 1104 0 

TATCTGTATA TGTATAATCT ATCAGATGTT TTGATAGTGC CCTCAGAGAG AATGAAAACA 11100 

CGCCTGATGG AAGAAGGATT GACGACTAAG AAGATTCTTG TTCAAGGGAT GTGGGATCAT 11160 

CCTCATGATT TATCCTTATA CACCCCTGCT TTTAAAAAAG AACTTTTTTT TGCTGGAAGT 11220 

TTAGAGCGTT TTCCAGACTT ACAAAATTGG TCTCAAGATA CGCCTTTGAG AGTATTTTCA 11280 

AATAAAGGGG AAGCTAGTTC TAGTGCTAGA AGTCTCACCA TCGAAGGATG GAAAAAAGAT 113 40 

GAGGAATTGT TCCTAGAATT ATCAAAGGGT GCATTTCCCC TTGTCTGGGG AACCcATCAA 114 00 

AATGAGGGAG AAAGT AAC C A AT ACT AT AC C TTGAATATAT CTCATAAGGT CAGTACCTAT 114 60 

CTAACAGCCC GCATTCCAGT CATTGTACCA AGTAGCTTGT CAACTGCTAA ATTTATAGTA 11520 

GATCAAGGCT TGGGCTTTAT GGCGG AT AGT CTGGAAGAGG TTCATGAGAT AGTTGATAAA 11580 

ATGAATCTAC AAGAATATCA ACAAATGACG AATCGTATCA AGACCTTTAG CTATTTGTTA 11640 

AAAGAGGGCT ATTTCACTAA AAAGTTATTC GTAGATGCAA TCTATCACTT GGGAATTGAT 11700 

TAAGGGAATC AAATCAACAA AACAATTGTA CTAGCAGGGG ATCGCAATTA CACCAGGCAG 11760 

TTAGAAACAA CGATAAAATC TATTTTATAC CACAATCGAG ATGTTAACAT TTATATTTTG 11820 

AATCAAGATA TCATGCCAGA TTGGTTTCGC AAACCACCAA AAATAGCTCG CATGTTAGGT 11880 

AGTGAGATTA TCGATGTTAA ACTACCTGAA CAAACTGTGT TTCAAGATTG GGAAAAGCAA 11940 

GATCACATTA GTAGCATTAC TT ATG CT AG A TATTTTATTG CAGATTATAT CCAAGAAGAT 12 000 

AAGG TTTT AT ATTTAGACAG TGATTTGATT GTAAATACTT CTTTAGAGAA ATTATTTAGT 120 60 

ATTTGTTTAG AAGAAAAATC ACTCGCAGCA GTTAAAGATA CAGATGGAAT TACATTTAAT 12120 

GCAGGTGTTT TATTAATCAA CAATAAAAAA TGGCGTCAAG AG AAA TT AAA AGAACGACTA 12180 

ATTGAACAGA GCATTGTTAC AATGAAGGAA CTTGAACAAG GCCGTTTCGA GCATTTTAAT 122 4 0 

GGTGATCAAA CGATTTTTAA TCAGGTCTTG CAAGATGATT GGTTAGAACT AGGTCGAGCT 123 00 

TATAATTTAC AAGTAGGGCA TGATATTGTG CCTTTGTATA ACAATTGGCA GGAACATCTG 12 3 60 
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GCTTTTAATG 


ATAAACCAGT 


GGTGATTCAT 


TTTACGACCT 


ACAGAAAACC 


CTGGACTACC 


12420 


TTGACAGCCA 


ATCGTTATCG 


TGATTTATGG 


TGGGAATTCC 


ATGATTTGGA 


GTGGAGTCAG 


12480 


ATTTTACAAC 


ACCATATGGG 


AGAATTTGAA 


CTAATATCGC 


CTCTAGATAA 


GGAATTTTCT 


12540 


TGCTTAACCT 


TAACGAATTC 


CCAAGATTTA 


GAAGGAATAG 


AAGAGCTAGT 


TACAGCTCTA 


12600 


CCTGAGGTGG 


TATTTCATAT 


CGCAGCTTGG 


ACGGATATGG 


GAGATAAATT 


AAAAAAATTA 


12660 


GCTGTATATA 


ATAATGTGAG 


ATTGCATCCA 


CAAATTGTTC 


CACCGGTCTT 


AGATAAGCTG 


12720 


AAAAAGTCAA 


CAAATCTATA 


TTTGGATATC 


AATCATGGTA 


CTGCAGATGA 


GAACTTTTTA 


12780 


AAATCTTTGC 


AAGAACAAGA 


AAAAACGCTA 


CTAGCTTTTC 


AATCGACTCA 


GCACGGAGAG 


12840 


TT AGG AC AAA 


TCGTTTTCGA 


AAATGGGAAA 


GTTTCCTTTA 


TGATTGATAC 


GATTAAAGAT 


12900 


TTTAAGAAAA 


ACGGACATCT 


TACCTGTTTT 


CGACAACTTC 


CAAGTTTAAC 


TTGTTTAACG 


12960 


TTTACGGCTT 


CTCAGTATAT 


CGAACAATTG 


GATTACTTGC 


CTGGACAGTT 


GCC AAATGTT 


13020 


GTTTTTCAAA 


TTGCTGCTTG 


GACAGCTATG 


GGGCCAAAAT 


TATATGATTT 


GTCTAATCGT 


13080 


TATCCTAATA 


TTCAGCTCTA 


TCCGGCAATT 


TCTAGAGATA 


AGCTAGACGA 


GTTGAAGGAG 


13140 


AAGATGGATG 


CTTATTTAGA 


TATCAACCTA 


CTGACTTCAA 


CATCCGATAT 


CGTTGCAGAA 


13200 


ATGGCTCATC 


TATCTAAACC 


TATACTAGCC 


TTTTATAAAT 


CTCAAAATGC 


GAATAATGCC 


13260 


CAAAGGTTGT 


ATTCAAGTGA 


ACATCCTGAA 


CGAATGTTGG 


CTCATTTGCA 


AAAATTGATA 


13320 


ACT AAGG AT A 


TGCTAGAAAA 


ACCGCTTGAT 


ATAATCCAGG 


TGAAAGGGAT 


AGATGAAACC 


13 380 


TTGGATTATA 


TTATTGAACA 


CAACTCTTCT 


TTAGTTCGTT 


TTGGAGATGG 


GGAAATCAAT 


13440 


ATGCTTGCAG 


GGCATTCAAT 


TCCCTACCAG 


GATTATGATG 


AAGAGTTGGT 


TTCAATCATG 


13500 


AGGGACATTA 


TCGGCCAAGA 


AAGTCGAGAA 


GATTTAGTAG 


TGTGCCTTCC 


TGATGCTTTT 


13560 


ACAGATCGTT 


TTAGCTTTAC 


A 7CGTGGGCG 


ATTCCATTTT 


GGAAACATCA 


CATGGATC AT 


13620 


TATATGGATT 


TTTACAGAGA 


GTTATGCAGT 


GATTCATCGT 


ATGGCTCAAC 


CTTTGTATCT 


13680 


CCCCCTTATA 


TCGATTTTGA 


AGACAAGAGT 


CAAGCTAAAG 


CTCAATTTGA 


AAAATTGAAA 


13740 


ACCATTTCCC 


AAAACCGTGA 


CTTACTGATA 


GTCGAAGGTG 


CGACTTCTCG 


TTCAGGTGTC 


13800 


GGAAATGATT 


TATTCGATGA 


GGCAAATTCT 


ATTAAGCGAA 


TTATCTGTCC 


TTCTCATAGT 


13860 


GCCTTTTCTA 


GAGTTCATGA 


ACTTGAACAA 


G AAATTG AAA 


ACTATGCTGG 


TGGTCGCTTG 


13920 


ATTTTATGTA 


TCCTTGCACC 


TACAGCAAAA 


GTTCTG AGTT 


ATAATCTATG 


CCAGATGGGC 


13980 


TATCAAGTTT 


TGGATGTAGG 


CCATATTGAC 


TCAGAGTATG 


AATGGATCAA 


AATGGGAGCT 


14040 


AAAACTAAGG 


TTAAATTTTC 


TCATAAACAT 


1 ACTGC AG AAC 


ATAATTTCGA 


CCAAGATATT 


14 100 


GAATTTATTG 


ATGATGAAAC 


CTATAACAGT CAGATTGTTC 


; CACGAATATT 


' AAACTAGACT 


14160 
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ATTTAAAATA AATGATAAGG ATTTAAAATG AGAAATACCA AACCCCCTGT AGTATTTGCA 14 22 0 

GGTGATTACG CTTATATTCG ACAAATCGAA ACGGCGATGA AGTCACTCTG TAGACACAAT 14 2 80 

AGTCATTTGA AAATTTATCT GCTAAATCAG GACATTCCTC AGG AATGGTT TAGTCAAATA 14 3 40 

AGAATATATT TACAAGAGAT GGGGGGCGAC TTGATTGACT GCAAGTTAAT TGGCTCACAG 14 400 

TTTCAAATGA ATTGGTCTAA TAAATTACCT CATATCAATC ATATGACATT TGCACGCTAT 14460 

TTTATTCCAG ATTTTGTAAC AGAAGATAAA GTTCTCTATC TAGATAGTGA TTTGATTGTG 14 520 

ACTGGTGATT TCACCGATTT GTTTGAATTA GACTTAGGTG AAAATTATTT GGCAGCAGCT 14 580 

CGTTCTTGCT TTGGAGCAGG AGTCGCCTTC AATGCTGGTC TTCTCTTGAT TAACAACAAA 14 640 

AAATGGGGAT CTGAAACTAT TCGACAAAAA TTGATTGACT TAACAGAAAA AGAACATGAG 14 700 

AATGTGGAAG AAGGAGACCA GTCAATTTTG AATATGTTGT TTAAAGATCA ATATAGTTCC 14 7 60 

CTTGAAGATC AATATAATTT TCAAATAGGA TATGATTATG GGGCGGCAAC CTTT AAAC AT 1482 0 

CAATTCATTT TTGATATTCC GCTCGAACCA CTGCCACTAA TTTTACACTA TATTTCTCAG 14 880 

GATAAGCCTT GGAATCAATT TTCTGTTGGA CGTCTAAGAG AACTTTGGTG GGAATACTCT 14 940 

TTGATGGATT CGTCTGTTAT TTTAAATGAA TGGTTTTCAA AGAGTGTGAA GTACCCTAGT 15000 

AAATCACAAA TATTTAAGTT GCAATGTGTT AATTT AAC G A ATTCTTGGTG TGTCGAGAAA 15060 

AT CG ATT ATT TGGCGGAGCA ATTGCCAGAA GTTCATTTTC ATATTGTTGC TTATACAAAT 15120 

ATGGCAAATG AACTACTAGC TTTAACGCGT TTTCCTAATG TTACCCTATA TCCAAATTCC 1518 0 

TTACCAATGT TATTGGAACA AATAGTAATA GCTTCAGATT TCTATTTGGA TTTGAATCAT 15240 

GATCGAAAAT TAGAAGATGC ATATGAGTTT GTGCTTAAGT ACAAAAAACC AATCATACCT 15300 

TTCGACAATA CTTGCTCTGA AAATCTTTCT GAG ATTTC AT ATGAAGGTAT CTATCCAAGC 153 60 

TCCATTCCGA AAAAAATGGT TGCAGCAATC AGATCTTACA T G AGGT AG AG AACAGTATGA 15420 

GAAAATCAAT AGTATTAGCG GCAGATAATG CCT ATCTT AT TCCTTTAGAG ACGACTATAA 15480 

AGTCTGTATT GTATCACAAT AGAGATGTTG ATTTTTATAT TCTCAACAGT GATATAGCTC 1554 0 

CTGAATGGTT TAAATTATTG GGGAGAAAAA TGG AAGTTG T GAATTCTACA ATTCGCAGTG 15 600 

TACACATTGA TAAAGAACTT TTTGAAAGCT ATAAAACAGG ACCTCATATA AATTATGCTT 1566 0 

CTT AC TTT AG ATTTTTTGCG ACAGAAGTGG TTGAATCTGA TACGGTATTG TAT C TG G ATT 15720 

CCGATATCAT TGTAACTGGG GAACTAGCTA CTTTGTTTGA GATAGATCTC AAAGGATATT 15780 

CAATTGGTGC TGTTGATGAT GTCTATCCCT ATGAAGGACG AAAATCTGGA TTTAATACTG 15840 

GTATGTTACT AATGGATGTT G C AAAGTGG A AAGAACATTC TATTGTCAAT AGTTTATTGG 15900 
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AATTAGCGGC 


CGAGCAGAAT 


CAAGTTGTTC 


ATCTTGGGGA 


TCAGAGTATT 


TTAAATATTT 


15960 


ATTTTGACGA 


TAATTCGCTA 


GCCTTAGATA 


AAACATATAA 


TTATATGGTG 


GGTATTGATA 


16020 


TTTATCACCT 


TGCTCAAGAA 


TGTGAACGTC 


TAGATGACAA 


TCCACCTACA 


ATTGTTC ACT 


16080 


ATCCTAGTCA 


TGATAAACCT 


TGGAATACAT 


AT ACT AT AT C 


T AG AC T ACCT 


GAAT7ATGGT 


16140 


GGGTTTATAG 


AGATTTGGAT 


TGGT C AG AG A 


TTGCTTTTCA 


ACCTTCCGAT 


TTAAATTATT 


16200 


TTGAAAGAAG 


CAATCAGTCT 


AAAAAACAAG 


TGATGCTTCT 


GACATGGAGT 


GCAGATATAA 


16260 


AACATTTAGA 


GTATTTAGTA 


CAACGGTTAC 


CTGATTGGCA 


TTTTCATTTG 


GCTGCACCGT 


16320 


GTGATTGTTC 


TGAGGAGCTG 


ACCTCTCTAT 


C AC ACT AT AC 


GAATGTAACA 


GTATATCAAA 


16380 


ATGTATTACA 


TAGTAGAATT 


GATTGGCTAT 


TGGACCATTC 


TATAGTTTAT 


TT AG AT ATT A 


16440 


AT AC AGGTGG 


AGAGGTTTTT 


AATGTAGTTA 


CAAGGGCACA 


AGAAAGTGGC 


AAGAAAATCT 


16500 


TCCCTTTTGA 


TATCACACGT 


AAAAGTATGG 


ATCATGGACT 


CTATGACGGT 


ATTTTTTCTG 


16560 


TGGAGAGACC 


AGATGATTTA 


GTGGATAGAA 


TGAAGAATAT 


AG AG AT AG AG 


TAATGAGTGA 


16620 


ATTAATTACT 


GTTGTGGTAC 


CGATATACAA 


TACGGGAAAA 


TATTTAGTGG 


AGTGTGTCGA 


16680 


GCATATTCTG 


AAGCAAACCT 


AT C AAAAT AT 


AGAAATTATT 


TTAGTTGATG 


ACGGTTCTAC 


16740 


GGATAATTCT 


GCGGAAATTT 


GTGATGCTTT 


TATGATGCAA 


GATAATCGTG 


TCCCAGTATT 


16800 


GCATCAAGAA 


AATAAGGGGG 


GGGCAGCACA 


AG CT AAAAAT 


ATGGGGATTA 


GTGTAGCTAA 


16860 


GGGAGAGTAC 


ATCACGATTG 


TTGATTCAGA 


TGATATCGTA 


AAAGAAAATA 


TGATTGAAAC 


16920 


TCTTTATCAG 


CAAGTCCAAG 


AAAAGGATGC 


AG ATCTT GTT 


ATAGGGAATT 


ACTATAATTA 


16980 


TGACGAAAGT 


GACGGGAATT 


TTTATTTTTA 


TGTAACAGGG 


CAAGATTTTT 


CCGTCGAACA 


17040 


ATTACCTATA 


CAAGAAATTA 


TGAACCGTCA 


AGCAGGAGAT 


TGGAAATTCA 


ATAGCTCGCC 


17100 


CTTTATA7TG 


CCGACATTTA 


AGTTGATTAA 


AAAAGAATTA 


TTCAATGAAG 


TTCACTTTTC 


17160 


AAATGGTCGC 


CGCTTTGATG 


ATGAAGCAAC 


TATGCATCGC 


TTTTATCTTT 


TAGCCTCTAA 


17220 


AATCGTCTTT 


ATAAACGATA 


ATCTCTATCT 


GTATAGAAGA 


CGTTCAGGAA 


GCATCATGAG 


17280 


AACGGAATTT 


GATCTTTCCT 


GGGCAAGAGA 


TATTGTTGAA 


GTGTTTTCTA 


AGAAAATATC 


17340 


GGATTGTGTC 


TTGGCTG GTT 


TGGATGTCTC 


CCTTCTGCGT 


ATTCGATTTG 


TCAATCTTTT 


17400 


AAAAGATTAT 


AAGCAAACTT 


TAGAATACCA 


TCAATTAACA 


GATACTGAGG 


AATATAAAGA 


17460 


TATTTGTTTC 


AGATTAAAGT 


TCTTTTTTGA 


TGCAGAACAA 


AGAAATGGTA 


AAAG TT G AAA 


17520 


TAAAACAATT 


GTTATTTACC 


ATATCACAAA 


CAATGAAGGT 


GAGGGGAGTG 


TTTTATGACT 


17530 


AAGATTTATT 


CGTCAATAGC 


AGTAAAAAAA 


GGACTATTTA 


CCTCATTTCT 


ACTGTTTATC 


17640 


TATCTATTGG 


GAAGTCGTAT 


TATTCTCCCT 


TTTGTTGACC 


TAAATACTAA 


AG ATT TTTT A 


17700 
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CGAGGTTCAA CAGCCTATCT AGCCTTCTCA GCCGCCCTAA CAGGTGGGAA TCTAAGAAGT 177 50 

TTATCAATTT TTTCTGTTGG ATTATCCCCT TGGATGTCCG CCATGATTTT ATGGCAGATG 17 92 0 

TTTTCTTTTT CTAAACGGTT GGGTTT AAC A TCTACGTCTA T AG AAAT AC A AGATCGCCGT 17 88 0 

AAAATGTACC TGACCTTGCT AATTGCTGTG ATTCAATCCT TGGCAGTTAG CTTGAGACTG 17 94 0 

CCAGTACAAT CCTCCTATTC TGCAATATTG GTTCTTCTAA TGAATACAAT ATTGC TG AT A 18000 

CCAGGAACAT TTTTTCTTGT TTGGTTGTCA GATTTAAATG CGAGTATGGG GATTGGAGGT 18060 

TCTATTGTAA TCCTCCTATC CAGTATGGTT TTAAATATTC CTCAGGATCT TTTGGAAACA 1812 0 

TTTCAGACAG TACACATTCC AACAGGGATT ATTCTGTTAC TTGCTTTATT AACCCTTGTC 18190 

TTTTCTTATT TACTTGCCCT TATGTATCGA GCTCGCTATT TGGTTCCTGT TAATAAAATT 18240 

GGCTTACACA ATCGATTTAA ACGCTATTCT TATCTCGAAA TCATGTTGAA TCCTGCAGGT 18300 

GGGATGCCTT ATATGTATGT GATGAGTTTT CTTAGTGTAC CAGCTTATTT GTTCATCTTG 183 60 

TTGGGATTTA TTTTCCCTAA TCATTCAGGG TTAGCGGCTT TATCAAAGGA ATTTATGGTT 1842 0 

GGAAAGCCTT TGTGGGTCTA TGTTTATATT TCCGTCTTAT TTTTATTTAG TATCATTTTT 18 480 

GCTTTTGTTA CGATGAATGG AGAAGAGATT GCAGACCGTA TGAAAAAATC TGGAGAATAC 18 540 

ATTTATGGTA TTTATCCAGG TGCGGATACT AGTCGATTTA TTAATCGATT GGTCCTTCCT 18600 

TTCTCAGTCA TAGGTGGTCT* CTTTAATGTG ATTATGGCAG GTGGTCCCAT GCTTTTTGTT 18660 

TTGTTTGATG AAAAGTTATT ACGATTGGCA ATGATTCCAC GCTTATTTAT GATGTTCGGG 18720 
GGCATGATTT TTACGATTAG AGACGAGGTC AAGGCTTTAA GGCTAAATCA GACCTATAGA . 187 80 

CCTTTGATTT AGGAGACTTT TATGTATTAT TTTATTCCAG CTTGGTATGG GTCACAAAGA 18840 

ACATGGCATG CACA7ATCAC TCCATGGTAT TTTTCTCATT TTCGTCTACA GTTTGATGAT 19900 

ACCTTTCACC AGATTCGGCT CTTTCAACAC CAAGATATAG ATTCTCGTCT ATTAGTATTA 18960 

GCTTACCAGC CTCATCTACG TTATTTTTTA TATAGACATG GTGTGTTAGA AATGG AT ACT 19 020 

TATTCCGTTT TTCATCTTAT GCAAGATTTT CATAATCTCC ATACCCAAGT TTTAAGCATT 19080 

AGAGATATTG AGTGGGATGA TGACTGTGAA TTTATTTATA GTCCCTTTAC CATTATCCTT 1914 0 

CAAAAAAATG GGAAGAAATT TGCTAAGGTT GAACATGGAG TTGAAGGCTT CATCAGTGAT 19200 

ATACAGTATT 1*TG AAC C AAA TGGTCAAATA CATATGCACC ATATCGTGGA TGATCGTGGG 192 60 

TTTGTATCGA GCATTATCTT TTTTGAAGAT GGGCAAGCAG CCTATCAAGA ATATCTGAAC 193 20 

CTCAAGCGAG AGTGGCAATT CAGAGAGCCT TTAAAAGAAG GAGGACAGGT AGAAGTCAAT 19 3 80 

CCAATTTTGG GTTATCGCTT TAAAATGCTT ACCTATCAAA ATATGGGAGA TCTGCTGGCA 19440 
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GAATTTTTTC 


ACAATTATCT 


GCAAACGTAT 


GTGAAGGATC 


AGGATATTTT 


TATGCTTCCT 


L95C0 


TCTCATTCTC 


ATCATGACCA 


GTTGGTACTA 


GATCGTTTAC 


CTAGTACTAA 


TCCTAAACTG 


19560 


TTGAGTCTGT 


TCATTGCACG 


T AATCCTC AA 


GATACCTTTA 


GGGATTTAGA 


TGTAACTTTT 


19620 


GAAAAATCGG 


ATTTGATTTT 


GGTGGATAGA 


GAGGATAGTT 


TACGATTGTT 


GCAGGAGTTG 


19680 


TATCCTGAAC 


GAATGCATCA 


ATGTT ATC AT 


TTATCATCTT 


TTGACACCCG 


ATTACCATTC 


19740 


GGACGAAGCC 


AAACTAAGAA 


AGAATCCATC 


ATTTATTTTC 


AACTGGATTT 


TGAGCAGGGG 


19800 


ATTGATAATC 


AAGCTCTGCT 


TCAAGTCTTG 


TCCTTTGTCG 


CTGAAAATAA 


GGATACTGAG 


19860 


GTGATTTTTG 


GAGCCTTTGC 


TGCTAGTCAG 


GAGCAAATGA 


ATGAGGTTGA 


AGGGATTGTT 


19920 


GAG TCTTTC A 


TCCAAGAAAA 


CATTCAATCC 


CAAAATCTGG 


GAAAGGCGAT 


TGATTATGGT 


19980 


GATGCAGAAA 


ATCCTCTGGA 


AGAAAATCAA 


CACCAGGACT 


TACGCTTACA 


GTTTGTTAAC 


20040 


TTGAATGATG 


AGTTAGATTT 


GATAAAAACA 


CTAGAATTTG 


TCCCTTTGAT 


TGTGGATTTA 


20100 


AATAGACATC 


CTCATCTCTA 


CACACAGATT 


GCTGGGATTA 


GTGCAGGAAT 


TCCTCAAATC 


20160 


AACCTAGTTG 


AAAC CGTCT A 


TGTTGAACAT 


TTAAAAAATG 


GTTATTTGTT 


AGCAGATGTT 


20220 


ACAGAATTTT 


CTAAGGCTGC 


AC ATT ATT AC 


ACAGATAGGT 


TGAACGAGTG 


GAATGAGTCC 


20280 


TTGATATATT 


CAATTGATAA 


GATTAAGGAG 


C AC AC AG G AC 


AACAATTTCT 


TGGAAAATTA 


20340 


GAGAAATGGA 


TAGAGGAGGT 


TAAAAATGTC 


AAAGGAACTT 


AATATTTTAC 


AG AT AGG ACT 


20400 


TGCCAATTGG 


GAAAATCACT 


ATGACATACC 


TGAAAATATG 


AGTTGCTATT 


ATTTTTACCC 


20460 


AAACTCATCA 


AAAGCCCTTC 


GTGAAATAAT 


TGAAAAAGAG 


GATATTAACC 


GTTTTCATGC 


20520 


AGTTTTAATA 


GAAGAT.GGTC 


AGTATTCCAG 


AGACTTATTT 


TCCTATGTAA 


AATATTTTGA 


20580 


AC CTT AT ACT 


TT ATT TT AT A 


ACCAGAATCT 


AC AAAT AAAT 


CATAGAGAGG 


TTGTGGATTT 


20640 


TCTAAAAAAA 


CGATGTGCAC 


AAGCAATTCA 


TTTTTTAAGT 


CCCCAACAAC 


TAATCAATGA 


20700 


TTTAAGTAAA 


TCTCTTTTTG 


GCGGTGGGTA 


TGGTGATAAA 


CTCTTTCCTC 


CGACGATACA 


20760 


AGTCAATCCA 


AATTTTACAG 


GAGCTATTTC 


TTATCAAGGA 


TTGGATTATG 


TCAGTTTGGA 


20820 


AGGTGAGTTT 


GGGCAAGATT 


TTGCCCAGCT 


TGCCTATTGG 


GCTTATAATA 


TTATGGTGCA 


20880 


AAAAACACTC 


CCTATTGAGT 


TGTGGCTTGA 


ATATGAGAAG 


GAAGGCAATT 


GTGACTTTCG 


20940 


TTTAGTAATC 


CGTAAAATGT 


GGAGTGGGTC 


TGTTGATGAT 


TTCTTTGAAG 


AAGTAATAGT 


21000 


ATCTGAAAAA 


GACTTGGAGC 


AAGCACTTTT 


TATGGATAGT 


CGAGACGGAG 


ACTACTTTCT 


21060 


CTCCATATCT 


GTTGAAGCAA 


GAGGTCGTGG 


AACTATCAAA 


CTAGGTAATC 


TTCACCAACG 


21120 


ATGGAGTCGA 


AAACAATTTG 


GT AAGTTTG T 


ACTTGGTGGA 


AATATCCTAC 


ATGATTCCAA 


21180 


GCGTGATGAA 


ATAAACTATT 


TCTTCCATCC 


AGGTGATTTT 


AAACCGCCTT 


TGACTGTCTA 


21240 
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TTTTGCACCT 
ATGTCCCTTC 
GGATGAGCTA 
TGATCATAAG 
TGGTGCTTCT 
TATAGCTAGT 
AATTCATCAT 
GAAAATTTTT 
TGAAGAAATG 
GAAGATTTTA 
TTGGTTTTTG 
ATTATTACAC 
ACAACTGTAT 
GGTGAGATTC 
CAGCTTCCCT 
CCTGAAAATG 
AAAAAG GT AG 
CGAGTTTCTC 
GAGATCAGAA 
TTTAAAGCAG 
AGCTAAAACA 
CCTTGAAGCT 
TGATGTTCAA 
GGGGGAAGGT 
AGGAGTGATG 
TCAAGTTTAT 
GGAGATGAAA 
7AGTAATTTA 
TTTACGACCG 



TATCGACCTG 
ATTTTATTTT 
GAGGGAAAAG 
G ATTTG AT TT 
TTTGAACCCC 
CGTGGACGTT 
ACAGGTGGGA 
AAACAAGCAA 
GATCCACAAG 
TCTAAAGGAA 
CACTTTTATA 
AG AG AC ACT C 
CTTATTACCC 
TAAAGACTTG 
TATT AAAAAG 
GAGTTTATAC 
AAAAAG TGAA 
TTTTAAGTGC 
GTGTTTAGAC 
ATTAATGCCC 
GTACAGTTTC 
TTTGCTGTGG 
GTCATGGGAG 
AAGACCTTGA 
GTTGTGACTC 
CGTTTTCTAG 
GCTGAAGAAA 
GGTTTTGATT 
TTTAACTATG 



CAGAAGGATT 
CTGATCCACG 
TAAAGGATAC 
TATCAGGTCT 
ATGCCATCAT 
TGGACGCACC 
CAAGTTCTCA 
ATTTTTCAAA 
CCTATGAACA 
CTGCTGGACG 
GAATGGTCTT 
TATTCATTGG 
TGACAAAAGT 
GTTTTCTAGT 
AAAGCAGCAG 
CAAGATAACT 
AGATTTCATA 
TGGATTTGAG 
GTTTAGGTCA 
TGAAAGGCAA 
GTCAGCGTCT 
TGCGTGAAGC 
CTATTGTCAT 
CAGCTACCAT 
CTAATGAGTA 
GATTGACCAT 
AGAAGCTTAT 
ATCTAAATGA 
TGATTATTGA 



587 
CGAGGGTTAC 
TTTAGAGGGG 
GATCACTCAC 
TTCTATGGGA 
AGTTGGTAAG 
GGGTGTCTCT 
AGATATGACG 
GACAACCTTT 
ATTAGTGTCT 
ACACAATGAT 
AGAGACTGGT 
GGAGAAGTTG 
GTTCCTCTGT 
GTCAATTACC 
TATCAACTAT 
TTTTTTGACC 
TTTACTTATC 
TCCTTAACTT 
AGATTTCCAG 
GATGTCCTCT 
TTCCGAGGGA 
AGATAAGCGG 
GCACTATGGA 
GCCTGTCTAT 
TTTATCAAAG 
TGGTGTACCA 
CTATGCTTCG 
TAACCTAGCC 
TGAAATTGAT 



TTTATGATGA 
GGAGCTTTTT 
TATCTTGATT 
ACGTTTC CGG 
CCCTTGGCTA 
AACTTAGCTT 
G AGTTGG ATC 
GGTTTATCCT 
TATCTGTGTA 
GATACAGATA 
TTTGGAAGGG 
GTGGGACTTA 
ATAATCCTCT 
AGGCTGCACG 
CACTGGTTTT 
GCTATGGTGA 
CAGAAGATAG 
TTTATCATTT 
CTTAGCAAAG 
CTTTCGGATC 
GAAAGTCTAG 
ATTTTACGGA 
AATGTTCCTG 
TTGAACCCTT 
CGTGATGCCG 
TTTACGGAAG 
GATATCATCT 
TCGAATGAAG 
GATATCTTGC 



AAACTCTTGG 
ATCTCGGAAC 
ATTTAGGCTT 
CTCTCTATTA 
ATTTAGGAAC 
TTGATTCTTT 
AGCGTTTTTG 
ATATGAAAGA 
ATACAGGTGC 
CCAATATTTC 
AG AAAAG ATG 
TATGTATGGA 
ATTGCCATCT 
AACCCAACCT 
TGACTGTCAG 
TATTTTAGAA 
TTATACTTAT 
TTCTATCAAG 
TG AAAAAG AT 
AAGAATTAGT 
ACGATATTTT 
TGTTTCCT^A 
AGATGAATAC 
TTTCAGGAGA 
AGGAAATGGG 
ATCCAAAGAA 
ACACAACCAA 
AAGGTAAGTT 
TTGATAGTGC 



21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
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AC AAACT CCT CTGATTATTG CGGGTTCTCC TCGTGTTCAG TCTAATTACT ATGCG ATC AT 2 3040 

TG AT AC ACT T GTAACAACCT TGGTCGAAGG AGAGGATTAT ATCTTTAAAG AGGAGAAAGA 2 3100 

GGAGGTTTGG CTCACTACTA AGGGGGCCAA GTCTGCTGAG AATTTCCTAG GGATTGATAA 2 3160 

TTTATACAAG GAAGAGCATG CCTCTTTTGC TCGTCATTTG GTTTATGCGA TTCGAGCTCA 2 32 20 

TAAGCTCTTT ACTAAAGATA AGG ACT AT AT CATTCGTGGA AATGAGATGG TACTGGTTGA 2 3 230 

TAAGGGAACA GGGCGTCTAA TGGAAATGAC TAAACTTCAA GGAGGTCTCC ATCAGGCTAT 2 3 340 

TGAAGCCAAG GAACATGTCA AATTATCTCC TGAGACGCGG GCTATGGCCT CGATCACCTA 234 00 

TCAGAGTCTT TTTAAGATGT TTAATAAGAT ATCTGGTATG ACAGGGACAG GTAAGGTCGC 2 3 460 

GGAAAAAGAG TTT ATTG AAA CTTACAATAT GTCTGTAGTA CGCATTCCAA CCAATCGTCC 2 3 520 

GAGACAACGG ATTG ACT AT C CAGATAATCT ATATATCACT TT AC CTG AAA AAGTGTATGC 2 3 580 

ATCCTTGGAG TACATCAAGC AATACCATGC TAAGGGAAAT CCTTTACTCG TTTTTGTAGG 2 3 640 

CTCAGTTGAA ATGTCTCAAC TCTATTCGTC TCTCTTGTTT CGTGAAGGGA TTGCCCATAA 2 37 00 

TGTCCTAAAT GCTAATAATG CGGCGCGTGA GGCTCAGATT ATCTCCGAGT CAGCTCACAT 2 3 760 

GGGGGCTGTG ACAGTGCCTA CCTCTATGGC AGGACGTGGT ACGGATATCA AG CTTGGT AA 2 3 320 

AGGAGTCGCA GAGCTTGGGG GCTTCATTGT T ATTG GG AC T GAGCGGATGG AAAGTCAGCG 23 880 

GATCGACCTA CAAATTCGTG GCCGTTCTGG TCGTCAGGGA GATCCTGGTA TGAGTAAATT 2 3 940 

TTTTGT ATC C TTAGAGGATG ATGTTATCAA GAAATTTGGT CCATCTTGGG TGCATAAAAA 2 4 000 

GTACAAAGAC TATCAGGTTC AAGATATGAC TCAACCGGAA GTATTGAAAG GTCGTAAATA 24 060 

CCGGAAACTA GTCGAAAACG CTCAGCATGC CAGTCATAGT GCTGGACGTT CAGCACGTCG 2 4120 

TCAGACTCTG GAGTATGCTC AAAGTATGAA TATACAACGG GATATAGTCT ATAAAGAGAG 24180 

AAATCGTCTA ATAGA7GGTT CTCGTGACTT AGAGCATCTT GTTCTGGATA ?C ATTG AG AG 24 2 40 

ATATACAGAA GAGGTAGCGG CTGATCACTA TGCTAGTCGT GAATTATTGT TTCACTTTAT 24 300 

TGTGACCAAT AT T AGTTTT C ATGTTAAAGA GGTTCCAGAT TAT AT AG ATG TAACTGACAA 24 3 60 

AACTGCAGTT CGTAGCTTTA TGAAGCAGGT GATTGATAAA GAACTTTCTG AAAAGAAAGA 2 44 20 

ATT AC TTAAT CAACATGACT TATATGAACA GTTTTTACGA CTTTCACTGC TTAAAGCCAT 24 4 80 

TGATGACAAC TGGGTAGAGC AGGTAGACTA TCTACAACAG CTATCCATGG CTATCGGTGG 24 540 

TCAATCTGCT AGTCAGAAAA ATCCAATCGT AGAGTACTAT CAAGAAGCCT ACGCGGGCTT 24 600 

TGAAGCTATG AAAGAACAGA TTCATGCGGA TATGGTGCGT AATCTCCTGA TCCCCCTGGT 24 6 60 

TGAGGTCACT CCAAAAGGTG AAATCGTGAC TCATTTTCCA TAAAAGGAGA AAAT ATG A C A 2 4 720 

A7TTACAATA TAAATTTAGG AATTGCTTGG GCTAGTAGCG GTGTTGAATA CGCTCAAGCC 24 7 80 
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TATCGTGCTG GTGTTTTTCG GAAATTAAAT CTGTCCTCTA AGTTTATCTT TACAGATATG 2 4 840 

ATTTTAGCCG ATAATATTCA GCACTTAACA GCCAATATTG GTTTTGATGA TAATCAGGTT 2 4 900 

ATCTGGCTTT ATAATCATTT CACAGATATC AAAATTGCAC CTACTACCCT GACAGTGGAT 24960 

GATGTCTTGG CTTACTTTGG TGGTGAAGAA AGTCACAGAG AAAAAAATGG CAAGGTTTTA 2 5020 

CGTGTATTCT TTTTTGACCA AGATAAGTTT ' CTAACCTCTT ATTTGGTTGA TGAGAACAAG 2 5080 

GACTTGGTTC AACATGCCGA GTATGTTTTT AAGGGAAACC TGATTCGGAA GGATTACTTT 25140 

TCTTATACGC GTTATTGTAG CGAGTATTTT GCTCCCAAGG AC AATGTTGC AGTCTTATAC 25200 

CAACGAACTT TTTATAATGA AGACGGGACT CCAGTCTATG ATATCTTCAT GAATCAAGGG 2 52 60 

AAGGAAG AAG TTTATCATTT CAAGGATAAG ATTTTCTATG GAAAGCAAGC TTTTGTGCGT 253 20 

GCCTTTATGA AATCTTTGAA TTTCAATAAG TCTGATTTGG TCATTCTCGA TAGGGAGACA 2 53 80 

CGTATTGCAC AG GTTGTGTT TGAGGAAGCA CAGACAGCAC ATCTACCGGT AGTTGTTCAT 25440 

G C GG AG C ATT ATAGTGAAAA TGCTACAAAT GAGGACTATA TCCTTTGCAA TAACTATTAT 25500 

GACTATCAGT TTACCAATGC AGATAAGGTT GACTTCTTTA TCGTGTCTAC TGATAGACAA 2 5 560 

AATCAAGTTC T AC AAG AG C A ATTTG C C AAA TAT ACT C AG C ATCACCCAAA GATTGTTACC 2 5 620 

ATTCCTGTAG GCAGTATTGA TTCCTTGACA GATTCAAGTC AAGGGCGCAA ACCATTTTCA 2 5 680 

TTGATTACGG CTTCACGTCT TGCCAAAGAA AAGCACATTG ATTGGCTTGT GAAAGCTGTG 2 5740 

ATTGAAGCTC ATAAGGAGTT ACCGGAACTA ACCTTTGATA TCTATGGTAG TGGTGGAGAA 2 5 800 

GATTCTCTGC TTAGAGAAAT TATTGCAAAT CATCAGGCAG AGGACTATAT CCAACTCAAG 2 5 860 

GGGCATGCGG AACTTTCGCA GATTTATAGC CAGTATGAGG TCTACTTAAC CGCTTCTACC 2 5920 

ACCCAAGGAT TTCCTCTGAC CTTGATGGAA GCTATTCGTT CAGGTCTACC TCTAATTGGT 25 9 30 

TTTGATGTGC CTTATGGTAA TCAGACCTTT ATAGAGGATG GGCAAAATGG TTATTTGATT 2 6040 

CCAAGTTCAT CTGACCATGT AGAAGACCAA ATCAAGCAAG CTTATGCCGC TAAGATTTGT 26100 

CAATTGTATC AAGAAAATCG TTTGGAAGCT ATGCGTGCCT ATTCTTACCA AATTGCAGAA 26160 

GGCTTCTTGA CCAAAGAAAT TTTAGAAAAG TGGAAGAAAA CAGTAGAGGA CGTGCTCCAT 2 6220 

GATTGAACTT TATGATAGTT ACAGTCAAGA AAGTCGAGAT TTACATGAAA GTCTAGGCGC 2 62 80 

TACTGGTCTT TCTCAACTTG GAGTGGTCAT CGATGCAGAT GGTTTTCTGC CTGATGGTCT 2 6340 

CCTTTCTCCT TTTACCTATT ATCTAGGTTA CGAGGATGGA AAACCTCTCT ATTTTAATCA 2 64 00 

AGTTCCCGTT TCAGATTTTT GGGAAATTTT AGGAGATAAT CAGTCTGCTT GTATTGAAGA 2 64 60 

TGTGACGCAG GAGAGGGCTG TCATTCATTA TGCTGATGGA ATGCAGGCTC CCTTGGTTAA . 2 6520 
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ACAGGTAGAC 


TGGAAAGACC 


TAGAAGGTCG 


AGTACGTCAG 


GTTCACCACT 


ACAATCGCTT 


26580 


CGGAGCTTGT 


TTTGCTACAA 


CGACTTATAG 


CGCAGATAGC 


GAGCCGATTA 


TGACACTTTA 


26640 


CCAAGATGTC 


AATGGTCAAC 


AAGTTTT ACT 


GG AAAACC AT 


GTGACGGGTG 


ATATCTTATT 


26700 


GACTTTGCCA 


GGTCAGTCCA 


TGCGTTACTT 


TGCAAATAAA 


G TTG AATTT A 


TCACCTTCTT 


26760 


TTTGCAAGAT 


TTGGAAATAG 


ATACCAGTCA 


GCTTATCTTT 


AATACTCTAG 


CGACTCCTTT 


26820 


CTTGGTTTCC 


TTCCATCATC 


CAGATAAATC 


TGGCTCGGAT 


GTCTTGGTAT 


GGC AGGAACC 


26980 


TCTCTATGAT 


GC C ATT CC AG 


GTAATATGCA 


GTTGATTTTG 


GAAAGTGATA 


ATGTGCGTAC 


26940 


TAAGAAGATC 


ATCATTCCAA 


ATAAGGCGAC 


TTATGAGCGC 


GCTTTAGAGT 


TAACTGACGA 


27000 


GAAATACCAT 


GATCAGTTTG 


TGCACTTGGG 


TTATCATTAC 


CAGTTCAAAC 


GT GAT AATTT 


27060 


CCTAAGACGA 


GATGCCTTAA 


TCTTGACCAA 


TTCAGATCAG 


ATTGAGCAAG 


TAGAAGCAAT 


27120 


CGCAGGAGCC 


TTGCCTGATG 


TCACTTTCCG 


TATTGCAGCG 


GTGACAGAGA 


TGTCTTCTAA 


27180 


GCTCTTAGAC 


ATGCTTTGCT 


ATCCT AATGT 


GGCCCTTTAC 


CAGAACGCTA 


GTCCACAGAA 


27240 


GATTCAGGAG 


CTGTATCAAC 


TGTCGGATAT 


TTACTTGGAT 


ATAAACCACA 


G T AATG AGTT 


27300 


GCTACAGGCA 


GTGCGTCAGG 


CCTTTGAGCA 


CAATCTCTTG 


ATTCTTGGCT 


TTAATCAGAC 


27360 


GGTGCACAAT 


AGACTTTATA 


TCGCTCCAGA 


CCATCTATTT 


GAAAGTAGTG 


AAGTTGCTGC 


27420 


TTTGGTTGAG 


ACCATTAAAT 


TGG CCCTTTC 


AGATGTTGAT 


CAAATGCGTC 


AGGCACTTGG 


27480 


CAAACAAGGC 


CAACATGCAA 


ATTATGTTGA 


CTTGGTGAGA 


TATCAGGAAA 


CCATGCAAAC 


27540 


TGTTTTAGGA 


CGCTAACATG 


TCAGAGGAAG 


ATTTATTTTA 


CAAAGACGTT 


GAAGGCCGCA 


27500 


TGGAAGAGTT 


GAAACAAAAA 


CCCATCAAGA 


AGGAAAAAGA 


AACCCGAGGG 


GAAAAGATTA 


27660 


GTAAGACTTT 


TTCACTTTTA 


CTGGGTTTGA 


TGATTCTGAT 


TGGTTTGCTC 


TTTACTTTGC 


27720 


TGGGAATTTT 


GAGGTAGATC 


TATCATTGAA 


ATACTAATTG 


TTTTAGCTAT 


TATCCTATCT 


2 7730 


CTTGCTTTGA 


TTGTATTGGT 


AACTATACAA 


CCCCGTCAAA 


ATCAACTATT 


TTCCATGCAT 


27840 


GCCACTAGTA 


ATATTGGTAA 


ACCAAGCTAC 


TGGCAGAGCA 


ACACCTTGGT 


CAAGGTGCTC 


27900 


ACTTTATTGG 


TGAGTTTGGC 


TTTATTTATT 


CTACTATTAA 


CCTTTATGGT 


GATTACTTAT 


27960 


AAATAAAAGA 


AAACTTCAGA 


TATTCACCTT 


TTGTGGATTG 


GTCTGAAGTT 


TTCTTTTTTA 


28020 


TACTCAATGA 


AAATCAAAGA 


GC AAACTAG G 


AAGCTAGCCG 


CAGGCTgCTC 


AAAACACCGT 


29030 


TTTGAGGTTG 


TAGATATAAC 


TGACGAAGTC 


AGCTCAAAAC 


AC CGTTTTG A 


GGTTGTAGAT 


28140 


ATAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


28200 


AGCTCAAAAC 


AC CGTTTTG A 


GGTTGTGGAT 


AGAACTGACG 


AAGTCAGCTC 


AAAACACCGT 


28260 


TTTGAGGTTG 


TGGATAGAAC 


TGACGAAGTC 


AGCTCAAAAC 


AC CGTTTTG A 


GGTTGTGGAT 


28320 ■ 
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AGAACTGACG AAGctCAGTA ACATATATAC AGCAAGGCGA CGCTGACGTG GTTTGAAGAG 2 83 80 

TATTACTGTC TATATTTTTG GTAAAAATCA ACTTTTACTT GGATGAAGGT TTTGGCTTCA 28440 

CGTAGGAGTT GAAGAAGGGT GGCGCGGGTT TCAAATTCTT CTCTTGTCTT GGGCAGACTG 2 8 500 

CGGTTC CGG A AG ACTTC C AG ATAACGTTCA ATTTCATCTA GC AAATCAGA AGCAGGATTG 285 60 

GTCTGGCTCA GTTGACCTGC AATTTTTGAA AAGAGTTGCG CTAAGATCAG GCTTTCACTG 28620 

GCGGCAAGGT CACAAGTGTT AATCTGTTGG GCCATGTTTC TCAGGATACG ACTTTGTCGC 28680 

TGTCTCATCT CAAAGTAGTG GAT ATGGT AG TCTGTCTGGT GAAAGAGGTG GTCAGAGTGA 287 4 0 

TCCAAATAGA CCAGTCTGAG GGCTTCTTTC AAAAGCGTGT CTAATTCTGC TACCAGCTGT 28800 

GCT CGGTTGC GTCCGTCTCC TCTGGATAAA TAGTATTTGA AGCGCTGGAG GATATCTTTT 28860 

AACTTTTCTT CCACCAGCGT GTGGTAGTGC TGGATTTCCT CTTCTCGTGA AGGCATATAG 28920 

AGATTAACAA G C AAGG C AAA TCCTGTACCA AT AGCAAAG A GAAGGAATTC ATTGACTAGA 28980 

AGGTCTGGAG AGGTTGACTC TTGAACCAAG AGATGGCTAA CCAAAACAGT GCTTGGTGTG 29040 

ATGCCAATTT CCCAGCCCAT CTTGTAGGCT AAAGGAACGT AGAAGGCCAG ATAGAGGCCG 29100 

AGACTCCAGA TATGAAATCC GCTCAAGTGA AAAGCT AG AA CACCGATAGC CAGAGCTAGA 29160 

AGCATAGAAA AAAGACGATT GCGAGCCAGT TTTAAACTAC TTCTACCCGT ATCAGATAGG 292 2 0 

CTCAAGAGAG CGATAATTCC AGCCGAAACT GCTGACGAAA GATTGAGAAA ATAAGCAACC 292 80 

AGGCAGGCAA GACAGGTAGC TAAGATGAGC TTGGTCGTAC GTTCCCTAAT AGACATAAGA 293 4 0 

ATTTCCTAAT AAGTTAGAAT AAAAGCGTAA AAGACAAGAC ATGACCAGGC TTGCCTTGAT 29400 

GAGTTATTTT TTACGGGTTG CTGCGTATTC GGCAACGGCG GTAAAGAGGA CATCTGTAGA 294 60 

AGAGTTAAGG GCTGTTTCAC ATCAGTCTTG GATGACACCA ATCACAAAAC CAACCCCAAC 29520 

AATTTGT ATG GCAATATCGT TAGAAATACC GAAAAGGCTA CAAGCAACTG GGATAAGAAG 2 9 580 

GAGGGAACCT CCGGCAATAC CTGAAGCATC ACAGGATGAG ATAGCTGCTA CCACACTG^G 2 9 640 

GACAAAGGCT GTGGCAAAGT CAACAGGAAT TCCAACAGTG TTAACTGCAG CAAGGGTCAA 2 9700 

AACGTTAATG GTAATCGCTA CTCCAGCCAT ATTGATAGTA GAACCGAGTG GGATAGAAAC 297 60 

AGAATAGGTA TCTGGGTTGA GTCCAAGGTC ATGGCAGAGT TTCATGTTGA CAGGAATGTT 29820 

AGTCGCAGAA CTACGAGTGA AAAAGGCTGT CACACCGCTG ACACGGAGGC AGTTCCAAAC 29880 

TAGAGGGTAA GGATTGCGTC T C AT AAAGAA GAAGGCAATC AAAGGGTTGA CCACAGGGGC 2 9940 

AACAAAAAGC ATAGTCGTTA CTAATAGAAC CAATAAAATA CCGTAGTTCC CAAGGCTTCC 3 0000 

GACTCCCTTG TCAGAAATGG TTTTAAAAAC AAGACCAACG ATTCCAAATG GAGCCAGATT 3 0060 
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GATGATCCAT 


TCGACAATTT 


TAGAAGTCAC 


GTCAGCGATA 


GTTTTTAGCA 


ATTCTTGACT 


30120 


ATTTTTACTG 


GCTTCTCTCA 


TAGCGATTCC 


AAAAATGACT 


CCCCAAGATA 


AGATTCTAAT 


30180 


ATAGTTAGCA 


GTAAGCAGGG 


CGTTGACTGG 


GTTGTCAACC 


AGTTTGAGCA 


AGAGGTTGCT 


30240 


GAGAACCTGC 


CCAATCCCAT 


CTGGTGGTGC 


AATTTCAGTA 


TTGGCACTAT 


TTGGGCTAAT 


30300 


TTCAATAGGG 


ACGATGAAAT 


TTGCTAGTAC 


AGCTACAAGA 


GCAGCGGCGA 


AAGTCCCTAT 


30360 


CATAGGATAT 


ACAAGAAAAC 


AACAGTTTTC 


ATATTGCTAT 


CTTGTCCCTT 


TTGATGTTGG 


30420 


GAAAGGGCAT 


TGGCAACGAG 


AGCAAAGACT 


AGGATAGGAG 


CAACAGCTTT 


TAGACCTCCA 


30480 


ACGAATAAAT 


CCTCGAGTAG 


CCCAATCCCT 


GAGAGATTAG 


GAAGGGTCAG 


TCCTAGGATT 


30540 


CCCCACAAAG 


CATACCAATC 


AAGATACGCT 


TCACAAGGCT 


TGCCTTATTC 


CAACCATGAA 


30600 


TGATTCTTTT 


CATAATAATC 


TCCTTTTTGT 


GTACTGATTA 


TGATTATAGT 


ATAAATGATA 


30660 


GACAAAATCA 


AGAATTTTCT 


GTCTATTTTT 


TGAATATTTA 


TGGAGAATGA 


GACTGATGAA 


30720 


AATATGGTAT 


AATGAAATAA 


AGCAGTTTTA 


TATGCAAAAA 


TTTATTCAGG 


CTTATATTGA 


30780 


AAACCTACAT 


GTGACAACCA 


TTATCGAGAA 


TATTCTAACC 


AAGGTCATTT 


CTCTTTTACT 


30840 


GCTTTTAATT 


GTATTTTATA 


TTGCTAAAAA 


AATGCTTCAT 


ACCATGGTGC 


AGAGAATTGT 


30900 


CAAACCTTCT 


CTAAAAATGT 


CTCGTCATGA 


TCTTGGACGC 


CAAAAAACCA 


TCTCACGTTT 


30960 


ACTAGAAAAT 


GTGTTTAATT 


ATACGCTATA 


TTTCTTTTTA 


CTCTACTGCA 


TTTTGTCGAT 


31020 


TTTAGGTTTG 


CCAGTTTCTA 


GTTTGCTGGC 


TGGAGCTGGT 


ATTGCTGGGG 


TAGCGATTGG 


31080 


TATGGGAGCC 


CAAGGCTTTC 


TGTCTGATGT 


CATCAATGGC 


TTTTTCATCC 


TCTTTGAACG 


31140 


TCAACTGGAT 


GTGGGAGATG 


AGGTCGTTCT 


G AC AAATGG A 


CCGATTACTG 


TATCGCGTAA 


31200 


GGTTGTCAGT 


GTGGGAATTC 


GTACGACACA 


CCTTCGTACC 


GAGGAGCAAG 


CCCTTCACTT 


31260 


TGTCCCTAAC 


CGAAATATCA 


CAGTTGTTAG 


CAATTTCTCA 


C 3 C AC AG ACT 


AGACCTGTTA 


31320 


TTT T AAGT AA 


TTTGTGGTAC 


AATAGAGGGA 


GTTTAATAAG 


GAGAAAAGAT 


GCTTTTAGAA 


31380 


AAGCAGTTGG 


GCAATCGTTG 


TACCTGGATA 


GACCTAGACC 


TAGGAAAGTT 


GAATAAACTA 


31440 


GAAGACCTTT 


CTCAAATTTA 


CGGTTTGGAC 


AAGGAAACCA 


TTGAATACGC 


ACTGGATAGA 


31500 


AACGAGCGCG 


CCCACATGGA 


CTACCACCGT 


GAAAGTGAGA 


CGCTTACCTT 


TATCTATAAT 


31560 


GTCTTAGACG 


TAAAAAAGGA 


CAAGGCCTAC 


TAT G AG ACT T 


TTCCCATGAC 


CTTTATTGTC 


31620 


GAGCATCGTC 


GCCTGATTAC 


CATTAGTAAT 


ACCAAGAACG 


CCTATGTCAT 


TGAACAGATG 


31680 


ACTCGTTATC 


TGGAGAACCA 


TGACACGCTT 


TCCATTTATA 


AG7T7CTCTT 


TGCCAGTCTG 


31740 


GAAATCATCA 


GCAATCCCTA 


CTATCCTGTC 


ATTGAGCACA 


TGGACAAGAG 


TAGGGATGAG 


31800 


GTCAATGACC 


TCTTGCGCCA 


GCGAACTACC 


AAGAAAAACC 


TCTTTGTCCT 


GTCTGATTTG 


31860 
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GAGACTGGTA TGGTTTATCT GACGGCAGCT GCCAAACAAA ATCGGATTTT GTTAGAGCAT 31920 

ATTCAAGGTC ATGCCTTGTA TCGT AGTTTT GATGAGATTG AGAGAGAACA GTTTGATGAT 3L930 

GCCATGATTG AGGCTCATCA GCTGGTATCC ATGACAGACC TAATCTCTCA G ATTTT AC AG 3 2040 

CAGCTTTCAC CCTCTTACAA CAATATTCTA AACAATAATC TGAATGACAA TTTGACAACC 3 2100 

TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCG TGACAGGCTT TTTCGGAATG 3 2160 

AATGTTCCCT TACCTTTAAC AGATGAGCCC CATGCTTGGC TCTATATCAG TTTGGCTAGT 3 2 220 

GCAGGTTTGT GGATTGTTTT ATCCTTGTTA CTAAGGAAAA TTGCGAAAAA AAGTTAAGAA 3 2230 

AAGGAGCCAG AATCGCGATT GAAAATTATA TACCAGATTT TGCTGTGGAA GCAGTCTATG 3 2 340 

ATCTCACAGT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC TGTTTTGGTC GATTTGGATA 3 2400 

ATACCCTCAT TGCTTGGAAC AACCCTGATG G AACGC C AG A GATGAAGCAA TGGCT ACATG 3 24 60 

ACCTTCGGGA CGCGGGTATT GGCATTATCG T AGTGT C AAA TAACACCAAA AAACGCGTTC 3 2S20 

AACGAGCAGT TGAGAAATTT GGGATTGATT ACGTTTACTC GGCCTTGAAG CCCTTCACAT 3 2580 

TTGGTATTGA CCGTGCTATG AAGGAATTCC ACTATGACAA AAAGG AAGTG GTCATGGTTG 3 2640 

GTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTCC ACGGATTCCC TCAATTTTAG 3 2700 

TCAAACCCTT GGTCCAACAT GACTCAATCA AAACG C AG AT TAACCGAACT CGTGAGCGTC 3 27 60 
GTGTTATG 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 
CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 
AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 180 
TTACCGTAGA AAAGGGTG AC TACGTTGCCA TCATGGGTGA GTCTGGTTCT GGTAAATCAA 2 40 

CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 300 
GAACTGACAC CGCAACTATT AAAAATTCAC AGGCTTCTAG TTTCCGGCGT G AAAAGCT AG 360 
GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TGTTAAGGAC AATATCTTGC 4 20 



32768 



6C 
120 
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TTCCGCTTGT 


CTTGTCAAGA 


AGACCTATAA 


CGGAGATGAT 


GAAGAAATTG 


GTGGTG AC AG 


430 


CTGAGAATCT 


GGGTATTAAC 


CAATTGCAAG 


AGAAGTACCC 


TTACGAGATT 


TCTGCTCGTC 


540 


AGAAACAGCG 


TGTAGCAGTA 


GCCCGCGCCA 


T CATC AC AG A 


ACCTGAAATT 


CTCCTTGCGG 


600 


ACGAGCCAAC 


AGGAGCCCTT 


GATTCCAAGT 


CATCTGCAGC 


CTTACTTGAT 


GTCTTTAATG 


660 


AAATCAATGA 


GCGTGGGCAA 


ACCATCCTCA 


TGGTAACCCA 


CTCAACAGCA 


GCTGCTAGCA 


720 


GGGCCAAGCG 


TGTTCTCTTT 


ATCAAAGACG 


GCATTCTTTA 


CAACCAAATC 


TACCGTGGAG 


730 


AGAAGACAGA 


GCGTCAGATG 


TTCCAAGAAA 


TCTCTGATAC 


CTTGACTGTC 


ATGGCAAGCG 


840 


AGGTGAATTA 


GTATGTTTCG 


ATTAACCAAT 


AAGTTAGCGG 


TATCGAACTT 


GATTAAAAAC 


900 


CGCAAACTCT 


ACTATCCCTT 


TGCACTGGCT 


GTTCTCTTGG 


CAGTCACCAT 


CACCTATCTC 


960 


TTTTACTCCC 


TAACCTTCAA 


TCCAAAGATT 


GCGGAAATCC 


GTGGAGGAAC 


CACCATTCAA 


1020 


GCAACACTTG 


GATTTGGTAT 


GTTTGTCGTT 


ACCCTTGCGT 


CAcCATTATC 


GTCCTCTATG 


1080 


CCAATACTTT 


TGTCATGAAA 


AACCGTTCCA 


AGGAACTGGG 


T AT AT A TGGC 


ATGTTAGGCT 


1140 


TGGAGAAGCG 


CCATCTAATC 


AGTATGACCT 


TTAAGGAGTT 


AGTGGTATTT 


GGCATTCTAA 


1200 


CTGTTGGAGC 


GGGTATCGGT 


ATTGGAGCCT 


TGTTTGACAA 


GTTAATTTTC 


GCTTTCCTGC 


1260 


TCAAACTAAT 


GAAACTGAAG 


CTTGAGCTCG 


TTGCTACCTT 


CCAAATGAAT 


GTTGTCATTG 


1320 


CAGTACTTGT 


TGTCTTTGGA 


TTGATTTTCC 


TAGGCCTCAT 


CTTCCTGAAT 


GCTCTTCGAA 


1380 


TCGCCCGTAT 


GAATGCCCTC 


CAGCTCTCGC 


GTGAGAAAGC 


AAGCGGAGAG 


AAAAGAGGTC 


1440 


GCTTCCTACC 


TCTCCAAACG 


ATTCTTGGTT 


CCATAAGTTT 


AGGGATTGGC 


TATTATCTTG 


1500 


C C CTT ACGGT 


AACCGATCCT 


CTTACAGCCC 


TAACAACTTT 


CTTCCTAGCT 


GTTTTGCTGG 


1560 


TTATCTTTGG 


TACTTATCTA 


TTGTTTAATG 


CAGGGATTAC 


ACTCTTCCTA 


C AAAT CTT AA 


1620 


AGAAAAACAA 


G AAAT ACT AT 


TACCAACCTA 


ATAACCTCAT 


ATCTGTTTCC 


AACTTGATTT 


1680 


TCCGTATGAA 


GAAAAATGCG 


GTTGGACTAG 


CAACCATCGC 


TATTTTGTCA 


ACAATCGTTT 


1740 


TGGTAACCAT 


GTCAGCAGCG 


ACAAGCATTT 


TCAATTCCGC 


AGAAAGCTTT 


AAAAAAGTTC 


13C0 


TAAATCCTCA 


TGATTTTGGG 


GTTTCAGGGC 


AAAATGTTGA 


AAAAGAAGAT 


TTGGACAAAC 


1860 


TCTTGAGCCA 


GTTTGCAAGT 


GACAAAGCTT 


ATAGTGTCAA 


AGAGAAAGAA 


GTACTTCGTT 


1920 


ACAGTAACTT 


TGGTATTGCA 


AATCAAGAAG 


GAACCAAGTT 


AACTATTTTT 


GAAAAAGGAC 


1980 


AAAACCGTGT 


CCAACCCACA 


AC AGTTTT C A 


TGGTATTTGA 


CCAAAAAGAT 


T ATGAAAAT A 


2040 


TGACTGGTCA 


AAAACTGTCT 


CTATCAGGAA 


ATGAGGTCGG 


TCTCTTTGCC 


AAAAATGACG 


2100 


GACTGAAAGG 


AC AG AAAGCT 


CTAACTCTAA 


. ATGATCATCA 


ATTTTCTGTC 


AAAGAAGAAT 


2160 


TTAATAAAGA 


, TTTCATTGTG 


AACCATGTTC 


1 CAAATAAGTT 


TAATATCTTG 


1 ACT ACT G ATT 


2220 
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APAATTACCT 




GATTTACAAG 


CCTTTTTGGA 


TCAATTCCCA 


GATTCGGCTA 


2230 


T f*T AT AA T C A 


rTo^r t AC GGT 


GGTATGAATG 


TAAATGTCAG 


TGAAGAAGAA 


CAACTCAAGG 


2340 


L LuL 1 VjAv»L»M 


rtTATf^AAAAC 


TACCTCAATC 


AATTTAATGC 


TCAATTAGAC 


ACAGAAGGTA 


2400 


^ CV ft T , r , T w T ,r P & 

GCTAIu* I LA 


TrrT & nr* a at 

1 VjVj I ftutrtn 1 


r*T a a n at n 


CTAGTTCTCA 


GATGAGTGCC 


CTCTTTGGTG 


2460 




TATPnnTATT 


TTCCTATCCA 


TT ATCTTT AT 


GGTCGGAACT 


GTTCTGGTCA 


2520 


i L * At_ 1 nLnn 


rtL rt/irt 1 1 i V» L 


GAAGGCTACG 


AAGACCGTGA 


ACGCTTTATT 


ATCTTGCAGA 


2580 


AALj I LUU 111 


V_rvj Al_ v_ nnnnu 


TAAATTAAGC 


AAACCATCAA 


CAAACAGGTT 


TTAACTGTTT 


2640 




111 bL 1L 1 1 1 


GCCTTCATAC 


ATCTCGCCTT 


TGCCTACCAT 


ATGCTTAGCC 


2700 


TGATTTTAAA 




GT ACTGG AT A 


CGACTATGAT 


GTTGATTGTG 


ACCTTGTCTA 


2760 






nrrTATGTGC 


TGATTTTCAT 


GATTACTTCA 


AGAAGTTATC 


2820 




f*C ft A A TnT A A 


A A A AfiATACC 


TCGACTTCAA 


AATCGAGGTA 


TTTCTTGTAT 


2880 


It 1 AAA 1 Ljv_ I 


rj u ft r.TTr.T 


CfGAGCAGGA 


AGGTAACTCC 


CATGGTCAAG 


AGACCAATAG 


2940 


CAAutj 1 TC,(.U 


AA I A I nwL 1 




GCGCTTTTCC 


AAGTCTAGCA 


CTTGTGTAAC 


3000 


Lftb IGAtjAAvj 


nAuub v. ML A 




GGACGGTAGC 


AGGGATGCGG 


TAATCACTTG 


3060 


f* ft ft ft ft ft rn/-"/— T> 
O AAAAA 1 LjLj 1 


l_ AL 1 UML/\oL 


ATTfiCAGGCA 


AACTTCTAAG 


GAAAAAGGCA 


ACGAAGCTAG 


3120 


X ft ft T*f t** ft /"* 


Vj l\»H.AAuUA 


TTnnT AAAT T 


CTTCATACTC 


AATCCC AT AT 


TTTTCCTCTA 


3180 


CC AG AG CCTT 


OAU 1 uurt 1 * I 


TT A AG AAA G A 


TCTTATTGGT 


CAAGAGTTGG 


G C AG AAGTTT 


3240 


1 Li AA 1 .L I LL 


A I i. * - UuA . /\ 


TAAGCAGCAT 


AGAGGGATTT 


TTTGGCTAGT 


TCCCTATCTT 


3300 


/**(~rp/-HT1 f. /->/-• » Ik 

(ju It; Abl».AA 


Urtb ^ i, I 1 l\- I 


rr.rnAAACGG 


CAGCTTCCTC 


GGTATCTTTT 


GG AGTTG AAA 


3360 


t_ jurt * Av»A 1 .\ 


1 * 1. V— v— /as*. u r% 


GCCATTGAAA 


AGGC AC CAGC 


TAAGATAGCC 


GTAAAACCTG 


3420 


A 1 AAAAAVjM i 


A ATPPAfi ATA 


TTGGTCGTGG 


CACTGGCAAC 


TCCGATAACC 


ACACCAGCAA 


3480 


TGGAAATAAT 


TCCATCGTTA 


GCATCAAGAA 


CACCCGCACG 


CAGGATATTT 


AAACGACCTG 




CAAAATTTGA 


ATCAATTTCG 


TGATTTGTTT 


CTGACGCTAA 


ATTTCAAGTT 


CAAGTTAGCC 


3600 


A7CAAGAAGT 


CTTCTCTCGG 


TGACTTGTAG 


TCCAAGCATT 


TTTTAGGATA 


GTTGTTAATC 


3660 


CACTTTTCCA 


TGAATGCGAC 


TTCTTTGGGA 


GTCATTTTCT 


TGGTTCCCTT 


AGGTAACCAT 


3720 


CTACGAATGA 


GCCTGTTGTG 


ATTCTCATTA 


GTTCCCCTTT 


CCCAAGAGGC 


ATAGGGATGT 


3780 


GCATAATAAA 


TGTGCTCCTC 


AGAAAATACA 


TTAGACAAGC 


GATTGAATTC 


CGTTCCATTA 


3840 


TCTGCCGTGA 


TGGAAAGAAT 


CTTGTGTTGT 


TTTAAGATGA 


GTTTT AG AGC 


CTGATTGACC 


3900 


ACATCAGCAC 


TTTTATTTGG 


AATCAATCGG 


ATGATCTGAT 


GTCTACTTTT 


TCGATCCGTC 


3960 
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AAGACAAGCA 


AGCAGTAGTT 


TTTCGCTCTC 


GTAAGTAGAA 


CTGTATCAAT 


CTCATAATGC 


4020 


CCATTCTCCA 


AGCGAAAATT 


CATAGCTTCA 


AGCCGCTGTT 


CGATGGATTG 


ACCAGCAGGT 


4080 


TTAAACTTGG 


TG C TGGCCTG 


TTTCTTAAGC 


GCTTTTCCTT 


TTCTAGGGTA 


AAGCAGATCC 


4140 


TGTTTGCTTA 


ACCCCAATTT 


TCCATGATGA 


ATCCAATAGT 


AAATGGTTGA 


AATTCCCACG 


4200 


TTAACCCCTT 


TAGCCATCAC 


CATCATTTCA 


GGCGAAAATT 


TTTGGTTATG 


ATAGTGGAGA 


4260 


ATCTTTTCCT 


TTAGTTCCTT 


GGTC AAGCTT 


GATTTCTTGA 


CCGAGCGCTT 


GCGATTGTTT 


4320 


TCATAAGACT 


GTTGAGCATA 


GTCGGCAGAA 


TAAACCTCTT 


TGAAGCGCCC 


TTTTCCAAGA 


4380 


CATTGTCGGA 


CTGTCCCACG 


CTTGATTTCA 


GTG TGG AT AG 


TTTGAGGAAC 


TTTTCCAAGC 


4440 


AGAGAGGCAA 


TTTCTCTATT 


TGATTTCCCT 


TCTTTTTTCC 


ATCTTTCGAT 


TAAGCCACGG 


4 500 


CTATCGATTG 


TCAAATGTTC 


GCCTTTTCTA 


GTATAATGGT 


TTTGCATCTC 


TGTGCCTTTC 


4560 


TTGTGTTTGT 


GGTTGAACAA 


CAAGTATAAC 


ACAGAGGTGT 


TTTCTTATGC 


CTACAAGAGC 


4620 


TATCGGCTAG 


TTGAACCATC 


TAATTTTTAG 


GAGGGCTGGG 


TCGCTAACTT 


CATTATAGAA 


4680 


CTTTCATTTA 


CGAACATATA 


GTAAAATGAA 


ACAAGAACAG 


AACAAATCGA 


TCAGGACAGT 


4740 


AAAATCTATT 


TCTAACAATC 


TTTTAGAAGC 


AGAGGTGTAC 


TATTCTAGTT 


TCAATCTATT 


4800 


ATATTTTTGT 


TTTTTATCAA 


AAAATACTTT 


ACAAGTTCTT 


AAAAACATGA 


TATAGTAATA 


4860 


AAGCTT AG AA 


AATGAGATGA 


TGTTTTCTAG 


CAAATATAAA 


CCCGAGTAAA 


AAATGCCTAC 


4920 


GCACAGGCAG 


GGTTGAATGC 


CGAAGCGTGG 


TTGAAAAGCC 


AC ATT ATTGA 


TAGGGTTAAA 


4980 


AGCCTACTTT 


TATAAGTTGA 


TGTTAGGACA 


CTTGTCCTAA 


TTCATAAATT 


TTTAGTCTGC 


5040 


TCAAAGCACA 


CGTCATCTTG 


TGAAACGATC 


AAT AAAGT AC 


GTAATATTTG 


CTACTAGAGA 


5100 


GTTAGGAAAC 


ATCGGGAACA 


GACATACTCA 


ACAGAAACCA 


AAATAAACAC 


GTCAGAAGAT 


5160 


TGCAGAGCAG 


CTGAAAACCT 


GCTCTTTTTT 


CATGAGTCAA 


CCTTTACTTC 


CTTAGTTTTC 


5220 


ATAAGGTCCT 


AAAAATATTG 


AAAGGAGTAT 


CTTTTGAAAG 


AGTTAGATCA 


AAACCAAGCC 


5280 


CCAATTTATG 


AGGCCTTGGT 


GAAGTTACGC 


AAGAAAAGGA 


TTCTTCCCTT 


TGATGTTCCA 


5340 


GGTCACAAGC 


GTGGACGGCG 


AAATCCAGAA 


CTTGTCGAAC 


TCTTAGGAGA 


AAAATG TGT A 


5400 


GGCATTGATG 


TCAATTCGAT 


GAAACCTTTG 


GATAATTTAG 


GCCATCCTAT 


TTCGATTATT 


5460 


CGTGATGCAG 


AGGAGCTGGC 


TGCAGATGCT 


TTTGG AG CT A 


GCCATGCCTT 


TCTAATGATT 


5520 


GGTGGAACAA 


CTTCATCGGT 


GCAGACTATG 


ATTCTGGCAA 


CCTGCAAGGC 


AGGAGATAAG 


5580 


ATTATTCTGC 


CACGAAATGT 


CCATAAATCT 


GCTATCAATG 


CGTTGGTTCT 


ATCTGGTGCC 


5640 


ATTCCCATCT 


ATATCGAGAT 


GAGTGTAGAT 


CCTAAGATTG 


GTATCGCTTT 


AGGTCTTGAA 


5700 


AATGACCGAG 


TAGCACAGGC 


CATAAAGGAC 


CATCCAGATG 


CTAAGGCTAT 


CCTAATCAAC 


5760 
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AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 58 20 

GCTGGCATGA TGGTTTT ACT AGATGAAGCC CACGGAGCGC ATTTGC AT TT CACTGATAAA 5 880 

CTTC CAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAC CAGTTTCCAT GCATAAGTCT 5 94 0 

G GTGGG AG TT TGACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA TAATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGCTAGT 606 0 

TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG AGTCGTTTGA GAAAGTCATT 6120 

GAGCTATCTG AGTATGCCCG CCGTGAAATC AATGCTATCG GTGGCTACTA TGCCTACTCA 6130 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 62 4 0 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6 300 

ATTCAGATCG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCGACCGC 6 3 60 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 64 20 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 64 8 0 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTGTTGG ACAGGTCTGT 654 0 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TGGCTCCTGG TGAACGCATT 66 00 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAAGGAAC CTGGTTGCTC CCTCCAAGGG 6 6 60 

ACGGAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA CTATAAGAAA 6720 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGGC GATGATTTTA 67 80 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6 840 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA ACTCAGGAAA GATGATATCA AGACAACGAG 69 6 0 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 702 0 

TTTTT AG TTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 7140 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTGTA AACTTTATAT CCTTGTTCAT 7 200 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGG T ATT TTCAAGAGCG ATGAGACAGT 72 60 

CTTGGTTGAG CTGTCCAAGA GACAGATCTA AGAGTTCAAA ACCAGCTTTA TTATTTGAAA 7 320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTGTTTAT 7 380 

TTTTAGCGAC ATC AATGCCC ACATAAAGCA TGGGAGT ATC TCCAG AT AT A GT ATTTCAAG 7 4 4 0 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7 500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGG TTAGAGCCTT TCGGAAATCG 7560 

TCAAGCGATT GGAGGAAATG AACTAATCCA CACTCCCTTA TTCCAAGTAT ACCACTTGGG 7620 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7 680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 774 0 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACG CCftGCTTTTG GGAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 7 8 60 

CGTTCACGTT CCCATGCCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 792 0 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7 980 

GGAACCCGAT GAGATGTTGG TCGAGGTCTG TCGTGAGTAT TTCCCAGACT TTGCTGCAGG 804 0 

GCTAGATGAT C CTCGTGTT A CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACGATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC AT ACGG AAGG 8160 

ACTCTTTACC AAGGAATTCT ACCGCAATAG TTATCGAGCT CTGAAGGAAG ACGGCATCAT 8220 

GATTTACCAG CATGGGAGTC CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAAGCATGCA 8280 

CCGCAAGGTC AATCAAGCCT TTCC AATC AG TCGGGTTT A T CAGGCCCATA TTCCAACTAG 8340 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 3400 

TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 84 60 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TG AG G AC ATT TTAGAAGAAG AGG AAGG AAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GGGG GCGTTG CCCAAGTTGC TATTTCAAAG 8580 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATCATTG CTAGCCGTAC CAAGTCAAAA S64 0 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCAG C A 8700 

CTTGATGCTG ACAAGGTTGA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 87 60 

CTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 8820 

GGTGTTCACT ATATCGATAC AGC CAACT AC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8 880 

CGTGCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 894 0 

TGGCAGTGGG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 9000 

GGTTTTGACC CAGGTGTAAC TACTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTG ACCACGGTTA TCCATTTGCA 912 0 

ACCAACTTTA AT CC AGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGGAA 918C 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT GGCCAAGAAC 9 300 
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ATTCCAGCTG TC AAACG CAT TCCTTTCTTT ATGACTTTTG GTCAATCTTA CTTGACGCAC 9 360 

ATGAAATGTC TTGAAAATGT TGGACTCCTT CGTACGGATA CCATTAACTT TAACGGCCAA 94 20 

GAAATTGTTC CAATTCAATT TTTGAAAGCC TTGCTTCCAG ATCCTGCCAG TCTTGGGCCA 9 480 

CGTACAGTCG GAAAAACCAA TATTGGATGT ATCTTTACAG GTGTCAAAGA CGGTGTCAAA 9 540 

AAGACTATCT ATATCTACAA TGTCTGCGAC CATCAGGAAT GTTACGCAGA GGTTGGTTCG 9 600 

CAAGCTATTT CTTATACGAC AGG AGTTCC A GCCATGATTG GGACAAAATT AGTCATGAAC 9 660 

GGAACTTGGA AACAAGCTGG AGTGTATAAC CTTGAGGAGT TAGATCCAGA TCCATTCATG 97 20 

GAAGCTTTGA ATGAGTATGG TTTGCCATGG GTTGTGGTTG AAAATCCACA AATGGTGGAC 9 7 80 

TAATGAACTT AGAACAAGTA CCAACACCAG CCTATGTTAT TGACTTGGCC AAGTTAGAAG 9 84 0 

CTAATTGCCG CATTCTACAA TATGTACAAG AAGAGGCCGG TTCCAAGGTC TTGCTTGCCC 9 900 

AGAAGGCATA TTCCCTCTAC AAAACTTATC CCTTGATTAG CCAGTATCTA TCAGGTACGA 99 60 

CAGCTAGTGG ACTCTATGAG GCCAAATTGG CAAGGGAAGA ATTTCCTGGT GAAGTCCATG 10020 

TATTTGCGCC TGCTTTCAAG GATGCAGACT TGGAGGAATT GCTAGAGATA ATGGACCATA 10080 

TAGTCTTTAA CTCAGAGAGA C AG T TGCGT A AACACGGTCC GCGTTGTCGA GAGGCTGGTG 10140 

TCAGTGTTGG TTTGCGCCTC AACCCTCAGT GTTCAACTCA AGGcAGATCA CGCGCTCTAT 10200 

GACCCTTGTG CACCAGGTTC TCGCTTTGGA GTTACTATAG ACAAGATTCC GAGTGATTTG 10260 

CTAGATTTGG TTGACGGACT TCATTTTCAT ACCCTTTGCG AGCAGGGAGC AGATGATTTA 10320 

CAAACAACTT TGAAAGCAGT AGAAGAACAG TTTCCTCCCT ACTTACATGA GGTAAAATGG 10380 

CTCAATATGG GTGGTGGTCA TCATATTACA AGAGAAGGTT ACGATGTGGA TTTGCTGATT 1044 0 

TCAGAAATCA AGCGTATCCG AAAAACTTAC AATCTT G AAA TCTATATCGA GCCTGGTGAA 10500 

GCCATTGCGC TTAATGCCGG TTATTTAGCA ACTG AGG TAT TAGATATTGT AGAAAACGGT 10560 

ATGG AAATCT TGGTTTTAGA CGCCTCTGCG ACCTGCCATA TGCCTGATGT ACTTGAGATG 10620 

CCCTATCGTC CACCTTTGAG AAATGGCTTT GAGTCACAGG AAAAAGCCCA T AC CT AC AG A ] 0680 

CTTTCTTCTA AT AC CTG T CT GACGGGCGAT GTGATTGGTG ATTATAGTTT TGAAAATCCA 1074 0 

GTCCAAATCG GAGACAGACT TTATTTTCAA GACATGGCCA TTTATTCTTT TGTCAAAAAT 10800 

AATACCTTTA ATGGTATTGG ATTCCCAAGT CTCTATCTCA TGGACGAACA GGGAGACTGT 10860 

AGCTTACTCA AAGCTTTTCG CTATCAAGAC TTTAAAGGGA GATTATCATG ATGGACAGTC 109 20 

CAAAAAAATT AGGCTATCAC ATGCCAGCAG AGTACGAACC CCATCATGGT ACCCTCATGA 10980 

TATGGCCGAC TCGACCAGGA TCATGGCCTT TTCAAGGAAA GGCTGCTAAA AGAGCATTTA , 1104 0 
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CTCAGATTAT 


CGAGACCATA 


GCAGAAGGGG 


AAAGAGTCTA 


TCTTTTGGTG 


GAGCAGGCCT 


1I1C0 


ATCTATCTGA 


AGCCCAATCC 


TATCTTGGAG 


ACAAGGTTGT 


TTATTTAGAC 


ATTCCCACCA 


11160 


ATGATGCCTG 


GGCGCGTGAT 


ACTGGCCCAA 


CCATTCTCGT 


CAATGATAAA 


GGTAAGAAAT 


11220 


TAGCCGTGGA 


TTCGGCCTTC 


AATGCTTGGG 


GAGGCACCTA 


TGATGGTCTT 


TATCAAGATT 


11280 


ATGAAGAGGA 


TGACCAAGTA 


GCCAGTCGTT 


TTGCTGAGGC 


CTTGGAAAGG 


CCTGTCTATG 


11340 


ATGCTAAACC 


TTTTGTACTG 


GAAGGAGGCG 


CAATCCATAG 


CGATGGTCAA 


GGAACTATTC 


11400 


TCCTAACTGA 


AAGTTGCTTG 


CTTACTCCTG 


GTCGCAATCC 


TAACTTGACT 


AAAGAGGAGA 


11460 


TTGAAAACAC 


ATTATTAGAA 


AGTCTTGGTG 


CTGAAAAAGT 


TATTTGGCTT 


CCTTATGGTA 


11520 


TTTATCAGGA 


TGAAACCAAT 


GAACACGTCG 


ATAATGTTGC 


TGCCTTTGTT 


GCTCCTCCTC 


11580 


AG CTTGTTTT 


GGCTTGGACA 


G ATG AC G AAA 


ATGATCCCCA 


GTATGCCATG 


TCAAAAGCAG 


11640 


ATCTCGAACT 


CTTAGAACAG 


G AAA C AGATG 


CAAAAGGTTG 


TCACTTCACC 


ATTCATAAAT 


11700 


TGCCTATCCC 


TGCAGTTCGA 


CAAGTTGTGA 


CAGAAGAAGA 


TTTGCCAGGC 


T AC ATCT ATG 


11760 


AAGAAGGAGA 


AGAAAAGCGA 


TACGCAGGTG 


AACGACTAGC 


AGCTTCCTAC 


GTAAACTTTT 


11820 


ATATCGCCAA 


CAAGGCTGTC 


TTGGTTCCAC 


AGTTTGAGGA 


TGTAAACGAC 


CAAGTGGCCT 


11880 


TAGATATCCT 


CAGCAAGTGT 


TTCCCAGACC 


GTAAAGTTGT 


CGGAATACCA 


GCCAGAGATA 


11940 


TTCTCTTAGG 


TGGTGGCAAT 


ATCCACTGTA 


TCACCCAACA 


AATTCCAGAA 


TAGGAGAAAA 


12000 


AG ATG AG AAA 


TGTAAGAGTT 


GCAACCATTC 


AGATGCAATG 


CGCTAAGGAT 


CTGGCAACAA 


12060 


ATATCCAAAC 


C GC AG AGCGT 


TTAGTACGTC 


AGG CTGCTG A 


GCAAGGAGCC 


CAAATTATTC 


12120 


TCTTGCCCGA 


CTTGTTTGAA 


CATCCCTATT 


TCTGTCAGGA 


ACGTCAGTAT 


GACTACTACC 


12180 


AGTATGCCCA 


ATCTGTAGCG 


GAAAATACTG 


CCATTCAGCA 


TTTTAAGGTG 


ATTGCT AAGG 


12240 


AACTACAAGT 


TGTTTTACCA 


ATCAGTTTCT 


AT G AAAAAG A 


TGGTAATGTC 


TTGTATAACT 


12300 


CTATTGCCGT 


CATTGATGCA 


GATGGGGAAG 


TGCTGGGCGT 


TTATCGAAAG 


ACCCATATAC 


12360 


C AG AT G ACC A 


TTATTATCAA 


GAAAAATTCT 


ATTTCACGCC 


TGGTAACACT 


GGTTTCAAGG 


12420 


TCTGGAATAC 


TCCCTATCCT 


AAGATTGGTA 


TCGGTATCTG 


TTGGGATCAA 


TGGTTCCCTG 


12480 


AAACAGCGCG 


CTGTCTTGCA 


TTGAATGGTG 


CTGAATTGCT 


CTTTTATCCT 


ACAGCTATCG 


12540 


GTTCAGAGCC 


AATTTTGGAT 


ACAGATAGTT 


GTGGTCACTG 


GCAACGTACT 


ATGCAAGGGC 


12600 


ACGCAGCAGC 


GAA7ATTGTT 


CCAGTCATCG 


CAGCCAATCG 


TTATGGTTTA 


GAGGAGGTTA 


12660 


CTCCTAGTGA 


GGAAAATGGC 


GGACAGAGCT 


CCAGTCTTGA 


CTTCTACGGT 


TCCTCCTTTA 


12720 


TGACGGATGA 


AACAGGAGCT 


ATTCTAGAAC 


GAGCTGAAAG 


ACAAGAAGAA 


GCTGTTCTGT 


12780 


TAGCTACTTA 


. TGACCTAGAC 


AAGGGAGCAA 


> GTGAACGCCT 


AAACTGGGGC 


TTGTTTCGAG 


' 12840 
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ATAGAAGACC 


AGAAATGTAT 


AGACAAATTA 


CAGATTAGTG 


TGGCAGAAAT 


G AG AG ATT C A 


12900 


TTCTGCTAGA 


CTAACTTCTT 


ATTAGTAACT 


AT AAG AT ACT 


ATGGCATCTA 


GTAAATCGAT 


12960 


TTTTATGATT 


CGCTATTCTT 


GTCTATTGAT 


TAGTCCGTAT 


TTTAAAATAT 


TAGCAAAAAA 


13020 


GCAAATAGCA 


GTAACTTCTG 


TCTATTTGCT 


TTTCTTTTTT 


ATAGAATATA 


TTTCTCAATA 


13080 


GCACGCGCAA 


CGCCGTCTTC 


TTCGTTGCTT 


GAGGTAACGG 


CATCCGCAAG 


AG ATTTG AT A 


13140 


TAATCGCTGG 


CATTTCCCAT 


TGCAATCCCA 


ACCCCTGCAA 


ACTGGACCAT 


TTCGATATCG 


13200 


TTATTAGCAT 


CGCCCATGGC 


CATAATCTCT 


GAGGAATCAA 


TCTTCAAAAT 


CTCAGCTAGT 


13260 


CGTGAAAGAG 


CAGTAGCCTT 


TGTCGTTCCA 


AGCGGCATTG 


CTTCATAAAT 


GACAGGCTGC 


13 3 2 0 


GAACGAACTC 


CACTGAATCG 


TTGGCAAAGC 


TCTTCAGCAA 


AACGCTGCTC 


AAAATCGTCT 


13380 


GTTTGTTCTT 


TTGTTCCTAA 


ACACATACCT 


TGGAACATCC 


GGAACTTTCC 


ACT ACT CG CT 


13440 


TCTTCAAGAG 


AAATTTCAGT 


CAGGTCTGAA 


AATACTAGTT 


TAGCATCATT 


TTCAATAACT 


13S0O 


TGATTGGGCT 


TGTCACCGAG 


AACAAAATAA 


TGTGACTCGT 


CAAAAAGTGT 


CAACTGAACA 


13560 


TCACTCTTTT 


CAGCAAGGTC 


ATAGAGGTAT 


TCGATGTCAG 


CTGGACTCAG 


TTCTTTCCAG 


13620 


TCAACTAGAC 


TCCAATCACT 


GGTCTGGTGA 


GTTGAACAAC 


CGTTGTTAAC 


AATAATATAT 


13680 


TCGTTCTGGA 


GGTCAAGCTC 


CACTTTTTTG 


TAGTAGGGGA 


GGACACCGAA 


AAGGGGGCGA 


13740 




G AAC C AGTTT 


GACACCTTTT 


TCAATCCCTT 


TGTGAATAGC 


AGT AATGTGT 


13800 


GCTTGTGGGA 


TTTCCTTGGC 


TTCATTGAGG 


AGGGTGCCGT 


CCATATCCAA 


GGCTAGTAGT 


13860 


TTAATCATAG 


GTCTTCCTCT 


TTATCTTTGC 


TATTATTATA 


GCATATTTTG 


GAGAAGAAAT 


13920 


TGATAGAAAG 


CTTGAGACTA 


ATTG ATTTT A 


TAGTTTAAGA 


TGTTTTGATG 


ACAATTCATG 


13980 


ATTTG AAG AC 


GATATTTCGC 


AAAGATATGC 


T AT ACT ATG T 


TTGTCAATGT 


TCCAACTAGA 


14 0 4 0 


CAAATTAAAA 


AACCAACTTA 


ATATAATAGT 


TTTTTTGTAA 


GTAGGTATGA 


GTAGCACATT 


14100 


ACTCAACTAA 


TCTG AAvj AA I 




ATATATCATG 


ATTTT AATGA 


CAAAAAATAT 


14 160 


AAATCTAACA 


AATGAAGAAT 


TAGAGCTGAT 


ACAAGGTGGA 


GCAGATCCAT 


ATGGT AAAAA 


14220 


TCCTAATGGT 


AGGTACGATT 


GGGAAATAGA 


ACCAGTATTA 


ACTCTGCTGG 


TTCATGGATT 


14230 


TTGTCCCAGA 


GGCACCTATG 


ATTCAGGATA 


TATTGGAGGA 


GGTAATCATC 


TTTGCAAAGG 


14340 


AAGTGCTGCG 


AG ATT TT AAG 


TAAAA7TTAT 


TAGGAATATG 


AAGAAACAAG 


GGGAGAAAAC 


14400 


AGAGGATTTA 


AT ATG AAAAA 


ACGAGCTATT 


CAAATTTTAC 


TAGCATTGTC 


CTTAATTTTT 


14460 


TACAAATCAA 


CTTGGTTTTG 


GAGGCTTTTC 


AATTATCTCG 


CAAAGCCCTA 


TCTACCAGCA 


14520 


AGTCGTGAAT 


TTTTTCAGAT 


TCTGCTTTTG 


ATGGAGAGCG 


GACTTCTTTT 


CTTAGCGGTC 


14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14 640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGT AT 14 700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 14 7 60 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14 820 

ATTTATCGAG GtCTGCTTAT GACAACCTGT TGCAAAAACT C ACCTTGGT A CG 14 872 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10223 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNESS : double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTCGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 6 0 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGT A TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTT ACTG ACT TCGTCAGTTC TATCTCCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTT CT ATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 300 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCT ATCTG CAACCTCAAA 3 60 

ACAGTGTTTT GAGCAGCCCG TCGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 48 0 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 54 0 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG ATAAACATTG ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 7 20 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT ACCCCCGACC CTGCATTTCC TTTGTTACTC 7 80 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 84 0 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGGACTACCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 1020 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AACAGCATCT 114 0 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT TATCCACCTC AATCATGGCA ATCTTGAGAA 1200 
TATCTCCTCC CGAACCCTGG AT AGGTG AGT TGATAGCAGT TCGCTCCGCA AAACCACGAA 
TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGCCTCTCTA 
CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTC AT C CAT GTACTTTTTA ATACCTGGAA 
AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 
TAG AC AAGCC AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 1500 
GACGGTCGTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 
AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC TTAGAAATAT 
GCGCCAAAAC GCGCAATTCA AT CTGTG AAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 
ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GG AAT ATTTT 
GCAAGTTTGC ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC ACATAGCGAG 
TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAAGCC AATTACATAA GTAGATTGAA i8 60 

TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGAGCAATA GGAGCGAGAC 
GCTCTAAAAC ATCCACTCCT GTCGAATAAC CTGTCTTGCT TTTCTT AGTG TATTCTAGAG 
GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGACTTG ACATTAAACT 
CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC T C ATTTT C AG 
CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 
GGACAAAAGC CAGAGCTTGC TCCATATCAT AAACAAGCTC TAATTGCCCA TTTTCGCTGA 
GTTTTTCAAG TAAAATAGCC TCTGTTTCTA CCAAAACAGC AAGTTT AC AA CCTAAGTCTT 
CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCCTAG AAAGTTTCAT 
CATCAACCAA GTAAGTCTGA C C AT AAAG AC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 
CAGTCGAAAG GAGGTATTTA GCCAAACGGA * TGTCAAAAGC ACCCCCCTGC AAATCCACAC 
CAAAACGTTC CAAAAGAACT TTAACCTTCT TAAACTCATA AACTCTCAGA GATGTTTTTT 
CTAAGAAATC CTTG AAAAT C GGCTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 
TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGGTAATTC TCACCAAAAA 
GCTCAAAGTG GAAGATAGAC TCTTCACTCA GC AT ATCTTG ACTGATTTGG TCAACAATAG 
TAAAATCCAA ACTCTCAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTGTT 
TGAAGCCCAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA CTATAGACCA 
AGTCCTCTAA ACCAATCCCA ATCGGTGCCT TGGTATCAAT GGTCGCTACT GTTTTAGACA 



126C 
1320 
1380 
1440 



1560 
1620 
1680 
1740 
1800 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2 94 0 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 306 0 

GATCGATAAA CTGAGCTCCT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCCTAAAGG 3 120 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 3180 

GAATCAAATC CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTCCTAGCT 3 240 

TATCCAGCGT CCCAATGATG TCATCCCCCT CATACTGAGC CAGATCATAG TGACGAATCC 3 300 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3 3 60 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 3 42 0 

CAAAAGCCAC CAAAATATGA CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 34 80 

GAAAACCATA AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 3 54 0 

GATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3600 

TCTTATCCAT ACACCCATTA TAAACGAAAG AATCAAAAAA T AC C ATTGGG AAGACCTAGA 3 6 60 

GCAACTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG CCAGAGAATT TAGTAAACCT 37 20 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 37 80 

AAAATATTTC TATAAATTAA TTTGA C TTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3840 

TCACTATAGA AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATGA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AG AAAT C ACT TGTCAATTTA TTCGCGATAT 39 6 0 

GCTGGACTTG CCAGCAAAAA ATGTGACCAT TTTGG AGGG A AGCGATATTC AC GT ATT ACT 4 0 20 

CTCCATGCCT TACTCGGTGC AGGATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 4 080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CACAATTTTT TCATCAATCA 4140 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 4 200 

AG GTG AT ACT CACTAGAGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 4 2 60 

TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 43 20 

AGGTGAGGTA TTGGCGATTA CCAACAATGG ACAGGAAAAC CATCTGGTTA AGATGGCATT 4 3 80 

CTTGG AAT T A AAAAATACAG AGAAACCACC AAAGACAAGG TTCGCAAGCC ATGCTTGGAG 44 4 0 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACT AC AT ATG 4 5 60 

CGAGAAGAAC AAGT CTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4 62 0 

CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG . CAGAAAGGAA ACTTTTTGCC 4 68 0 
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TTCCTAGACA TAGTACGCCA AGGTCTTCTG ACTTCTGAGG TTGCCAGCCA GCAATTAGGT 474 0 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4 800 

AAATCCCCTA TGGATATTAT GCACCCTATT TTGTGTAGAA AAAAAGTCCC ATATGACCTA 486 C 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4 920 

AC AAAATT A C TAGACATTAA AGACCCTAAT CTCCAGATTT TAAACATCAT CAATAAGGAT 4 980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTGCCC TGAGTGCCGA 5 040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5X00 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCCATTCA ACTGCTATCA CTGTTCAAAA 5X50 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 52 20 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 52 80 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5 3 40 

GTGGAGAAAG TGGTCG TTTT TCATGAATAC GTACGATAGC ATCCCCTAGG AGATGAGCGA 5400 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CACATAGATC GT AT CCAAAA 54 60 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 5520 

GGTGCGTACA GCTTG CATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTGCCGCAT 5580 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 564 0 

CCTTACCGAT GATATTCATA ACTTCACTAG T ATTC ATCTT ATCAACGCTA CGACCTTTAT 5700 

CAATAATAGC GATAGATGTT TTCAAAAATT CTGCCAACTT ACGAGCACCA GTCACCCCTC 57 60 

CATGGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACCC TCAAAATAAT 582 0 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5 880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 59 40 

CGACAACTTT TGAAGTGATT GGCTCACGCG CTCTCCCCTT TCTATCCTGA CGTGCATACC 6 000 

CATAGTAAGG CATGACAACA TTGACAGATT CTGCACTCCC ACGCTTCAAA GCATCTACCA 60 60 

TAATCAAAAT TTCAAG C AG A TTGTCATTTA CAGGCGAACT AGTTCATTCT AAGATAAAGA 6X2 0 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGCC 6X80 

GAACACTTCA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 62 4 0 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATCATTTCC TCCGGTATAT 6 3 00 

ATGTATAACT TGTGCTTTTC ACAAGATTTT GCATCTACCA TTGTAGCGCT TTTT GCACT A 6 3 50 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 64 2 0 
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AAATATTCTA CGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TGATAAAAAC 64 80 

C AT AG AG AGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6 54 0 

C AAAAGG AG A GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

GTTTCTTGCC AACCTTCTGT CCTTTATAGC CTCTGTAAAG CAAGTAAACA CCTACTACAG 66 60 

CTAGACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GAT AC G CT CT AAAATTCGCG 67 20 

AAATAGTAAA GTCAAAGCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 67 80 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AGATTTTAAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGCTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 6960 

AATCATGGAT ATAAGTATTA GC AAAAC T AG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7020 

ATAAAGACAG AATC AAAAC C TTCTTTGCCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7080 

ATAAAGAGCC GGAATCATGT AAAAGAGAGC T AG AAAG ATA GAAACTCCCA TTAGTCCATA 7L4 0 

TTCCGCACCT CACCTCGAAC CGTCCGTATA GTAGATCGTT TCATACTGTA AACCAATACA 7 2 00 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC AT AC ACTTTC TTTCTAAATG 7 260 

AAGTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA AC ATT GT AAA TGGCACCAGA 7 3 20 

GATAATGACA TCGTCTGGT A TTAGGGGAGA CTCGATAAGC AGTTG CAT AT CCTCGCGTAC 7 380 

ACTCTCTTCT AT ATCTT GG A AGGGCAAGAA GTCCTGGTCT GACACATCGA CACCCAATTC 74 4 0 

TTCCTTCAAA TACTCCTGAA AAGGTTCATT TTCAAAGGTC TGACCACCAC AGTCTGTATG 7SO0 

ATGCAATACC ACAATTTCTC TTGTCCCCAT TTGTTGCTCG GAAATAACTA G AG AACG AAT 7 5 60 

CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 7 6 20 

ACCTAGAGCT TGCG C AACGT GCAAACGTGA GTCCATACAG GTCACAATGC CTACTCTGGT 7 680 

TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 77 4 0 

CATAAACTCT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 7 800 

TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7 8 60 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7 920 

GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 79 8 0 

CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCCA CAAAACATTC 804 0 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 3 LOO 

AATCGC AGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTG ATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AG C CAT AAT C AAGCTAGCAC GGTTAAAATC 82 2 0 
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AAGTCCTGTC GTAGTACGCT TGGCATTTCC AAACATCCTC GGTGTTACCA AAGCCTGAAC 8 280 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGCAGGAAC CAGTCGCCCC 8340 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CCCCTCTCAA 84 60 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC AC CGT AT CCA CCATATGCTC 8520 

CAACATACGA GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 8580 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 3640 

AGACCCCTGC ACCCCTGAAA TCTCAGGAGA CATGATGGTC TGGATGGAAT CAATAATGAG 8700 

AAAGTCTGGC TGG AT ACGCT CCACTTCTGC ACGAACACTC TGCATATTGG TCTCTGCATA 87 60 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CGTAGTTTAA TCTGCTGCGC 8820 

AGACTCCTCC CCACTGACAT AGAGAACTGT CCCCACTTGG GACAACTGGG TTGAGACTTG 8880 

TAGGAGAAGA GTTGATTTCC CAATCCCAGG ATCCCCACCG ATAAGGACGA GACTTCCTGG 8940 

TACCACTCCG CCTCCAAGCA CACGGTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GCGCTTGGTT TTCTCACCTG TCAAGGACAC 9060 

ACGCGC ATT C TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 9120 

GTTGCCGCAA CGTCCCAGAT ATTTAGGCGA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC TAACTCACAC TCAATCACTT 924 0 

GGCAAAAATC AATCTTCTCA TTTGGCACAA ACTGGCGCAT GAGCATTCCA TGACCAACAA 9 3 00 

CTACCACAGT CTGATGTTCT CGATACTTAG ACATACATTC TAGAAACCGA GACTTCATTT 9 3 60 

CCGTAGCTGT CTCATATTCA ATAGCACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 94 20 

ACAGTCTTCT AGCTGTTTCA AACTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 94 80 

ATAAAGGCTC TACTCTTAAA GGAAGACCCG TAGCACAGAC CACATACGAA GCCGTTTCTA 954 0 

AAGCTCTTCT GACTGCAGAA GATACGATTA TTTCAGCTGA C GAG AG T AAA GGATTTTTGC 9600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TCCCAAATCT ATCCCAAATC 9 6 60 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 97 20 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACGCCATC AGCTGCATCT 9 7 80 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9 840 

ACAACCTCTT GGTCTGTGAT -ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9900 

TTTCCATAAT AATCCCCATC AATCACTCCA ACTGAGTTAA TTAAAACCAA GCCCTTCTTA 9 960 
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CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTCCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGT ACGC AC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTCGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATT C AAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

C TAG ATT ATT CT AT ACTT AT TTA 10223 
(2) INFORMATION FOR SEQ ID NO: 74: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNES S : double 
iD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTG TCTTG CTTGCCATCG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG ATACAGCAGC TCGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCGATCCA 240 

GCACCACAAA CTGGACAAGA TCGTTCAACT GAGCCAACTA CTTCT ACT AG TCCAGTAACA 300 

ACTGAAACTA AGCCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 3 60 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 4 20 

TCT G AAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAACAAAGA TGACTACGGC 4 80 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTACCTT CCTCATCAAC 54 0 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTCTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 6 60 

ACTGTTCGCG TC AACT ACT A CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 7 20 

TGGGGAGATG TG AAAAATC C AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 7 80 

ACAGGCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG ACAATTTGGA 8 40 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 9 60 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 102 0 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 1140 
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CAAGCTGGTA 


AG AAAGT G AC 


CTACAGCGGA 


GATTTCTCTG 


ACACAAAACA 


TCCTTATACT 


1200 


GTTAGCTACA 


ATTCCGACCA 


ATTCACTACC 


AAAACAAGCT 


GGCGCCTGAA 


AGATGAGACA 


1260 


TACAGCTATG 


ATGGCAAACT 


GGGAGCTGAC 


CTAAAAGAAG 


AAGGAAAACA 


AGTTGATTTG 


1320 


ACCCTTTGGT 


CACCAAGTGC 


TGATAAGGTT 


TCTGTTGTTG 


TCTACGACAA 


GAATGACCCT 


1380 


G AC AAAGT AG 


TTGGAACTGT 


CGCTCTTGAA 


AAAGGGGAAA 


GAGGAACTTG 


CAAACAAACT 


1440 


CTAGACAGCA 


CAAACAAACT 


CGG AATC AC A 


CATTTCACTG 


GCTACTATTA 


TCAATACCAA 


1500 


ATCGAGCGTC 


AAGGTAAAAC 


TGTTCTTGCA 


CTCGATCCTT 


ACGCTAAATC 


TCTTGCTGCT 


1560 


TGGAATAGCG 


ACGATTCCAA 


GATTGACGAT 


GCCCATAAAG 


TGGCTAAAGC 


CGCCTTTGTA 


1620 


GATCCAGCTA 


AACTCGGACC 


TCAAGACTTG 


ACTTATGGTA 


AGATTCACAA 


TTTCAAGACT 


1680 


CGTGAAGACG 


CCGTTATCTA 


CGAAGCTCAT 


GTGCGTGATT 


TCACTTCACA 


TCCTGCCATT 


1740 


GCAAAAGACT 


TGACCAAACC 


ATTTGGGACT 


TTTGAAGCCT 


TCATTGAAAA 


ACTAGACTAT 


1800 


CT C AAAG AC T 


TGGGTGTAAC 


CCATATCCAG 


CTCCTTCCAG 


TCTTGTCTTA 


CTACTTTGTC 


1360 


AATG AATTG A 


AAAACCATGA 


ACGCTTGTCT 


GACTACGCTT 


CAAGCAACAG 


CAACTACAAC 


1920 


TGG GG AT ATG 


ACCCTCAAAA 


CTACTTCTCC 


TTGACTGGTA 


TGTACTCAAG 


CGATCCTAAG 


1980 


AATCCAGAAA 


AA CG AATC GC 


AGAATTTAAA 


AACCTCATCA 


ACGAAATCCA 


CAAACGTGGT 


2040 


ATGGGAGCTA 


TCCT AG ATGT 


CGTTTATAAC 


CACACAGCCA 


AAGTCGATCT 


CTTTGAAGAT 


2100 




ACTACTACCA 


CTTTATGGAT 


GCCCATGGCA 


CACCTCGAAC 


TAGCTTTGGT 


2160 




TGGGGACAAC 


CCACCATATG 


ACCAAACGCC 


TCCT AATTG A 


CTCTATCAAA 


2220 


TACCTAGTTG 


ATACCTACAA 


AGTGGATCGC 


TTCCGTTTCC 


ATATGATGGG 


AG AC C ATG AC 


2280 


CCCGCTTCTA 


TCGAAGAAGC 


TTACAAGGCT 


GCACGCGCCC 


TCAATCCAAA 


CCTCATCATG 


2340 


CTTGGTGAAG 


GTTGGAGAAC 


CTATGCCGGT 


GATGAAAACA 


TGCCTACTAA 


AGCTGCTGAC 


2400 


CAAGATTGGA 


TGAAACATAC 


CGATACTGTC 


GCTGTCTTTT 


^ r\\jr\ 1 vjnv. t\ I 


CCGTAACAAC 


2460 


CTCAAATCTG 


GTTATCCAAA 


CGAAGGTCAA 


CCTGCCTTTA 


TCACAGGTGG 


CAAGCGTGAT 


2520 


GTCAACACCA 


TCTTTAAAAA 


TCTCATTGCT 


CAACCAACTA 


ACTTTGAAGC 


TGACAGCCCT 


2580 


GG AG ATGTC A 


TCCAATACAT 


CGCAGCCCAT 


GATAACTTGA 


CCCTCTTTCA 


CATCATTGCC 


2640 


CAGTCTATCA 


AAAAAGACCC 


AAGCAAGGCT 


GAGAACTATG 


CTGAAATCCA 


CCGTCGTTTA 


2700 


CG AC TTGGAA 


ATCTCATGGT 


CTTGACAGCT 


CAAGGAACTC 


CATTTATCCA 


CTCCGGTCAG 


2760 


GAATATGGAC 


GTACTAAACA 


ATTCCGTGAC 


CCAGCCTACA 


AGACTCCAGT 


AGCAGAGGAT 


2820 


AAGGTTCCAA 


ACAAATCTCA 


CTTGTTGCGT 


GATAAGGACG 


GCAACCCATT 


TGACTATCCT 


2880 
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TACTTCATCC ATGACTCTTA CGATTCTACT GATGCAGTCA ACAAGTTTGA CTGGACTAAG 2 9 40 

GCTACAGATG GTAAAGCTTA TCCTGAAAAT GTCAAGAGCC GTGACTATAT GAAAGGTTTG 3000 

ATTGCCCTTC GTCAATCTAC AGATGGCTTC CGACTTAAGA GTCTTCAAGA TATCAAAGAC 3060 

CGTGTCCACC TCATCACTCT CCCAGGCCAA AATGGTGTGG AAAAAGAGGA TGTAGTGATT 312 0 

GGCTACCAAA TCACTGCTCC AAACGGCGAT ATCT ACGC AG TCTTTGTCAA TGCGGATGAA 3180 

AAAGCTCGCG AATTTAATTT GGGAACTGCC TTTGCACATC TAAGAAATGC GGAAGTTTTG 324 0 

GCAGATGAAA ACCAAGCAGG ACCAGTCGGA ATTGCCAACC CGAAAGGACT TGAATGGACT 3 3 00 

GAAAAAGGCT TGAAATTGAA TGCCCTTACA GCTACTGTTC TTCGAGTCTC TCAAAATGGA 3 3 60 

ACTAGCCATG AGTCAACTGC AGAAGAGAAA CCAGACTCAA CCCCTTCCAA GCCTGAACAT 34 20 

CAAAATGAAG CTTCTCACCC TGCACATCAA GACCCAGCTC CAGAAGCTAG ACCTGATTCT 34 80 

ACTAAACCAG ATGCCAAAGT AGCTGATGCG GAAAATAAAC CTAGCCAAGC TACAGCTGAT 3540 

TCACAAGCTG AACAACCAGC ACAAGAAGCA CAAGCATCAT CTGTAAAAGA AGC GGTTCG A 3600 

AACGAATCGG TAGAAAACTC TAGCAAGGAA AATATACCTG CAACCCCAGA TAAACAAGCT 36 60 

GAACTTCCAA ATACAGGAAT CAAAAACGAA AACAAACTCC TATTTGCAGG AATCAGCCTC 3 7 20 

CTTGCCCTCC TTGGTCTCGG TTTCTTACTA AAAAATAAAA AAG AG AACT A AACTAGCCCT 3*780 

CCTATAGAAA AATCCCCCAA GCATTATAGC TCGGGGGATT AATTTTTGTA CAATATTTGT 38 40 

TGTCCTAATA AACTTGATTA GGATTTTTTA TTAAGCCTCT TTCATAGCAA AATAAGCTCG 3 900 

TACTTTGGGT GCAACTTGTG TTCCGAAGAG TTCAATAGCT CTCAGAACCT GGTCATGAGG 3 9 60 

C AT AG AAC C A AGCGGTAGAT GAAGCATGAA GCGGTCCAAT CCTAAATCCT CTATCATGCG 402 0 

AATCAATTTT TCGGCCACCT GATCTCGATT GCCAACAAAC ATGGCGCCAT TTGGCCCTAC 4 080 

CTGCTCCAAA TATTGCTCAT AACGCAATTC CTGCCAGTGC GCACCCTCTT TGGAAATAGC 4 140 

ATCCACCACT TGCTTAGTCG GATGGAAATA ATCTTTCACC GCCTGCTCAC CATCTTCCGC 4 200 

AATCCACCCC C AAG AATGG G CTCCCACTTT CAAGTCTTTG TCAGCATGGC CCCTTCGCTT 4 2 60 

CCAATCTCAC GATAAGCCTG AATCAACTTT TTAAAATAAC GTGGATTACC ACCAATAATA -13 20 

GCATATACAA TCGGTAGACC AGCCTGAGCA ATCTTCACTG TTGATTCGAC ATGACCACCT 4 380 

GTACCTATCC ACAAGGGCAA TTTGTC CTG A ACTGGACGAG GATAAACTTC TTTACCAGCA 44 40 

AT CGTTTC AG TCAATCGACC TTGCCAGTCT AACTTGGTCT TTTCATTGAC TAACTGAAGC 4 500 

AAGTCTAATT TCT CATC AAA AAGAGAGTCG TAGTCTTTCA AGTCATAACC AAACAGAGGG 4 5 60 

AAAGATTCCG TGAAAGAGCC CCTTCCAGCC ATAATCTCCG ATCGTCCATT TGACAAAGCA 4 62 0 

TCGATAGTGG CATACTGTTG GAACAAACGA ATCGGGTCCA TGCTTGACAG AATGCTGACT 4 630 
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CCACTGGTCA AACGGATTTT CTTCGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 474 0 

GCTGATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 4 800 

TTGTCACCCA GCTCAATCTC TGCCACCAAC TGGCGAATCC GTTCAGCATG ACTGTAAGTT 4 8 50 

TGTCCAGTCC CTTCAAGCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 4 920 

GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 4 980 

AGTACAAGCA ACCGATTTGC TCATTAGAAA AACCCTAGAT AACTAGACTT TTTTAGCTTA 5040 

TTCTACCCTT ACTGACTTGG CAAGGTTACG TGGTTTGTCC ACATCGAGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTGCGTTGG TACGACCATT GAAATTGGTG AGAGGTATGG 5160 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATCTTTGGCT AC A TTCTCTT CTGCGATAGT 5220 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 52 80 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

CTGCTTGAGT TCTCCTGCAG CAAAGCCTTC AC AC TGG AT A TAAGAAATCT CTTTGAGTTT 54 00 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT C CG ATGT AAA AGGCGTTACG 5460 

AGTTGTTTCA AGAAGTTCAC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 5520 

TTCG AT AG AC TGAGCTACGA TTGACAATTC ATGAACCAGG TCAAAGGCTT GCGCTTTACC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCG A TTTGCGCTGT 5640 

ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 57 00 

GGCTTCACGT G AG AGGGTTG AACCTGGAAC GTTT GTC ACT GTTAAGCTTG G AATTC C CAT 57 60 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCAGATT GGCTGATAAA 5820 

GATGAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 5880 

AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 594 0 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCC TCTTGAACAG CCTTAATGAT 6000 

ATCTGGCTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATCA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 6X2 0 

ATCTGACAAG T C AAGTTC AG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 624 0 

TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG ACCATCTTAT AGCCTTCTCC 6 300 

AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 6 3 60 

GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATCTCA AGGGCTTTTT TGAAGGCTTC 64 20 
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AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 64 80 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC AAGGTATTCT TCCTTGATTT CAAGATAGTT 654 0 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TGTCCGATAC CAGTTGTTCC 6660 

CTCAACACCA CCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 67 20 

GTTATCAGCA CCATCTACGA CAAAAATTCC CG CAGAAT C A TAGCCACGGT ATTCAAGCTT 6*7 60 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA T AT ACG AC AC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGACT 6960 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 702 0 

AATCAGTCAG AGTC CTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 7080 

TGATTTCTTC AGACCACCTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 72 00 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 72 60 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACGGAATTTT TCTTCCCCTT CAACACCCAT 7 3 20 

ACCATTCTCA GTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 73 80 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGGTGA GAATTGATTT TACGTCCAGG 7 4 40 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGACTGGA CTCTCTGGAT GCTTAGCAAA 7 500 

TCGAGGAGCC ATAACACCCA AAGGTTGATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 7 560 

ATCACGAATG AGTTCCAACT CTTCCTCTCT ACCATCAGGT AAAAGACCGT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GG ATCTAAGA AAGATTGGGC 7680 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCAGCA GGATGCTGGC TACGTGGATA 77 4 0 

AGCCGGTGTC AAGTTGAGCA CAATCCCAAT CTTGCAATCA GGCAAAAGTT CATGGCAAGC 7 800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7 8 60 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 7 92 0 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7 980 

ATAGTCTTCA TAGGCTGAGA CTGTCG CCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 804 0 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CCAATTCCTT TAG CCTT AAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCG AAAGG C T GTGTGACCAG TCTCTAACAA B2 2 0 
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AAGCTCAATA 


TCCCGCTCCC 


AATTTTCATA 


AAAAGTCGAT 


CTCTTATCTG 


AACCAATCCC 


8280 


ATTATAGTAA 


CGATTTGGCT 


CCACTTGGAA 


CCAGTAATCC 


CAGAGATTGT 


CTCCCTTACC 


8340 


GTCACCAGCT 


ACACGTCCTT 


CTGTCTGCGG 


TCCAGAAGTA 


GAGGATCCCC 


AGACAAAATC 


8400 




CTTAGCATAC 


ATTTACCTCT 


TTATCTACTC 


ATTTCTCCCA 


TTATACAGAA 


8460 


AAAACAAGGT 


AAAAACTAGT 


TACATTTTTT 


CCTTGTTTTT 


CTTCTGATTA 


TACTTTTTAT 


8520 


TTPTTGCTTA 


GGATTTCAAG 


CGTTTCAAGC 


ACGTTATCTG 


CATGAACCTC 


AATGGTGTCA 


8580 


fTAGTTGCCT 


TGATCTTAAC 


TTCTACAATG 


CCATCGGCCG 


CTTTTTTACC 


AACAGTGATA 


8640 


CGGATTGGAA 


GACCAATCAA 


GTCACTATCG 


CTAAATTTAA 


CACCGACACG 


TTCGTTACGC 


8700 




AGACTTCATA 


ACCAGCTCCC 


ATCAAGCTTG 


CTTCAAGTTT 


TTCTGTCAAG 


8760 




CTTCATCCTT 


GACATTGACA 


GTAATCAAAT 


GCACATCAAA 


TGGTGCCAAT 


8820 


TCTTTAGGGA 


AATTGATTCC 


CCAAGCGTAA 


CGGTATTCAC 


CTTTTGGCGT 


TTTGTTAACA 


8830 




CGTGTTGCTC 


CATCACTGCT 


GAAAGAAGAC 


GGCTGACACC 


GATACCGTAA 


8940 




TGATTGGCAC 


AG C ACG AC C A 


TTTTCATCCA 


AGACATCTCC 


TCCCATGCTT 


9000 




CAGTTCCGAG 


TTTGAAAATA 


TGACCGATCT 


CAATACCACG 


CGCAAAGTTA 


9060 


AGGACACCTT 


GTCCATCTGG 


GGAAATTTCA 


CCCTCACGAA 


CTTCACGGAT 


ATCCACATAT 


9120 


TC^CCAGTAA 


AATCACGGCC 


TGGGTTCACA 


CCAGTCAAGT 


GCTACTCATC 


TTCGTTAGCA 


9180 


CCGACAACTG 


CATTGCGAAC 


ATCTTGTACC 


TTACGATCTG 


CAATAATTTT 


AATATTCTCT 


9240 


GGCAAACCAA 


CTGGTCCAAG 


TGAACCAAAT 


CCTGCTTGAA 


CAACATTCGC 


CACTTCTTCT 


9300 


TCGCTAGCAA 


CGTCAAAGAA 


ATCTGCTCCC 


AAGTGATTTT 


TCAACTTGAC 


TTCGTTGAGT 


9360 


TGGTCATTTC 


CAACTAGAAG 


CCCTGCAACA 


AGCTCACCAT 


CTGCAATGTA 


GAAGAGGGTT 


9420 


TTAATCGTTT 


GTTCTTCTGG 


AACATTGAGG 


AAGGCTGCAA 


CTTCATCAAT 


TGATTTAACA 


9480 


TCTGGCGTTG 


C AAC ACG AG T 


AACTTCTTCT 




PACGGTTGCT 


TGGTTTGTAC 


9540 


TCGTTTGTTG 


CCATTTCTAA 


GTTAGCTGCA 


TAGCTAGACT 


CACTTGACTA 


AGCAATGGTA 


9600 


TCTTCACCAG 


AGACTATCCA 


TTTGAGCAAT 


TCTGCCTTGA 


T T T CT T CTT G 


CACTTCTGCA 


9660 


GGAATTTCGT 


CAAATGAGGC 


AACTG ACTTG 


TCCAAGACAA 


CCCAGCGGTC 


AAGGTCTGTA 


9720 


CGAGCAGATG 


TAATGGCCAT 


AAATTCTTGG 


CTATCCTTAC 


CACCCATGGC 


TCCACCGTCA 


9780 


CCAATAATAG 


CCTTGAAGTC 


TAAACCACTA 


CGAGTGAAAA 


TACGCTCATA 


GGCTGCTTTG 


9840 


TACTCATCAT 


AAACACTATC 


CAAACTATCA 


TAGTTAGCGT 


GGAAAC7ATA 


AGCATCCTTC 


9900 


ATGATAAACT 


CACGTGTACG 


AAGAAGTCCA 


TTACGCGGGC 


GTTTTTCATC 


ACG AT ACTTG 


9960 
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GGCTGAATTT GATAAAGGTT GAGTGGCAAT TGCTTGTAAG ATTTAACAGA ATCACGGACA 1002 0 

ATAGCTGTAA AGGTTTCTTC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 100 80 

TTTAGTTTCT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 1014 0 

GCACTAAGAA GGGCTGGAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTGGCGC 10200 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA AT AAAC ACCT 102 60 

GCTGAAACTT GGCGAACATA ACCAGCACGC AAC AT AAG AG CATGGCTGAT AACTTGAGCA 10 3 20 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTGTTT CAT AATATTCCTC 103 30 

GATTATCTAA AAAAGAGTCG C AT AATGT C A TTCCAAGTCA CAGCAATCAT C AAG AC AAC C 10 440 

ATGATGACCA CTCCGGCCAA G GTG AC AT AG GTTTCAATTT CTTGTTTCAA TGGTTTGCCG 10 500 

CGGATGGCTT CTAGGATATT GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 105 60 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 106 2 0 
C C ATTTTT AG CAGCATCACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC - 10680 

AAATCTGGTT GGAAAATCAG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

GCAGTTGTAA AACCACCTAC AAACATGGAT AGAAAATCTG ACTTAACCCC CGGTTGAACA 10 800 

CCTAGAAGGT AACGACCTTG ACTATCTTTG GGTGTAACAC TGACTTGTTT GTCACTCCCC 10 8 60 

TTTTCAGAAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 10920 

GCTTGGATCA AGCTTTCCCA CTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTCTG CC 10980 

ATTTCTGGTA CTCCTACCTT GGCCAAGGCA CCTTGGGGCA TGATATGCAA CTGATTGGTA 1104 0 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC GACACCTAAG 11100 

ATAAAATTGT TCATAGGACC TGCAAAATTG GTAATCAGTT TGCCCCAGAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCACTAC CATCTGCTTC CACAACCGTT 11220 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT GATAAAGACC 11280 

TTGTCTTCAA AATCAAACTG GGTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTTI A 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA TCATCAGCAA GTGTCAAACT AACAGGCGTT -.14 00 

CCTGTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 11460 

CGCAAGATTC GAATGGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 11520 

CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GT AG AAGTG A 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAACCAGAA TAAAGGTTAA AATTCCGAGC 11640 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TCGAAATACA 11700 

AGCAACATAC TATCGAAACG ATCCAAAACA CCACCATGTC CAGGGATAAA TTTCCCAGAA 117 60 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 11820 

ATGCTAAAGA AAATAGCAAA GACTGACATC TTGTAAATTC CATATGGAAG AGCAACTGTA 11880 

CTGTCAACTA TCATAAGGAT AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 11940 

CCCTCAAGGG TTTTATTACG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 12000 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAACGCTAA GAGAGCCTTG 12060 

TCCAAACCTG CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 1212 0 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCGTTCCT 12130 

AACATGATTG AAATCAAAAC ACTATACGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 12240 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GACCCCCTCC 12 300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 12 360 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATT GGT AG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 12 480 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT ACCTTCCTGC AAGGCCGCTT 12540 

CGTCAAAATC AGGCCATAAG GTGTCCGTAA AATAAACCTC ACT AT AGG CT CCCTGCCATG 12600 

GAAGGAAATT GCTCAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT GGGTCTCGTA 12660 

AGTCCTTAGG CAAATCCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATGTCAC 12720 

CTGGGTTGAT TTTGGCATCT AAAACATCCT GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12 780 

CAGCACGTCC ACCATAGTTA AGAGCAAAAT TAAGAATCAA TCCTGTGTTG TTCTTAGTCA 12840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGCTTAGG CAGGCGGTCT CTCTCCCCAA 12900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC AT AATT AT C A TAAAACTCTA 12960 

CTGGCAAGTT CATGATAAAC TTGACTTCCT GATCTGGACG CGTCCAGTTT TCCGTAGAAA 13020 

AAGCATAGAC CGTAATAACC TTGACGCCCA GTTTGTTGGC TGCCTTGGTC ACGGTTTGCA 13080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTG CATACGTTTT TTAGCCCAAC 1314 0 

GGCCATTGCC ATCCATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA ACATGATCTT ATTCCTATTC AAAAATCTAT 13 260 

CGTTT C ATT A TACCATATTT CCCCATTTTC TTCTATCACT AAGCTATTTA TTCTCAGGCA 13 320 

CCAAGCCCAT TTTTCAAAAA AATAAGCCGC CTGATTGGGC GACTTTATTT TTATAGGGAG 13 38 0 

ATTATTATGA AAAAGTTTTA GGAGTTTAAG TTAAGGTCTT CTTAACTTAT GAACTTAGTG 13440 

TACACTCCCT AGCTTAAAGT TTCCTTAAGT ATTTTTAAAA ATCAAATTTT TCCATTTCTC 13 500 
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CTGCCAATTT TTCTTGCATA AACGTGTTTC ATAGAGTTCC ATTCGGTCTT CATTTTCTAA 13 560 

GAAATGAGGA GTTGGACGAA CTTGAAAATT CAAAATATCC TCCAAACCAT AAGGTACATA 13 620 

GAGTTCAAAA TCT AATTCTT CATTCAAGCG CAGTCCAACT GCCGTACACC GTTCTGGATA 13 680 

CTTACTCATA G CATC ACG AG AACTGGT ATA GGAAGCAGTG TGAGGACTGT GCTGATGCAT 1374 0 

ATAGACCTGA TTTTTCAATT CCCACTGGTA CTGAGGAAAA TCCTCTCTCA GCTTTTTCTC 13 800 

CAGTAATAAG GTTTCCTCAT AAGAAAAATC TGGATCAAAG AAAATCACAT CTATATCTGT 13 860 

TTCATGATCA AAAGGGGATT TGTCTGACAA AAGATTCCAG ATGAAATTTC TGACAGAACC 139 20 

TGCTGCCAAC CACGAGTCTT TCAAACCAAG GTCTCGGATG ATCGTCAGAA TGGCCATCAT 13 980 

ATCTGGACTT TCTCTAAAAG CCTCTAAGAT TTCTTGCTTA TTTTTCACTG TATTCATAAC 140 4 0 

CTAAGTGCTC ATATGCCTTA GCAGTCGCCA CCCGTCCAGA CCGTGTCCGC ATGATAAAAC 14100 

CTTTTTGAAT CAAGTAAGGC TCATACATGT CTTCAACTGT CTCACGCTCT TCGGCGATAT 14160 

TCACAGAAAG AGTTCCTACA CCAACAGGTC CTCCACTGTA CAT CTC AATC ATGGTGCGAA 142 2 0 

GG ATTTTTT G ATCCACATAG TCCAAACCTT CATGGTCAAC ATCCAGCATA GTCAAAGCCT 14 2 80 

TATCGCTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 14 3 40 

GCTTGAGGAG ACGATTGGCA ATACGAGGGG TTC C ACG ACT ACGTAGGGCC AACTCAGATG 14 400 

CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTCCCGT CCGCTCCACA ATTTCTGTCA 14460 

AGTCAGCATG AGCATAATAC TCCATATGAC CTGTAATCCC AAAACGTGCC CGTAGTGGAT 14 520 

TTGAGACCAT ACCAGCCCGA GTCGTCGCAC CAATCAAGGT AAAAGGAGGC AACTCCAAAT 14580 

GAACACTGCG ACTGCCTTCA CCAGCCCCAA TCATAATATC GATGTAGAAC TCCTCCATGG 14 640 

C ACT AT AAAG CACTTCTTCC ACTGACATGG GTAAGCGATG AATCTCGTCA ATAAAGAGGA 14 700 

CATCTCCAGG CTCTAAATCA TTC AAAAT CG CTACCAAATC ACCCGCTTTT TCGATAACAG 14 760 

GACCAGACGT TTGCTTGAGA TTGACTCCCA GTTCATTGGC AATGACAAAA GCCATGCTTG 14820 

TTTTCCCAAG CCCTGGAGGG CCAAATAAGA GCACATGATC CAGCGCTTCA TCCCGCATTT 14880 

TAGCGGCTTC G AT AAAG AT C TGAAGTTGAT CCTTAACCTT ATCCTGACCA ATATATTCAC 14 94 0 

GTAAATACTG AGGACGGAGC GTGCGTTCTA CTAACTCCTC ATCACCCATC ATCTC ATT AT 15000 

CTAAAATTCT ACTCATGGCT CTATTATATC AAAAAAAACA AGCCACAAAC AAAAAAGCCA 15060 

CCTGATTGGG TGACTCCTAA GTTTAGCACT TATGTGGTAT AATATTATAC GGCACTTCTA 1512 0 

CACCGCCTAC GAAAGGAGGT G AG AT AGCCC ATGATGGAAT TACTACTCAA AACTATTATC 15180 

GGACCAATTG TGGTCGGTGT CGTTCTTCGT AT AG T CG AT A AATGGCT AAA C AAGG AC AAA 15240 

T AGTGTC AAA AAAGACCTCA AGCTTATTTG GTCGTGAGCT TGGGGTCTTT TCTAGCCTAT 15300 
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GATATAGAAC TAGTACTCAA TTCCTTTTTA TTATCCCATA GTTCACGAAT TTTGTCAAAA 153 60 

CTTTACATTT TCTTCAACCG CTGTACGACA AGACGGTTAA GATTAAGAGA ACGTTAGGGA 15420 

TTCTATCAAT TTCATAGAAA TTTTGATTTC GTAAACGAAG AGACAATCTT AC ATGT C ACT 15480 

TCTCATTTAA TACGCCACTA CTAGACAAGC AAAATC AT T A TTACAGTAGT TCCAGTCCTT 15540 

CAATTAACAG TCACTTACAA TCAAATTGAG TTTCAACTAG CTGAAGCGAC CACAGACCTA 15600 

TTTCTTAGTC ATATTCGCTA AAAAAATCCC CGCCAAAATC TCAAAAAGTC CCCGCCAATT 15660 

CCCCGACCAA AATCCGAAAA ATACCGAAAA ATATCGAAAA ATTATTTTTA GAATAGTCCC 15720 

AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG GTGGAGTTAA GGGAGATTAT 157 80 

TATGAAAAAG AAAAGTTTAG GATTTTATTA AAT AAAGTT A GGAGGTCTTT ATTTAATAAC 1584 0 

TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT CTAAAATTTT ACTATTTTGC 15900 

AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG GATAATCTCT AGCCCCTCTC 15960 

ACACCACCGT ACGTGCCGTT TGCCATACGG CGGTTCAACT AACTTTT AAC GCATGTCGTT 16020 

CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC CAGGACTGGT TTTGATATAG 16080 

CACGTTTAAG TACCGACTTC TGAGCTACTA ATTGATAATG GTCGCCCCAG CCAGATACCT 16140 

TATCTGCTAT CCATTTAGGA ACTCCTAACT TAAGCAATCC CCATAATCGT CTCGATTTCT 16200 

TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG CAAGCGCTCA TCTATGCTGC 162 60 

CC ACT AT ACT TTTCATATTT CCCAATGAGC AATACTTTAT CCATCCTCGA AT AG AC AAA T 163 2 0 

TCAGTTCCTC AATACGTCTT GTTAGGTCTA TACTCCATTT CCTCTCTGTT AGTTTCTTCA 163 80 

ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACC GCTTTTCCAA CCATCTGATA 16440 

ATTTCCAGAA CCCAAAACCT AGATATTTCA A CTCTCTTGG TCATGTTTAC TTTCAAACCT 16 500 

AGCCGTTTCT CAATAAACGA CTGACTGAAT ACATC 16535 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 8136 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
< D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 

GATGCATACT T ATTTG C AAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 12 0 
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ACGTGGTTAC CGTCGTATGG TTGCGGCTAT GGCTGATTTA GGATTTTCAG GAACTATGAA 180 
GGCTATCTGG GATGACCTCT TTGCCCATCG TAGTTTTGCC CAGTGGATTT ATTTGCTGGT 240 

TTTAGGAAGT TTTCCTCTCT GGCTGGAGTT GGTTTACGAA CAT CGT ATTG TTGACTGGAT 300 

TGGGATGATT TGTAGCTTGA CAGGG AT T AT CTGTGTAATC TTTGTATCGG AAGGTCGAGC 360 

. AAGTAATTAT CTTTTTGGCT TGATTAACTC TGTT ATTT AC CTTATTTTGG CCCTACAGAA 4 20 

ACCCTTTTAT GGTGAGGTGC TGACGACACT TTACTTCACA GTCATGCAGC CAATTGGACT 4 80 

TCTAGTTTGG ATTTATCAGG CACAGTTTAA GAAGGAAAAG CAGCAGTTTG TCGCGCGTAA 5 40 

ACTGGACGGC AAGGGCTGGA CAAAGTATCT TTCCATTAGT GTGCTTTGGT GGTTGGCCTT 600 

TGGCTTCATT TATCAGTCTA TTGGTGCCAA TCGTCCCTAT CGTGATTCAA TCACAGATGC 6 60 

AACCAATGGG GTAGGGCAAA TCCTCATGAC AGCTGTTTAC CGTGAACAGT GCATATTCTG 720 

GGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGGA GAAAGCCTGC AAATTCAAGG 780 

GAAATATCTA AT TTATCTCA TTAACACTCT AGTTGCTTGG T ATC AATGGA GCAAGGCAGC 84 0 

TAAGCACAAT AC TG ATTT AC TTAACTAGGA AAAG ATGTTT GAAAGTGCTG TTTTG AG AT T 900 

TCGATTAAAA CAGATATAGT TGATAATCAA GGATTTATAG TATGAAAAAG AGGATCGGCG 960 

GGTCCTCTTT TGTTGTTGAA AAGATAAAAA ACTCAGTAAC CTAGAAATAA GACAACTGAA 1020 

GCTTTACTCT ATATTCAATT TTTAGGAATG AGAAGGTCTA GATAAAATTG CACAACTTCC 1080 

TGGTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAAGTTGGAC 1140 

ATGACCAAGT GTTGGAGGTA AGAAGTAGGC AGATTAGGGT GGGCTTCTTT TAAATTATCA 12 00 

GCTAGCACGG AATAGACATG CTGTTCTAGC TCTTTATGGA GTTGACGGAG GAAGTAGTCA 12 60 

TTTTTGGAAA ATAGCAGACT GGTGATATGG TCTTGGTTTT TATGAAAATG GAGAAAGAGG 13 20 

TGGGCGAGGT AGTCCTCGGT TGAAATGGCT TGCTCTCTTT CAAAAAGATG ATGGAAGAGC 13 80 

T AGCGG C AG A GCT GGTCC AG AAGAAGCTCC TTACTCTCAT AGTGACAGTA AAAGGTGGAT 14 40 

CGTCCCACAT CTGCGAGATC AATGATATCC TGAACAGTAG TGGCCTCGTA GCCCTTACCA 1500 

TTCAAAAGTT GTATAAAAGC TTGATAGATG GCTTTTTTGG TTTTGCTGAT ACGGCGGTCA 15 60 

ATGTTAGTCA TATGGACACT TAAGGCAAAT TGTTCAGAAC TGAATAAAGC TCACCTTTTC 16 20 

CTTCTATCCT TTCTTTGAGT TTTAGTGGAT AATGATAATG AACAAGGTGT TCATAAATCT 1680 

ATTATAACAA AGGAATGAGA AATATGAAGG CAAAATATGC TCTTTGGGTC GCTTTTTTCT 17 40 

TAAATTTGAC TTATGCCATT GTTGAGTTTA TTGCAGGTGG AGTATTTGGT TCTAGCGCTG 1800 

TTCTTGCTGA CTCTGTGCAT GACTTGGGAG ATGCGATTGC AATTGGAATA TCAGCTTTTC 18 60 

TAGAAACAAT CTCCAATCGT GAAGAAGACA ATCAGTACAC CTTGGGCTAT AAGCGGTTTA 1920 
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GCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GGGCTCTGTT CTAGTCATTT 1980 
TGGAAAATGT CACGAAGATT TTCCATCCCC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 
TAGGAATTAT TCCGATTACT AT C AATCTGT TAGCGAGTCT CGTCG7TGCT AAGGGAAAGA 
CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 

TTATCCTGAT GGCGATTCTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 2 2 20 

CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGC TCTACACTCA 2 2 80 

AG ATTT T CTT GGATGCTGTG CCAGAAGCTC TTGATATCAA GCAAGTAAAG AGTGGCCTGC 2 34 0 

AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGCACTATC GATCCTTTGG 2 4 00 

AAAAAAATGC CATTGTCCAT CTTTCTCTAA AAGAAATGGA ACATATGGAA AC TTGT AAAG 2 4 60 

AGTCTATTCG AATTTTCCTA AAAGATTCTG GTTTTCAAAA T ATT AC C ATT G AAATTG ATG 2520 

CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 2 580 

AGCATCAACA TTAGAAAAAA GTGAAAAATA C TTGGGT ACT ATCTTATTTG GAATAGAGTA 2640 

ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 2700 

GATATATGAT TGTTAATGAT AAAAATTTTT CGATTAGATA CAAAATGCTT GACTTGGAGT 27 60 

CAACTCAAAG TTATATAATA AGATAAGTGA GTTAGAATAG CGTGAATTCA GTGAATGAAA 2820 

TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 2 880 

ATACGATTCG G T ATT AT G AA CGGGTTGGTC TTGTGCCACC GATTACTCCT ACTGCTACTG 2 9 40 

GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA ATTTATTAAG TCTTTTCGTT 300C 
CGGCGGG TGT CTCTGTAGAT AGTTTAGTTG ACTATATGTC GCTCTACCAA AAGGGAGATG 3 060 

AAACGAGAGA GGAGAGGCTT GGTATTTTAG AAGAGGAAAA GCAAAAATTA GAGGAGCGCT 3120 
TGTCTCAGCT ACAGACAGCT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 3 130 

GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 3 240 

TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGCTT CGTGCGTGCG 3 3 00 

TTTGTGGTTC AGATTTATGG AGGT ACCGT A AT C C AG AAAC GAAAGCTGGA CACAAAAATA 3 3 60 

GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AGCCATTACG ACCGTGAAAC 3 4 20 

CAGGTGATTT TGTGATTGTC CCTTTTACAC ATGGATGTGG TGAGTGTGAT GCCTGTCTTG 3480 
CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 3 54 0 

CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 3 600 

CTGACTATAC AG AAGGG ATG CTCAAGTCCC TTTTG AC T C T TGCAGATGTC ATGCCGACAG 3 6 60 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT GTTATCGGTG 3 7 20 

ATGGGGCTGT TGGTCAATGT GCTGTCATCG CGGCTAAGAT GCGTGGAGCA TCACAAATTA 3 7 80 

TCCTTATGAG CCGTCATGAA GACCGTCAAA AGATGGCTAT GGAGTCAGGT GCGACAgcTG 3 84 0 

TTGTTGCAGA ACGTGGTCAA GAAGGAATTA CCAAGGTGCG TGAAATCCTC GGTGG AGG AG 3 900 

CAGATGCAGC ACTTG AATGT GTTGGTACGG AGGCTGCT AT AGAACAGGCG CTAGGTGTTC 3 9 60 

TTCATAATGG ACGGCCTATG GG CTTTGT AG G AGTCC C AC A CTATAATAAT CGTGCTCTTG 4 020 

GTTCGACATT TATGCAAAAT ATCTCTGTAG CAGGTGGGGC AGCTTCTGCT AC AAC AT AC G 4 0 80 

ATAAGCAATT TTTACTAAAA GCCGTCCTTG ATGGTGATAT CAATCCAGGT CGCGTCTTTA 414 0 

CTTCAAGTTA TAAACTGGAA GATATCGACC AAGCCTATAA AGATATGGAT GAACGTAAGA 4 2 CO 

CAATTAAGTC TATGATTGTA ATCG AAT AAA AAACGAATAG CACTTTTAGA ACTCTATTCG 4 260 

TTTTTTATGT TATCCTATTC TTGATTTAGG GTACTTTCTC TTAATGTCAG TCTGGTTCCC 4 3 20 

AGCATGGTCA GGCTAGGGAT TTTCCGACCC TGGAGGACTT C CTTGTT AAG AATATCCATA 4 380 

CCTGCTCGCC CCATTTCTTC AGTATAAACT GTAATACTAG AGAGGGGAGG ATAGACCTGT 4 4 40 

TTGGTCAGAC TAGTGTCGTT AAAGGAAATG AGGCTGACGC CATCTGGCAG GCTGATTCCA 4500 

GCTTCTTGGA GGGCACGGAG GGCACCGATA GCTAAACTAT CGCTGGCTGC GAAAAATCCT 4 5 60 

GGCGGAAGTT GGTCTCCCAA GCTCTGAATG CCCTCCTTCA TTAAGTCATA GCCAGACTGG 4 620 

GCAGTAAATC TT CCTT G AAA G AC C AGTTC A TCATGATAGA TTCCCCTCGC TTGACTATAG 4 680 

TTTTTGAAGT TTTCTAGACG CTTGTCCTCA ATGATTTCTT CTTGGTCTGT TGTTTCTTCA 4 7 40 

AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA GGAAATAATC GACAACCTGT 4 800 

TTCATAGCAG TGTAAAAATC CGTGATAATA CAGGTATGTC CCAGGGAAAG TGTATCGCTG 4 860 

TCTAGAAATA CAAGAGGCTT TTGGTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4 9 20 

TTTCCGATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAAGGGTG GTCATTAAAA 4 980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 504 0 

TAGTAGTACA GGTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGGATAAT GGCAATCTTT 5100 

TCCTTGGGTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGGTTAAAA TACGGTGTCT GGTTTCTTCT GTAACAGATA GGCTCTGGTC GCGGTTGAGG 52 20 

ACGCGGGATA CGGTCGCGAT AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 52 80 

ATAAATCCTC CTTGATTAGG TTAGTATATC ATGTTTTTCT TCTTTTTACT GATATTTTAC 5340 

TAAAATTTTA GTAAAAAGGA T TG ACCTTGG AAAATTCCTT GG AT AT AAT A GAAAGAAAAC 5400 

GATTACACGT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 5 4 6C 
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TACTACTGAA 


GCCCTTCGCA 


AAGACTTTCT 


TGCTGTTTTT 


GGTCAAGAAG 


CAGATCAAAC 


5520 


CTTCTTTTCA 


CCAGGCCGCA 


TT AATTTG AT 


TGGTGAACAC 


ACAGACTACA 


ACGGTGGCCA 


5580 


CGTTTTTCCT 


GCTCCTATTT 


CCTTGGGAAC 


TTACGGTGCA 


GCTCGTAAGC 


GTGACGACCA 


5640 


AGTCTTGCGT 


TTCTACTCAG 


CTAACTTTGA 


GGAC AAGGG C 


ATTATCGAAG 


TGCCTCTCGC 


5700 


TGACCTCAAG 


TTTGAAAAAG 


AGCACAACTG 


GACCAATTAT 


CCAAAAGGTG 


TCCTTCATTT 


5760 


CTTGCAAGAA 


GCTGGGCACG 


TGATTGACAA 


AGGTTTTGAT 


TTTTATGTTT 


ATGGAAATAT 


5820 


TCCAAATGGT 


GCTGGCTTGT 


CTTCTTCTGC 


ATCCTTGGAA 


CTCTTGACAG 


GAGTCGTGGC 


5880 


TGAGCATCTC 


TTTGATTTAA 


AATTAGAGCG 


TCTCGATTTG 


GTTAAAATCG 


GCAAACAAAC 


5940 


AGAAAACAAC 


TTTATCGGAG 


TAAACTCTGG 


CATTATGGAC 


CAGTTTGCTA 


TTGGTATGGG 


6000 


GGCAGACCAA 


CGTGCTATTT 


ACCTAGATAC 


TAATACTTTA 


GAATACGACT 


TGGTGCCACT 


6060 


TGATTTGAAG 


GACAATGTCG 


TTGTTATCAT 


GAACACCAAC 


AAACGCCGTG 


AATTGGCGGA 


6120 


CTCTAAATAC 


AATGAACGTC 


GTCCTGAGTG 


TGAAAAAGCA 


GTGGAAGAAT 


TGCAAGTTTC 


6180 


CTTGGATATT 


CAGACTCTGG 


GTGAATTGGA 


CGAGTGGGCC 


GTTGACCAAT 


ATAGCTATCT 


6240 


GATTAAAGAT 


GAAAATCGTT 


TGAAACGTGC 


TCGCCATGCT 


CTGCTTGAAA 


ACCAACGTAC 


6300 


CCTCAAAGCT 


CAAGTAGCAC 


TCCAAGCAGG 


AGATTTGGAA 


ACATTTGGAC 


GCTTGATCAA 


6360 


TGCGTCACAC 


GTTTCTCTGG 


AGCATGATTA 


TGAAGTAACT 


GGTTTGGAAT 


TGGATACCCT 


6420 


TGTTCACACA 


GCTTGGGCAC 


AAGAAGGAGT 


TCTCGGTGCT 


CGTATGACAG 


GGGCTGGTTT 


6480 


TCGTGGCTGT 


GCcATTGCCT 


TGGTTCAAAA 


AGATACTGTT 


GAGGCCTTTA 


AGGAACCTGT 


6540 


AGGCAAACAC 


TACGAGGAAG 


TAG TTGG AT A 


CGCTCCAAGC 


TTCTATATCG 


CTG AAGTTG C 


6600 


AGGTGGCACT 


CGCGTCCTTG 


ACTAGTCAAA 


AGGAGGCTCT 


ATAGTCACCT 


TAGTAAATAA 


6660 


AT7TGTAACA 


CATGTCATTT 


CTGAAAGCTC 


ATTTGACGAA 


ATGGATCGAA 


TCTATCTGAC 


6720 


CAATCGTGTT 


TTGGCACGAG 


TGGGAGAAGG 


TCTTTTCGAA 


GTTGAGACCA 


ATCTCCATAA 


6780 


ATTCATTGAC 


CTCAAGGACC 


AGCTGGTTGA 


AGAAGCCGTT 


CGATTAGAGA 


CGATTGAGGA 


6840 


TAGTCAGACT 


GCGCGTGAAA 


TCCTTGGTGC 


TGAACTGATG 


GATTTGGTGA 


CTCCTTGTCC 


6900 


AAGTCAGGTC 


AATCGTGATT 


TTTGGGCAAC 


CTACGCCCAC 


TCTCCAGAAC 


AAGCGATAGA 


6960 


GGATTTTTAC 


CAACTCAGTC 


AGAAAAATCA 


CTACATCAAA 


CTCAAGGCCA 


TTG CT AG AAA 


7020 


TATCGCTTAT 


CGTGTTCCAT 


CTGACTACGG 


AGAACTTGAA 


ATTACCATCA 


ATCTCTCTAA 


7080 


GCCTGAAAAA 


GATCCCAAAG 


AGATTGTGGC 


AGCCAAGTTG 


GTGCAAGCTA 


GTAATTATCC 


7140 


TCAGTGTCAG 


CTTTGTCTAG 


AGAATGAGGG 


CTACCATCGT 


CGAGTTAACC 


ACCCAGCTCG 


7200 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAG GAATGGCGTT TCCAGTATTC 7 2 60 

GCCCTATGCT TACTT7AATG AGCATTGTAT CTTTTTAGAT GGCCAGCATC GTCCCATGGC 7 320 

CATTACTCCT CAGAGTTTTG AACGTCTGTT GGCTATCGTA G AC C AGTTTC CAGGATATTT 7 380 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGCCTCT ATTCT AACTC ATGATCATTA 7 440 

TCAGGGAGGC CGTCACGTAT TTCCT ATGG A ATTGGCTCCC TTGCAAAAGG CCTTCCG AT T 7 500 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACGTTT 7560 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTGC AGGAATGGCG 7 62 0 

C C ACT ATTC A CATCCTGCAG TGCAGATTTT GGCAGAGACA GACAGGACAC CGCATCACAC 7 680 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAC TTGGACTTGG TCTTCCGAGA 77 4 0 

CAATCAGACT TCAGCAGAGT ATCCTGATGG TATCTATCAT CCCCACAAGG ATCTCCAACA 7 800 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC TTGGCAATCT TGCCACCACG 7 8 60 

TCTGAAACAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7 9 20 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCAGACT AACG GAT AAA 79 80 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTCGGTGCTA TCTTTGCGCG TGTACTTGAG 804 0 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG CCTTTATGCG CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGC 9136 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCATAGTGA AGAC7GCCCA TAAGAAGGTC TTCTAGGCTT AATTTAGCTT . TTCGTCCAC C 60 

TTTTGCGTGT TTAAGTTGAT AAG CTGTTTT TAACACAGCT GAACATCTCT TCAAAAGTCG 12 0 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 180 

CATTCATAGA ACT ACT AT AC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 24 0 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGCCCCAG CTTCGATAAC TGGGATACCT 3 00 

GGTTCAAAAC CAACGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 3 60 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 4 80 
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GCAAGAATTT T AAT C ATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 540 

GGTTGCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 6 60 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 7 20 

CATATTATGG AAAGTCACCG GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 780 

ATGAACAATA TCTGTGTGAG GAATCATTAC ATTTGCAAGT CCTTTCCTAG AAATTCCATA 34 0 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AG TTGG AGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAAGTTATTC TTGATTATCC 9 60 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AAGAAATAAT CTAATACCA? AAGGTTTTCC 1020 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 10 80 

TTCTATTCTT TTTTGTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT CTTATATAAA 114 0 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 1200 

AGACCAAACT AGGAAGCTAG CCGCAGTTGT TCAAAACACA GTTTTG AGGT TGTAGATGAA 12 6 0 

ACTGACGAAG TCACTCAAAA CATGGTT TT G AGGTTGTACA TGAAACTCAC GAAGCAACAg 13 20 

CCATACATAC GGTAAGGCGA CGCTGACGTG GTTTGAAGAG ATTTTCGAAG AGTATAAAAA 13 80 

CTAAAAAAGC AGACCATCTA AGC CTGCTTT ACTATTGATT CTTATATAAA TTTCCTGTGA 14 4 0 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGGAAGGC 1500 

TTCTTGCGCA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTT AC C AG CTTTTTGTTT 1560 

TTCAAGGAGT TTACGCTTAC GAGAAACGTC ACCACCATAA CATTTAGCAA GTACGTTCTT 1620 

ACGAAGGGCC TTGATATCAG TACGAGCGAC AATCTTGTGT CCAATAGCCG CTTCGATTCC 1680 

AACTTCAAAT TGTTGGCGAG GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACC 174 0 

TTCGTAGGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 13 00 

AAGAATATCC ATTTTC A CCA GCTTAGATGG GCGATATTCT GACAATTCGT AGTCAAAGCT 1360 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCAGCAAG 19 20 

AGGAATTTCA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAC TCACAAAGTC 19 80 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT 2 040 

TTGCGCCTTG ACATAAGGCT CTTCAATGGT CGC AATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TGGGTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTGTAAA TAACAGACGG 2 160 

AGCTGTCATG ATCAGGTCAA TATTGAACTC ACGCTCTAAA CGTTCCTGGA TAACATCCAT 2 2 20 
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ATGCAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTGAG ATGTTTCTCG 2 2 80 

TTCAAACTGA AGACTAGCAT CATTCAGTTG CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2 340 

GTACTTGTTT GATTCGATTG GGT AG AG ACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 24 00 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TGCCAAGGTA ACGGTATCAC CCACACGAGT 2 4 60 

ATCCTGAACC CTCTTGATAG ACGCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2 5 80 

GCTCATGAGC TGAATCTTAT C AC C AGGTTT GACCACTCCG TCCATGACAC GC ACTTG GAG 2 64 0 

GATAACCCCA CGGTAAGCAT CGT AAAC AG A GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2 700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 27 60 

AATACCAGCC TTGGCAGAAG CCAAAACTGC TTCACTGGCA TCCAAACCAA TCACATCTTC 282 0 

AATCTCTGTA CGCACGCGCT CCGG ATCTG C AGCCGGCAGG TCAATTTTAT TAATGATAGG 2 880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG GCAAGAGTTT GAGCCTCAAT 29 40 

TCCTTGAGCC GCATCGACCA CCAAAATAGC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AAGTTTC CCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGGC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATGCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3 180 

TTTTT CC AAA ATGCGGTCTG CTACAGTTGA TTTTCCGTGC TCAATATGGG CGATAATAGA 32 4 0 

GAACTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3 3 00 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTGATATTTT GACAACACCA TACCCTGCTA 3 3 60 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 34 2 0 

TTGGTCTGTA AAG AC AATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3 480 

AATCTTGCCA TCTTCTCTAG TCCAAAGCTC AGATGTACCG CGTTCTTGCT GTAACAAACC 3 540 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3 600 

TGCTTTTCTA ACCCATACTT TTTTATAATC TCTTGTTTCA TGCCACTCGT CCAAGGTCAA 3 6 SO 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACT AC AAATG GCATTTCACC 37 20 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCGC TTGGCATCTT CTACTCCATC 3 7 80 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGAGTTAATG GTTGTATCTC CACCATTGCG 3 8 40 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCGTC 3 900 

GTTTCCGGAC AAAC AG AT AG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3 9 60 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAG AG C AAC T GGGGCTGACC 4 020 
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ATCCCAGGTT TTGACAAGCT CTTCCAGCAT CCCAGCTTTT CCGTGAACAT CTCCAATTAC 4080 
ATAATAATCT GTCATCTTAT TTCTCCCTGT TTCTCAACAA TTCTCTTGCT TGCGTCAGGG 
CTGCTTCTGT CACATCATCA CCTGCCAACA TCTTGGCAAC TTCCTCCACT CGCTCTTCGA 
CCGTCAAGAG ACGAACAGTC G AAACC GTTG AATGGTC AT T ACTAATCTTC TCAATAAAGA 4 2 60 

ATTGATAATC TGCAATCGCA ATT ACTTGT G GCAAATGGGA GATAGCCAAA ACCTGACCAT 4 3 20 

GCTGACCAAT TTTATGAATT TTCTGAGCAA TAGCTTCACC AACACGACCT GAAACTCCCG 4 380 

TATCCACCTC ATCAAAGACA ATGCTAGTCT TGCCTTCTTT ACGTGAAAAG GCAGACTTAA 4 4 40 

TGGCTAACAT GAGACGAGAT AATTCCCCTC CAGAAGCAAC CTTAACCAAG GGTTT AAAGT 4S00 
CTTCTCCAGG GTTGGTTGAA ATATAAAACT CAACCATTTT ATTTCCCTCA CGACTGAATT 4 560 

TTCCCT TACT AAAACGAACC TGAAACTGGG CTTTTTCCAT ATAAAGATCT TGCAGTTCTT 4620 
GTTTAATCTC AGCTTCGAGT TGCTGAGCCA AATTATGACG AGCAGAAGCA AGTTGACCTG 
CCAAATTGAC AAGATTGACT TCCAACTTCT TAAGCTCTGC TTCCATGTCC TCAGACGAAA 
GATTATTGCC TCTCAAGAGA TTGTATTCTT CCGTAATCTT GGCAAAATAA AG C AAAAC AT 
CATCAACAGT CCCACCATAC TTACGACTAA TAGTATGAAG GAGGTCCAAA CGATTCTCAA 
CCTGCATCAG GCGATTGCCA TCAAAATCAA GGTCCTCAAT GATAGCTTCC AAACGTTTGC 
TAATGTCTTC T AAAAC AT AG TAGGTCTCAG AC AG AT AG CT TGAAATTTCA CGGTATTCAG 4980 
GATCATACTC TTCGACACTT TCCATGTCAT TCATACCTGA ACGAACATTG GCCAGACTTC 
AAAAATCTTC ATTGTCCAAC ATACTGTAGG C ATTGGT C AG TGTATCCGCA AT ATTTTT GT 
GGTTGAGGAG TTTATCTCGC TCTTGATTGA GAGCCAAGTC TTCTCCAGCC TGCAAGTTTG 
CTGCCTCAAT CTCTGCCATT TGAAATTCCA ACATTTCGAT ACGTGCCTTG TGTTCCTGTT 
GCTTTTTCTT GACTTCCAGA ACCTGCTTGC GCATTTTCCG ATAGGCATCA AAACTCGTTT 
GATAGGTTTC TTTCAAGTCC CAAAAAGCGG CATCACCAAA TTCATCCAAC ATCTGGATAT 
GCAGTTGGGG ACGCATTAAC TCCTCATGGT CATGCTGACC ATGAATATCT ACAAGATGTT 
GCCCAATAGC T CGC AAAAC A GACAGATTAA CCATCTGACC ATTTACACGG CTGATACTAC 
GACCATTTTG CAAGATTTCC CGACGGATGA TAATTTCATC ACCTAATTCT AAACCTTGCT 5520 
CATCAAAAAT TTCCTG T AAA AGACCACTAT TCTC AACTG A GAAAAGCCCC TCAATCTCTG S580 
CCTTTGGTGC ACCATGACGA ATAACATCTG TCGTCGCACG AGCTCCCAAC ATCATATTCA 5640 
TGGCATCAAT GATAATCGAC TTCCCTGCAC CCGTTTCACC AGTCAGGACA GTCATCCCCT 5700 
T TTC AAAATT GAGGGAAATA GCCTCAATAA TGGCAAAGTT TTTTATCGAA ATTTCAAGTA 5760 



4680 
4740 
4800 
4860 
4920 



5040 
5100 
5150 
5220 
5280 
5340 
5400 
5460 



r 

WO 98/18931 PCT/US97/19588 



626 

ACATATAGAC CTACCAATTT TTTACTTCTT CAAAGATTTC CTCTGCTAGA CTTCCACTTC 5 820 

TGGCAATGAC TAAAATCGAG CTATCATCAG TCAAACACC? AAAAATCTTG TCTGCAAAAC 5880 

TCTCGATTAA CTGAGCTTTT ACAAAAGCCG TATTTCCTGG AATAACTTGG AG ATTG ATCA 5940 

TCTTATCCAT CAATTCAGCC GATTCGATAT TGTCTTCAGC CAGTTGCAGA CTTTTTACGA 6000 

TTGATTTTGG CAATTCGTAG ACATAGGTGT TGTCTCTCAA AGGAATTTTG ACAATACCTA 6060 

ACTCTTTGAT ATCTCGGGAT ACCGTCGCCT GAGTGGCAGT GATACCTGCT TCTTTCAAAT 6120 

GTTCTACAAT TTCTTCTTGC GTCCCGATTT GATAATCTGT CACCAATCTT CTAATTTTTT 6180 

CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATGCG CCCTCTCTAC TGCTTCTT T A 624 0 

ATCTCACCAA GAATCTGATT GCTTGCTGAC TTTTCTTTTT TCAAATACGC TAAAAATTCA 6 300 

AT ATTTCC AT GTCCACCTTG GATGGGAGAA AAGTCCAAGC CAAGGACTGA AAAACCTACC 6 3 60 

TCTACTCCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT GAACCTTAGC ATCTCGAATA 64 2 0 

ATTCCATTTT TCCCAATCTG CTCACGTCCT GCCTCAAACT GAGGTTTGAC AAGTGCTACC 6480 

ACCTGACCTT GATCAGCCAA GACACGGTGC AAGGCTGGCA AAATCAGACT AAGGGAAATG 6540 

AAACTCACAT CAATACTGGC AAAGCTCGGC TCCTGCTCGA AATCAGTCTT TTCAGCATAG 6 600 

CGGAAATTCA ACTGCTCCAT GCTGACAACT CGTGGGTCTT GGCGTAATTT CCAAGCCAAC 6660 

TGATTGGTAC CAACATCGAC TGCAAACACC AACTTCGCAC TATTCTGTAC CATGACATCG 67 2 0 

GTAAAACCTC CAGTAGAGGC CCCGATATCA ATCGTAGTCG CGCCATCCAC CGACAAATCA 67 80 

AAGACCTGCA AGGCCTTTTC CACTTTCAAA CCACCACGGC TGACATACTT GACTTTCTCC 6 84 0 

CCCTTGACTT TTAATTCGGT GTCATCTGGA ATTTTCTCTC CTCGCTTCTC AAACCGTTCT 6900 

CCATTAAGGA CTGCTACGAC TAGGCCAGCC ATCACACCTC CCTTGGCCTG CTCTCTCGTT 69 60 

TCAAACAACC CCTGTTTATA AGCT AGT AC A TCCACTCTTT CCTTAGCCAT TGATTCTCAA 7020 

ACTTTCTACT ACACTTACAA TCGATTCTGT TTCAAAGGGA AGCTGCTGGG CAATTTCTTC 7 080 

TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG GCAATGGACT CTTCCAAGCC 714 0 

CAACAGGGCA GGATAGGTTG ATTTTTCTGC CTGCAGATCC TTTTGAGGTG TCTTGCCGAT 7 200 

TTCCTCAAAA CT AG CTGTC A CATCCAGTAC ATCATCTCTG ACTTGAAAAG CAAGTCCAAT 7 260 

CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT G AC AATTC AG CTATAATAGC 7 32 0 

TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA TTGGCATGAA TAGTCTGAAG 7 380 

TTCTTCCAAA GACAAGTGCT GGTGTTCGCC CTCCATATCC AAAACTTGCC CTGCTACCAT 7 44 0 

ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATCAAG TCCACCTTAA TCTGACTTGG 7 500 

CAAATCTGCC TGCGCAATCA AGGCATATGA GTCTAAGAAT AAGGCATCTC CAGCCAAAAT 7 560 
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GGCCATAGCT TCACCGAATT TCTTGTGATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 7 620 

ATCCATAGCA GG AAGGT CAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7 680 

AGCTACCTGC GCGTGAGCAG GTTTGATGGT AACCTGCAAG GCTTCCAGAA CTTCTAACAA 77 40 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATGAATAGAA TAGAGAACAG ACTCCCGTAA 7800 

ACTAGAGGCA AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7860 

TTTTTCTTGC TTTTTCATTC AAAATCACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 79 20 

AAGGT CTTTT CAGCCTTGTC CAGCGTAGCT TGCAGCTCTT TTG AC AAG AC CATGCCCTTT 7 9 80 

T G AAAGG C AG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 804 0 

GTTTCCAGTT CTGCTAGATT TTCCT C AAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TTCTACTTGA CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGGACTCTT CTTTTTTGAC AATAGCATAA CCACGCGCCA CGATTCGGCT 8220 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCCCTTCCCC TCAGCAACCT TCGCGTCATA 8 280 

AACTAACGCC ATTTGGCTAC CTAAGAGCTT GTCCAACTGT CCTAAACGGT CTTGATAGCG 8 34 0 

TTGGATTTTG GTAACAGGTG ATAATTGTAC TAATTCATGA CTTCTTGCTT GAACTAATTG 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTGTTTCAAA CGCAGTTGCA GTTGGTCCAA 84 60 

CCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGTTGTCTA AAGATAACAG ACTGACTGCA 8520 

TTTTTTCAAA GCCTCTTGTT TCTTACATAG AACATTTCGG ACTGCCCTTA CCATCCCTTT 8580 

TTCCTGATTT TGCAAATGAG CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCACC 8 640 

CGCCGCTGTT GGCGTTGCAG CGCGTCGATC TGCCACAAAA TCTGCCAAGG TCACATCCG? 8700 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACCACATTCA AAAATAGCTC GTACCACAAT 8760 

TTCTTCGTTA AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATCAGCAA 8820 

ATCCAAATCG TCCCGTTGAT TAGCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8880 

TTCACCTTGA ACCTTGGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC GCCTGCTGAC 89 40 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GTTACTACAC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TT C TTG AAA C AGGCCTTCTT CTGTCAATTT 906 0 

TTTCTTAACT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAGCTTTTTC 9120 

AATGATGATG GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTGAT 9180 

CTTCATTCCT TCTTCCAGGT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 92 4 0 

7TCAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 9 3 00 
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GTTGGAAACT TGACCAGTTA AATAGACCCG TTCCAAGTAT GGGTCTTTAT CGAATTTCAT 9 3 60 

TTTCAGATAC TTGGTCAAAG TTGTTACCGA TAAATACTTT TCCATCTCCA CCTACTATTC 94 2 0 

ATTTACTTGC TCTTTCATGG GT ATT ATT AT ACC AAAAAT A TGCCTAAAAA TCTCCATTTA 9480 

TGTACCATTA TGAGGGAAAA ATAGAAAAAG GAGGCAAGGC CTCCACATGT G ATT ATTTG C 9540 

TGTTTCGAGC TTCTTCCAAA ATCTTTGCAA TCTTGGTCGT CAACAGGTCG ATAGCCACGG 9600 

TATTGCTAAC CCCTTCAGGA ATGACGATAT CAGCATAACG CTTAGTTGAC TCGATAAACT 9 660 

GGTGGT AC AT TGGTTTGACC ACACCTAAGT ACTGGTTAAT AACGCTATCA AGGCTACGGC 9720 

CACGCTCCTC CATATCACGC TTGATACGAC GAATAATGCG CACATCGTCA TCCGTATCCA 9 7 80 

CAAAAATCTT GATATCCATC AAATCGCGCA GACGCTTGTC CTCCAAGACC AAAAT AC C CT 984 0 

C AACG AT AAA GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT GTATGCTCTG 9900 

TATAGTCGTA GGTCGGGATG TCCACCGGAC GCCCTGCCAA CAATTCCTTA ATCTGCTCGA 99 60 

TCATCAAGTC TGTATCAAAG GCAAAAGGAT GGTCATAGTT GGTTTTGACG G 10011 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 3365 base pairs 
(3} TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



CGTGTGGTCT 


TAAAAATAGA 


AGACAAAGAA 


CAAACTCTTG 


GAGG CTTTG T 


CCTTCCAGGC 


60 


TCAGCCCAAG 


AAAAAACCAA 


AACAGCTCAA 


GTTGTGGCTA 


CTGGACAAGG 


TGTTCGTACC 


120 


TTGAACGGTG 


ACTTGGTTGC 


TCCAAGTGTT 


AAAACTGGAG 


ATCGTGTCTT 


AGTTGAAGCC 


180 


CACGCAGGTC 


TTGATGTCAA 


AGATGGCGAT 


G AAAAGT AC A 


TCATCGTAGG 


CGAcTAACAT 


240 


TTTGGCAATC 


ATTGAGGAAT 


AGAAGGAGAA 


AGTAAGTATG 


TCAAAAGAAA 


TTAAATTTTC 


300 


ATCAGATGCC 


CGTTCAGCCA 


TGGTTCGTGC 


TGTCGATATC 


CTTGCAGACA 


CTGTT AAAGT 


360 


AACCTTGGGA 


CCAAAAGGTC 


GCAATGTCGT 


TCTTGAAAAG 


TCATTCGGTT 


CACCCTTGAT 


420 


TACCAATGAC 


GGTGTGACCA 


TTGCCAAAGA 


AATCGAATTG 


GAAGACCATT 


TTG AAAAT AT 


480 


GGGTGCTAAG 


TTAGTATCAG 


AAGTAGCTTC 


TAAAACCAAT 


GATATCGCAG 


GTGACGGAAC 


540 


TACGACTGCA 


ACAGTCTTGA 


CCCAAGCTAT 


CGTCCGTGAA 


GGAATCAAAA 


ACGTCACAGC 


600 


AGGTGCAAAT 


CCAATCGGTA 


TTCGTCGTGG 


GATTGAAACA 


GCAGTTGCCG 


CAGCAGTTGA 


660 


AG CTTTG AAA 


AACAACGCCA 


TCCCTGTTGC 


CAATAAAGAA 


GCTATCGCTC 


AAGTTGCAGC 


720 
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CGTATCTTCT CGTTCTGAAA AAGTTGGT G A GTACATCTCT CAAGCAATGG AAAAAGTTGG 780 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 

AATGGTGGCT G AC CTTG AAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 960 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC T CTTG ATT AT 1020 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATCCTTGA 1140 

AGATATCGCC AT CTT AAC AG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC ACCGAGAGTG ACCGTGGACA AAGATAGCAC 12 60 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 13 20 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACCCTACT CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA C AG CTCTTGC C AATGTG ATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACACGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT G GT AT AGG AT TTAACGCAGC AACTGGCCAG TGGGTTAACA TGATTGATCA 17 4 0 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA C AAAATG C AC CATCTGTAGC 18 0O 

C AGCTTG ATT TTGACAACAG AAGCAGTCGT AG C C AAT AAA CCACAACCAC TAGCCCCAGC I860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATCATGTAA GCTTTCTATA GAAAACAACT 19 20 

TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 19 80 

GTAGTGGGTT GAAGTCAGCT AAGCT CG AG A AAGCACAAAT TTCGTCCTTT CTTTTTTGAT 2 04 0 

GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 
GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 
TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 22 80 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2 3 40 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 

ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGACCTTCC AGTAGCGCTT 24 60 



2160 
2220 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 2 520 

ACTCATAGCA CGGCTAAGAT GTTGTACAAT GTGAAAGCGA TCCAACACGA TTTTAGCATT 2 580 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 264 0 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 27 00 

CGAACGGCTC TATCGTAGCG AAGAAAGTCA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2 7 60 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2 820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATGCTCAAAG 28 80 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 294 0 

T C AGTC AT AG AAATTTTTTC AATTAACTTT TCAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCACCAACCA TCATTTTTGA ACAGTGATAG 3 060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3 240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGTGATAAA ATGTAATTGT 3 300 

TCCATATGAA TCTTTCTAAT GAGTTCTTTT GTCGCTTTTC ATTATAGGTC AT ATGGG ACT 3 3 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 3 4 20 

TATAGAGCCG AAAATTCACA TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3 4 80 

TTATTAAAGG ATGACACAAA AGT T T T TGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3 540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3 6 00 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3 6 60 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 3 7 20 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 3 7 80 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3 84 0 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT :'900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3 9 60 

TGTTT AC AC A ATTTATTTTA TACTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4 02 0 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATGTTTAC AT AG GCTTCT 4 080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA 7TTTGACATC TTCTTTTAAA 4 14 0 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4 2 00 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATC AAT AT AC 4 2 60 
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4 



TTCAGTGCTT 


CTTCATCCAG 


ATAGTAGGGG 


CTAGCCATAT 


TGCAATAGTA 


AGAAGTTCCA 


4320 


TTATCACGAA 


TCATGGCAAT 


GATATTAAAC 


CAATATTTCT 


TGTGAAAGTA 


AACAATAGCC 


4380 


GGTTCTCGAG 


TGACCCAACG 


ACGACCATCA 


CTTTCGGTAA 


CAAGTGTATG 


ATCGTTGACA 


4440 


CCAATAATGG 


CGTTTTCTGT 


TGTTTTT AGT 


ACCATGGTGT 


CCCGCCAAGT 


T CGGTGG AG A 


4500 


CTCCCATCAT 


GCTTATAACT 


TTGAATTGTA 


ATAAAGTCGC 


CTTCTTTTGG 


AAGCTTCATA 


4560 


ACTAACCAAC 


TTTCTACAAT 


TTATAAGTTT 


ATCATTTACT 


ATTGTACCAT 


AAAATT AC CC 


4620 


AAAATCTGTG 


AATTTCACTT 


GGAAATATTA 


AAGATATTCT 


CTAAGAGCGC 


TTGCTATATC 


4680 


CCAAAAAT CG 


TAGCCCTTTC 


GTGCTAAAAC 


TTG AGTT AAA 


CGCTGCTTCA 


GTTCGTATCC 


4740 


TTCATACTTT 


CGGGCATACT 


TAGTATATTG 


CTTATCAAGT 


TCCTTGAAGA 


TGAGTTCCTG 


4800 


ACTCGTTTCT 


TCATCAACTT 


GACTATCCAA 


TTCGTCAAAG 


GCAATTTTAG 


CATCAAAATA 


4 8 60 


AGAGAAGCCC 


TTGTTAGTCA 


AGTTCTGGAT 


AATCTTATCT 


TGCAGGGCAC 


GAGCTGGAAG 


4920 


TTTTCCCTCA 


TATTTTTTCA 


ATAGTTTATT 


GGCT AC AC GT 


rtjAL»L.AA(w I I 




4 980 


AAAATCATTC 


AAGATTTCTT 


CTATAGTAGA 


TTTTGAAATT 


CCTTTTTGTG 


CTAATTTCTG 


5040 


AGTCAGTACA 


TAAGGTCCCT 


TGTCTCCTGA 


AAGTTG ATTG 


GCATTGATCA 


TAGCATAAGC 


5100 


GTACTGGCTA 


TCATTAATCC 


ACTTCTCTTC 


TTT^AGATTA 


GCAATGACTT 


GAGAAACGAT 


5160 


GTTTTC ATT A 


AT AT CAT ATT 


TTTTCAGATA 


TTCTCTGACC 


TCTTTTTCAG 


TACGTGCTTT 


5220 


AAAGGATAAG 


TGGTAGAGGG 


C C AG ATT C TT 


ACCATAAGAA 


AATTGAGCAA 


AGTCTTGAAT 


5280 


CTCTTTCAAT 


TCCTCTTCGC 


TTATCACCTT 


ATCTCTCGAT 


AACATAAAAC 


GAACAATTGT 


5340 


GTCTTCGGTG 


ATATAGCATT 


TGTCG 








5365 



(2) INFORMATION FOR SEQ ID NO: 78: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3636 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 
ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 
TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 
TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATCCA T G ACT T GT AA AATAATCAGA 
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GTCATTGTAG 


AAAAAATCAG 


GGTGAAGACA 


CCGAAGTTGC 


GGATAAAATA 


ACTAAAGTCA 


300 


TCCGCATACC 


ATGTTTTTTT 


AAGTTTACTG 


AACATCTTTT 


AAAAGATACC 


CAACACTACG 


36C 


CAAAGTTTGC 


AAATTCTCTG 


CAAAAGTGGT 


TCCCTTTAAT 


TTCTTACGGA 


CTTTTGAAAC 


420 


ATAGACTTCG 


AC AAC CG AAA 


TCGTTGTATC 


ACTATCAAAT 


CCCCATAGAC 


GGTCAAAAAT 


480 


CTGCGTCTTA 


GGCAAAATCA 


CATTTTGATT 


TTGAAGGAAA 


TAAACTAGTA 


AATCGAACTC 


540 


TTTCCCCAGC 


AATTCGACAG 


GAGTATCTTC 


AACTTTAACG 


GTATTGGTTG 


ATAAATTAAC 


600 


CACGATATTC 


CCATAAGTCA 


AGGTGTTTTC 


ATTAAACTTC 


CCTGAACGTT 


TGAGAAGGGC 


660 


CTGAATCCGC 


ATTTTAAGTT 


CTTCTAGGTA 


GAAAGGTTTG 


GTCAGATAAT 


CATCCGCTCC 


720 


CAGTTCAAAT 


CCATGTCCCT 


TGTCATCCAA 


ACTTTCCTTG 


GCAGTCATAA 


TCAGAACTGG 


780 


TGTCGTAATT 


CCCTTTTCAC 


GCAATTCTTT 


TAAGACTTGG 


AAACCATTTT 


TTTCTGGCAA 


840 


CATCAAATCC 


AG C AAAATC A 


AGTCATAGAC 


ACCACTCTCA 


GCTTCGTAGA 


GACCTTCTTC 


900 


TCCATCAAAT 


ACCTGCATAA 


CATCCGCAAA 


ATCGTCTAAA 


AAGTCAAATA 


CTGAATTTGA 


960 


CAGACCTAGG 


TCATCCTCAA 


CCAATAAGAT 


TTTTATCATG 


AGAAACTCCT 


CCTTATTAAA 


1020 


ACTATTATAC 


CAAATTTGCC 


TTAAAAAAAA 


CTCAACTCTC 


TGCATTTTAC 


AT G AG AT AG C 


1080 


TGAGTTTTCT 


TTTTATTTTA 


GGCTTATTTA 


TCCATTTCCC 


TATTGAAGAA 


CAACTGCTTC 


1140 


CACTGCAGCT 


TTTTCACGGC 


TAATCAAGTC 


AACACGCGCT 


GCAATTTCCT 


TGATTCCCAT 


1200 


AC CG ATGTT A 


CGGCTAAGAG 


C AAGGT C AG A 


AAGTTGCGGT 


TCAAAGAACT 


CCTTGTATTC 


1260 


CGCCAAGCGT 


TGCTGAGTCT 


TAAATACATG 


AGCACGAAGG 


ATAACAAAGC 


TATCAAAGCT 


1320 


CATATCTCCT 


CCAAGGGCTG 


CCTTAATCCA 


AGCCCAGTTT 


TCACGCGCCC 


AAGACCAAGC 


1380 


TGTTTTCTGA 


GTTGCTTGAT 


GAGCTAGCAA 


TTGGTAATAC 


CAAGCAGACA 


AGTCCTGTCG 


1440 


TTTG AC C AC A 


AATTTGTCCT 


TCCAAGAAGT 


AATCAGGTTT 


TGGATATTAT 


CCGCATCTGT 


1500 


ACTGTATGCA 


AGAGCTCCTG 


CCAACTGGCG 


TTTAAAGACA 


GCATCTGTTG 


CGTGAGTATA 


1560 


AGTATCAAGA 


TAAAGTGCTA 


ACAAGTCTTT 


AGTCTCATGA 


TGTTTCATCT 


CATTAATCAG 


1620 


AACTTGTGAG 


CGAATAGCTG 


CTGGGAGTCC 


TGCAAGATTC 


TCCTTGTGTG 


TTGCGAAGAT 


1680 


TTGGCTAGCG 


ACTTGACTAG 


CTTCTGCATC 


ATTTGAGCGA 


ATCATCATCG 


AAACAGCCAG 


1740 


CTGACGAACC 


AATTCATCCT 


CATCTGATTC 


TCCGTCTTTA 


GCTTCAAAAC 


CAAGACGGTC 


1800 


ATAGTTATGA 


CGAGCCAATT 


TAGCAACCAG 


TCCTTTGAAG 


CCTCTTTCAG 


CATCCGTTCC 


1860 


T T C ATCAAT A 


AAGCGCTCAA 


GGGCTGAAAT 


CACTTGAGAA 


ACAGCTGAAA 


CCACCAGATA 


1920 


AGACTCTTCC 


TTAGCAAGTT 


TATCAAGAAC 


TGGAAGCAAG 


TCTGCATAAG 


AAATGTGCCC 


1980 


TGCCTCAGCC 


AAC AAACG A C 


GTTCTTGAAC 


AATTTGCAGT 


TTGCTTGTG7 


TATCAAGTGT 


2040 
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CTCTAGCTCA 


GCAAGAACAG 


CTGCTAACAA 


GTCTCCTTGA 


TAGTCGGTAA 


T AT AGTGG GC 


2100 


AGT ATTTT C A 


GTGTTGAGAC 


GAAGAGCTCC 


TTCATTTTCA 


GCAAGAACAG 


CTGCGTAGCC 


2160 


Avjuun i * ± *j 


ATACTTTCAG 


TTTCGAGTGT 


ATCAGGCAAG 


CCTTTCCAGT 


TGCTATTGAG 


2220 


vjOU AV. /AN— v. 


CAGAGACGGT 


TCTTGTCTTC 


GTTCTCACCG 


ATGAAGAATT 


GTTTTTGTGA 


2280 




ACATCATTTT 


CAACTTTAAC 


AGTAAGAACT 


GGGTAACCAG 


GCTGTTCCAA 


2340 


rCAAGAATCC 


ATG AAGG CTG 


CGACATCACG 


TCCTGACGCT 


TGACCAAGGG 


CATCCCAAAG 


24C0 




AT GGT GTTG C 


TGTATTGGTG 


TTTTTCAAAG 


TAGGCGTGCA 


AACCTTTAGC 


2460 


AAAATPAfirA 


TCTCCTAGCC 


AACGGCGAAG 


CATGTGCATG 


AGACGGCTTC 


CTTTGGCATA 


2520 


GACGATACCC 


CCGTCAAAGA 


GTGTATTGAT 


TTCATCTGGA 


TGTTTAACTT 


CGACGTGGAC 


2580 


AfiAfTnAArC 
Hunt, lunAV>u 


C CATC AGT AG 


CGTCACGTTC 


AAGAGCAAGA 


GGTACTCCAC 


CTGTTTGGAA 


2640 


ATCTTCAAAG 


ATATTCCAGC 


TTGGTTCGAT 


GGTATCCACA 


CAGACGTATT 


CCATCATATT 


2700 


AnCKAAACT^ 


TCATTGAGCC 


AAAGGTCATC 


CCACCATTTC 


ATAGTCACGA 


GGTTCCCAAA 


2760 




GCCAATTCAT 


GGGCCACAAC 


AAGGGCAACT 


TGTTGACGGC 


TAGCAAATGT 


2820 




TCGACAACCA 


AGTAAACTTC 


ACGGTAGGTC 


ACAAGACCCC 


AGTTTTCCAT 


2880 


AnfArTACrr 


GAGAAGTCAG 


GAAGGGCGAT 


GTGGAGAGAT 


TGAGGAATTG 


GGTACTTAAC 


2940 


TrriTAftTAA 


TCTTCGTAAA 


ACTCGATAGA 


GCGAACAGCG 


ATATCCAGTG 


AGAAATCAAG 


3000 


ATTTrtAAAf^T 


GGATGTGCTT 


TGGTTGAGTA 


GACACCTACC 


AGGGTACCAT 


TTTTAGTTTT 


3060 




CCTTGCAAAT 


CACCAGCAAC 


AAACGCCAAC 


AAGTAAGAAG 


ACATGCGAGG 


3120 


TGTTGTCTCA 


AACTTCCAGA 


TACCTGTTTC 


CTTACGGTTT 


TCAACATCCA 


TTTCTGGCAT 


3130 


GTTTGACAAG 


GCCAATTCAC 


CTTCTGCTTG 


GTCAAAGCUA 




CAAAAflTTGC 


3240 


TTTGGCTTCA 


GGCTCATCCA 


CACATGGGAA 


AGcTTCGCGC 


GCAAAATGGC 


TCTCGAACTG 


3300 


AGT AG AC AAG 


ACCTCCTTCT 


TGACTCCATC 


AACTGTATAA 


TAAGAAGGGT 


AAATCCCTGT 


3360 


C ATGTTGT CT 


GT AATTTT AC 


CAGAAAAGGC 


AAGAACCAAT 


TCAACTTGAC 


CAGCCTCAGC 


3420 


CAATTCGATA 


TGAAGGGCTT 


CATTGTCATG 


GTCAACTGTA 


AATGG A CG AG 


CTTGACCTGC 


3480 


AACTTCTACA 


GAGGTGATTT 


CCAAATCTTT 


TTGGTGGAGG 


GAGATGCGGT 


CACTCTGTGC 


3540 


TTGACCAGTG 


ATGGTCACTT 


TCCCAGAAAA 


AGTCTTGCTC 


TCACGACTCA 


AATCTAAAAA 


3600 


TAAATCATAA 


TGTTCAGGAA 


CAAATTGCTT. 


AATGGG 






3636 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 506 6 base pairs 



WO 98/18931 



PCT/US97/19588 



634 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
ATAGCGTGTA ATAATCGATT TTAGAGGTAC CATAAGCCAC CTCCTACAAA TAGAAACCGA 60 

TATAAATCAA TGCCTTCCAC CCTTAGACTT CCCTAGTTCC TGTCTCAAGC GAAACATTTC 120 

TTTGAAACAG GAATAAGTTA ACCAATTCAT ACCAATAGCT AGCAGAATAA AAAGAAACCA 180 

AATGCCCCAT AACTTGATAT CTGTCACATT TCTCAAGACG GTATTGAAAA ACAGAACTGA 2 40 

AACAACTGTC CAAGCAAGGC TAAAAAGAGA AT AG AAGGG G ATGTAAAACC AGTAAAAATA 3 CO 

ATAAAAAATT GGAAAAAACT TACTATTTCT GTTGGCCTTT TCAATCCAGT T ATC AAAAT A 3 60 

AAAGTACGGT GCTAAAAGTA AGAATTTAAA CAAATGTTCC ATCACCGACA TCCCCCCTTC 4 20 

TTTTGATAGC GTTTTCTATT ATTTTATTAT ATCAAAAAAA TCCGGAACTG TCATTCCAGA 4 80 

TTCTACTTTT TTATTTGCGT TTTCTTGCGA TGAGATGAAT CGGTGTTCCC TCAAAAACAA 54 0 

AGGCCTTGCG GATTTGATTT TCCAAGAAAC GCAGGTAAGA AAAGTGCATG AGTTCTTCTT 600 

CATTGACAAA GATGACAAAG GTTGGTGGTT TGGTTGCCAC TTGGCTCGCA TAGAAAATCT 660 

TGAGACGTTT TCCTTTGTCT GTCGGTGTTG GGTTGATGGC AATGGC ATC C ATGATGACAT 720 

CGTTCAAGAC AGCTGATGGA ATACGTGTAT TTT G ACTTTC GCTGATTTGC TTAATCATCT 7 80 

CAGGAAGTTT GTGGAGACGT TGCTTGGTTA AAGCTGATAC AAAGATAATC CCTGCCTAAC 8 40 

GCAGGTATTG GAACTGCTCA CGGATATCTT CTTCCCACTT TTTCATAGTG TGGTTATCTT 900 

TTTCAAGCGT ATCCCACTTG TTGACCACGA TAATCATCCC TTTACCAGCT TCATGGGCAA 9 60 

ATCCTGCGAT ACGCTTGTCG TACTCACGAA TGCCTTCTTC CGCATTGATG AC CAT C AAG A 10 20 

CCACATCTGA ACGGTCAATA GCACGCATGG CACGCATAAC AGAGTATTTC TCAGTATTTT 10 80 

CATAAACCTT ACCAGACTTA CGCATACCAG CCGTATCAAT CATGGTAAAC TCTTGACCAT 1140 

CTGTATCTCT AAAGTGGGTA TCAATGGCAT CACGAGTTCT TCCAGCAACA GGACTAGCAA 120C 

TAACACGGTC TTCTCCCAAG ATAGCATTGA TCAAGCTTGA TTTTCCAACG TTAGGACGAC 12 60 

CAATCAAGCT AAACTTAATG ACATCTGGAT TTTCTTCCTC ATATTCATTT GGAAGATTTT 13 20 

CTACGATCGC ATCTAGCACA TCCCCTGTAC CGATTCCATG GACAGATGAG ATAGGCAATG 13 80 

GTTCACCCAA ACCGAGAGCA TAGAAATCAT AT AT AT C ATT TCTCATCTCA GGGTTGTCCA 14 40 

CCTTGTTGAC TGCGAGGATA ACTGGTTTGT GGGT CTT AT A AAGCTTACGA GCTACGTATT 1500 

CGTCTGCATC AGTAATTCCT TCCTTACCAG ACACGACAAA AACGATAACA TCTGCTTCTT 1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA ACGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATACGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTC AAG AT TTTTTCTAGT TGAGCTTGGT I860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCACC TTGTGGTCGC ACGCGCCCCA 19 20 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTT GGT AT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2 04 0 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AG AGCTG CAT CGACCGAACT AATCAAGAGA 2 2 20 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGCCTCCCTT TTCTGCTCGG 2 280 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTCGC GAATGGCTTC TTGAGGTACC 2 34 0 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2 4 00 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2 460 

GCAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2 52 0 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCCT CATTTCCATG TTTCACTTTC GCAAAATGCT 264 0 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 27 00 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACCGTTGAT TGCTCGGCGC 27 60 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 28 2 0 

TGCCATCACT GATAGCATTT TT G AC AT C AA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2 8 80 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGCC CGCCACCATG AAACTTTTAC 29 4 0 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT ACCTCGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3 0 60 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 312 0 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 313 0 

CATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 32 4 0 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTC CTTTG 3 3 00 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3 3 60 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT CACGTTTCAG TACGTCTCCG ACACTTTCCA 34 20 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGG 3 54 0 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3 660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 384 0 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

CTTTTTGAGC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA AT AG CAAAG A 4020 

GGTTTAACAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 414 0 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTGACTGA GACAGGTATT TCTTCACCAT 4 200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4260 

TCCGATAAAC AGGATGCCCC AAGAAGTCTT G ACT AG AT AG AGGAGCATGG AGGGCTAGCT 4 320 

GATAAACATC ACCCTTTTG A TAGAGGGTCA AG AG ATT AAA AGCAGATAAG ATTTTCAATG 4 330 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 444 0 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAACATA AAGGCTCAGT GCCTCCTGAC 4 500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4 560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTT CT TT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CG AT AG AC AC 4 680 

TAGCAAAACG TACATAGGTA ATCTCGTCCA ATTCACCCAA CTCCTCCATG ACGAGTGAAC 474 0 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4800 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT T TT ATCTCTG GAGAATTGTT CCCGTGTGCC ATCT TTTTT A ACAACCACTA 4920 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGCCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 506 6 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS : 

<AI LENGTH: 9607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



CACTTGAAGT 


ATTTGAAACA 


GCTATGGAAA 


AC AT CAT G C C 


TGTACTTCAA 


GTACGTGCAC 


60 


GTCCTGTTGG 


TGGTTCTAAC 


TACCAAGTCC 


CAGTTGAAGT 


TCGTCCAGAA 


CGTCGTACAA 


120 




TCGTTGGTTG 


GTAACAATCG 


CTCGTCTTCG 


TGGTGAACAC 


ACAATGCAAG 


180 


ACCGTCTTGC 


AAAAGAAATC 


TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 


240 


GTGAAGATAC 


TCACCGT ATG 


GCTGAAGCTA 


ACCCTG C AT T 


CGCACACTTC 


C GTTGGT AAG 


300 


AT AGG ATGCG 


AAAGCGTTAA 


GAAAGTCC C A 


GAGAAAATAG 


GGAATCGAAG 


CAGCTTGCGG 


360 




TGAGATTCAT 


CTTTTTCTCC 


AGACTTTTAG 


CTTGAGCTCA 


ACTAAATCAT 


420 


GAT GCT AGG A 


ACGGTAAGGA 


TGCAAGGTAA 


AAATAGGAAA 


CTGACGCAGT 


ATTCGACGAA 


480 


TACAAGGAGT 


TTTATCTTTT 


TCACGCAGCA 


TCCCGTTCCA 


GCTCACATCG 


GCTAACTAAC 


540 


TTTAGCCCGG 


GTTCAAATTA 


GCTAAATCGA 


I 1 f\<j LAI 1 




G CTT ACC AT C 


600 


TCGTAAGTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGAGAA 


CCGGTACGTC 


CTGCCTATCC 


660 


GTTTTTATTA 


AAATCGTGTT 


ATAATAGAAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATGGCAC 


GCGAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTGAGCGT 


ATTCTTTACT 


ACACTGGTAA 


AAT C C AC AAA 


B40 


ATCGGTGAAA 


CTCACGAAGG 


TGCGTCACAA 


ATGG ACTGG A 


TG GAG C AAG A 


GCAAGAACGT 


900 


GGTATCACGA 


TCACATCTGC 


TGCGACGACA 


GCTCAATGGA 


ACAACCACCG 


CGTAAACATC 


960 


AT CG AC AC AC 


CAGGACACGT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCGTGTATTG 


1020 


GATGGTGCGG 


TTACCGTTCT 


TGACTCACAA 


TCAGGTGTTG 


AGCCTCAAAC 


TGAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATGG AC AAA 


1140 


ATCGGTGCTG 


ACTTCCTTTA 


C TCTGT AAG C 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


12C0 


CACCCAATCC 


AATTGCCAAT 


CGGTTCTGAA 


GATGACTTCC 


GTGGTATCAT 


. TGACTTGATC 


1260 


AAGATGAAAG 


CTGAAATCTA 


T AC T AA CG AC 


CTTGGTACGG 


ATATCCTTGA 


AG AAG AC AT C 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTCCT 


1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAC*? ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 15 60 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TC AAG AT CAT GACTGACCCA 16 80 

TTCCTAGCTC GTTTG AC ATT CTTCCGTGTT TACTCAGGTG TTCTT CAAT C AGGTTCATAC 174 0 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TC A GGTG AT A TCGCTGCTGC CGTTGGTTTG 1860 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAC CTAAAATCAT CCTTGAGTCA 192 0 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 204 0 

ACAAACGTTG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TCAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTT CT ACT CAAGCACGTG GATTCTTCAA AC GTC AGTCT 2220 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2 280 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 24 00 

GACGTTAAAG CTAACCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTCCC 24 60 

TTCAAGATTG CCGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCACCTATC 2 520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAAC CTTGG TGATGTTATG 2580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2 640 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACAGT TCTTCGTTCT 2 7 00 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2 7 60 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA T ACCTCT AAA TATGGTAAAA 2 8 80 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 294 0 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG G GAG TTT CAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA AT CTTTTT AG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAACCTT TTAAGTCCCT 3240 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3 300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3 360 

GTTTCGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 34 20 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 34 80 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGCTATTT GAAGGGGTTG TTTTTGATGT 3 6 60 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGG ACT AT AC 3720 

TTCAAGTTTT TCTATGCAAA AG TGGGTT AA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3 "7 80 

CATCAAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 3840 

CGATTTCACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3900 

GATGCCAGAA GCTCAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GAGGTCGAAG AG ATTGTTG C AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC G AAGTGG AG A TGGACTTCTC AGAAGCAACC TACGTGGTCT TTGACGTGGA 4 200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 4 260 

CAAGGGGAAT GTTATTGCTG AATTTGATCA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4 320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATCCCA AACCACTAGA 4 380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 4 440 

TACCTTTGAC GTTGGCTTTA TGAATCCTAA TTATGAGCGG CATGATCTTC C AAAG ATT AG 4 500 

TCAGCCAGTT ATTGATACCC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 4 560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTCGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4 680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATC AGTCC AG ATTCTTACAA 4 74 0 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4 800 

CTTTAAGCTG GTTTC CTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4 860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 4 9 20 
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AG TTTTTG A C GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4980 

TGATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAG AGCAGGT 5040 

CAAGGATATG GAGGAACTCC AG AC C ATT AT CAAGACTTTG ATAGAGGTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGTTA TTGAAAACAC 53 40 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTG AAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 54 60 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

ACTCCCGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATCCTGGTCC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 564 0 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

. CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GAT ATGCCCC ATAAGGACTG 57 60 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 582 0 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTC GGG AG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 594 0 

GGTTGGTACG GTAGCTGCCA ACACTGCCTA TGGATTTGTC AAAGG TTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACCCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 62 40 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTC CT AT 6 3 00 

CGATGACGAA GCCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGCGG TAACACCTGA 6 3 60 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACC 64 20 

T GG AATGGT A GACGAAACCC AT CCG AC AAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 80 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTC AG GAT CTGATTAAGC AAGGAATAGC 654 0 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG CTTTACCTCA TGCATGCGCC 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6 660 

GATTTCAGAA GAGGAGAGAA ATGG CT AT AT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTT ATT ACTA 684 0 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG AC AG AGTTCC TCATCGACGG 7080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GG AG AG AACG TTGCCAAGCA 7140 

ACTGGTG CGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 7 200 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 72 60 

AGAGGATAAC CAGTTGAGTT T GTTTG ATG A GTTGTTTTAA AAAATTGCTT AATAATCTAT 7 3 20 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 7 3 80 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGACTATAA GCATCCGTCT GGATAGTGAG 74 4 0 

GTTAAGGAGC AGGCCCAACA CCTCTTTAGT AATCTGCGAA TGGATATGAC AACAGCTATT 7 500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTCATGT TAG ACT AG AC 7 5 60 

GAAAATCGGA AGTTGCTCCA AG CGTT AACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7 620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7 680 

ACAGTTTAAA AAAGATTTTA AGTTGGCT AT GAAGCGTGGT TTGAAGGCAG AATT ATT AG A 77 4 0 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC GTGATCATTC 7 80 0 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCGTGAATCC CATACCCAGC CAGATTGGCT 7860 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 79 2 0 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAAGAAT ATTTGCGGAA 798 0 

GATAAGCCGG CTAAGAAGGT ATTTG AAT AT CAATTAGAAC TTCCTGATCG TACAATTCTT 804 0 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AG AAAAC AG A ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

G AAAAAT AT G CG AAATTGTT GGTTGCGAAC GG AATT AACG TGCAACCTGG TCACACTTTG 82 20 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 8 2 80 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG AT G TG ATT AA C CGTG AG AAA 83 40 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 84 00 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 84 60 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 85 2 0 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GG ACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 864 0 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGC AGC CAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 87 60 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8 82 0 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTCCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8 94 0 

AAACCGCTTA GCTACAACGG AAAT AT C ATT GAACGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT CCCTTGCTAC CAGATCCAAG TCCAATTTCT 912 0 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 9240 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9 300 

ATGGATATCG ATGGT AT TCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCC TAATGGGAAT 9 360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9 4 20 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 94 80 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGCCC AGTTTTATCA 954 0 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9 600 

CGAGGAA 9 607 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i> SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 14231 base pairs 
(31 TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATACTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CAT AG G AAC G TAATCGGTAA CCGATGACAA 120 

AAAT AGT ATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 24 0 
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ACTATGCGTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 3 00 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT GGTGAATATT TGTAAATCAG 3 60 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 4 20 

GTATATATGT GACTGACTTC AT C AGTTCT A TCTACAACCT CAAAACAGTG TTTCGAGCTG 4 80 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 540 

TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTCCCTCTA AT AATTTTT G 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAG AGGGGATTAA TTTTAAAAAA 660 

TCCAGTCTTG TAAAATTTAG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 7 20 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GCTCATCAAT 7 80 

ACCTCTTGCT GAACAAAATG CTAAATCG AT ATTAAAATGA TCTAATAAAG AATTTTCCTT 84 0 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTCACCTCG CCAGATGTGA TAAAGATTTT 900 

GGAGCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 960 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACAGTAG TACTAGAATC 1020 

TAGATAGATA C AC AT AC C AG ACCGAATAAA GTCTTT AG C G AGACTAGCGA TTAGTCTTTT 1080 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGAGGA 114 0 

CAGGGTCACG TATCCGTGCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 1200 

ACGACGTAAG GTACTTGTGC TGGAGAAAGT GATTTCTGCC AGCTCTTTTA CCGCAATTCT 12 60 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 13 20 

CTCCTAATTT ATCATTTCAA CTATATTATA GCACAAATTG GAGGAATTTG AATTATTTTT 13 80 

ATGAATATTC GGTTAACATT TGAACATTAT TCAAGTAACC GTTCACATAT TGAAAAAATA 14 4 0 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGAGCAATC GTTGGTTACC TTTCTGAAAA 1S60 

TAAACTAGTA AT GG AAG AAA TAAATCGCTT TTCTAATCTA CCTATTAGAG TAAAAGGGCA 162 0 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC GCTTGGCTAA 16 80 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GGAGTTGATT TTGGACTGAT 17 4 0 

TGATAATGAA GGTAAGCTGT TATTACAACC TGTTCATTAT CGTG ATG AAA G AAC AAAGGG 1800 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAG AGACAGGAAA 1860 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 19 2 0 

CTCTTTCTAT AAGACCAATA AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 198 0 
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AGCTAAC7TT G CT AC AG AAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTG AATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTT C A GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT T AGGCG AT AT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG AT TGTCTC AG TACCTAAGAC 22 2 0 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTCCTCTTTG GTTGGAGTGG AACTTACTTC 22 80 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2 3 40 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 24 00 

ATTTG AACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 24 60 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 252 0 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAG CCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2 64 0 

ACTAGAAGAA CTAACTCATA AGGTTTATAA GAGGATATAT GTGATTGGAG GAGGTGCTAG 2700 

AGCCACTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 27 60 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG GACAATTAAA 282 0 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2 880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 29 40 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTGTTTTG AT AT GAG GAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 30 60 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCGTGAAA AA C AATTG AT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTGTAGTGAT 3130 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3 3 00 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3 360 

GGCAGGGAAA GATGTTCGGG T AGCTG AG T A TGCAACATAT GGCACGAAAG AATTGGCTGT 3 420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 34 3 0 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 3 54 0 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3 6 00 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATG 3 6 60 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3 7 20 

CATGGAGATG AAATAGTATT AGCTG AC GCG AATTATCCTT CTGCCTCATG TGCAAATAAG 3 7 80 
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CTAATTCGTT GTGATGGTGT AAATATTCCA GAATTATTAG ATTCCATTCT GTATTTAATG 384 0 

CCATTAGATA GTTACGTCGA TAGTTCAATT CAGTTTATGA ACGTTGTTTC GGGTGATGAT 3 900 

ATTCCTAAGA TATGGGGTAC CT AT AG A C AG ATGATTGAAG GTCATGGTAC AG ATCTT AAA 396 0 

ACGATTACTT ATCTTAGAAG AGAAGACTTT TATGAACGTA GTAAGAAAGC TTATGCTATT 4 020 

GTTGCTACAG G AG AAACT TC ACTTTATGCT AATATTATCC TTAAGAAAGG AGTAGTTGTT 4 030 

GAAAGAGAAA ATGTTCAATA GAGGAATTTT AGTTGCCAGT CATGGTAATT TTGCTAGCGG 4L4 0 

AGCTCTCATG ACCGCAGAAA TGTTTGTTCG TGAGACAACA AATGATAGAG TTAGGACATT 4 2 00 

AGGTTTGATG C CTGG AG AG A ATATTGTAGA GTTTGAGCAT TATTTTAAAA ATCAAGTGGA 4 2 60 

TGAACTGTTA GACTCAAATC AAGAGGTTAT CGTTTT GACT GACTTCATTG GAGGAAGTCC 4 3 20 

TAATAATGTG GCTTTGTCAC GGTTTTTAAA TTTGGATTCA GTTGATATTG TAACAGGGTT 4 3 80 

TAATATCCCT CTCCTAGTGG AATTAATATC AAGTTATGAT TCAAAAATCA AT TT AG AAG A 44 4 0 

AATTGTTCAC AATGCTCAAA ATAGTTTGTT TAATGTTAAA CAACAACTTA AC GT AG AGG A 4 5 00 

GGAAGAAGAT TTATGTCTAT AGAGTTTGTT CGTATTGATG ACCGTCTGGT ACATGGTCAA 4 5 60 

GTTGTCACTA CGTGGCTAAA AAAGTATGAT ATTGAGCAAG TTATCATTGT TAATGATCGC 4 620 

AT CT C AG AAG ATAAAACACG ACAATCTATT TTAAAGATTT CTGCACCGGT AGGTTTAAAA 4 68 0 

ATTGTTTTCT TTAGTGTAAA ACGGTTTGTG GAAGTTTTAA ACTCTCTCCC AATAAAAAAG 4 74 0 

AG AAC AATGC TG AT AT AT AC AAATCCAAAA GATCTGTATG ATTCTATTGA AGGAAATTTA 4 800 

AAATTGGAGT ACCTCAATGT AGGACAGATG ACT AAAACGG AGGAAAATCA AAAGGTAACG 4 860 

GCACGTGTAC CTCTACGTCA AGAAGACAAA TAT T ATTTT A AGAAAATAGT TGATAAGGGA 4 920 

ACGAGAGTTG AAATTCAAAT G GTTCCT AAT GATAAAGTTA CAATGTTGGA AAAATTTTTA 49 80 

TAAAAATAAT TTAAGGAGGT ACAGTATATG CTATTCACAC AAGCATTACT GGTGACATTA 5040 

GTTGGGATTA TTGCCACTAT TGACTATAAT GGACCCTTAT TTATGATTCA CCGTCCGTTA 510O 

GTTACAAGTG CAATGGTTGG CTTAGTATTA GGAGATTTCA CCCAAGGTGT TCTTATTGGT 5160 

TC AG CTCTTG AATTAACTTG GCTCGGTGTA ACAGGTATTG G AGGTT AT AC TCCACCAGAT 52 20 

ACTATTTCAG GTGCGATTAT TGGTACTGCA TTTGGTATTT TATCTGGTCA AGGAGAAACT 5 2 80 

GCTGGTATCG CTATAGCAGT TCCAATTGCA GTTGCTACCC AACAGTTGGA TGT TCTTGC A 5 34 0 

AAAACTTTAG ATGTTTATTT TGTGAAAAAA GCTGATAATG ATGCTAAAAA CG GAG ATT AT 540 0 

TCAAAGATCG GTTTTT AT C A TTATTCAAGT TTGGTTTTAA TCACGTTATT TAAAATTGTA 54 6 0 

CCAATTTTCC TAG CT ATT AT GCTTGGAGGG GAATATGTGG CAGACTTGTT TGCTAAGGTT 5 52 0 
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CCACCAATCG 


TTATGCAGGG 


ACTTAACTCT 


GCAGGTGCTT 


TACTACCTTC 


AATTGGTTTT 


5580 


GGTATGCTTT 


TAAATATGAT 


GCTCAAGAAA 


AATATGTGGG 


TATTCTTGTT 


GATTGGATTC 


5640 


ATTTCTTCTG 


TGTATGGAGG 


AATGTCAACC 


ATTGGGATCT 


CACTAGTTGG 


TATTGCGGTA 


5700 


GCATACTTCT 


ACGATATGAT 


TGGAAGCAAA 


CCACAAGAAA 


CAACTTCAAG 


TAG TG AT GTT 


5760 


GAGGAGGATC 


TTGATCTATG 


ATGAATAATA 


AAGTAACTAA 


AGTTGAACTT 


AAAAAAGTTT 


5820 


TCAAACGAAG 


TTTTATGTAT 


GGTTCTTCAT 


GGAACTATGA 


GAG AATGC AG 


AACCTAGGTT 


5880 


TTCTATATAC 


AATTCTTCCA 


GTATTGAAAA 


AACTATACCC 


AGACAAAGAT 


TCAGCTTCTC 


5940 


CTGCAATGAA 


ACGTCACCTT 


GAGTTTTTCA 


ATACTCATCA 


AACAGCGGCA 


CCATTTATTC 


6000 


TTGGAGTTAC 


TTCCGCTATG 


GAAGAACAAG 


AAGGAAATGA 


AGGTGCAGCT 


TCAATTACTG 


6060 


GTATTAAAGT 


TGGCTTGATG 


GGGCCACTGG 


CTCGTCTACG 


AGATAGTTTG 


TTCTGGCTGA 


6120 


CACTAGTTCC 


TAT CTGTTTT 


AGTATTGGTG 


CGTCTTATTC 


TAAAGACGGC 


GGTGCTTTAG 


6180 


GTATCTTTAT 


CCCCTTAATA 


TTGTTTAATA 


TTATTAATAT 


T C C TGTT AAA 


TATTTCGGTT 


6240 


TGAAATATGG 


GTATACTAAG 


GGTTCTAGTC 


TTATCCAAGA 


AAATAATACA 


AAAGGAACAT 


6300 


TGAATCGCGT 


TACGAGTATG 


GCG AC AG CAT 


TAGGGCTAGT 


ACT AG TGGGT 


GGTTTGATTC 


6360 


CAT C AATGGT 


TGGTATTAAT 


TTTGGATTAG 


AATTTAAGCA 


GGGGGAACTT 


CTTATTTCTG 


6420 


TTCAAGAAAT 


GATTACAAAA 


TTAATTCCAG 


GATTTATCCC 


TATGGCTTTG 


ACTTTATTAA 


6480 


TGTGTAAATT 


AATTAGAAAA 


GGAAAGAATC 


CGGTTGTACT 


AATCTTTAGT 


GTTATGGCTA 


6540 


TTGGAGTTAT 


TCTAGTTGTT 


TTAGGAATTT 


TGAAGTAGTA 


GAAAGTGTGC 


AGCTGGTATT 


6600 


TGGGATATCA 


CCTCCATTTT 


GGAAGAGAGG 


TAAAGAGTGA 


AATTATGGTA 


TAAGAAAGCT 


6660 


GCCGCAAATT 


GGAATGAAGC 


CTTGCCGATT 


GGGAACGGTC 


ATTTAGGTGG 


TATGATTTAT 


6720 


GGTTCAGCTA 


CAAAAGAATG 


TATTCAACTA 


AACGATGAGA 


CTATTTGGTA 


TAGAGGAAAG 


6780 


T CAG AT AG AA 


ATAATCCAGA 


CTCACTATTG 


CATCTTAAAA 


AAATTCGGGA 


ATATCTTTTA 


6840 


G ATGG AG AAA 


TTCACAAAGC 


CGAAGAATTG 


ATAAAGTTAA 


CAGTGTTTGC 


TACCCCAAGA 


6900 


GATCAAAGCC 


ACTATGAATT 


ACTTGGGGAA 


CTTTACATTG 


AGCATATAGA 


T ATT C AGTCT 


-5960 


TGTGCTCTTT 


CATTGTATGA 


AAGAGAGCTA 


GATTTAGATA 


CAGCTATTTC 


TAATGTTGTG 


7020 


TTTGAGCCTA 


ATAGTTGTAA 


TTTACAAATA 


AAAAGAGAAT 


ATTTTACGAG 


TTTTAATAAG 


7080 


AATATTTTAT 


GTTGCCGTAT 


AGTGTCATCA 


GTTCAAAACA 


CATTAAATTT 


AAACATTAAT 


7140 


TTGGGTAGAA 


ATAAACGGTT 


TAATGACGAA 


GTATCTAAAC 


TGGATTCAAG 


TACAATTTTA 


7200 


ATGTCGGCCT 


CTGCTGGAGG 


TAGAAAAGGT 


GTTCAGTTTA 


AAGTAGTATG 


TCATTCTAAG 


7260 


GTTACGGATG 


GTGAAGTAAG 


TGTATTGGGA 


. GAGACAATAG 


TTATTCGGAA 


, TGCTACAGAG 


7320 
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GTATTTCTTT 


ATCTCAAATC 


AATGACGGAT 


TATTGGGGAA 


ATATAGATAT 


TTCTTCTCTT 


7380 


CAGGGAGAAT 


TTAGTAGTAT 


TGATTACTTT 


ACAGAAAAAG 


ATGAACATGT 


AAAAAAATAT 


7440 


CAGGAGCAAT 


TTAATAGAGT 


TGATTTTAAA 


CTAGACTATA 


GTAAAGGTTG 


TCTTAGCATT 


7500 


CCAACGAATC 


TACTTCTTGA 


AAACACTAAA 


AAGTATAGTA 


ACTACTTGAC 


T AACTT GTTA 


7560 


TTTCATTATG 


GAAGATATCT 


GTTAATATCG 


TCTAGTCAAC 


CGAATGGTTT 


ACCTGCCAAT 


7620 


CTTCAAGGAA 


TATGGTGTGA 


TGAATTAAAT 


CCAATTTGGG 


GTTCTAAATA 


TACGATTAAT 


7680 


ATTAATACTC 


AAATGAATTA 


TTGGATGGTA 


GGTCCATGTG 


ATTTACCAGA 


AGTAGAATAT 


7740 


CCATTATTTG 


AT ATGCTC G A 


AAGAATGAGA 


GAACCGGGAA 


GACTAACCGC 


T AAG AAAAT G 


7800 


TATGGAGCTA 


GAGGTTTTAC 


AGCACATCAT 


AATACGGATG 


GTTTTGGCGA 


TACGGCTCCC 


7860 


CAATCTCATG 


CCATGGGGGC 


TGCAATTTGG 


GTATTAACTA 


TTCCATGGTT 


ATGTACTCAT 


7920 


ATTTGGGAAC 


ACTATTTATA 


TTTCCAAGAT 


GAGCGTATTC 


TTACGGAACA 


TTTTGAAATG 


7980 


ATAAAAGAAG 


CATTTCTTTT 


CTTTGAAGAT 


TATTTATTTG 


AGGTGGATGG 


CT ACTTG AT G 


804 0 


ACAGGTCCAA 


GTGTCTCACC 


GGAAAATAAA 


TATCGCTTAA 


AAAATGGTAT 


TGAAGGAAAT 


9100 


GCTTGTCTAT 


CATCTACAAT 


TGATAATCAA 


ATTCTAAGAT 


ATTTTTGTCA 


TTCATGCATT 


8160 


GGCATTGCAA 


AACAATTAGG 


AGACAATTCG 


GATTTTATTA 


GTCGTGTGAA 


GGAGTTAAAA 


8220 


AAGAAACTAC 


CTAAAACAAA 


AATAGGTAGT 


AATGGGCAAA 


TCCAAGAATG 


GTTAGAAGAT 


8280 


TATCAAGAAG 


TAGAGCCTGG 


GCATAGACAC 


ATTTCACCTC 


TATTTGGGCT 


TTATCCTTAT 


8340 


AATGAGATTG 


ATATTCATAA 


AACTCCGGAA 


TTAGCAGAAG 


CAGCTAAAAT 


C ACT AT C AAT 


8400 


AGG AG ATT AT 


CAAACGCTAA 


TTTTT T ATCT 


TC AC AGG AG A 


GGGAGCAAGC 


GATTAATAAT 


84 60 


TGGTTAGTAA 


GTGGTTTGCA 


TGCTAGTACA 


CAAACAGGTT 


GGAGTGCTGC 


ATGGCTGATT 


8520 


CATTTTTTTG 


C GAG ACT AT A 


TCAAGGTGAA 


CCTGCTTATA 


ACCAGATTAA 


TGGTTTGTTA 


8580 


AATAATGCGA 


CTCTTGGCAA 


TTTATTTCTT 


GACCATCCAC 


CATTTCAAAT 


TGATGGTAAT 


8640 


TTAGGTTTGG 


TGAGTGGAAT 


TTGTGAATTA 


TT ACT AC AG A 


GCCATCATAA 


TTGGTT AT C A 


8700 


CTAATTCCAG 


CTTTACCTTC 


TGCTTGGTCA 


GAAGGAGAAG 


TGAAAGGTTT 


CAGAGTAAGA 


8760 


GGAGGATATA 


AGGTATCGTT 


TGCTTGGAAA 


AATGGGGATA 


TAACATTCCT 


AAAATTGGAA 


8820 


GGAGGAAACA 


AAGATCAAAA 


AGTAAGAGTA 


AGAATATATG 


GCAAAAATAC 


TGATGTACAA 


8880 


AATATTGAAT 


TGGTATTTAA 


TTCAGAAAAA 


ATTATTGAGT 


T AAATTTT T A 


GGTATAAGTC 


8940 


ATGAATAAAG 


AAAAAATAAA 


AAGAAAATTA 


ATCACAATAT 


TGTTTGTATG 


TATTGGGATG 


9000 


TTATGTTTTG 


GATTGTTAGC 


AGCAGTTAAC 


GCTGATAATC 


GTGTTCAAAT 


GAGAACGACG 


9060 
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ATTAATAATG 
TTATGGTGGG 
TATGCAGCGA 
GATACGAAAG 
ATTCCAGTTT 
TTAGATGAAC 
TGGATTTACA 
TATGGAGCGC 
GATCCGACAT 
AATACGCCAA 
GGCTTATGTG 
ACAAACACAT 
TCAATGATTG 
GAATGTGCCG 
ATTATTCCTT 
AATAGAACAA 
TATCAAGGAC 
ATTCTTCCTG 
GCAAAAATTT 
CTTTATCCAA 
ATTTATAATA 
AATAATACAA 
AATCCAAATA 
GCATTATCAG 
AACTGGATAA 
CTTACATTAA 
CATTATACTT 
CATAATGGAA 
ACACCAGATA 
AGTTCTGTAG 



AATCCCCATT 
GGAACACATT 
TTGAACTTCA 
AATTGAGAGA 
TCTTGGTTAT 
AATTCCAAAA 
ATAACCAGTT 
ATTTTATCTG 
TCTTTGAAGC 
TAAGAGATGA 
ATAACTGGGG 
TTGAAACTGG 
CTATGGAAAT 
CGTATACATT 
TCTTTAGACA 
AAGCTCTATT 
TTTATTCGAA 
TAATACATGA 
TGACTAAAAA 
AACTTTATGA 
GTAATCCTAA 
AGTCGTTATC 
ATTTACATAT 
GAAATTTTGA 
GCAAAAATTA 
AAGGGCATAC 
AT AC AG AAAA 
TG GT AG AG AT 
AATTTAATGA 
ATTACAATGG 



G TTG CTTTCT 
GAAGGGAGCA 
TCCTGCAAAA 
ATGGTATGTC 
TATGTCGGCT 
GTATAGTGTG 
AGCTCCGCAT 
GCATGATCAT 
GAGTCAAAAA 
TG CGGGT AC A 
CTCATCAACA 
AAGAGCTAGG 
GATGAATGTA 
TATGACAAAT 
TGCTATACAA 
TTGGAATGGA 
TGATGAAACA 
GAAAATTGAT 
TAGTGAGGAA 
AGGAGATGGG 
TATCAATAAA 
GTTAGATTTG 
TTTATTGAAT 
TGCATCAAAA 
TTCCATCAAT 
TGGTCATAAA 
TTGGGATGAG 
GTCTATAAAT 
TGGTAATTTG 
AGACCCTAAT 
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CCGTTGTATG 

TGGGAAGCTA 

GTCTGTAAAC 

AAGATGTTGG 

GGAGAGCGTA 

TTAAAAGGTG 

AGTGCTAAAT 

GAAAAATGGT 

TATCATAAAA 

GATTCTATCG 

GATACATGGA 

GATATGAGAT 

TATACTGGGG 

GATGTACCAA 

AATCCACCTC 

GAAGGTAGGA 

ATGC CTTT AT 

AAGGAAAAGA 

TTGTCTAGTA 

TATCCTCAGC 

AATCAGCAAG 

ACGCCACATA 

AATTACAGGA 

AGTTGGAAGA 

CCTGTAGATA 

CCTCAGATAA 

AATACCCATG 

ACTGAGGGGA 

AATATAGCAT 

AGAGCTGTGG 



GCAATGATAA 
TTCCTGAAGA 
CAACAAGTTG 
AGGAAGCTCA 
AT7SC AG TTCC 
TTTTAAATAT 
ATTTGGAAGT 
TCTGGGAAAC 
ATTTGGTGTT 
TTAGTGGATT 
AATGGTGGGA 
CCTATGCATC 
GAGGCACAGT 
CTCCAGCATT 
CAAGTAAGGA 
TTAGTTCATT 
ATAATAATGG 
TTTCATCTAT 
AAGTCAACTA 
GTGTAGGTAA 
TAATGTTGCC 
CTTACGCTGT 
C AG AT AAG A C 
AAGAAGAATT 
ATGACTTTAG 
ATATAAGTGG 
TTT AT AC C AT 
CAGGTCCAGT 
ATGCAAAACC 
ATGGTAACAG 



TGGTAACGGA 
TGTAAAGCCA 
TATTCCACGA 
AAGTCTAAAC 
TCCAGAGTGG 
TGACAATTAT 
TTGTGCCAAA 
TATTATGAAT 
GGCAACTAAA 
TTGGTTGAGT 
AAAACATTAT 
GGAACCAGAA 
TTATAATTTC 
T ACT AAAGGT 
AG AAGTTGT A 
AAACGGATTT 
GAGATATCAT 
ATTCCCTAAT 
TTTAAACTCG 
TTCCTGGTAT 
TATGTATACT 
TGTTAAAGAA 
AGCT AT GTGG 
AGAGTTAGCG 
GACAACAACA 
CGATAAAAAT 
TACGGTTAAT 
CTCTTTCCCA 
AACAACACAA 
AAATGGTAAT 
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9900 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTT AT A ATCGCACAGA TGCTGAGACT 10980 

C AACGTCT AT CTAATTTTGA TGTGATTCTA TATGACAATA AT AG AAAC G A ACTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAG AT AAAAT AACAGAAGAT 112 20 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCT GGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATCTTATT 114 60 

TTTCTATCTT C ATCAGGAG A AG AAGTT TTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TC AAAAT AG A ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAACTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 117 00 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTC ATC A TT CAGTG ACT 117 60 

CATACCAAAG AAGATTCTCC TTCATGGTGG CAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AG AG ATT ATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 119 40 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGCTTAG AATTTCTTTC 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAACTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 1212 0 

AAAAT AAGAG G AAAAT AGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 123 00 

CATTTCTCCA TCTACTATTG GCCGTGTACC AG AGGCTG C A GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATCC TATTTGGGGA TTTAATGGGA CAGAACCCCC 12 480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT C AAAAAGGG A TTCCAGCCTT 12 54 0 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12 600 
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AGAAAAACTT 


TTACGCTATG 


CGCGTGCAGC 


TCTTGCAACT 


GGCTTGATGA 


G AG ACACTGC 


12660 


TTACCTATCA 


ATGGGTAGTG 


TTTCGATGGG 


GATTGGTGGT 


TCTATTGTAA 


ATCCGGATTT 


12720 


CTTCCAAGAA 


TACTTAGGAA 


TGCGAAATGA 


ATCGGT AG AT 


ATGACGGAGT 


TCACGCGCCG 


12780 


TATGGACCGT 


GGTATTTACG 


ACCCTGAAGA 


GTTCGAACGT 


GCGCTCAAAT 


GGGTGAAAGA 


12840 


AAACGTAAAA 


GAAGGATTCG 


ACCATAACCG 


TGAAGACCTT 


GTTTTAAGCC 


GTGAAGAAAA 


12900 


AGATAGACAA 


TGGGAATTTG 


TTATTAAGAT 


GTTCATGATT 


GGACGTGACT 


TAATGGTTGG 


12960 


TAACCCAAGA 


CTTGCTGAAC 


TTGGTTTTGA 


GGAAGAAGCG 


GTTGGTCACC 


ATGCTTTAGT 


13020 


AG CTGGTTT C 


CAAGGTCAAC 


GTCAGTGGAC 


AGACCATTTT 


CCAAATGGGG 


ACTTTATGGA 


13080 


AACTTTC CT C 


AATACTCAGT 


TTGACTGGAA 


TGGTATTCGA 


AAACCATTTG 


TATTTGCGAC 


13140 


AGAGAATGAT 


TCACTAAATG 


GTGTGTCTAT 


GCTCTTT AAT 


T AT CT ATT AA 


CAAATACTCC 


13200 


ACAAATCTTT 


GCTGATGTGC 


GTACTTATTG 


GAGCCCAGAG 


GCTGTTAAAC 


GTGTAACGGC 


13260 


ACATACTTTA 


GAGGGTCGTG 


CTGCAGCTGG 


CTTCTTACAT 


CTAATCAACT 


CTGGTTCTTG 


13320 


TACATTGCAT 


GGTACAGGTC 


AAGCTACTCG 


AGATGGCAAA 


CCTATTATGA 


AACCATTCTG 


13380 


GGAGTTCGAA 


GAAAGTGAAG 


TGCAGGCTAT 


GCTTGAAAAT 


ACACACTTCC 


CACCAGCAAA 


13440 


CCGCGAAT AC 


TTCCGTGGAG 


GAGGATTCTC 


AACTCGTTTC 


TTGACGAAGG 


GGGATATGCC 


13500 


AGTAACAATG 


GTACCTCTCA 


ATCTTCTAAA 


AGGGGTTGGT 


CCAGTGCTAC 


AAATTGCAGA 


13560 


AGGTTACACA 


CTTGAACTTC 


CTGAAGATGT 


T C ACC AT ACT 


TTAGATAATC 


GTACAGATCC 


13620 


AGGATGGCCA 


ACTACTTGGT 


TTGCTCCACG 


TTTGACAGGA 


AAAGGTGCTT 


TCAACTCTGT 


13680 


CTATGACGTC 


ATGAATAATT 


GGGGAGCTAA 


TCACGGAGCC 


ATAACATATG 


GACACATTGG 


13740 


AG C AGACTTG 


ATTACCTTGG 


CTTCTATGTT 


GAGAATTCCT 


GTCAATATGC 


ATAATGTACC 


13800 


TGAGGAAGAT 


ATCTTTAGAC 


CTAAAAATTG 


GTCCTTATTT 


GG AACAGAAG 


ATCTAGAATC 


13860 


AGCAGACTAT 


CGTGCATCTC 


AGTTGTTGGG 


GCCACTACAT 


AAATAAAACT 


TGTTTATATA 


13920 


GG AG GTGAAC 


TTACGTCCCT 


CCTATCCTTT 


TAAAAAGATT 


TGTTAAACAA 


TTC AC AAAT A 


13980 


ATTGAAAACG 


AATACAAAAA 


GTAATATAAT 


GATGTTAAAT 


AGATAGCGCG 


GAGGCGCAGG 


, ;o40 


AGGAAAATTA 


TATGGCTATA 


TTTTATGTTC 


CGGCAGTCAA 


CCTTATTCGA 


AAAGGTGTTG 


14100 


TAAATGAAGT 


GGGTCCTTAT 


ATCAAGGAAC 


TTGGCTATAA 


AAACGCACTT 


TTGGTGACAG 


14160 


ATAAGTACAT 


CGAAGGCAGT 


GATATTTTAC 


CTAAGACTTT 


AAAACCACTG 


GATACAGAAG 


14220 


GAATCGAATA 


T 










14231 


(2) INFORMATION FOR SEQ ID NO: 82 











U) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTTCTCTTA 


ACTTTTTTAG 


GATGGCATTC 


TCCGCTCTCA 


GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTGG 


CTTACGTCCC 


ATTTTAGGTA 


120 


CTCTCCCTCT 


TGTTTTCTCA 


ACAATAGTAT 


ACCCCTTTTT 


CCTGTATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AACTCTATCT 


TTACTCCAGC 


240 


CTTCATGTCA 


GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTCCTTTTTT 


GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TACCTAATAT 


360 


TAGAATTGCT 


TATCCCAAAT 


TTATTTGAAA 


GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


TTTGGGGTGT 


AGTTCATGTA 


CACCTGATAT 


GATGCGTTTT 


540 


ATAATTTTTA 


AGCCTTTTTG 


CCCAGCCTCG 


TCAAAAGTAA 


TCTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTAGTTTTAA 


GAAAAGGAGG 


660 


ATGATCTAAT 


GGAAGAAAAA 


GTATCATTGA 


AAGTCAGGGT 


TCAAAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


GGTTATGCCC 


AAT ATTGG AG 


CATTTATTGC 


TTGGGGAGTA 


TTGACTGCCC 


780 


TrTTTATrr.r 

1 v_ I k 1 rV 1 v— \jv- 


TGATGGCTAT 


CTGCCAAATG 


AACAGTTAGC 


TACTGTTGTT 


GGTCCTATCT 


840 


TAACGTATTT 


ATTGCCAATC 


CTGATTCGTT 


ACACAGGTGG 


ATATATGATC 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGAGCT 


ATTGCT ACT G 


TTGCTGCAAT 


CACAGGTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


GACTATCAAG 


AAATTTGATG 


1020 


AG AAGTT CCA 


GGAAAAAATT 


CGTCCCGGAT 


TTGAAATCTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG 


TTTTGCATTA 


TTCCTTTTCG 


CTTTCTACGC 


AATCGGTCCA 


GTCGTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTC AAT CATC 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AGCTTATGCT 


GTATTCGGTA 


1380 


AAGGTTCTGC 


TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA 


CTTTCCTTAT 


GTTATCATGA 


AGCCTACTCT 


ATTTTT AG CT 


GCTATGGCAG 


1500 
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GAGGT ATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 15 60 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 174 0 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGT AT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG C AGGTCT AG A GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT AC CAACTTCA GCACCAGTAG ATAGTCAGGA AAGTG AC CTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 222 0 

TTCGGGCT AT TTTTAGAAAC AAGAATATTC GTATTCCACT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 24 00 

CACAATATGA CAAAATCGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 24 60 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2 520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT ACAACAATTT ATCGAACTTT 2580 

AT C AG ATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2 64 0 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TACTTGAGTA 27 00 

TAGTCCCCAA GAAAGACAAG ACTTGATTAC CTATCGCCTT CTGACTGAGA GTGG TTTTG T 27 60 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2 820 

TTCAGATATT GAT AAGCGTC TTTTAGACTT TCATCTGAAA ATTGAACGAC AAAAACGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2 94 0 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGG AAA TT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3 300 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACCGTT TCCATGTATA CCAAG ATT G A 3 3 60 

CTTTTTTAAG G A C AAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTAGG 3 420 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 34 80 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3540 

TCATACAGAG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3600 

TCCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3 720 

TCTAGTAGAT TACCACAGTA TTGATCTCAG TCAGTATGAT TATATTTTAT CTACCAAGCC 3 7 80 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTCCTTGA 3 84 0 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AG C TTT T G AG 3 9 60 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 40 20 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA G TTT T ACT AT GTTTGAACTC AAACTACCTA TCTCCGCATT 42 00 

GTC AATG AAA CGAGAGGAAG AAGAGGTCAA AACGTCTCTC CTAATGCTAA TGTCT AAAG A 4 2 60 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 43 20 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCT AAAT AC 4 3 80 

TATTTTTAAC GAAAAAATTA AG AAATTGG A GAACTAATAT GAAACTTGAA AAACATTTGA 44 4 0 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CG AGATAAAC 4 560 

AGCTATCTGT TTACATGGGT AACTTT AT CG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 620 

ATGATGTCCT CAACTCTCCT ATT AC ACT CG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 6 80 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4 74 0 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATCCTCTT GAATAATTGA 4 8 60 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 49 2 0 
AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTT AT AGG TGAAATTCTA . 4 980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATGCTCTG 504 0 
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AATGAAAAGG 


GCAAGTATGA 


AATTGAAATT 


GCACAGAAAG 


GACAGTCTCG 


T AT AG AAGT A 


5100 


ACTAATGTGG 


CTGGCATTAA 


TAGCAAAGAA 


CATCCTGAGC 


AAGTCATTGA 


AGCGATTCAA 


5160 


AAGACGGATA 


TTATTACTAC 


TGCAATCGGA 


CCTAATATAC 


TCCCTTTTAT 


CGCCGAACTT 


5220 


CTAGCCAAAG 


GAATCGAAGC 


TCGCCGAGTT 


GCAGGAAATA 


CACAGGCATT 


GGATGTTATG 


5280 


GCCTGTGAAA 


ATATGATTGG 


CGGGTCTCAA 


TTTCTTTATC 


AAGAAGTCAA 


GAAATATTTA 


5340 


AGTCCGGAAG 


GTTTGACATT 


TGCTGATAAC 


TACATAGGTT 


TTCCAAATGC 


TGCAGTAGAC 


5400 


AGGATTGTTC 


CAGCACAAAG 


TCACGAAGAT 


TCCCTTTTTG 


TTGTGGTCGA 


GCCCTTTAAT 


5460 


GAATGGGTCG 


TGGAAACCAA 


GCGTCTTAAA 


AATCCAGATT 


TACGTCTAAA 


AGATGTGCAT 


5520 


TATGAAGAAG 


ATTTAGAACC 


CTTTATTGAG 


CGAAAACTTT 


TTTCAGTCAA 


TTCTGGACAT 


5580 


GCAACTTCAG 


CTTACATTGG 


TGCGCATTAT 


GGTGCCAAGA 


CAATTTTGGA 


AGCTCTTCAA 


5640 


AATCCTAATA 


TTAAATCTCG 


GATTGAATCT 


GT ATT AG CT G 


AAATTCGGAG 


TCTCTTGATT 


5700 


GCCAAATGGA 


ACTTTGATAA 


AAAAGAATTG 


GAGAATTATC 


ACAAAGTCAT 


TATACAACCA 


5760 


CTTGAAAACC 


CTTTCATAGT 


GGACGAGGTT 


AGTCGCGTAC 


CTCGTACTCC 


AATCCGAAAA 


5820 


TTAGGCTATA 


ATGAACGATT 


CATCCGGCCG 


ATACGTGAAT 


TGAAAGAACT 


CAGTTTGTCA 


5880 


TATAAAAACC 


TACTTAAAAC 


AGTTGGCTAT 


GTCTTTGACT 


ATCGCGATGT 


AAATGATGAA 


5940 


GAAAGTATTC 


GATTAGGTGA 


ATTGTTGGCT 


AAACAATCAG 


TCAAAGATGT 


TGTTATACAA 


6000 


GTTACAGGTT 


TAGACGACCA 


AGAATTGATT 


GAGCAAATTG 


TAGAGTATAT 


TTAATCTTTT 


6060 


TCGAAAATCT 


CTTCAAATCA 


GGTTAGCATC 


GCTTTGTCTT 


AGGCATATCT 


TGTTCTATCT 


6120 


ACAACCTCAA 


AGCACTGCTT 


TGAGCTGACT 


CCGTCAGTCT 


TATCTGCAAT 


CTCAAAACAC 


6180 


TGTTTGAGTT 


ATCTGCGGTA 


ATCTTTCTAG 


CTTGTCTTTG 


ATTTTTGTTG 


TT ATTT AT AA 


6240 


GGTAAAAGAA 


GCTGGACAAA 


AAGTCTTCAA 


AATCGGGAAA 


AGGC AGCCT A 


TCGGGTGTTC 


6300 


AAAAATCTTG 


ATAGGATGTC 


CTTTATTATG 


GAAAGCCTTA 


TTGGATTTTC 


TCCTC AG ATT 


6360 


GAGTTTTTGA 


TCAGCTTTAT 


GAGATAGGTC 


TTGCTAGAGA 


TGTAGCCCAT 


CATGTTATTT 


6420 


TTATGGACAG 


TGGGAAAATT 


GTTGAAAAAA 


ATAATGCCCA 


TCAATTCTTT 


AGTCGTCCAA 


6480 


GAGAAGAACG 


AACCAAGCAA 


TTTTGGAACG 


AATTCTTTCG 


AATGCGATCT 


ATATAGTAAA 


6540 


ATGAAACAAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


AAATGTTTTA 


6600 


GAAGTAGAGG 


TGTACTATTC 


TAGTTTCAAT 


CTACTATATA 


ACTGAAAAAT 


TAGATAAATT 


6660 


AGTTTTGGAA 


AATGACTAAC 


CAAAAGATAT 


CCAAAGTAGT 


CTAAAATTGT 


■ CTATACTTTA 


6720 


TGAGTGTTTT 


AGTTAGGAAA 


AAGGCTTGTT 


GTCTATAATT 


GTCTGCATTA 


GTCTAGATTT 


6780 


TATTTATAGA 


, AAATGTTATA 


ATAGACTGTA 


. TTTAAAAAAT 


TTTAAGGAGA 


k AATGACAGAA 


6840 
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TCTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 69 60 

TTC C AGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 72 00 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 72 60 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7 320 

AAAAC GGCG A CACTGTTGTG AT CG ACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7 380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TCAATTCATC CCTGGTTTCG 7 440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA AC ATTCC C AG 7 500 

AAGACTACCA AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTGACA ACT ATC CACG 7560 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC ATTGATGAAG 7620 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 774 0 

AAATC GT AG A ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7 800 

TCCTTCGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7 860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 7920 

ACCTTGTTAT CGAAGCACTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7 980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGCACAA GTTCAAAACT 804 0 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTG AT C A 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CT ATTTTCC A AAAATCTCTT TGAGCTCTGT GTCTGTAATC CCAATCATGG 8 22 0 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 8280 

TGACTGTTTC AG AGACAG CT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 8 34 0 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 84 00 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGT G TAGATGAGAT 84 60 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 85 20 

TTCCCTGAGA CTTATGAATA. CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA 
TGTATTTACC 
CAGCATCGTT 
CAAAACTGAG 
CATCAATCCC 
TACCATTTCT 
CAATTTCAAA 
GTCGAATCTG 
CCAAGCGAGT 
TGACAGAACG 
TGGAGAAGAG 
AGTCAGCATC 
GGCGATGTAT 
CAGTTGGAGC 
AAAGGGCATA 
CTGTCAGGAT 
CATACTCAAT 
CATGACTCTT 
CAGCGAAAAA 
CGATCAGGTA 
ACTCAAGGAG 
TTTCCATACA 
GACTTTCGAT 
TATCCTCAAC 
AGTCTTGAAT 
TCATCTCCGT 
AGAGTCCTGC 
CGCCATAGGT 
TTGAAAAGTA 
GCAGTTGTTC 



GACTTCATTA 
AGGAATCAGG 
AACCAAATGA 
TTGATCTTTT 
TGCCGTCCCT 
GAGGGCGGCA 
GTAGGAACGG 
ACTAGCTAGG 
CTGAGGAATC 
TAGCTGATCA 
TTGATTGGCC 
TAGGTAATCT 
GGTCGCGCTA 
AGCAAGAAGA 
AACAGCAATG 
AAAGACCTTA 
TCCCAGTTCT 
CTCTTTTCCT 
GAGGCTGTTG 
GGAGAGCTCT 
AGTAAGGGTT 
GGCCTGAAAA 
GCCTAGTTCC 
CAGCTGGTAG 
CTGAAAGGCT 
TCCGTAGTTG 
GATGCCTTCT 
ATCCACGATT 
CTTGACCAAG 
TCCATACTTG 



C C ATCAAAAT 
TCTGTG AT AG 
ATGACCCTGT 
TGTGGGGGAT 
CGGTACATAG 
CCTAAGATTT 
TCAGCTTTTT 
GTGACGATGG 
AAAGGAATAT 
CTGTCACCTA 
AGCCAAGTAT 
TCCAGATGAC 
GGCAAACCTG 
ATGCGCAGAT 
ATTCC ATT G A 
TTCTGGATAC 
TGCTCGACAG 
TTTTCAAGGA 
TCAAAGATCT 
TGGGCAACTT 
TGTTCCAGCA 
AGACTGTGAA 
TCAGCTAGTT 
GGATAATTTT 
AGTTTGTTGG 
AG ACGG AG AG 
AACTTTTCTG 
TTCTGAGCTG 
CC CTT ACT AG 
GAGTGCTGGA 



656 
CAATGACAAT 



CTCCTAAATC 
CTCTCTTACG 
TGAGCAGGTC 
GAGCCAGAAC 
TTTCAATGGT 
TTTGGGTGAA 
TTGATTCTTT 
GAAGTAGATC 
CGATGAGGAT 
CTACCATAGA 
TGGTATCATC 
TCAATTCATT 
TGCTTTTCTT 
TAACAGTTGT 
CATCACAGAT 
TAGTGATATG 
TACGAACCAA 
TGGTATCAAT 
GGCTGGGGTC 
AATCCCGTGC 
CTAGACCGGC 
GGTCAGCAAT 
CAACCACATC 
GAATGCCGTA 
TGGAGACGAA 
GGTGTTGCAA 
TCTTGAGACC 
TTGGTTTTGC 
CAATTTGCCC 



CTCGTCTTGT 
CCCATTAAAG 
ATAGTGACAC 
TTGCATGAGC 
TTGGATATCA 
GGCAGGAATA 
ATCAGCTGGC 
GCTTTGTCGA 
CGCTAGAACC 
CTTACTGTTA 
GAATTCATCC 
GTCACCTCTC 
CATGCGACGA 
CCTGAAGT C A 
CTTACCAGTA 
AGCCTGTTTT 
TTTTTGAATG 
GTGACTGCGG 
CTGCTGAACC 
TAGTTCCACG 
TTCAACATAG 
GCGGAAGCGT 
GGTAAAGC CC 
AAGGGTTTCT 
GTTGGCTAGT 
AGCCTCGCGA 
AATTTCGTCA 
AATCCCCTTG 
GCGATCATAA 
CCAAAAAGTA 



TTCGATTCGG 
ACATTGATTT 
TGAGGAGCTT 
TGATTGATAG 
CGGGCGGGAA 
TGGCCACTAG 
AAGATGCCCT 
TAAATTTTTT 
TGTCCAGGAC 
GAAGAGATAT 
ACGATGATAA 
ATTCCCAAGT 
GCAGCTCGAC 
AGTCCTTC T A 
CCAGGCCCAC 
TGAATGTTAT 
GTTTCTAAAT 
ATGCCTTCCT 
TTGTCTTCTT 
GGACGGGAAG 
GTGTCCCCTG 
TCAGGAGCCT 
AAACCCTTGA 
TC CTTGT AAA 
TTGGCCAAAA 
TTTTTGGCAG 
ATGGTATTTT 
AAATGGCTAC 
CGACTGATTT 
TAGTCTTCGC 



3640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 
10080 
10140 
10200 
10260 
10320 
10380 
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CCTCAATTAC 
CGTCCGTATC 
TAATCCGTTC 
GAG AGTTGGG 
CCATCTGCAA 
AAAC ACTGTT 
T ATTTGT AAA 
TGCCTCTTAG 
TTCCCCTGTG 
GCGGTCATCT 
GTAGTTGACA 
TTTATTTCCT 
TTTGATATAG 
ATTTTCGTAA 
CCCATCTTCC 
GAAGAGAATT 
GCTCCAAAAA 
CTCTTTTAAG 
TTCAGTGTTT 
CAAAATCTGG 
CAGTTCTCGT 
AACTGCGACA 
ACGATTGGCC 
CACAATTTTC 
GGCAACCTTA 
TGAGAGAAGG 
CTTTTGGT CG 
GTCAGGTGTG 
CTCATCTGTC 



ATCAGCCATG 
GTCGATTTCT 
AATAGTTCCT 
ATTGTTTTTA 
CCTCAAAACA 
TTAAGCAGCC 
TAAACAATCA 
GTTTCTTAAA 
ATATCTTTTG 
CCGAGGAGAA 
TCAACTGTGA 
TCAAAGCCCT 
TCTGCTAGAT 
CGAATGGTGT 
TCATGGGCCA 
TCGCCATCCG 
AAGATACGAC 
AAATTTTTAA 
TTAAAGTCCA 
CTAGCCACCT 
CCCAAATTTT 
GACAAGTATT 
TCTTGTGCCA 
TCAGGCTGAA 
ACCGAAACAG 
AGTGCCTGGT 
GTCAGAGTCT 
ACAAAGGCAG 
CCAATTAAAG 



GTTCCTGTGA 
AGGAGGAGGA 
GAAAAATAAA 
TTTTATACTC 
GTATTTTGAG 
TACGGCTAGC 
CTTCTCACGA 
ATGTTCCGAT 
CTTTGAAGGT 
GGTATTCTCC 
AGCCTTGAGC 
TGCCTGAGTA 
AAGGCTCGTC 
CGCCAGGCAT 
CCACGATATC 
CTAGGGTCGG 
TTAAAGCTAG 
ATGAATTCAT 
ATTTGGCGCA 
TGTCAGAAGC 
CAAGATTTTC 
TGCCCTCAGC 
AGTACTTGTC 
CACCTTTTTG 
CGTTGTAGCG 
GCTGAATTTT 
TAG AATC C C C 
CCACAACTGC 
GAAGATTTTG 



657 
CAATGATTTC 
TGCGATAAAA 
CTTCCATAAA 
TTCGAAAATA 
CTGACTTCGT 
TTCCTAGTTT 
TAGAAGAAGA 
ACGGGTGATT 
ACCTACGTGG 
TTCTGGAACA 
TTTTTGAGCG 
AGTGCTTTGG 
CGTTTCTTTG 
TCCAATCACG 
AAAACGGTCA 
ATCCATGGAA 
TAATGACAGA 
AACTTACCTT 
GAAGCTGAGT 
CCTTCCAGCT 
CAGAAAGAGA 
CTTTTCTTCT 
ATAATTTTTA 
AAGGAGGAGA 
GTCTCCGATG 
TTCCTTGAGA 
CACACCGAGT 
AAGCCCACCA 
TCCGCTGGTT 



AAAATCATCA 
ATTGCTGGGA 
ATTCCTTTGC 
TCTTCAAACC 
CAGTTCTATC 
GTTCTTTGAT 
GGCTGAGATT 
GGCCATAACC 
CGGCTGTCGC 
GTAAAGCTAA 
ATACTTCTAA 
AGTTTGT CAT 
TCATTGATGT 
CGCTTGACGA 
ATACGAAGCT 
TGTCCTTCTA 
ATTAGGAGGA 
TCTAAGCGTT 
CCCTGCATAC 
CCACTTGGCA 
TCACGCGCAA 
AAGCTGATAG 
GCACTGGTAA 
TAGATAGCCT 
ACCTCGTTGT 
ATAGGAGTAA 
TTT CGTAAAA 
AAGTAGGAAC 
TGCTCTACAG 



AAATCCTCTG 
TTTTCAAAAA 
ATGAATAGGT 
ACGTCAGCTT 
CACAACCTCA 
TTTCATTGAG 
GGTGATTCTC 
GGAATTTAGC 
TCGAAACCAA 
AGTTGGTGTT 
AGAAAGTTCC 
CCTTGAAGCG 
AGAGTTTATC 
TGTCCTTATT 
GTTTTACAAC 
CGCGAACATT 
ACAATCCCCA 
TTTT G G CTTT 
CATAGGCTTG 
GCTGATAACC 
TGACAGAAGA 
CATTCCTGAA 
AGGCATCAAT 
GATTATGGAG 
ACTTGCTGGG 
TCTGACGGAT 
AGTCGTGCTG 
CATTTCCCAC 
CTTGATAGCC 



10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11*700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
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AAAGAAACTG 


GCGTATTTTT 


CAGCCCCTTC 


ACCCTGAAGC 


AAG ATTTTT C 


CAGAAGTATA 


12180 


GATAGAAACC 


GTTGCTTGAG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTGCTAGG 


12240 


AGCCAGACTG 


CTTTGATAGT 


GTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCGC 


TTGGTGTGAG 


12300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTAGCG 


AAT TTTGGT A 


TAATATCGTG 


AGGTG AATT T 


TATGGCAAAT 


CTAAATCGAT 


12420 


T C AAATTT AC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCG AC AGAAA 


AATACCAAGC 


AATT AAAG AA 


CAAATGCCTA 


12540 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTTATCA 


ACTCAGCTCA 


12600 


GCCGTGAGAT 


TGAATTTGAC 


GATAAGGAGC 


AAG AG C T AG A 


AGAACTCCGT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTT TGGGG AT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGC 


12840 


AGAGCAATTC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAAGCTCAGG 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCAGGTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


ATAGCATTGG 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTGAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GGTGACCTTA 


TTTGTCTTGC 


AAATGGCCTT 


GACAATCTTG 


GCGACCATCC 


CCATGGCAGT 


13140 


TAT AC AAAAT 


CCTCTTGAAA 


AGAGTATCGT 


CGCAAAAC AC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCAGT 


TGGCTCAAAC 


AAAT CTGGGT 


GAC AAATTT A 


ATCGGATAAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATGTA 


AAAAGCT AC C 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


GAAAATATTA 


GAAACATTAC 


13380 


AGTTCGATAA 


GGTCAAGGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAC 


GGCTTGGAGC 


13440 


AATT GAG AC A 


ACTGGCTCCG 


ACT G C C AAAG 


CAGAT AAAAT 


CAAACAGGCT 


TTTGCTGAGA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCGCATTT 


T ACTATTCT C 


TCAACTAAGG 


i3560 


AAATTGCAGG 


AGT CTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGAGTTCC 


13620 


TACTCTTGAA 


ACGCGTG CTT 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATG TC AG 


CTTGGAAGAA 


TTAGCCCTTT 


GGTTTGAGAA 


ATT ACATGAT 


TTTCCGCAAT 


13740 


TACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CCGCTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAAT ACATG 


ATAGCGAGAG 


TCAGGTACGC 


GATGTTTT AC 


13860 


AAGACTTGCT 


CAAGCAAAAA 


GCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGCAGAAATG 


13920 
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GCCGTCAGGT TTTACCACTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13 980 

ATATTTCTGC TAGTGGAAAC AC CGTC TAT A TCGAACCCCG TGAGGTAGTC AAACTGAGCG 14040 

AACAAATTGC TAGTCTGCGA GCAGATGAGC GCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTCCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 14220 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 14 340 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 144 00 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCGTGTTGG TATTTTTGAA GAAATCTTTG 14 4 60 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AG T C AT ATG A 14 520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14 580 

TCGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14 640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACG 147 00 

GTATTGAGAC AG C CTTTGTG CAAAATGCCA GTATCGAGTT TG AT ACTGC A ACTCTTCGCC 14 7 60 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TCCCTTTGAA ATTGCCAAAC 14 820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14 880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGC CGC AAACGTTTGG 14 940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTGCTG 15060 

ACATTGTGCA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 15120 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA CTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAAGAAAAAA CGAGCTCCAA 15240 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCACCG TGGTACCTTG ACCAGTCAAC 15 300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15 3 60 

AGTTTGATCT TGTTCAAGCC C AG C AAG AAA AACCAGTCAA GAAGAAACAC GTCAATGTTG 154 2 0 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 154 80 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15 540 

AACTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15 600 

TGCAAAGAAA CAAACATGTC AAG AGTTTC G GCTATGCCCC ACAAAATGCT GGAGGCAGTG 15660 
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GTG CG ACT AT 


TGTCACTTTT 


AAAGGATAGC 


AGTATTCTGG 


ACTTTATAAA 


GTAAAAACTG 


15720 


TTGAACTAAT 


TTTTACTAAT 


AAACACATTG 


ACAAAAGCCA 


ACATTTTTTG 


TAAAATTAGA 


L5780 


ATCAATTAAA 


TACCAACACC 


GAATGAAGTT 


T AAT AGAAGT 


GGGGAATCGT 


TTGATTTTCC 


15840 


ATGACTGTAA 


ATGGACGGAA 


CTCTGG AG AG 


AC CGT AAAGG 


CACCGAAGGG 


CAAGGCAGGC 


15900 


AACTGCTCAA 


ACTCTCAGGT 


AAAAGGACAG 


AGCTAGGATA 


GACCGCTTTT 


TAGCATTTAT 


15960 


CTAAGCATTC 


CAGAGTACAT 


GTATCTTGCA 


TGTGCTCTTT 


CTTTTGGGGT 


TGAAACGATA 


16020 


GGAGAAGGAA 


ATGTTAGAAT 


TGCTTAAATC 


AATCGATGCT 


TTTGCTTGGG 


GACCGCCCCT 


16080 


CTTGATTTTA 


TTGGTCGGAA 


CAGGGATTTA 


CCTAACTATT 


CGGCTAGGAC 


TCTTGCAGGT 


16140 


TTTGCGTCTA 


CCCAAGGCCT 


TTCAGCTTAT 


TTTTATCCAG 


GATAAGGGAC 


ATGGTGATGT 


16200 


ATCCAGTTTT 


GCAGCTCTGT 


GTACAGCCTT 


GGCATCAACT 


GTTGG AAC AG 


GAAATATCAT 


16260 


AGGAGTTGCG 


ACGGCTATCA 


AGGTTGGTGG 


ACCAGGAGCT 


CTATTTTGGA 


TGTGGATGGC 


16320 


GGCTTTCTTT 


GGAATGGCTA 


CCAAGTATGC 


GG AAGGACTC 


TTGGCC ATC A 


AATACCGCAC 


16380 


CAAGGACGAC 


CATGGTGCAG 


TAGCGGGAGG 


TCCCATGCAT 


TATATCCTTC 


TAGGGATGGG 


16440 


AGAAAAGTGG 


CGACCACTTG 


CTGTTTTGTT 


TGCAGTAGCA 


GGAGTATTGG 


TTGCTCTCTT 


16500 


GGGAATCGGA 


ACCTTCACCC 


AAGTCAACTC 


GATTGCAGAA 


TCTATCCAAA 


ATACAACGAC 


16560 


GATTTCGCCA 


GCCATCACAG 


CTCTCGTCTT 


GTCTGTCTTT 


GTAGCGATTG 


CAGTCTTTGG 


16620 


TGGACTCAAG 


TCTATTTCTA 


AGGTTTCAAC 


TACTGTTGTT 


CCTTTTATCG 


CCATCATTTA 


16680 


TATCTTAGGA 


ACTCTTACAG 


TTATTTTCTT 


TAATATCGGA 


AAAATCCCTG 


GCACAATCGC 


16740 


TTTAGTCTTT 


ACCTCAGCTT 


TTAGTCCCCT 


TGCTGCGGTA 


GGTGGATTTG 


CTGGTGCTAG 


16800 


CGTTCGGATG 


GCTATTCAAA 


ATGGTGTGGC 


GCGTGGTGTG 


TTCTCAAACG 


AATCTGGTCT 


16860 


GGGTTCTGCT 


CCTATTGCAG 


CTGCAGCTGC 


CAAGACAAAT 


GAACCAGTAG 


AGCAAGGTTT 


16920 


GATTTCCATG 


ACAGGAACCT 


TTATTGATAC 


CCTCATCATT 


TGTACTCTAA 


CTGGTTTGAC 


16980 


CATCTTGGTA 


ACTGC 










16995 


(2) INFORMATION FOR SEQ ID NO: 83: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 
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ACTAAGCGCT 


TTAATAAACA 


GTTAGATAAA 


ATTCAAGTTT 


CGTTGATTCG 


TCAGTTTGAC 


120 




CGGAGATTCC 


TGGGGTCTTG 


CGTTTGACCT 


TGGGGGAACC 


TGATTTTACA 


180 




ATGTCAAGGA 


GGCGGGCAAG 


CGAGCGATTG 


ATCAGAACCA 


ATCCTACTAT 


240 


art nnn a Tr. a 


^■^TGG"T'{TGC^ , 


GACTCTACGT 


CAGGCAGCCA 


GTGACTTTGT 


TAAGGAAAAG 


3 00 




APTATGCTCC 


T G AAAATG AA 


ATCTTGGTTA 


CAATTGGGGC 


GACAGAGGCT 


360 




PTTTfiAC CGC 


TATTTTGGAA 


GAGGGAGACA 


AGGTACTTTT 


GCCAGCTCCT 


420 






GATTGTTAAC 


TTAGTTGGGG 


CAGAAATTGT 


TGAGATTGAT 


480 




ATGGTTTTGT 


CTTGACTCCT 


GAGATGTTGG 


AG AAGGC C AT 


TTTGG AGCAG 


540 


vj<o 1. ijn 1 nnuv> 


TPAAGGCGGT 


TATTCTCAAC 


TATCCAGCCA 


ATCCGACAGG 


AATTACCTAC 


600 




AGTT AG AGGG 


CTTGGCAGCT 


GTTTTACGCA 


AGTACGAAAT 


TTTTGTTGTC 


660 






ATTGACCTAC 


ACAGGCGAAG 


CCATGTGTCT 


CTAGGAACGA 


720 




PPifIP,PTATT 


ATTATCAATG 


GTTTGTCTAA 


ATCGCATGCC 


ATGACAGGTT 


780 




nr*TG ATT^Tr 


GCTCCTGCGA 


CCTTCACAGC 


CCAGTTAATC 


AAGAGTCACC 


840 






AATACCATGG 


CGCAACATGC 


TGCGGTAGAA 


GCCTTGACGG 


900 


L Lib 1 nfUv\n 




C CATGAAGAA 


GGAATATATC 


CAACGTCGGG 


ACTATATCAT 


960 




AfTGfTCTTG 


G TTTTG AG AT 


TATCAAACCA 


CACGGTGCCT 


TCTATATTTT 


1020 




GGAGGGGGCT 


ACAATCAAGA 


CTCCTTTGCT 


TTTCTGAAGG 


ATTTTGCTCA 


1080 




P.TTPPPTTT A 


TCCCTGGTGC 


AGCCTTTGGA 


CGTTACGGGG 


AAGGCTACGT 


1140 


PPPPP'' , lTP m 


TATGCAGCCA 


GCATGGAGAC 


TATCAAAGAA 


GCCATGAAAC 


GACTTGAGGA 


1200 


CIT Af" ATP. AG A 


G AAGCATGAT 


TCAGTCTATC 


ACGAGTCAAG 


CCTTGGTGCT 


TTACAATCGC 


1260 


1 1 I ILOIV) 


AGGATGACAA 


GCTCGTCAAA 


ATTTTT AC AG 


AGCAG GTTGG 


CAAACGCATG 


1320 


TTT TTTGTC A 


AACACGCTGG 


TCAGTCTAAG 


CTGGCGCCTG 


TTATTCAGCL 


L I I vjU 1 vjV- 1 Vj 


13 80 


GCACGATTTC 


TCTTGCGAAT 


CAATGATGAC 


GGACTCAGTT 


ACATCGAAGA 


CTATCATGAG 


1440 


GTCATGACTT 


TTCCCAAGAT 


TAATAGTGAC 


CTCTTTGTCA 


TGGCCTATGC 


GACCTATGTG 


1500 


GCAGCTCTTG 


CAGATGCTAG 


TTTGCAGGAC 


AATC AG C AG G 


ATGCTCCCTT 


GTTTGCTTTT 


1560 


TTGCAAAAGA 


CTTTGGAGTT 


GATGGAAGCA 


GGCTTGGATT 


ATCAGGTTTT 


GACCAATATT 


1620 


TTTGAAATTC 


AAATTTTGAC 


TCGATTTGGA 


ATCAGCCTCA 


ATTTTAATGA 


GTGTGTCTTC 


1680 


TGCCATCGGG 


TTGGTCAGGC 


TTTTGACTTT 


TCTTTCAAAT 


ATGGAGCCTG 


CCTCTGTCCA 


1740 


GAGCATTATC 


ATGAGGATAA 


GAGACGTTGT 


CATCTCAATC 


CCAATATCCC 


CTATCTGCTC 


1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1360 

AAGCAAGACC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 192 0 

AAATCAAAGA AATTTATTGA TTCC CTAGC A GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 204 0 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGT A 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2280 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 234 0 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 24 00 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 24 60 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2520 

CAGAGACTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 2580 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC CTTGCAGATG 264 0 

TTGTTGTGGC AG AT GGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 27 00 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TCGTCTTCGA GCGAAACTAG 27 60 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2 820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC ACATCCGTAC CATGCTAGAA ACAGACGTGG 2 9 40 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 312 0 

TCT GGAAG AT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCCAAAACG T 318 0 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 324 0 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTCGT AAAATCTTGG CTGACAGTGG 33 00 

TTATCAAGGG CTCATGAAGA TATATCCTCA AG C ACAAA CT CCACGTAAAT CCAGCAAACT 33 60 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATCCG CTATCTAAGG AGATAAGCAA 34 20 

GGTTGAGAAT ATCTTTG CCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA C CT AT CG AAA 34 80 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 3 54 0 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTACAAAAC ATCTTTTTTA 3 600 
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AAAATAGAGA TGATTTTGAA ACAAAAAACC TAATTCAAGA CGTTTCGATG CCAATTCAAG 3 660 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTC 37 80 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 384 0 

TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3900 

GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 3 960 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTC GTTTAGTAGT 4 020 

TGGTATATTT GGTCCTTATA TGGATGATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 4 080 

GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 4 14 0 

AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4 200 

TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTAGG AATAGATCAT ACCAGAGGTA 4 2 60 

GCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAACTGTTAC 4 320 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTC CTTT 4 380 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 4 44 0 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 4 500 

GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 4 560 

ACGTCGCCTT GCCGTATATA TGTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4 620 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4 680 

GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 4 74 0 

TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4 800 

TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 4 860 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT T AAGGC AG AT ATGACGCTTT 49 20 

TTCACTTCCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4 980 

TCCACTACTA TGTGGTGACT GGGCAGG AT A AGGATAGCAT TCATATTGCC GATCCAGATC 504 0 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 5100 

CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGGTCTGC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 5 2 20 

CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 52 80 

CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CT AG T C AT CG 5 34 0 
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TCTACATCTT 


CCAGCAAATC 


TTGTCTTACG 


CTCAGGAGTA 


TCTCTTGCTT . 


GTTTTGGGGC 


5400 


AACGCTTGTC 


GATTGACGTG 


AT TTTGTCC T 


AT ATCAAG C A 


TGTTTTTCAC 


CTCCCTATGT 


5460 


CCTTCTTTGC 


GACACGCAGG 


ACAGGGGAGA 


TCGTGTCTCG 


TTTTACAGAT 


GCTAACAGTA 


5S20 


TCATCGATGC 


GCTGG CTTCG 


ACCATCCTTT 


CGATTTTCCT 


AGATGTGTCA 


ACGGTTGTCA 


5580 


TTATTTCCCT 


TGTTCTATTT T C ACAAAAT A 


CCAATCTCTT 


TTTCATGACT 


TTATTGGCGC 


5640 


TTCCTATCTA 


CACAGTGATT 


ATCTTTGCCT 


TTATGAAGCC 


GTTTGAAAAG 


ATGAATCGGG 


5700 


ATACCATGGA 


AGCCAATGCG 


GTTCTGTCTT 


CTTCTATCAT 


TGAGGACATC 


AACGGTATTG 


5760 


AGACTATCAA 


GTCCTTGACC 


AGTGAAAGTC 


AGCGTTACCA 


AAAAATTGAC 


AAGGAATTTG 


5820 


TGGATTATCT 


GAAGAAATCC 


TTTACCTATA 


GTCGAGCAGA 


GAGTCAGCAA 


AAGGCTCTGA 


5880 


AAAAGGTTGC 


CCATCTCTTG 


CTTAATGTCG 


GCATTCTCTG 


GATGGGGGCT 


GTTCTGGTCA 


5940 


TGGATGGCAA 


GAT G AGTTTG 


GGGCAGTTGA 


TTACCTATAA 


TACCTTGCTG 


GTT T ACTTT A 


6000 


CTAATCCTTT 


GGAAAATATC 


ATCAATCTGC 


AAACCAAGCT 


TCAGACAGCG 


CAGGTTGCCA 


6060 


ATAACCGTCT 


AAATGAAGTG 


TATCTAGTAG 


CTTCTGAGTT 


TGAGGAGAAG 


AAAACAGTTG 


6120 


AGGATTTGAG 


CTTGATGAAG 


GGAGATATGA 


CCTTCAAGCA 


GGTTCATTAC 


AAGTATGGCT 


6180 


ATGGTCGAGA 


TGTCTT AT CG 


GATATCAATT 


TAACCGTTCC 


CCAAGGGTCT 


AAGGTGGCTT 


6240 


TTGTGGGGAT 


TTCAGGGTCA 


GGTAAGACGA 


CTTTGGCCAA 


GATGATGGTT 


AATTTTTACG 


6300 


ACCCAAGTCA 


AGGGGAGATT 


AGTCTGGGTA 


GTGTCAATCT 


CAATCAGATT 


GATAAAAAAG 


6360 


CCCTGCGCCA 


GTACATCAAC 


TATCTGTCTC 


AACAGCCCTA 


TGTCTTTAAC 


GGAACGATTT 


6420 


TGGAGAATCT 


TCTTTTGGGA 


GCCAAGGAGG 


GGACGACACA 


GGAAGATATC 


TTACGGGCGG 


64S0 


TCGAATTGGC 


AGAGATTCGA 


GAGGATATCG 


AGCGCATGCC 


ACTGAATTAC 


CAGACAGAAT 


6540 


TGACTTCGGA 


TGGGGCAGGG 


ATTT CAGGTG 


GTCAACGTCA 


GAGAATCGCT 


TTGGCGCGTG 


6600 


CTCTCTTGAC 


AGATGCGCCG 


GTCTTGATTT 


TGGATGAGGC 


GACTAGCAGT 


TTGGATATTT 


6660 


TG AC AG AG AA 


GCGGATTGTC 


GATAATCTCA 


TTGCTTTGG A 


CAAGACCTTG 


ATTTT CATTG 


6720 


CTCACCGCTT 


GACTATTGCT 


GAGCGGACAG 


AGAAGGTAGT 


TGTCTTGGAT 


CAGGGCAAGA 


6780 


TTGTCGAAGA 


AGGAAAGCAT 


GCTGATTTGC 


TTCCACAGGG 


TGGCTTTTAC 


CCCCATTTGG 


6840 


TCAATAGCTA 


GAAAGAGGAG 


AGGATGAAAC 


CAGAATTTTT 


AGAAAGTGCG 


GAGTTTTATA 


6900 


ATCGTCGTTA 


CCATAATTTT 


TCCAGTAGTG 


TGATTGTACC 


CATGGCCCTT 


CTGCTTGTGT 


6960 


TTTTACTTGG 


CTTTGCAACT 


GTTGCAGAGA 


AGGAGATGAG 


TTTGTCCACT 


AGAGCTACTG 


7020 


TCGAACCTAG 


TCGTATCCTT 


GCAAATATCC 


AGTCAACTAG 


CAACAATCGT 


ATTCTTGTCA 


7080 


ATCATTTGGA 


AGAAAATAAG 


CTGGTTAAGA 


AGGGGGATCT 


TTTGGTTCAA 


TACCAAGAAG 


7140 
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GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCACTT GGACATGCTA AAGGATCAAA 7 2 00 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGACG 7 2 60 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7 3 20 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 7 3 80 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC GATTACCAGA 74 4 0 

CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7 500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 5 60 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCAGTCAA TTGGAATCCC 7 68 0 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 77 4 0 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7 800 

CGAGTGAGGA TGGGGT GCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATGGTTG C AG 7 860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 7 92 0 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT GTTCGCTATA 7 9 80 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC T ACT ATT AC A AGTATTGATG 804 0 

CG AC AG CT A C TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAG ACTAATCTAA 8100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGCTGGAAGG CCGCTTGCAC ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 82 2 0 

CTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 82 8 0 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTT GTGTTTT TTGGTTTTTT 8 34 0 

GTGGTAAAAT GTCCTCAAGT AATACGAAAG GCGAACTTTA AAATGTCAAA ACAATTGATC 8400 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 84 60 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8 52 0 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8 5 80 

ACTCACTTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCGTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 87 00 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACT AC AAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 88 20 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTGGTTTGCT 88 80 
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GAGATTGGCC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 894 0 

ATTATCTTGG CAGACGAATT TT CAC C AG AT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCT CTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA CAGAACTAGC 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTC CAGGGTTTAA AATAACAACC 9240 

TC AAGGCTG T TTGGG AAT AT TGCAAGAGCT GAAATAAAGG AATAAGAATT G ATGG AT AAA 9 300 

CGT ATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 9 3 60 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 9 4 20 

GTATTTCACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAG C AG 94 30 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 9 54 0 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCCTC ACAGGAAGCC 9 600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 9 660 

AAT AAAGAT A TTGATG CG AC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9720 

G ATTCTC GTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 9780 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCCCTAC 9840 

AAGGCCGAAC AAGGG AT GGC C ATGG AAGTG GATGATTTGC TCTTT ATC C A AGACTACTTT 9900 
AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGC TTTTG G AC AC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGACACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTCCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCT C AGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT T AAAAACGAA 102 60 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG T ATTGGTGG A J. 03 20 

GCTATTCGTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 103 80 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 1044 0 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT T C AT ATGGT A ACCACATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGT AG CT A AACGTATGGA ACTTGGTGCC 105 60 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10 620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGCTTCAAA CAGTTCAGTC TGTAGAGACT GCTGGTGCTG AGGTTCAAAA AGGAAATGCC 10740 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACT TTGGGGC AGGCGGTGTC TGTGTGGCTA TCGGTGAATT GGCAGACGGT 10860 

CTTGAAATCG ACCTCAACAA GGTG CCTCTT AAATACCACC GCTTGAATGG TACAGAAATT 10920 

GCCATCTCTG AATC AC AAG A ACGGATGGCG GTCGTGGTTC GTCCTGAAGA TGTGGATGCG 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTG AC TTGG A GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGGTTCTA 11220 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 11280 

CGCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC TCACACCAAC TGAGGCATCT 113 40 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCCGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTCCTTATGC GGTTATCGAA 114 60 

GCAACTGCTC GTTTGGTGGC TCCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAACA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TGGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 11760 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TCTCAAATAC 11880 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG T GC AG AGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGGCTTTGT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GGACAAACGA AAGCAGACTT TACACTGACT 12060 

GTCAACGGTG TG AAG CT AG A TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 12120 

GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12180 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 122 40 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT T C G AAAAAG A AGGTGCAGAG 12 300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC AGTTGAAACT 123 60 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 12 4 20 
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GATGAACCAG 


ATGGTTC AG C 


TAAGTTTATC 


GTCAATATCC 


TGCTTAATGA 


AAAAGTGCGT 


12480 


GTGGCT ATTG 


ATAGCTTTAT 


CGCCCGTGGT 


GGTTTG ATT A 


TCGGTATTTG 


TAATGGATTC 


12540 


CAAGCCTTAG 


TCAAATCGGG 


TCTCCTACCC 


TACGGAAACT 


TTGAAGCTGC 


TAACAGTACT 


12600 


AGCCCAACCC 


TCTTCTACAA 


TGATGCCAAC 


CAACACGTGG 


CCAAGATGGT 


GGAAACTCGC 


12660 


ATTGCCAATA 


CCAACTCACC 


ATGGTTGGTT 


GGTGTGCAAG 


TGGGCGATAT 


CCACGCTATT 


12720 


CCTGTTTCGC 


ACGGTGAAGG 


GAAGTTTGTC 


GTGACGGCTG 


AGGAATTTGC 


AGAGCTCCGT 


12780 


GACAATGGAC 


AAATTTTCAG 


CCAATACGTT 


GACTTTAACG 


GTAAACCAAG 


TATGGATTCT 


12840 


AAGTACAATC 


CGAATGGTTC 


TCTCCATGCC 


ATCGAAGGAA 


TTACCAGCAA 


GAATGGTCAA 


12900 


ATCATCGGTA 


AGATGGGCCA 


CTCAGAACGT 


TATGAGGATG 


GTCTTTTCCA 


AAATATCCCA 


12960 


GGCAATAAAG 


ACCAACACCT 


GTTCGCAT C A 


GCGGTTAAAC 


ATTTCACTGG 


AAAATAAGAC 


13020 


TTACAGATTT 


TCTAATAGAT 


AGTATCAGTA 


ATGTAAAAGT 


CATGTAAATC 


TAGCTCTTGA 


13080 


TG ATT AC AAA 


TGAAAATTAG 


GTATAAAAAA 


TGACATACGA 


AGTAAAATCT 


CTTAATGAAG 


13140 


AATGTGGTGT 


TTTCGGTATT 


TGGGGACATC 


CAGATGCTGC 


T AAGTTG ACC 


TATTTTGGAC 


13200 


TCCACAGTCT 


TCAACACCGT 


GGTCAGGAGG 


GGGCAGGAAT 


CCTCTCCAAT 


GATCAAGGAC 


13260 


AACTGAAGCG 


CCATCGTGAC 


ATGGGGCTTT 


TATCAGAAGT 


TTTCAGAAAT 


CCAGCTAATT 


13320 


TGCATAAATT 


GACAGGAGCT 


GGTGC GATTG 


GGCATGTGCG 


TTATGCGACT 


GCTGGCGAAG 


13380 


CTTCTGTAGA 


TAACATCCAG 


CCCTTCCTCT 


TCCGTTTTCA 


CGATATGCAG 


TTTGGTTTGG 


13440 


CTCATAATGG 


AAATCTGACC 


AATGCAGCCT 


CTCTCAAGAA 


AGAACTGGAA 


CAAAGAGGAG 


13S00 


CAATTTTCAG 


CGCGACTTCG 


GACTCTGAAA 


T CTTGGCTC A 


CCTCATTCGT 


CGCAGTCATA 


13 560 


ATCCT AGCCT 


GATGGGCAAA 


ATCAAGGAAG 


CGCTCAGCCT 


TGTCAAAGGT 


GGTTTTGCCT 


13 620 


ATATCTTGCT 


GTTTGAGGAC 


AAGTTGATTG 


CGGCTCTTGA 


CCCAAATGGA 


TTCCGACCGC 


13 680 


TTTCGATTGG 


TAAAATGGCT 


AATGGAGCAG 


TTGTTGTATC 


TTCTGAAACC 


TGTGCTTTTG 


137 40 


AGGTCATTGG 


TGCCGAGTGG 


ATTCGTGATT 


TGAAGCCAGG 


TGAGATTGTG 


ATCATTGATG 


13800 


ACGAGGGCAT 


TCAGTATGAC 


AGCTATACAG 


ATGATACCCA 


GTTGGCGGTT 


TGTTCTATGG 


13860 


AGTATATCTA 


CTTTGCTCGC 


CCTGATTCTA 


ATATCCACGG 


TGTCAATGTC 


CATACGGCAC 


13920 


GTAAGAGAAT 


GGGAGCGCAA 


TTGGCGCGAG 


AATTTAAGCA 


TGAGGCAGAT 


ATTGTAGTTG 


13980 


GTGTGCCCAA 


TTCTTCCCTA 


AGCGCGGCTA 


TGGGATTTGC 


GGAAGAATCA 


GGCTTACCAA 


14040 


ATGAAATGGG 


TCTCATCAAA 


AACCAATACA 


CCCAGCGAAC 


TTTTATCCAA 


CCGACTCAAG 


14100 


AATTGCGGGA 


GCAAGGAGTG 


CGGATGAAAC 


TGTCTGCTGT 


TTCGGGTGTT 


GTCAAAGGCA 


14160 


AACGTGTGGT 


CATGGTGGAT 


GATTCCATTG 


TACGTGGAAC 


AACCTCTCGT 


CGTATCGTTC 


14220 
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AGCTCTTGAA AGAAGCGGGT GCGACTGAGG TTCACGTTGC CATTCGAAGT CCTGCACTAG 142 80 

CGTATCCATG TTTCTACGGG ATTGATATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 143 40 

ATACGGTCGA AGAAACTCGC CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 144 00 

ATGGCTTGAT TGAGTCGATT GGT AT CG AAA CAGATGCGCC GAACGGTGGT CTCTGTGTCC 14 4 60 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14 520 

GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14580 

AAAAATAAAT GACAAATAAA AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14 640 

AGCATTTAGT CAGTAGACGC TTTGTCCTAT AGCATCAAAG CTAGAGCCCT GACTAGTATT 14 700 

TTTAGATAAA AAGATGGTTT ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 147 60 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAAT CT GGTGTGGATG TTGAAGCGGG 148 20 

TTATGAAGTT GTTCAACGGA TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14 880 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14 940 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTG GCT A TCAAGTACGA 15000 

CAAGCACGAT ACCATCGGGC AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 150 60 

AGGTGCGGAA CCCCTCTATT TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GCTAGAACAA GTGGTTGCTG GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

CGGTGGGGAA ACGGCTGAAA TGCCGGG CAT GTACGGCGAA GACGACTATG ACTTGGCTGG 152 40 

TTTTGCCGTC GGTGTGGCTG AAAAATC TC A AATCATTGAC CCTTCAAACC TGGTAGAGGG 15 3 00 

AGATGTTCTT CTCGGACTTG CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 15 3 60 

TCGTGTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTCGAAC GCAAGAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATGTCAAC GCTGTCTTGC CGCTCATCAA 154 80 

AGAAGAGTTG GTCAACGGCA TTGCCCACAT CACAGGTGGT CGCTTTATCG AAAATGTCCC 15 540 

TCGTATGTTT GCAGATGACC T AG CTGCTG A AATTGATGAA AGTAAAGTTC C AGTGCTT CC 15 600 

AATTTTCAAA ACC CT TG AAA AAT ACGGTC A GATTAAACAC GAAGAAATGT TTGAAATCTT 15 660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGTAGAGC GTGTAAAAGA 15720 

ATTGTTGGAT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT IS 7 80 

CATTATCAAA TGAAAAAAAT AGCGGTTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTCCAGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15900 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACCAAGCACC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAG GCTACATGAA AATCGTTGGA CCAACCTTAT TCTCGGCTTA TGAAGGTCGG 16 080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTCGGTGGA TTCGGGTGTG 16200 

GATACACGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAG CTG ATG A TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATGTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGG ATT C CAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 1662 0 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTAC AG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGGCT 16800 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGG AG ATT A 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG T C AACCTTT A CCCATTTAAG GAAACTATCC 17 280 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 1734 0 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT -.7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATGAA ACTCGCCAAC 174 60 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GC AG AAT ACT 17 520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 1758 0 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17 640 

CT AC AG ACT A CTCCATTGCT TCAGCCAAAC AGCTCAACGC GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 17760 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 
CTTGGGACTA CGC TT ATG AG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 
ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 
TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 
GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTGGAAGCA GAATACACAG 
GTGTAGTCGG TGG ACTTCT C GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 
GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 
GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 
TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAC 
CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 
ACGTGGAAGA AATCGCCAAA GC AGG AATT A AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 
GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 
TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 
CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TG AT AT AATG TTGGTAAATA 
ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTGAGCAT 
GCGATTGCTA AAAAGTTACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGGG 
AATGATGGGA TGACTCTGGA TCGTTTGGAA TTGGTAAATA TCTCTATTTC CGAACATTAT 
AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 18840 
GCCCTTGCTG CTGG T ATCGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTGGTCCG 18900 
ACTAGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGGAAAT CATGGTCAAA 
T AC GG CGTTC CG AC AG C AAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 
ATCGAAAAGC ATGGTGCGCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 
GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAACCCG CTCATGAGAT GCTTTTGGAC 
AATAAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AGGAGAGGAA 
TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 
CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACGG GT G GT ATGGG TGCCTATGCG 
CCAGTCCCAC ACTTACCACA GAGTGTAGTT GAT AC AG C GG TTGACACCAT TGTCAAGCCA 
GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 
ATCCTGACAG CTGATGCACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 



17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

L8600 

18660 

18720 

18780 
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ACTCAGATTA 


TCTTGCCTCG 


CTTGACCTCT 


GACTTTG CT C 


AAAATATCAC 


AGATATCCTG 


19S60 


GATAGCAAGG 


AGCCAAATAT 


CATGTGGACG 


GACAAGGGTG 


TGACTCTGGG 


TGTGGTTGTC 


19620 


GCATCCAAGG 


GCTACCCGCT 


AGACTATGAA 


AGGGGCGTTG 


AGTTGCCAGC 


CAAGACAGAA 


19680 


GGCGATGTCA 


TCACCTACTA 


TGCAGGGGCT 


AAGTTTGCGG 


AAAATAG CAG 


AGCACTGCTC 


19740 


TCAAACGGCG 


GACGAGTTTA 


TATGCTCGTT 


ACC AC AG CAG 


ATRCCGTCAA 


AGAAGCCCAA 


19800 


GCCAGCATAT 


ACCAAGAACT 


ATACCAACAA 


AAAATAGAAG 


GACTCTTCTA 


CCGAACAGAT 


19860 


ATCGG AAG C A 


AGGCAATTAA 


GTAAAGATAT 


AAGAATAACG 


CGCCGTAGTC 


GCCAAACACG 


19920 


ATAATGGTCG 


TCGTGGTGAA 


AAGACCAGAA 


CAGTGAATCT 


TCTGGTCAGG 


GGGAAACTTG 


19980 


GAGACCTTAG 


GCTCAAAGTT 


TAGGAATGAA 


ACCGAAGGTT 


TGCTTCCGCC 


TCCATCACCT 


20040 


AAGACCATTA 


TCAAAAAGAA 


AAATAAAAAT 


TCACAAAATA 


CGTTAATGAT 


CGTATGGTTT 


20100 


GCGAGCGTTA 


GCGAGCTAAT 


ATAGAACAAT 


CACCGCCGTT 


GTGAAAGAAC 


GATTGGATGA 


20160 


TAATCCAATC 


GTTCAGGGAA 


ATTGGAAGAC 


CTTGGGTTTC 


CAATTTAGGC 


ATGACACACC 


20220 


TTTGGTGGCT 


GCTGCCGTCC 


CTCACAAGCT 


AAGGTGATTG 


TTGAAAAAGA 


GGAAAAAGGA 


20280 


GAAGAAATGA 


AACCAGTAAT 


TTCCATCATC 


ATGGCCTCAA 


AATCCGACTG 


GGCAACCATG 


20340 


CAAAAAACAG 


CAGAAGTCCT 


AGACCGCTTC 


GGTGTAGCCT 


ACGAAAAGAA 


AGTTGTTTCC 


20400 


GCACACCGTA 


CACCAGACCT 


CATGTTCAAA 


CATGCAGAAG 


AAGCCCGTAG 


TCGTGGCATC 


20460 


AAGATCATCA 


TCGCAGCTGC 


TGGTGGCGCA 


GCGCATTTGC 


CAGGCATGGT 


AGCTGCCAAA 


20520 


ACAACCCTTC 


CAGTCATTGG 


TGTGCCAGTC 


AAGTCTCGTG 


CTCTT AGTGG 


AGTGGATTCA 


20580 


CTCTATTCTA 


TCGTTCAGAT 


GCCGGGTGGG 


GTGCCTGTTG 


CGACCATGGC 


TATCGGTGAA 


20640 


GCTGGAGCGA 


CTAACGCAGC 


TCTCTTTGCC 


CTCCGTCTCC 


TCTCTGTAGA 


AGATAAGTCC 


20700 


ATTGCGGATG 


CACTTGCCAA 


CTTTGCTGAA 


GAACAAGGAA 


AAATCGCAGA 


GGAGT CGTC A . 


20760 


AATGAGCTCA 


TCTAAAACAA 


TCGGAATTAT 


CGGTGGCGGT 


CAACTGGGTC 


AGATGATGGC 


20820 


CATTTCTGCT 


ATCTACATGG 


GC C AC AAGGT 


TATCGCGCTG 


GATCCTGCGG 


CGGATTGCCC 


20880 


GGTCTCTCGT 


GTGGCGGAAA 


TCATTGTGGC 


ACCTTATAAC 


GATGTAGACG 


CCCTCCGTCA 


20940 


GTTGGCAGAC 


CGTTGCGATG 


TCCTCACTTA 


TGAGTTTGAA 


AATGTCGACG 


CTGACGGTTT 


21000 


GGATGCCGTT 


ATCAAGGATG 


GACAACTCCC 


TCAAGGAACA 


GATCTGCTCC 


GCATTTCGCA 


21060 


AAATCGTATT 


TTTGAAAACG 


ACTTTTTGTC 


AAACAACGCT 


CAAGTCACTG 


TGGCACCCTA 


21120 


CAAGGTCGTG 


ACTTCTAGCC 


TAGACTTGGC 


AGATATCGAC 


TTGTCGAAAA 


ACTATGTCCT 


21180 


CAAGACTGCG 


ACTGGTGGCT 


ACGATGGTCA 


TGGACAAAAG 


GTTATTCGTT 


CAGAAGCAGA 


21240 


CTTGGAAGCA 


GCCTATGCGC 


TAGCAGACTC 


AGCAGACTGC 


GTCTTGGAAG 


AATTTGTCAA 


21300 
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CTTTGACCTT GAGATTTCTG 


; TCATCGTGTC 


AGGAAATGGC 


AAGGAGGTGA CGTTTTTCCC 


21360 


AGTTCAGGAA 


AATATCCACC 


GCAACAATAT 


CCTGTCTAAG 


ACCATCGTAC 


CAGCCCGCAT 


21420 


TTCTGAAAGT 


CTAGTAGACA 


AGGCTAAAGC 


TATGGCAGTG 


CGAATCGCAG 


i AAC AACTC AA 


21480 


CTTGTCTGGA 


ACTCTCTGTG 


TGGAAATGTT 


TGCGACAGCT 


GATGACATCA 


TTGTCAATGA 


21540 


AATCGCCCCA 


CGACCACATA 


ACTCTGGGCA 


CTATTCTATT 


GAAGCCTGTG 


ATTTCTCTCA 


21600 


GTTTGACACC 


CATATTCTGG 


GTGTTCTCGG 


AGCACCATTA 


CCAGTCATCA 


AACTCCATGC 


21660 


GCCAGCCGTT 


ATGCTTAATG 


TCCTCGGTCA 


GCATGTCGAG 


GCTGCTGAAA 


AATATGTCAC 


21720 


AGAAAATCCA 


AGCGCCCACC 


TCCACATGTA 


TGGTAAAATA 


GAAGCAAAGC 


ATAATCGTAA 


21780 


GATGGGACAT 


GTGACTTTGT 


TTAGTGATGT 


GCCGGATAGT 


GTGGAAGAGT 


TTGGGGAAGG 


21840 


GATTGATTTT 


TAGGACAAGT 


CTATGATACA 


AATTATCGTT 


AATACATTTA 


TTGAAAAGTA 


21900 


TAAGACTGGA 


GCAGTPGTTG 


AAGTGTTGTA 


TGCCAGTCCT 


GACCAAGATA 


AGGTACAAGC 


21960 


TAAATATGAA 


GAACTAGCTG 


CACAATACCC 


CGAAAATTAT 


TTAGCTATCT 


ATAATGTACC 


22020 


GCTGGATACG 


GATTTGAATA 


C ACT AG ATC A 


TTACCCGTCT 


CTCTTTATTG 


GAAAAGACGA 


22080 


GTTTGAGTAG 


AAATCTTGGT 


TTACCTAGAT 


AGCTTATTCC 


CAACAGCTTA 


AGAAGAAAGG 


22140 


AAAAATTAAC 


ACATGATCAA 


CCGTTACTCT 


CGCCCTGAGA 


TGGCGAATAT 


TTGGAGTGAA 


22200 


GAAAATAAAT 


ACCGTGCTTG 


GCTTGAGGTG 


GAAATCCTCT 


CTCACGAGGC 


ATGGGCTGAG 


22260 


TTGGGGGAAA 


TCCCTAAGGA 


AGATGTGGCT 


TTGATTCGCA 


AGAAGGCGGA 


CTTTGACATC 


22320 


GACCGTATTT 


TGGAAATTGA 


GCAGGAGACG 


CCCCACGATG 


TGGTGGCTTT 


CACGCGTGCG 


22380 


CTTTCTGAGA 


CTCTTGGTGA 


AGAGCGCAAG 


TGGGTTCACT 


ATGGGTTAAC 


TTCTACTGAC 


22440 


GTGGTGGATA 


CTGCTTATGG 


TTACCTCTAC 


AAGCAGGCCA 


ACGACATCAT 


CCGTCGTGAC 


22500 


CTTGAAAACT 


TCACTAATAT 


CATCGCTGAC 


AAGGCCAAGG 


AGCACAAGTT 


CACCATCATC 


22560 


ATGGGGCGTA 


CTCATCCTGT 


GCACGCTGAG 




TTGGTCTTAA 


ATTAGCAACT 


2 2 620 


TGGTACAGCG 


AAATGAAACG 


CAATATCGAG 


CGCTTCGAGC 


ATGCGGCTGC 


TGGTGTAGAA 


22660 


GCTGGTAAGA 


TTTCTGGTGC 


GGTTCGCAAC 


TTTGCCAATA 


TCCCACCATT 


TGTAGAGGAG 


22740 


TATGTCTGCG 


ATAAACTTGG 


CATCCGTGCC 


CAAGAAATCT 


CTACACAAGT 


CCTTCCTCGT 


22800 


GACCTTCACG 


CTGAGTACTT 


TGCGGTTCTT 


GCCAGCATTG 


CGACTTCAAT 


CGAACGTATG 


22860 


GCGACTGAGA 


TTCGTGGTCT 


ACAAAAATCT 


GAGCAACGCG 


AAGTAGAAGA 


GTTCTTTGCT 


22920 


AAAGGGCAAA 


AAGGCTCTTC 


AGCAATGCCT 


CACAAACGCA 


ACCCAATCGG 


TTCTGAAAAT 


22980 


ATGACTGGTC 


TGGCGCGTGT 


CATTCGTGGT 


CACATGATTA 


CCGCTTATGA 


AAACGTCGCT 


23040 
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CTCTGGC ATG 
ACCATTTTGA 
TTC C C AG AAA 
GCTATGTTGA 
CAAAAACAGC 
CAGAAGTAAC 
ACACCAAACG 
AACAGCGAGC 
TTAGTGAGTC 
TCGTGAGTTC 
AGAACTTTCC 
TTTGGTTTTT 
GTTCCGACTT 
TTCACTTCCT 
ACGCCCTTAG 
GTGGTAAATG 
GCAAGACCAC 
ATAACTTCTT 
AGTACAGAGA 
CCGGCTGGGA 
CTGATCAGTT 
GAATGAGTTA 
TCAAGCTCAG 
AGGGCCTCAA 
TCGCGGAGAG 
ATG AG TTC AA 
AGCATACTTC 
AATGGTGTTT 
TTTTGACCAA 
GCTAGGTATT 



AACGCGATAT 
TTGACTACAT 
ATATGATCCG 
CATTGATTGA 
CTACTCTTGG 
ATCACGTCTC 
AGTGGATGAT 
TTCAATCTCG 
CATAGGCTGC 
CTGTTTCAGG 
TTTCCTCAGC 
CTTCAGCAAT 
CGACTATTTG 
TACCATCGGC 
TAATGACTTG 
GAATTTCTTC 
TTTCATCACC 
TGGTTACCTG 
TGTAATGAGT 
GGTTAGGATT 
CTGGTCTGGT 
CAGCTGGTTT 
CTTGTTTATT 
GACTTTCTTT 
CATTATAATT 
AGATTTCCTC 
CGACTGTTGG 
TTCCAGTATT 
GAATCTAGTA 
GAGGTGATGC 



TTCTCACTCA 
GCTCAACCGT 
AAACATGAAC 
AAAAGGCATG 
GACAACCAAG 
ACACAAGAAG 
ATCTTTGAAC 
CTGTTTATTT 
TAGTGTGGAC 
AAGTTTTTTC 
AGG AG C AG TT 
AGCGGCTTGT 
AGTAACGGCT 
ACAACTGCTC 
TGTTTTTCCT 
TTCTTGGATA 
CTTGTGAGTT 
GCTATCAAGG 
TCGTTCACCT 
TTCTTTCTTG 
TTCAACATTG 
GAGGCCTTGA 
ACGGTTGAGG 
ACTATATCCT 
AGCACGAAAG 
TTCCTTGTAT 
AAGATCTACT 
CTC AATAGCT 
CCAGTCACCG 
GAGGTTATAT 
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TCAGCTGACC 

TTTGGAAATA 

TCGACTTTTG 

ACCCGTGAGC 

TAGACTTTAA 

AAATCGATGA 

GTCTTGGACT 

TTTATCGAAA 

ATGAGTCCTG 

TCTGTTACCA 

GATGGAGCTG 

CCGTTTTCAT 

TCCTGTGCTA 

ACAGAGTAGA 

TTGAGTAAGA 

TCCAGTCTAG 

ACAGGAGCGC 

ACTGTTTCTG 

TTGACTCCTC 

AT AACTT CAA 

GCAGCCACTT 

AG AGCGG CTT 

TTGTAATTTA 

TCTAAGTTTG 

TACTCTTTGT 

TCAGCGCTTG 

TCAGGATATT 

TTCTTGAGGA 

TTCGTATTCA 

CCCCACCAGC 



GTATCATCAC 
TCGTCAAGAA 
GTCTT ATCTT 
AAGCCTATGA 
AC C ACTTCTT 
AATCTTCAAC 
AGGTGATTAA 
AGACTTAGTC 
CGACTACTAG 
CAGGAGCTGG 
GTTGGCTTGG 
CGCCTACATG 
CGACACTATT 
AGTTGCT AC G 
GTGGATTTTC 
GTTTTACCTC 
C AACTT CAAC 
TTGTTTTTCC 
CTGTGATAAT 
ATGGAATTTC 
CATTTTCATC 
TTAGGTTGGC 
G AGCTG TTTT 
TAGGAATTTT 
TGTGGTCTGC 
GTCTATCTGC 
TGGTAGAAGC 
AACCACCACC 
AGAATTTATA 
CTTTAGACCA 



ACCAGATACG 
CTTGACAGTC 
TAGCCAACGG 
CTTGGTGCAA 
GAGGCAGATT 
CCAGTTTATT 
TTAAAAAATA 
TTCTTTTCTT 
TCCTGCAGAA 
ATCTTGAGGA 
G ATTTCT ACT 
TGTTACCATA 
TACAAGTGTT 
ATGTCCATTG 
ACAAGTCACT 
AGTAGTTGGT 
CACTTGGTTT 
ATTTTCAGTG 
ATTTTCCTGA 
TTCAGTTCTT 
TAGGCTTCCT 
TACAAGCGTG 
AGCTGCGTCA 
AGCTAATTCT 
AAAGGCAGTC 
CCAGATTGAA 
TAGTTGATTG 
ATCTTCTGGT 
GCCTTTGCTT 
GTAAGAAATC 



23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23530 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
.:4480 
24540 
24600 
24660 
24720 
24780 
24840 
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AAGACATCTT 
GCCATTGGTT 
AATTTGCCAT 
TTGCCGTATT 
TATTTACCGA 
GTTTTTGAAA 
ACCAGCATAG 
GATTTAGCGT 
TCGTCGTAGT 
GTTTTTCCGT 
AGAAGGAGAT 
TGGTTCAGAG 
TTTTCAACCT 
TTGACAGTTT 
AGGTTGCTAT 
GCAGTATAAC 
TCAGCTGCGA 
GATGGTGAAT 
GGGATATCAG 
AGCCCCTGAT 
CAAGCATCAT 
ACATCGTAGC 
CCAAAGCTTG 
TGCGATTTTA 
ACAGGGTAGC 
TCTGTCGCAT 
AAATAAGCAG 
AGATCGACAG 
AATTCTTTCA. 



TGTCAAACTG 
GAAGCCCTCT 
AGAGTTGATA 
CGTCAGTACC 
TGAGGGCTTT 
CTTTATCAAA 
CATCCATGTG 
ATTCAATTAG 
AACCTTTAGT 
TGGCAGTAAT 
GGACATCAGA 
TAAAGTATTT 
C ACGTTT AG C 
CTTGAAGTTT 
CGAGAGAGCG 
GGTTCAGGTC 
AGTATTCAGC 
AACGTGCAGA 
CTCCTTCTGT 
CGAGATTGTA 
TGGTATTAAG 
CTCCCCAACC 
TGTCGCTATT 
CAATACGAGC 
CTTCGTTTGG 
CATTGGCATA 
CATACTTGTC 
TACGGGCTGA 
TGG CATTG AG 



AACATCGTCC 
TTCTTTGGCC 
CCACTTGAGG 
AAAGTTGAAA 
TACAAAGTTC 
GTGGGCTTGA 
ACCTGGACTG 
CTCTGTTACT 
TCCTTCGATA 
GGTCATATCA 
ATATCCGAGC 
GCGTCCAGCA 
TTCTTCTTCT 
AGCAATGGCT 
AATAGCTTTT 
TTTTGGTACC 
GTTGGCATTT 
TGGAGTGTCA 
TTTTGGTACA 
CCAGCCTTGG 
GATTTGGTGA 
ACCAGTCCAC 
GTAGTAGATA 
GAGCTCATTG 
AT AGTATTT A 
TTCATCAAGT 
GATAAGGGCT 
TTTCTTCCCA 
AATCGCATCC 
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TTGTCTTCAT 
ATAGCTGCGA 
TAGTACCAGC 
ATCTTTGTTT 
ATCGCTTCTT 
GGATTTTTAA 
TTAATAGCTG 
TCTGCCTGTG 
AT AG CTTTTT 
TCGAGTAGAA 
TCACTGGCCT 
TCGATTGAGA 
TTTTGAGCTT 
TGATCAATCG 
TCAGCTTCTT 
TCGTTAAGTG 
G C AAAATG AC 
GCCCAAGCAG 
GAAGTGATTG 
CCATCAGCGT 
CCTTTTTCAG 
AT AG AAACG A 
CCGTCGTTAA 
GCGTAGGCAA 
TCAGCTTGAA 
CCGATGTTGA 
TTTGTAAAAG 
AAATAGCTAA 
ATGTGTC C AG 



AGTAGAAGCC 
GGGTGTTGGC 
CTTGGGCACT 
T AC CTG C AAA 
CGTTTTTCAA 
TACCTAATTT 
GGATGAGACC 
TTAGTGCAGT 
TAACGTCATC 
AGCGAAGTCC 
TGTCTACGAT 
TTACCTTGTT 
CAGGCGTGAG 
TATCTTCTTG 
TTACGGCCGT 
CTTGCTCTGC 
GCATGACTTT 
CTACCATACC 
GTGTGTTTTT 
TTCGTCCAAG 
CTAGTAGTTT 
TGATGTCTTT 
AAGCCATTGG 
TAAATTTTTC 
GCACACTCCA 
AGATTTCAGT 
CGACAGCTTG 
AGTTAGGGTT 
GACTATTTAC 



ATCGTTGAAG 
ATATTCGGCA 
AGTCGCATCG 
GAAGTCCATG 
GTCCATAGTT 
TTCCATGGCA 
GATGTCCTTA 
ACCGTTTGGA 
ACTAGCATAG 
GTCATTTCCT 
GCGTTTGAGC 
TTTGGCAAGT 
GGTCAAGTTG 
GGCACGGCTA 
GACGCTTTCT 
AGATTCATAA 
GAAGAGGCGT 
ACCGATGATT 
AATACCATTG 
AACGTAGTAC 
AGAAGAAGCG 
GTCAAAACTA 
TTTGAGACCG 
ATAGCCTTTT 
ACCTTTAGCA 
CTTTTTCGCG 
TTCGTTGTCA 
TTGGATTCCC 
TCTCGGAATG 



24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25330 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
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AGACCGATAC 


CTTTATCTTT 


GGCATAGTTA 


ATC AG ATCTG 


TCATTTGACT 


TTCTGTTAAG 


26640 


TGATTGCCGT 


TTGGATCGTT 


GTAATAATCA 


TTTGTACCTT 


TTTCAATGGC 


GCGTTTGACA 


26700 


TCGTCACTGG 


CATAGGTCTT 


GCCGTTAGCT 


GTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 


26760 


AGTCCATCAT 


TTCCGACTAA 


TAGCTGTAAA 


TCAGTGTAGC 


CATAATGTTT 


CGCTTTATCG 


26820 


ATGATTTCCT 


TGAGCTGTTC 


TGGTGAGAAA 


TATTTACGTC 


CAGCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT 


GCAGCACGTT 


CCTTTCCTGC 


CTCTGTTGCC 


26940 


GGTTTGTCAG 


CCTCTGCTTT 


CGCTTCATCT 


TTTTTAGCTG 


GTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT 


TTCGCTTCTT 


CCTTTTTAGG 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTACCAG7 


TTCTTTTTCA 


GCAGAAGTTG 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT 


GGTTTTTCTT 


CTGTTTTTGG 


AAGACTAGCT 


27180 


ACCTTATCAG 


TAGCTGGAGT 


TTCTGTTTCT 


ACAGTTTTTG 


GAGCTTCTGG 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCACT 


CCGATTTTCG 


GATGATTGAG 


GGGAATCACA 


AACCGTATGG 


27300 


ATGGTCGGTT 


GCTTTTCTGT 


AGTAGT AGG A 


GTAACTCCAT 


CGGCTGCAAC 


AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT 


GCTCCTACAG 


CGTATTTACG 


AATAGAAAAA 


27420 


CGC TGTTGTT 


TTTCATGTTT 


CAT TGC AAAA 


CCTCCTGATT 


GCATTGTTAT 


ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT 


CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG 


TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


GTCGGAAAAA 


ATACGTATAT 


ATATCTAGTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CCACAAATTA 


TAGAATTTTC 


CAAAAATAGG 


TAAGCGCTAC 


CTTTTTGGTG 


27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAAGC 


GATTAGCTCA 


GGTTTTCTTC 


TTAGTGATCA 


27780 


CGGTCACATG 


AGATAAATTT 


AATCTTGTAG 


TAATCAGATC 


GTTTGTAAGT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG 


GTGATTTTAG 


TTTGTAGGAC 


AGTAGGGAAT 


27900 


TGTTCATCGA 


CTCCGAGGAC 


TGAAGCTGCA 


TGTTCTGGAG 


TTGGAAAGAC 


TATTTCGTTG 


27960 


ATTTCTTCAA 


AGTGTTCATC 


ATTCATGTCA 


ATGTGGTAGT 


CTAACTTGAA 


A CG ATT AT AG 


28020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA 


TTTGCGTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT 


GATTCGCGGA 


TACGTTCAAT 


CTTGTAGTAG 


28140 


AATTGATCGC 


CGCGTAGACC 


CAATTTTTCC 


AAGTAAACAA 


GCTTGTTTCC 


CCGTTCAATT 


282C0 


GAAAGAACAG 


TTACCTTATC 


ATCTTTAGCA 


TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG 


GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


CAAGGTCGTT 


TAAGGCGCGA 


> ACAACTGTGA 


TAGAGCTGAC 


ATCGTACATT 


28380 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC ACAGATCATT 28 4 40 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 8473 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCCC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGCT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 2 40 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 3 00 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 3 60 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 4 20 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTG AAAGAAG CAGGGAATAT TATCCTCTTC TTTG ATG AAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTCCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 6 60 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 7 20 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 7 80 

CCACAATGTC ATC TTGCC AG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 9 60 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GC AG AATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TG AAAGTG A C 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCCTCGTA ACCGTGCTGG 12 60 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


TAAGACGGAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATGTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCGTTTAGAT 


ATGTCTGAAT 


ACAGTGACCG 


CACAGCTGTT 


TCTAAGCTAA 


TTGGTACAAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATG AC AAT AG 


CAATACCTTA 


ACAGAACGTG 


TTCGTCGCAA 


1500 


TCCATACTCT 


ATCATTCTCT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA 


GTTCTAGATG 


ATGGTCGTTT 


GACAGATGGT 


CAAGGAAATA 


CAGTAAACTT 


1620 


CAAGAACACT 


GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1680 


AGAAGATGCG 


G AT AAAC C AG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCTAA 


iaoo 


GATTGTAGAT 


TTGATGTTGG 


CTGAAGTTAA 


CCAAACCTTG 


GCTAAGAAAG 


ACATTGACTT 


I860 


GGTAGTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACGACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TGGTTGAACA 


AGAAATTCGT 


G AT AAGGTG A 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATGCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


TCGTGAGAAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGCAGCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CGTATAGTAG 


AAGTGTATTA 


TTCTAGTTTC 


2160 


AATATACTAT 


AGTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TATATCAAAA 


CCAGTCCTGG 


2220 


AAAAACGTGG 


ACTGGTTTCG 


TGTTTGGATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA 


ATGGTACGTA 


CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT 


TTTAAATCAC 


ATGACGTTCA 


AAGGCATCAT 


CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGGAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGAGCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTAGCACTG 


GTGTGACGTC 


CCCACATAAT 


CATGTGGAAG 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGCGT 


ACGGATGAGT 


2640 


TTGGCTAAGA 


GGTGCTCGAT 


AGTGTGAAGG 


CCGCCAGTAC 


GGATAGAGTC 


TTCCTTTGGT 


.7700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTTGACAATG 


GTGTGGTCAA 


GTTCAAAACT 


TTCGACAATA 


2620 


ACTTCTTTTG 


ACATGGTAAA 


TCCTTTCAGT 


TTTCTTCTCT 


CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGAGG 


CTGCAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTGGTAGT 


ATGGGT C AAA 


GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCACTCA 


AGATGATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 
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GACTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3130 

TTGG ATTGG C TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 3240 

TTTTTTGATG GC GTTGATC A AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3 300 

AGTGATGATT TCCCCATTTC GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3 3 60 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3 4 20 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 34 80 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3 600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT C AC GGT TGT A 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGCAAGGGG TCAATCATGT TG A CAT CAT C 3 7 20 

AAGGTCAGCT GTGGCACCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 7 80 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AG AT TTG ATG CCATTGATTT GGTCG TGGT A 3 840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3 900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3 960 

ATCCATATCC GTCGCATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTCTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTCT TGTCAGGTTC 4 2 60 

AT AACC AG G A AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4 3 20 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4 380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4 500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4 560 

AAC AG ATG C A AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4 620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4 680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTT AT G AC AGCCCTATGG 47 40 

ATGATAATGG CT ATG AT ATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG .4800 
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AGGACATGGA 


TCAGCTGATT 


GCAGAAGCTA 


AG AAGCG TG A 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


TCATACCTCA 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC 


GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


ATACTATCTC 


CACTTTTTCA 


5040 


GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


S100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACGAG 


AAGGTAGTCA 


ATAATGGTCC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCGACC 


TTTGGAGATA 


AGGATCTCTT 


GACAGTAGGG 


G AG ACTTGGG 


5280 


GACCAACTCC 


AG AG ATTG C C 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


5340 


TCTTCCAGTT 


TGAACATATC 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


ACAGAGTTAG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


GCAATCTTAC 


5580 


TTCATCTCAT 


GAGAGGAACT 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


AT CCGTT TG A 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTGTTCCG 


ATTGAAGAAA 


TCATGGACAG 


TATCCGTGTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCTATCAGA 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTAT AT AC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTAGA 


AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGGCT 


AAAGAAGTAC 


6120 


TTGAAAAACA 


GGTCTTGCCT 


CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CCACTATTAA 


GCAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


T.TCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6480 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


654 0 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 
AGAATAAAGT GCTGAAAACA ATTCGGAATA GG C AT AG AG A CTAGACAATT TGAGGAGCTG 6720 
CTTGCGTCCT GTTCG AAC AC ATTTTCCGG 6 749 



(2} INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 120 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 18 0 

ATATTGCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 24 0 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA G ACTTGG AT A AACT CGCTT A 300 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 3 60 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 42 0 

TCAAAACAAT CTGAAGAATA GCTCATC AT C TCAATTAATT TGTCCTTTGT CATTTCAGAA 4 80 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAC AAGTTGATTT 540 

CTTTGTCTAT TTTTACTTAG ATAGAGATCA ATCATGGGAG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA T AAATTTG CG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 72 0 

CCCTCTTTTC CCTCGGCTCG AT AG CCTTGT CCATATAAAA C AAAAAC GAG ATTTTGATGA 7 80 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGT CTTG AC TTTCAAGGAA TTCCATAACG 84 0 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TAGTTATGTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 9 60 

C AAATTTG AA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 102 0 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 114 0 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 12 00 
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1260 
1320 
1380 
1440 
1500 
1560 



1680 
1740 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 

GCTTTT AC C C AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CCTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT CTTAAAGTTT GGAAGTCACC 

TCCAGCTAGA TGTTTGAGAA AAAG AT AG AG ATTGTAGGCG ATACAGCTCA TCATCATACG 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 1842 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 
<8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 60 
CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 
TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC G ATTTTTTC C AGATTGAGCA 180 
CTGCATCGTG ACTCATCAAG ACTTG AC CAT ACTCTTGTAA G ACT G AGCG A CTGATATCAC 
TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGC CTTGG ATG AT 
TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 
TTTCCATTTC AAAAGTCACT CTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 
AAAGG ATGT A GTAGATGTAA GGTTCTTTCT TACTCTCACC TTCTTGTTCA G CCTG CTCTT 
GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAG AT TG AAT GGCTTCGATA TCATCCTTGG 
TTTTGTCTGC GATGCTTTTT TCCAGGCTTT TGATAAATTC AT CTGG AG AC ATTTGAGCCA 600 
ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC T AG ATTTTG G TCAATCTTTG 660 
ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 720 
CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 780 



240 
300 
360 
420 
480 
540 
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CTCTTTTTTC TTGAGCAACC AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 84 0 

CTAAAGATAT CGTGATTTTT AAAGTTGTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GAGACTCCAA AAGAAAATTT 102 0 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 114 0 

AGAATTTGAA GACTTTGCTC AAA1TTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 126 0 

TTTAAAGCTT CTTCACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATC 132 0 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTC CTAGGTTAGC AATTGTTGTA 13 80 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 144 0 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTC AC A TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 162 0 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTACAC GGTCAATTTT 174 0 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTC 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC ATTCAAATTA i860 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCCTT 192 0 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 198C 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 2 04 0 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCCCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 2 22 0 

TGT AAAT AAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA AT AG ATT ATT 22 80 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2 34 0 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2 4 00 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 2 4 60 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA TCAATAGCTA GTTCCACACT 252 0 
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ATTGGTTGTA TGTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2 700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 2 7 60 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTT AAACC C 2 820 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA AT ACT ATT AG CTGC AAT AAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3 000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 3 06C 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3 240 

GTAACGACCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3 3 00 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3 3 60 

GGTCGGTCTT GAT AC CAG AA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3 4 20 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3 4 80 

AAGTAAGCTA AC ATT ATT AT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3 540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3 600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3 660 

AATATTTATG AATAGGGCCA AAGC ACT AAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATACG CTTCAAACTG 3 780 

TTTATACTGA TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3 84 0 

GAAAAAAGCG ACCTCTTTCA ACTCAGGATA GTTTTTC AC A ACTTCATTAT AAAACTTT TG 3 900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTCCTCGTAA TACTATCCCC 3 960 

TCTATGGAAA TAGTAATAAT AGCGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 4 020 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 41 40 

GT AT TCCTTT AGAAATGTTT GAGAATCACA G AC AAAAT AG TCATCCTGAT TGGCTGACTG 4 200 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 4 2 60 

TTGAACTAAT TG C T CAT AT A AG CTCTG AAT CATTTCTGGA T GG AT AT AAT CATCTGAGTC 4 320 
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AATAAAAATC AGATAATCCC CGTGAGCCTG CTTCATCCCA TCATTTCGTG CTTG CG AC AA 4 3 80 

TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 44 4 0 

CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA G ATTTTGAT A 4 500 

GGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTCG 4560 

CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 4 62 0 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 474 0 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4 800 

AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4 860 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 49 20 

AATCTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4 980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 

CTTCCACTAC ACCGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATACTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGG ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGG CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA T AAATTCT C A 54 60 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGG CG AT A ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA G ATTCAT ATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5 580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCACCGATGA CAGCATCATG ATGCGGTACC 57 60 

TCCTTGTCTG CTGGTGGAGG CGTC AT AT AA TCCCCAAAAG C AG TTCTG AG A TAG AC ATC A 58 20 

TAGCCGATTG GAATAGGCAT CTCTGTTCCT TCAAATGGCA AG AAAAG A TT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCACGACCTG AGCATAATTC TGTAATGCCA 59 40 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCC AG T AAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG 


GAATAGTTTG 


CGCACAAAAG 


AGTGAATAAA 


TCAAGGCCCA 


ACGAACCTGT 


6120 


TTTTTCCGCT 


CAGCTGGATT 


TTTCGGATAA 


TAATCCAAAG 


GCAAAACATC 


CAAGGCCAGA 


6180 


CCATGTCGCA 


AATCCAAATC 


CTGCTGATAA 


GGCTTGATAC 


AGGTGGTTTT 


CTTGTCACGA 


6240 


ATGGTAATAA 


AAAGATTACG 


ATCAACAAAA 


TCCTTGTGAC 


TCTTTGACAA 


GAAATAACGT 


6300 


TCATCTGCAT 


AACGAGGCCA 


TAATTCTGCT 


AATTTCT CAT 


AATCTTTACG 


AGGCATAAAA 


6360 


AAGTCTAGGT 


CGTCGTCCCA 


AGGAATAAAT 


CCCTTGTTTC 


GAAGGGCACC 


AATAGCGCCT 


6420 


CCGCCACAGA 


GATAACAGAG 


CAAATCATGT 


TCTTTACAAA 


AGG CC AC AAA 


ATATTCAGCC 


6480 


ATCTCCAGAC 


TACGAGCCTG 


AATTGCTTTT 


AAATCAGTCA 


TATTGTTCAT 


TATTCTTTCT 


6540 


ATCGTATCGT 


TTCATTATAC 


CACAAACAAG 


GGGTGAAAAT 


CTATTGCAGA 


CTGTAAAAAA 


6600 


TCAAAGCCTG 


ACTGCTATCC 


AAATAGCTAT 


CAAACTTTGA 


TTTTTCTGTC 


TTATACTCTT 


6660 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCA 


CCTTGCCGTA 


GGTATAGGTA 


ACTGACTTCG 


6720 


TCAGTCTTAT 


CTACAACCTC 


AAAACTGTGT 


TTTT AGC AG C 


C TGCGGCT AG 


CTTCCT AGTT 


6780 


TGCACTTTGA 


TTTTCATTGA 


GTATTATCTT 


ATCTTAAGCC 


CATTTGAGCG 


AGCTTGGTTT 


6840 


GATATTTGTT 


TTGATCAACC 


AGCAGGCCCA 


AGCCCCCATA 


AACATCATAG 


GCATCTACCC 


6900 


AGTCACCCAG 


TTCTGGAATC 


GTCAATTTTT 


CAATACCATT 


TTTTGCTCCA 


TCCAAAACAG 


6960 


ATAAACCGTT 


TGTTAGGAGG 


AAAGT AT AGG 


GTACGTTGGT 


TGAGGTCATA 


GCAAAAACCT 


7020 


TTCCAAGAGC 


TTCAGAACCA 


GTGAAAAGTT 


TAGTGGGATC 


TTTAATTTGC 


TCTAAAATTG 


7080 


CTGTTAAAAC 


TTGTTGCTGT 


CTTTTTGTAC 


GGCCGTAATC 


TGCCTCATCA 


TCATCACGGA 


7140 


AACGAGCATA 


ATTGAGCAGG 


GTCGAGCCAT 


TCATCTGCTG 


TTTT CCG ACT 


TTAATGGTTT 


7200 


GGGTTGGAGA 


CTCAGTCTCG 


GTAGCGTATA 


AATCATCTCC 


GACTGTAGCT 


TCTGTTAGGG 


7260 


GACGCCCATT 


CAATGTTGAA 


AATTG AGC AT 


CAATCGTCAC 


CCCATCAGGG 


AAAAGCGTGT 


7320 


CAATCGCTGT 


GGCAAAGGCC 


TGGAAATCAA 


CCAAGGCGTA 


GTACTTAATG 


TCCAAGTCAA 


7380 


AATTATCTTT 


CAAGACTTGG 


CGAACCATTT 


CTGCCCCTTT 


TTGCCCCTCT 


TGTTCTCCTA 


7440 


ACTCGTAGGC 


TACGTTTAAC 


TTGTTATCTG 


TCTG TTTTCT 


AC C ATTAATC 


ACTTG ACT AT 


7500 


AACCATCTAT 


ATAGACCAAA 


TTATCACGCA 


TGAAACTGAC 


TAGCTTCATT 


TTCTTATCTG 


7560 


AGCCCCCGAC 


ATTTAATACC 


ATAATAGAGT 


CAGTTCGTGT 


CTCAACACTG 


TTCTGGCCGA 


7620 


TTCGACCATC 


AGTACCCATG 


ATTAAAATAT 


TAACTCCATC 


TCTAGTGTCC 


TGACCATTAA 


7680 


AGACTTCTAC 


TTGAGCTGCC 


CGGGCATCAG 


CAGTTTTCTT 


TGCGCTAGCA 


TCTTGGTAAC 


7740 


CACGCAAAAA 


CATGAATACC 


ATGGCCAAAG 


CCACACAGAC 


CAAAAGTGAA 


AAAATC AC C A 


7800 


TAAAAATTCG 


TTTAAGACGG 


AGCTTCCGTC 


TTTTCTTTTT 


TGGAGGGAAA 


GAGAGTGCTT 


7860 
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CCTCCTCCAG 


TTGACACGAC 


TTGGTCTGTT 


TGTAGTAAAT 


CAGCTAGGAC 


TTCTGATTCT 


9240 


ACCTGACGAA 


AGGCTGTTTC 


TCCCTTTTCA 


GCGAAAAAAT 


TCGCAATGGA 


CATACCTAGG 


9300 


CGATTCTCAA 


TCAGAGCATC 


CAT ATC AAGG 


TAATTAGGGT 


CCAAGCCTCT 


TGCAATAGTC 


9360 


GATTTTCCAG 


CCCCCATAAA 


CCCTAATAAC 


ACCTTAGCCA 


TGAATCAAGC 


TCTCCAAATC 


9420 


ATCAAAGAAA 


CTAGGATAGC 


TGGTATTGAT 


GGCTTCTGCA 


CGGTCAAGCT 


CCACCTCTCC 


9480 


ATCTCCAACC 


AAGAGGGCTG 


CGATAGCTGT 


CATCATGCCG 
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9540 
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TCCTTTGATA 
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AGTAATATCT 


GCTCCCATAC 


TATTTAAGGC 


GTCTGCCACA 


ACCTGAATAC 


GGTCTGTTTC 


9660 


CTTGACCTTG 


AGCTCCTCAG 


CATCCTTGAT 


AACTGTTACA 


CCTTGGGCTT 


GGGTCGCAAG 


9720 


CAGGGCAATA 


ATGGGCAATT 


CATCAATCAA 


TCGTGGAATC 


AAAGCGCCAC 


CAATCTCTGT 


9780 


TCCTTTCAAG 


TCAGAAGACT 


CAACAATCAA 


CGTAGCAGAT 


TTACCGACTG 


GATCGATTTC 


9840 


AGTTATTTCC 


AATTTTCCAC 


CCATGGCACG 


AATGACATCA 


ATAATACCGG 


TGCGAGTTTC 


9900 


GTTGATCCCC 


ACATTCTGCA 


GCACTAGACG 


AGAATTTGGA 


GCAATCAAAC 


CTGCGACTAA 


9960 


CCAAAAGGCT 


GCACTGGAAA 


TATCTCCTGG 


TACGACCACC 


TTCTGTCCTG 


TCAATTTTTG 


10020 


TGGCCCCTGG 


ACTGTGATTT 


TCTTACCATC 


C AC ACTT AAA 


TGACCACCAA 


ATTGTTTCAA 


10080 


CATATCTTCA 


GTATGATTAC 


GGGTG7ACTC 


TTTTTCGATA 


ATAACTGACT 


CCCCCTTAGC 


10140 


TTGTAACCCT 


GCAAACATCA 


AGGCTGACTT 


GACTTGCCCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGGTTTT 


TCGTCCCTTT 


TAAGCGAAGG 


GGAGGCAAGT 


CTCGTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CAGTGGAAGG 


GTCACACGGT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGG AC A C C 


10380 


TGAAATCAGG 


CGAATCGAGG 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 


10500 


ACCAAGGTCA 


CGAAAAACCT 


GCATGGTCGA 


AAGAACGTCT 


TCACCTCGCA 


GAATATCATA 


10560 


AACCTTCGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGGTGGC 


TGATAGACTT 


10620 


GTCACCTGGG 


ACGCGGATAC 


TACCATGTAA 


ATGGCGAATG 


TTTGTTTTTA 


GTTTCATACT 


10680 


GGACCTCATA 


CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAAGCCAGAA 


ATTCCTTAAA 


10740 


AATTCCTGAC 


TTTAGGATCG 


TTCTTTTCTT 


ATTTCAGCAA 


TTCTGAAACT 


GGTTCAAAAA 


10800 


CAATTTTTTC 


AATATCAGAA 


AGGTAAATGG 


CCAATTGTTG 


TTGCTTGGTA 


AAGAATTCTG 


10860 


ACAAGAGGCT 


ATTTCCTTGA 


ATCTGTTT AC 


CAAAGCCTTC 


CATCTTAGCT 


TGGAAGGACG 


10920 


CATCTGGCAT 


TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAC 


GCAAGATAAT 


10980 


CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTGCAATCGC 


ATCTTTAGCT 


GCTTTAACAG 


11040 


CCTTGTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGCACTA 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACG 


ACTGTATAGT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCACCTC 


TTTCCAAGTC 


TTGAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACGAT 


TTTCCTCGTT 


G ATGTGG AT A 


TTAACCAAGG 


AAGTTCCACG 


TAGCAGTTCC 


11280 


AAAATCCGC A 


GGATGACATC 


TTCTTCATCA 


, GGAACGTCAA 


. CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


1 GTCTGGCGTT 


GTTCACGCGC 


TTGGTTAAAA 


11400 
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AAGTTCCAAA TTTGCTCTTC ATCTCC CTT A CTAATGGCCT GACCAATCGC TTCCAAACGT 114 60 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

C AC ATTCCT G GCTCGCTTTC CGCAATTCGG GTCATATCTC GAAAACC AC C TGCCGCAAAG 11580 

CGCCTTGCCA TCTCATGCTC T TG AG C AT AG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATATGAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 117 60 

CTTGTC AGG C TTGAAGGTGT AAAGATATAA TAGGCATTTT CAAAAAGATT GACATCTGCC 11820 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCGA ACCAGCATCT 11940 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATCAA AGCAATAGTT 12000 

TGTTTGATTG GCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT AGCAAAATCA 12060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 1212 0 

TTATAACCTA AAATTTCATA ATCTGGATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12 24 0 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12 300 

AATTTTTCCA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTCCTTCCAT GACCATTCCT 12 360 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCGTGGGTT 1242 0 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGG TTT C AT GACCCCACGA 12 480 

ACAACGATGG GTTGCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12 54 0 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12 600 

TAACCAGCCT CAAAGCCAAG AC C AAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12 660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 12720 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC A CGTTTG ATT 127 BO 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TGACAATAGA AACTTCAGAC 12840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12 900 

CCACCAAAGA CCACGACATG CTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12 96 0 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 13020 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT G CTTTT AAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 1314 0 



WO 98/18931 



PCT/US97/19588 



690 

TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13 200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13 2 60 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 13 3 20 

GG AAGTCC AG CTGGAATTCC CTCAAT AAT A GCTGTTAGAC GGCGGCCGTG TGATTCTCCT 13 380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13 500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13 560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGCT AATGCCAGCT 13 620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTGTA CC ATT C CC AT GGCAACAGCC 13 680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT C C AAAAT ACT CTCAACAGAA 13 860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13 920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13 980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 14040 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC T AG AAATT C A 14 2 20 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTC TTC AC CTTCTAAGAA ATCAAAAACA 14 2 80 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14 340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14 400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TG AAT AT CAT AAGGATGGTG AGGAATATCG 14460 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 14 520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGCAGCTTGA 14 580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14 640 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14 7 60 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14 820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTC CAT AG AA 14 880 

ACGGAACGAA CAAAGACCGA AATCTG ACT G ACGCCATCCA AAATAGCCTG TGCCAAGATT 14 940 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGCAAG CACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATCC CAAACATCTG GTAGCGACCA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA C AACTGC AG C TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 153 60 

GATTTTTTTC ATCCTAGCTA G AG AAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATCAAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GC AACCTTAA CAACTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTCGGAAGTC CAGCATTTCC TCAAAAACAT CCTTGTAGCT 15720 

ATAGTACTCA AAATCAATAT AGTCTGGTTG AT AGAGTTG C GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCC GACCCACTTC GTAG CGTGAA 15840 

AACCAACTCA CCGCCTGCCA ATTTTTCAAA AATGG CTGG A GCTACCTGCA AAATCCCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 159 60 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16020 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTT ATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 16140 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 162 00 

CCTCCCGGAT TTAAAATCTC AACACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 162 60 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CT AGGCGG AT CTAGGACAAT CACATCGTAG 163 20 

GTCAAGTCTT TCCGTTTGGC ATATTTGAAA TACTCAAAGA C AT C CATC AC TATAAAACGA 16 3 80 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16 620 
ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 16 680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TT AT AC CAAG AAAAGACGGC GTAGTCGCCA TAAAGCTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAG CCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTG CTG ACT 17160 

CTTATTCTA.T TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 17 2 20 

AC C CTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 17 280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCC TCTTGGAGGA GCATTTCTTT 17 340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CAGGTTAACT CTGAGATTAT 17 400 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT T AGTG AC AAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17 520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATCTTC TATCAAAAAT 17 640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 177 60 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17 820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17 880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGG C CATC AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGAC fA 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA i8120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TCACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GAT AT AG AG A AAGAGCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18 300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 183 60 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 184 80 
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GACCGATCAT CATCAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18 540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18 6 60 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18 720 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGCCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 189 60 

AATGCACTGC CACTACCATC ACTAGAGCCA C AG AG AC AAA TAATAAGGTC AAGGACAGTA 1902 0 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGT AG AGG AC C AG AAAAT 1908 0 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG G A C AG AC AGT CCTTTAGCAA 1914 0 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 192 60 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT AT ATT ACT AA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 19380 

TTACTTTTTT 19 3 90 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18436 base pairs 
(8) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AG AAATT C AA CATATAAAGG 60 

AAAAT ATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGGC TTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTT AAAAT CTTGGGC AC A GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTG G A ATCAAAGTTT T AC C AG AT AC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 360 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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CTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC ACCAGACTCT ACTGACCACT 4 80 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGCGGACAAG 540 
ACTTGGAGAA AAAAGGTTTC CTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 
GGCAACCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 
CTAATTTGGT C TTTGT AG AG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 
AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 7 80 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 84 0 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TCCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 9 60 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTG AA GAAGCCTTCT CAAACCCTTC CCAAACCAAG AACACTTACG 10 80 

AACTTGAACT CAACTTGGAA GCTAATAGCC AG AG CG AG AT TGTCTTACTT GCTGATAAAG 114 0 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGC TCAAGTAACA GTGGATCGTA 12 00 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 132 0 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GCTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GG AACTT ACT ATGAATTAGA TTATGGTCGC AAAACTAACT 14 4 0 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCCGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTCAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 162 0 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 174 0 

GAATACATCG AAATGTTGGA AGCCAATC AG GTGGACGGCA TCATTTCTGG TAG TC AC AAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 18 60 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 192 0 

ACCTTGGTCA AG ACAGGTG C CCAGTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 19 80 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2 04 0 

AATGTTTCCA GTG A CTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGG C AT TTCTGTCCCA AAAG AG C TC A AGGTC ATCGG CTATGATGGG 2 2 2 0 
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GGCGATTTTA 


TCCCCAGCTT 


TCAAGCTATC 


TGTATTTTCC 


AAGAGAGACA 


AAAGTCCAAG 


4020 


GAAAAGTGAA 


CCTGTGTAGA 


TATTCCCCAC 


CTTTTGACTG 


TAGAGAATAG 


ACTGGTCAAA 


4080 


ATGCTTTTGT 


AAGAGGTCTT 


TTTTCTCTTG 


AGGCAGGCTC 


TTATCCATGA 


TTTTTTTCAA 


4140 


GCCTTTTAGC 


GCTAATTTAG 


GAT AAGGCAA 


GTGGAAACAA 


ACAGCCGCAA 


AATCATCCAA 


4200 


AGTAAGCTGG 


TAGCGTTTTT 


GATATTCAAG 


CCAAGTCGTT 


TTCAAACTAT 


CCAAGTATTG 


4260 


TTGGGTAGAA 


TAGACACCAT 


TTACATAAGG 


AGTTGTCGAG 


TAATTTGGTC 


GCCAGAAATC 


4320 


CATGATCTCA 


CGGGTCTGAG 


CTACATTGTC 


ATTATTAAAG 


GCCATCATGC 


GTGGATTTTG 


4380 


TGTAATCAAC 


ATAGCTACAC 


TTCCAGCACC 


TTGACTTCGT 


TCTCCTGGAG 


TTTCAATACC 


4440 


GTATTTGGCA 


ATATCACTGG 


CAATGACCAA 


GACCTTGGAC 


TCCGGAGAAT 


TTTCCACATG 


4500 


CAATTTGGCA 


TAATGGAGGG 


CAGCAGTCGC 


TCCGTAGCAG 


GCTTCTTTAA 


TCTCGAAACT 
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ACGAGCAAAG 


GGCTGGATGC 


CCAGCAAGCC 


ATGCACAAAG 


ACGGCCGCAG 


CCTTACTCTG 
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GTCAATTCCT 


GACTCGGTCG 


CCACAATGAC 


CATGTCAACT 


TCTTGTCTTT 


CTTGCTCAGT 


4680 


TAAAATAGAG 


TCACTAGCAC 


TCGCCGCCAA 


GGTCACGATA 


TCCTCAGTTA 


GGGGCGCAAT 
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ACTCAATTCC 


TTGAGTAAGA 


GTCCTTTACT 


TAATTTTTCA 


GGGTCAATTC 


CCCTCGCTTC 
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TGCTAAGTCT 


TCTAATTTCA 


AGACATATTG 


ACTGGTCGCA 


AAACCAATCT 


TATCAATACC 
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GATTGTCATA 


TTTACCTCTG 


TTTTATCATT 


CATGTAAAAA 


ATCGTTCTAT 


ACT ATTTT AT 
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CACAAATGGC 


AGTAAAAGAG 


AGAAAAAAGA 


CTTGATTCAC 


CAAATCAAGC 


CTCTTATTGG 


4980 


TCATCATTTT 


AAAGAATGAT 


TAGTTG CT AG 


AGAGTTCACC 


GATATAAGTA 


GCTTT AT AAG 


5040 


CTCCATTCAC 


AGTTATCAGC 


TCCTGGAGGA 


TCAAATTTCC 


TGAGTAAGTC 


CTTCCCATCT 


5100 


CATCTACAAA 


TTTTTGATAA 


AACTG ACT GG 


TCGGAATTTC 


TCTGACATCC 


TTATCAAATG 


5160 


TCTTATCAAG 


TGTTTTACTA 


ACCTTCTCAG 


CAATCAATTG 


ATGCTCTTGC 


CATCCACTTT 


5220 


GAAACTCTGA 


GCCCGAACTA 


GAAACCATGA 


CTGGGATAAA 


CAACAAGGTC 


AGTAGATTTA 


5280 


CAGACAATAA 


GGAAAGTAGT 


AGACTTCCTG 


CAAAACTAGA 


ATCCTAGTTC 


ATGATTGATA 


5340 


AT AC C AGCAA 


TCAAATTCAT 


TCGTAATCCG 
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GATGATTTCG 
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T ACTTTGG C A 
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CAACCTTGCT 


TCTCTCCTTA 
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ATATATCTTC 
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TAGTTCACAG 
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CGCTTGAGCC 
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TCATTCGCTA 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AGCTGACAGG AGTTCTTGAA 5820 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 58 80 

CTTTCGTGGA TACCAAAATC AGCCGCAATT T CTT CAT AAG TGCGGTATTC TCGC AC AT AT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GAGATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTGCCAAA ATGGCCAAGA TAAAGACCAC ACCAGCTGTC TTATAAAGAA TACTTAAAAT 6240 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGC 6 3 00 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 63 60 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTG ATA ATGGCAAGAA AACTTGCTAC 642 0 

TGCTTGTAAT AAAAACCTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 64 80 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

GCATGACCGC ACCAGCTATA ATCAAAT CTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 672 0 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 6 7 80 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 684 0 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTCCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TC AATCTGC C CCCCAACTAG CTGACGAGAA ATCTGGTTCA 696 0 

CATCGCAGAC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 714 0 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7 2 00 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 72 60 

AAAAAATCTT TGCCATGAAA TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 73 2 0 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7 3 80 

ATTGAGGTCT ACCTTTTCAC CTGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTCCACT 744 0 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7500 
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TTGAATCAAA TTCTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7 560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CATCATAAGT 7 620 

CGTTCCGCCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7 680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 77 40 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTKGCTTTTT TAGCTACTGT 7 800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7 9 80 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8 040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 82 80 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 83 4 0 

GTTTTTGATT CATTTGTCTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 84 00 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 84 60 

ACTTTCATTT CTACGATTGT AG G AC AAGCT ACTAGACAAG TCCACATACA TCTCTGAAAC 85 20 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8 5 80 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATCCTGGAC 8 6 40 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 87 00 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 87 60 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTCTCTCTGC 8 8 20 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8 8 80 

ATTTTGAGGT TTCAAGTCCC TCTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 894 0 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT. 9000 

AT AG CGTT TG ACGTCCAGTC CAGCCACATA CTCCATAGCT AGG T ACTGTT GACCGTCTTC 90 60 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 912 0 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 92 4 0 
TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC C G ATTTGG AT . 9 3 00 
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LA* ll» luvnl 




CATAGAAACA 


AGGGCAACCG 


TAATGTTGTC 


TAAACCTCCT 


9360 




CAAAAC*fiAAC 


AAGTGTCTCC 


GTTTT ATCTG 


CTAAAGGAAT 


ATCACTGGTT 


9420 






GC CTGAAATC 


ATGTTGGTCA 


AGCCGTCACT 


ATTGAGCAAG 


9480 


AGATACj I LAV- 


pTparTr a an 


f^ATAACTGTC 


CCAAAATCAG 


GCTGAATTTC 


ATCTTTTTGC 


9540 


ft iv T* A /"^ ft 

CLAAi AOAL 1 




A' PT'P^^TTCIC 

r\ 6 I 1 X X * i w*— . 


GGATGAGCTT 


CTGCCTCTTC 


TGGTGTCAAT 


9600 




L UAwCnn I I 


ATTAACCAAG 


GAATGATCGC 


TCGTCAACTG 


ATGGT ATTCT 


9660 


TCTCCACCjAA 


L LAAuv.LvjA 1 




CCAATATGAG 


CATAGATAGC 


CTGATTATCA 


9720 


AT AAT AGCAA 


UuAL J ll_L.A/\ 


AulAul 




AAGCTTCATC 


CTGACCAAGC 


9780 


TGGTG AA 1 1 


TTTGATTTTC 


AATTTCTAGG 


TAATGGGCGA 


ACCATTCACG 


CACTTCATTG 


9840 


ACTGTATCUA 


1 V, I VjVjVj I Ai\. 




AfACCCAGGT 


CTGTGACCGC 


CATTTCACTA 


9900 


#— * j— ■ ft/n ft 1 T'/**i^* 

GCGATATTCt- 


\_ 1 (j^uLun 1 vj 




CCATCAGCTA 


AAATAATCAT 


GGTACGTCCA 


9960 


GCTCT ATTG A 


t»A 1 Au 1 1 i 




TCCTTATTTG 


TTCGTTTCTG 


ACCAACATCT 


10020 


gttaataatg 


ft ft ft <|MI «(»/"»/"* ft *"P 

AAA1 ILUAi 






CG AT ATCTTG 


CGAAATTGAC 


10080 


fjf^ ft t*^* ft ft r & 
TGATQjAAC*AA 


TrriffarTT 

I Wt. A 1 LAL L 1 




CAGGTGTAAT 


GAGGATACAG 


CCGTCTTTCA 


10140 


TGATATCCTT 


ALA I 1 wA 1\» 1 


Tp , ra( I'P'fi'M'A 


CCTGCTCGAA 


CTCGGGATGA 


CTCTCTAAAA 


10200 


AGGCCTTAAv. 


UAL 1 A tj/Wvi 




AGACGATAGT 


GCAGGTGCTA 


TAAGTrATTA 


10260 


TACCACCTTT 


1 1 l\ I * 


TC! A rAAACAC 


TACCTAATAT 


TTCTAACTGA 


ATTTCCTGCA 


10320 


AGGALGCGAA 


AIL 1 ULUu i 1 


TP TTT ATTfZT 

1 V« I l 1 f\ 1 IVJ 1 


ATTTGATATC 


TGGTTTTCGG 


CGCAAAAGAC 


10380 


CGATTCCTCjA 


AL AAuuAOLn 


TPPArrAAAA 


TCTTATCAAA 


GGAATCCTGG 


TCAAAAAACT 


10440 






AAT TTTTG AG 


TTTGAACCCG 


ATCTTCAACT 


CCCAGACGTT 


10500 


{jLiOLAl 1 I 11. 




TC C AACTTGT 


GGTCGTACAA 


GTCCAGAGCA 


GTAACCTGAC 


10560 


CTGTCGTAAG 


ATAAGAGGCT 


ATATGGGCTG 


TTTTCCCACC 


TGGAGCCGCA 


CAGGCATCAA 


1 Pi C ~i (\ 


GCACTCGCTC 


ATCACCTTGT 


AAATCAAGCG 


TCGGAGCAAC 


CAGCTGACTG 


GACTCATCTT 


10680 


GGATGGTAAT 


GGCTCCATCC 


GCAAACAAAT 


TATGCCCTGC 


AAAATGCCCC 


TGCTCCTTAA 


10740 


CCAGACCAGT 


G GTTGCT AAA 


AGGGAATTAT 


TCGCCTCCAA 


CAAGGCTTGG 


ATTTCCTCTT 


10800 


TTCGACTTAG 


GTCTGTTACA 


CGAATACTGG 


CTTTGTTT CG 


CACTAACAGG 


CTTTCAAAGA 


10860 


TGCCTTTTCC 


TCTCTCCTCT 


CCGTATTCTT 


CCTTGAGTTT 


GGCAACTAGC 


CAAACTGGGA 


10920 


GAGAATAGGC 


AATGGAGTCA 


CGCTTGTTTT 


TTCGCTTGAT 


G C T AG C AAT A 


TCTGGCCAGC 


10980 


CTTCACGCAA 


GATACGGCGA 


AGGACAGCGT 


TGACCAATTT 


TTCACTCCCT 


TTTTTACGGA 


11040 
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GTTTGGCCAA TTCCACTCCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC 11100 

GGAGTTGGTA GGCACTCATG AGAAGAAG G A CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 11160 

CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 11220 

CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 11400 

ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 11460 

AGGGATAGAG CCCCTTCAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AGAGAGAATC 11520 

TCACCTGGAT TTCCCTCACC TTCTACTGGT AGGGCTTCAT AAATCTTAAA GCGGTCGCCC 11530 

TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTCA 11640 

CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT TT AT ATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 117 60 

TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 11880 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGAT AG G C 11940 

GCACCACCAC GGTGTCTAGG AAGGAGGGAG GCATGAACGT TGACAGCAAA GTCCATGCTA 12000 

TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12060 

AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGATAATT TTTCAGGTTG GT AG AT AG A T 1212 0 

AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGGTTTCTT GGATAACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 122 40 

CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA AG ATT AGTTT TGTCATATCT 12 300 

TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATGC TCACACGGAG CTCACTATTT 123 60 

TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 1242 0 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTGGG TACGGGCAAT CGGTTTTGGC '-2480 

GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12 600 

TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 12660 

TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 12720 

ACTTGACCTC CCTTTTCAGC CCGACCTGAT CGACCTGCCA CCTGAGTCAA GAGCTCGAAG 12780 

GTTCTCTCAG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG TATCCGCATT TAGAACTCCG 12840 
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ACTAGGGTAA 


CATTGGGAAA 


ATCCAAACCC 


TTTGCAATCA 


TCTGAGTACC 


AAG T AAAATA 




TCCGCTTCCC 


CTCGCCCAAA 


CTGGTCAAGC 


AAGGCTTGGT 


GACTGCCTTT 


CTTTCGAGTC 




GTATCCACAT 


CCATCCTCAG 


AATGCGAGCT 


TGGGGAAAGA 


GTTCTGCTAG 


CTC ATCAT AA 


13020 


GCCTTCTGAG 


TTCCCCTCCC 


ATAGTAACGA 


ATACTGCGGC 


TCTTACACTT 


AGGACAGACC 


13080 


TGAGGAATAT 


CCTTCGAGAA 


ACCACAATAA 


TGGCAGTTCA 


TAGTCTTGGT 


ATCCATATGC 


1 3 14 0 


AAGGTCAGAG 


AAATATCGCA 


GTTGGGACAA 


GT AT CC AC CG 


TCCCACACTC 


CCGACACATG 


13200 


ACAAAGCTAG 


AATAACCACG 


GCCATTGAGC 


AT GAG AAC C A 


CCTGCTCTTT 


TTTAAC C AG A 


13260 


CGGTCTTGGA 


TAGCCTCTAG 


CAAAGGAGGC 


GTAAAGTTTG 


ACGTCTCATT 


TTGTCCGATA 


13 3 2 0 


TAGTCTCGAA 


AGTCAATCAC 


TTGAACCTCA 


GGGATTGTAG 


CCAAAGGATT 


GGCACGTTGG 


13380 


GTTAGACCTA 


AGTGTTGATA 


GACGCCTTTG 


CCACCACGTG 


CCCGGCTCTC 


TAAGCTCGGC 


1344 0 


GTTGCAGATC 


CAAGTACCAG 


AGTTGCTTGA 


TTATACTGAG 


CCCCTAAAAT 


AGCT AC CT CT 


1 3 500 


CTGGCATGGT 


AACGGGGATT 


GCTGTCCTGC 


TTATAAGCCG 


CTTCATCCTC 


TTCATCAATA 


13560 


ATCATGACAC 


CCAGATTTTT 


CACAGGAGCA 


AAG AT AG C AG 


ATCTGGCACC 


AACAACAACT 


13 620 


TGGGCATCGC 


CACGCTCCAC 


CTTGCGCCAT 


TCATCATACT 


TTTCACCATT 


GG AT AATC C T 


13680 


GAGTGAAGAA 


TGGCTACCTT 


CTCCCCAAAA 


CGTGCTATAA 


AACGCTCGGT 


CATCTGAGGA 


1374 0 


GTCAAGGAAA 


TCTCAGCTAC 


CAGCAAAATA 


GCTGTCTTGC 


CCTTATCCAG 


GCCACCTTGC 


13800 


ATAATCTGCA 


AGTAAACCTC 


GGTCTTCCCA 


CTTCCTGTAA 


TCCCTTCAAG 


TAGAAAGGGA 


l Joou 


GGTTG AG AAC 


TGCCAATACA 


ACTCACAACC 


GCATCACGCG 


CCTGTCTTTG 


TTCTGGATTT 


13920 


AACTCCAAAG 


GTCTACTTGC 


TTCAATTCCT 


TCAAAATAAG 


CAGCCGAGCG 


TTGAACTTCC 


i toon 


TTTTGGACTA 


TGGTAACAGC 


ACCTTGATCC 


ACAAAGAAGT 


TGACTTGCTC 


TCGCGAGTAG 


1404 0 


GACTCTAACA 


AGCTAGCCAA 


GGAAGCGCTC 


TCTGCATGAG 


ACAGCAGATA 


ATCTCTCAGT 


LH 1 UU 


TCCAACTTTT 


TCTTGGCACG 


TGT AG AAATC 


TCAACACCTT 


CTAATTGAGC 


ATGGTCAACC 


14.160 


TCATACCAAG 


ACTGCCTCTT 


GACCTTCTTT 


TGATCCACTC 


CCTCATATTC 


CAGACCAACC 


14220 


AGGCCTTTTC 


TACT C AAA CG 


C ATC ATTTC A 


GCTTCCTTGG 


CAAGGTCTAG 


TCAAGAAAAC 


14230 


GCTAGCGAAT 


CTTCTGAACC 


AAACAGGCGC 


ACTCGTTCTT 


CCTGACTCAA 


GCCTTCCAGA 


14340 


GGATAGAGAA 


TCTTGTCATA 


GCTAGAATTC 


AGAAACCCTG 


GAAGCATCGC 


CTTGAGGATA 


14400 


G AG ATTTTGT 


AGGAGAAGAC 


AGATTTGCGT 


AACTCCTCAG 


CCAGCCAGAG 


TTCTTCTGGC 


14460 


GTGAGAACAG 


GAGAAAAATC 


CAGCACCTCT 


GCAATATCTT 


TTAAATCTTG 


CTCCATCTCT 


14520 


TCTCCATCTG 


ATTGGGACTT 


CAAACCAAGA 


ACAATCCCTT 


GAATCAGGCG 


ATTACCCTTA 


14580 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14 640 

CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 14 700 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 147 60 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTCTTT AGCAATTTGC TCTTTGATTT 14 82 0 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 

GTTTTCCTTC TGCATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14 940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTCCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 15120 

TTGGACATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15 240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT CGTTGGGCAA 1S300 

TCACTTCTGC ACT ATCTG TT CCACGACCTA CCAAGCCATC TTGCAATTCA TCCAAATCTG 15 3 60 

GTGCTCTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 154 20 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGGTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AG CCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 1578 0 

T AATG GC AAA AAGCCAGATT ATCCTTT AC A GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACCCTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 1596 0 

TGTGAGCCAA GATTGTGACT TTCTCGTCCT TCATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTC AAG AG A CTCACTACGA GCACCTGGAC 16140 

GGGCTGCGCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG CGGTTCCTTG T ACTT ACGGG 16260 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 163 20 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 16 380 
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TGATACCAGC 


CAACTTAGCA 


ACCTCAATCG 


AATGGCGCAA 


AACATTTTGT 


CCATATGAAG 


16440 


TACGGAACTG 


CAAACGTCCC 


AT AATCT TC A 


TCAAGTCTGG 


ATGAAGGTTT 


GGCGCACCAA 


16500 


TTTCATAGGC 


AGCAGCCTCA 


CCGTATTCAC 


GAATCTTATT 


GTCAATCTCT 


TGACGGTTTT 


16560 


TCTCAACCAA 


CTCTTCGATA 


CGAGCTGGAT 


GTATACGACC 


ATCTTTGAGC 


AACATTTCCA 


16620 


TAGTCATACG 


GGCAATCTCA 


CGACGAATCG 


GATCAAATCC 


TGACAAGGTC 


ACCACTTCTG 


16680 


GTGTATCGTC 


GATAATCACA 


TCGACCCCTG 


TCAAACTTTC 


AAAGGTACGA 


ATGTTACGAC 


16740 


CTTCACGACC 


AATAATGCGT 


CC CTTCAT AG 


TATCGTCTGG 


CAGATGAACT 


GTTGAGTT TG 


16800 


TTGACTC C C C 


T AC AT ATT C A 


CCAGCGATAC 


GTTGCATAGC 


TTGAACCAAG 


ATGTCCTTGG 


16860 




AGAACGTTCC 


TTGACCTCTT 


GCTCAGCTTC 


GCGAATGCGA 


C TGGC AATCT 


16920 




GTTTTCCTCT 


GTCTGAGCCA 


AGATAATATC 


TCGTGCTTCT 


GCCTGAGACA 


16980 


GCGCACCAAT 


ACGCTCTAGT 


TCTGCTTCTT 


TTTGTCTTTC 


GACTTCCTCT 


AATTGCTCTT 


17040 


CACGCGCATC 


AAGGTTTTT C 


GCTCTATCAG 


AAATACTTTG 


TTCTTTTTGT 


T C AAGTGTTT 


17100 


HTTrTTTACT 
u 1 1L 1 1 1 nv. 1 


CGTCAAATTG 


TCGTCCTTAC 


GGTCAAGGCT 


AGTAGCTCTC 


TCTGTCAAAC 


17160 


GACTTTCGAT 


TTG TTTG AGT 


TCTTGACGTT 


CTGATTTGAA 


TTCAGCGTCC 


ACTTCTTCAC 


17220 


fiGT A.TTTT CT 


GGCTTCTTCT 


TTGGCCTCCA 


ATAGTGCTTC 


TTTTTTAAGA 


GACTTGCTTT 


17280 




TTCATTAACA 


AGTAAATCCG 


CTTCACCCTC 


AGCTTGT C C A 


CGTAAATTAG 


17340 


TTGfTTC TTG 


T TC AGC ATTT 


AAAAGCATCA 


ACTCTGCAGC 


TTCCTGAGAT 


GATTTCATCT 


17400 


TAGCTGAGAT 


GCTGACATAT 


CCAATGACTA 


AACCAATGAT 


GACGGCAAAA 


ACAGCAATCG 


17460 


fAAGCGACAT 


G ATTT CC ATG 


TTTTTACCTC 


ATTTT ATTGT 


TATTCCGAAT 


GACATACATT 


17520 


CTTTTACATT 


CT AC CAT AAA 


AAAGTGATTT 


TC AC AAACCT 


AAAATAGAAT 


ATGTTTTGAC 


17580 


GAATTTGGAA 


C AC ATTT AC C 


AAAATAAACT 


TGTTGTTTAG 


AAATAGTAGT 


TTAGTAGAGA 


17640 


CTTGAGAAAA 


AGCCTACCTT 


T C AAT AG ACT 


T AGTAA * OA I 


l_ 1 » L /\Af\OVjrt 


fAAGAAAGC C 


17700 


ACCCTATCTC 


CATC CAT CAT 


ATAAATCAAG 


CGATTTTCTG 


CATC AAT A CG 


CCGTGACCAG 


17760 


GCTCCTTGGT 


AATCATATTT 


GAGTGGTTCT 


GGTTTACCTA 


TTCCTGTAAA 


GGGATCACGT 


17820 


TGAATATCCT 


TGATTAGTTT 


ATTGATTCTT 


TTTAACGTTT 


TCTTATCCTG 


ATTTTGCCAG 


17880 


TAGCAATAAT 


CTGCCCAGGC 


ATCTTCTGTA 


AACTTGAGCA 


GCATTTCTTA 


CTCCTCAATA 


17940 


AC ATGG AC CT 


GAGTACTTCC 


AGCACGAACT 


TGAGCCATTC 


CTCGCAAAAC 


CTTATCAGAA 


18000 


ACTTCCTTAT 


TTTGAGCAAT 


TCTCAGGGTT 


TCTTGGATAC 


TATCCCACTC 


ACTCTTTGAA 


18060 


AGGACTACAA 


TGTCCTCATC 


TGGATTTTTA 


TTGACCACCG 


TCAAAGGCTC 


AAATTC ATC A 


18120 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATC TTGAGTAAAG GACTGCTTCC 18180 

ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATCAAAATC AAAGAGCAAA 18300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG C G ACGTTG AC GCGATTTGAA 183 60 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGCTAGACT 18420 

TAGTTTGTTT CTATTC 18436 
(2) INFORMATION FOR SEQ ID NO : 88: 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


88: 






ACGTAGAAAA 


ACT ATTTCTA 


TCACAGATAA 


TATTCCGTAT 


GTTGTTGGAG 


GTATTGAAAT 


60 


AAACGTCCTA 


GGTATCTTTC 


TCAGTCTATG 


TGACTTACAA 


GGGAAAACTC 


TTTTCGAG AC 


120 


AGAAATTTTG 


AATGAAGATT 


ATCCTATTTC 


AG AAA TC AAT 


TCCACCATTA 


CCAATATGAT 


180 


AAAAACAGCT 


ATAGAGTACG 


TCCCTTTGGA 


AACAAAATTA 


CTTGGATTTG 


GCTTATCAAT 


240 


ACCTGGACAT 


TATAACAAAG 


ACTCCGGAAG 


TATCATTACA 


AACAACCCCA 


TATGGGAATC 


300 


TTTTAATTTA 


TTAAATGTAA 


TTAAAAGATT 


CAATTTTCCT 


TTTATTGTAA 


AAAATAATAT 


360 


CGATTGTATG 


GCTATAGGAC 


AATACCTTTT 


TAATCCACAC 


AATACCCCCG 


ATAACTTTAT 


420 


TTTCCTACAC 


GCTGGATTAG 


GTATTTACAC 


TTCCTTTTTC 


ACAAAAGAAA 


AAATAGGAGC 


480 


CTCTAAAAAT 


CCTTATATCG 


GAGAAATTGG 


ACACACCATT 


GTCGAATTGA 


ATGGGCAATA 


540 


TTGTGAATGC 


GGAAAAAAAG 


GTTGTTTACA 


AACATATATT 


TCGGATGCTT 


GGTTAATCAA 


600 


ACACGCCCAA 


TTATTATTTA 


AAAATTCCCA 


ACTAACTGTA 


CTAAAAAGCC 


TTGTAAAGAC 


660 


TG AAAAAG AC 


ATTCATTTAG 


ACACCCTTTT 


AACGGCTTAT 


AATTTAGGCG 


ACTCCGCTTT 


720 


ACGTCAACAA 


ATTGATAAAG 


GAGTCAATTT 


ATTAGCCACT 


TCTATTGCAA 


ATCTCCTCCT 


780 


CATCAATCCT 


GCTGATAAAA 


TCTATATCAA 


CAGTCAATTG 


CTTAATTATC 


AACCTTTCAC 


840 


TCATGAAGTC 


AGGGATAAAA 


TCCAAGACCA 


GCTCCACTTC 


GTTCCCTTTA 


CTCGTAATAT 


900 


AGAAATTGAA 


ATTTTACCTT 


ACAACAAACA 


TCGTGGAAGT 


ATAGGACCTT 


GTGCATTAGC 


960 


TATCGTCGCT 


TTTTTCATAG 


AACATAGCAA 


TGTATTACAA 


GATATTATTT 


CACCTTAATA 


1020 


TATTAGAAAT 


CTATAGACCT 


GTTTAAATCA 


ACTATAACCT . 


GTAGTAGATA 


TCTCGTATTT 


1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCCCT ATGTTGAGTG 
ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA C ATT CATC AT 
AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 
T ATGAAATT C TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 
TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 
TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TC AACTT C AT ATGGCTCAAA 1440 
AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 
GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAACA GGCATCATAA CTCTAAAAAG 1560 
TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 
AAAAGAAACG TCAAATGCTC ATGAAACAAC GAAT AC AGGT ATCAAAACTA TGACAAAACA 
AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGT AAATGGT TGTTGTATAA 
TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 
TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 
GTGCTTTGG C ATTCCTAGCC TTCATATCAT TTAAAGAATT GTATAGACAA AATTTTTTCC 
AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CAT ATTT AGG AACAGGATAA 
GATACAAGAG TATTAATCCA TAGCTCAGTT CT AT A C GAAT CTAAGACAAA TAAGCTAAAA 
AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTT AAT AAA C 
AATACTAGTG TAACTATCCT TCCACTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 
AAAATTT ATC TCGTTAGGCA ATCAACCAAA AACTCGACGA TAGTACAAAC ATT ATC AT AC 
AGG ATT G ACT TCCTAAATTA TATACTTTAG TAACGTTTTC GGATAAGAAA AAAGGTTCAT 
TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 
AGCAACAAGA AG ATTTTC AG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 
TTTTG AAT AT AAA CT AC AAT AAT AT AAAC T AAATTTT AT A GGAGGAAGAC AATGG ATTGG 
TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 
TATTTGCGAA AAGT T ATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGT AG AAAAG 
TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 
GAGCATACTC CAACTCATAA AT AT GTG ATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2700 
GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 27 60 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2 880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 294 0 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTC ATT AGAAAGAATC ATATGGAACA 3000 

ATT AC ATTTT AT C AC AAAAT TACT AG AC AT TAAAGACCCT AAT ATCC AG A TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3 240 

TCACTGTTCA AAAAT G ATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3 300 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTCATTGAC AACATTTCTA TGACCGATAT 3 3 60 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 34 20 

TGAGCATGAT TTTTCGCGTC TTC CTG AG AT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 34 80 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3 600 

TAT GAT AG AG CCGTCCGATG TCGCGTCAAA ATT ATT ACTA TGGATATGTT T AGTCCTT A C 3 660 

TATGACTTAG CT AG ACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3 720 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3 780 
CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3840 
CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3 900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 9 60 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4 020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 
AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 
AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4 200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 4 2 60 

CGACCTTACC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4 3 20 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4 380 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTCC 444 0 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4 500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTT ATT AAA 4 560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4 620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC 
CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC 
GACATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTCAG 
AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT 
CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC 
AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC 
GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA 
CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT 
CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG 
TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC 
CTTTGACAAT C CGTCG ATT A CGATACACGC GATACAGCAA 
CTCCTACTAA TAACCACAAC CAGAATTGCC CACGCTCTCT 
CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA 
TATACGGTCT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC 
CAAAGTCATT CGGCTCCACC GTC ACT ATCT GATCCACTTG 
AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC 
CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT 
GCGAAGCCCC TTCTAACAGC CCTGCCCCTT TCTGAAGAAT 
ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA 
TT AG CAT ATT GGCATATTCT GAGACGCCTT TCTTTTTCTC 
GAATTTCAGA TGGTTTCAAG GTCGCATTG A AGGCTTGAGC 
GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC 
CAATACGATA ATAATAACGA GACACCAATG GATATATCGC 
AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT 
TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG 
TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCG AATAA 
AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC 
AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC 
GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT 



ATCTG AT ATT CTACAATTTC 4 6 80 

TGATCCAAAT GGCGGAAAAA 47 40 

CGAATCCCTG TACCATCCAG 4 800 

AAATGATGAT AATCTGCTCC 4 8 60 

AAGTAACCAT AGACTGCATC 4 9 20 

GCCAAAAAAG GATCTACAAT 4 9 80 

TGGTTCTGTT GTTCTTGGTA 5040 

TTCACCTGTC CAAGAGACTG 5100 

CCTACCAACA TCAACAAGTA 5160 

CTCCAATTGC TTTTCTAGTC 52 20 

GAGAAGGATG ACCGCCATCG 52 80 

CACCGCTCCA TTCCGCTCTG 5 3 40 

CAGACGATGA CTGTT AATC A 5400 

ATGTTGAATC AAGACAGGCT 54 60 

GTAGGCCAAC ACCTGATCTA 552 0 

CAATTGACTG AACAATTCTG 55 80 

ATTTTC AC CT CCAACAGGCA 5 64 0 

CTCCTCACTC CTTCCGACAT 57 00 

GCCAATCCGC TCATGGACCT 57 60 

TTGCTCTGTA AAAGGATCAA 582 0 

CAAGCGCCAA CGCTCCTCAA 58 80 

TTTAATAACC TCGTTTGACT 5 9 40 

AACGGCGAAT CCTACTAAGA 6000 

GCCTTTTTTT CGTGAACGTC 60 60 

CTTTCAGCGC CTTCAAAGCC 6120 

TCCATAAAAG AATCACAATC 6180 

TGAGTTTGTT TGCTGCAATA 62 40 

CCCGAACCAA TAGACGATCG 6 3 00 

AATCATGACC TGGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 64 20 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 64 80 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATAOTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTCTGC CAATTTCATT 67 80 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG G AT ACGCC AT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6 960 



ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104X1 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



GAGGGAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAAGGTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTCTGA 


TACGATGCTT 


TGTTTAAGGT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTGCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTGTTATGAC 


CTGATCAAGT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGGC 


300 


GAGTCCTGAC 


TTGATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGA1 ?T 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA 


AGCTTTTAGC 


TCTTCTTGAG 


CCAAATCTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAAGC 


480 


ACTTTCTCCT 


GTAACTGGAA 


TAGGGGCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGCT 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGGTGTTTC 


600 


AATCTTGACC 


TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


C C ACATTC AT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTCT 


TTGCTATCTG 


TTGACGCATT 


GTAGCCCAGT 


TTTTTAAGAG 


TTTGATTTTT. 


780 
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TTGCTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGCT TGGACATAGG TTTCATCGAT 84 0 

AACTTTTTGA ACATCTGTTG CTGC ATGG AC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 
CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 960 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTCGTCTAG CCACATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT LO80 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGGG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 12 60 

CAAAATTATA GGCTTGAACA GCTGTCC AG A TATCTACCCC CTTCTTCTGC GCCAGATAGA 1320 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TC ACT AG ACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 14 4 0 

CAGTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTCA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AG C AAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 16 20 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAACATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACCA TGAGCTCAAT 17 40 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCG AC TGGCATCAAT 1800 

TTCATCAAAT TCAACACGTT CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 18 60 

AGCCGTCAGA TTGTTGTCAA AAAGGTCATT AGCCAATTTC TCACGTGACA ATTCATTGCT 19 20 

TTCTTCTACG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 19 80 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2 040 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2 100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTCCTACTC TGAAGATTGA CCACCAAGGC 2 160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 22 20 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 2 2 80 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTCGTCTTGA GATTTTCAGA 2 3 40 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2 4 00 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2 4 60 
ATACACATGT TCAATTTTTT 7ACGCAGGTA TTCTTCGATT TTTCGAGTAA TATTCAGAAA . 2 5 20 
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CTTATCTGCT AAGAACAGTT CGGTATCATC CGGACTGAAC TGGTCAATAA TGGCTTTCTT 2 580 

AATATAAATC T C CAT AAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GT C AATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TG AATTCTT A AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 27 60 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 2 820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2 880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 294 0 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATG AG CT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3 300 

ATTTTGAAAA ATCGATAATT TGAGAATACG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 360 

TTACTTCTTG GGCTTGTTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3 420 

CACTATAAGA AACAAAGTTG T AGGTTTG AC CAGAGAACCT AACAGGAGCC CCATGAGTCA 3 4 80 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3 54 0 

TGTAAGCCGC ACAGTTACCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTT AGC AC 3 600 

CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT AC AT C AGTTC 3 660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 37 20 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 37 80 

GTTGGCGTTC TTCTTCTTTG GC AAT AG CAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 3840 

CAT C TT T GTC AAAAATCATA GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 3 960 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AG AATTT AAG 4 020 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4 0 80 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACT GCT AAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4 200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4 260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATCTC AC AT AT AG CT 4320 
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TTCACATCAG 


TTTCTCTTAA 


ATCTCTTAGC 


ATCTCATTCC 


TCCTCAAAAG 


AAATCTTTGG 


4380 


CAACCGAGCA 


AGAATATCTT 


CTCGCTTAAT 


GGCCCCTTGA 


CGTAAGATTT 


TC AC CTTGTC 


4440 


TCCCGACAAA 


TTCAAAATAG 


TTGAATCCTG 


TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4500 


CAGAACCTCT 


TGGTCAAAAT 


CCTCTAGAAT 


TTGATTAAAG 


GTCACTCCAC 


TCCCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC 


CAATCAAGGG 


ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACTA 


GCCATCCGAA 


ATCCAACAGT 


TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4680 


TCGGTCATTA 


GCTTCGAGAA 


TAATGGTCAA 


GGGACCTGGT 


AAAAAG AT CT 


CTACAAGTTT 


4740 


TTG AAG AT AA 


GTTGGCTGAT 


TCTTTGAAAA 


GTACAAGATG 


TCCTCTAAAG 


AGGCAACATT 


4800 


GAGATTGAGC 


GCCTTGTCTC 


TACGTCGACG 


TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4860 


TTCGTCTAGC 


GCCTTAGCAA 


AGAGACCGTA 


AACTGTCTCT 


GTAGGCAAAA 


CGACAGCTCC 


4920 


AC CAT TTTCC 


AACTCTTGTC 


TAATCCTGTC 


CATCATCAAC 


GACAACCATC 


CTATCTTGAC 


4980 


C AAATTGGT C 


CTTG AGTGT T 


CGTACTCGCT 


TTTCAGGAAG 


ATGTTTCCTA 


AAAACTTCAG 


5040 


OAACACTTTG 


ACCTTGCTTG 


TATCCAATTT 


CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 




ATCTTCCGCA 


ATTCT AC GGT 


AAATAGCTAG 


GCCATCCTCA 


TCTGCAAAGA 


5160 


G AGCT AG ATG 


AGGCTCCGAA 


T AC AAG AC AT 


TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AG AT AT AGGG 


TGGATTGGAA 


ACAATTATAT 


CAT ATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CAGATTTTTT 


TAAAAATATT 


TGAAGATTTT 


GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAG CATCTT G 


GGAAATATCT 


GCTGCCGTCA 


CTGACCAATC 


TGGTCTGTTT 


TTTCCTAGAC 


5400 


CGAG AG CAAT 


AGCTCCACTA 


CCTGTTCCGA 


TATCTAGGAC 


CATAAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG 


GATAAGCTCC 


ACCAACTCCT 


CTGTTTCTGG 


ACGAGGAATC 


AAAACCCGTT 


5520 


CATCCACCTT 


TAAATGCATT 


C C AT AAAAAT 


CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG 


TAAATATL I [ 


V_ 1 1 I'J 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCCTGCTG 


GAGGGCAAAA 


ATAAAGTCTG 


TAAAAGATAG 


ATTTTTCAGA 


CTACGATAGA 


5700 


C AAAAG AG AG 


GCTTTCCGCT 


TCCTCTCCTT 


GTCTTATCAA 


CTCTTCTTCA 


AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT 


ATTTGTTTAA 


TTCTTCTAGT 


TTTTGTGTTT 


GGTCATAAAG 


5820 


CACCAAGGCA 


TC C AC AACTT 


CGTCCAATTT 


ACC AG AC AAA 


ATCGTATCTA 


GTTTTTGG AG 


5880 


GGTCAAGCCG 


ATACGGTGGT 


CTGTGACACG 


GTTTTCTGGG 


AAGTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC 


CGATTGTCGA 


CTTACGCTCA 


GCGTCCTCCT 


CAT CTTG AGC 


6000 


AATCTGAGCA 


AAGTGGTCAG 


CAACACGGGC 


AC GG ATG ATT 


TTCATCCCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA ACTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6 180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 62 40 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 63 00 

CACACGGTGG GCACCTGATT CATACTTAAG CTT AG AG T AT ACAGACTGAC CTGAAACCAT 63 60 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 64 20 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 6 4 80 

CGCTTCGTCT CCACCAGCTG CTC CACGG AT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGG TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT ACATTGCTAC 6900 

TACTTCATTC TCGCATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 702 0 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATACTAA ATAATTTCCA CT ACT AATT C CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGAGC 72 00 

TACAAAATCT CCATTTTTAG TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTT GGTTGC 7260 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATT TTC CAT 7 320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTG CTCCTT GTTAGAGGTT ATCATTGGCG 7 3 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTG'l'T 74 40 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7 500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7 560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GC GTTTG AG A AACTGGGCTT 7 680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAG AT ATCCG 77 4 0 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC AC AT AG CC AA 7 800 

CGCCGTCACG CGTATCCAGA CCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTT ACC AG A GTTCATGGTC CCATAACGAT 7 920 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7 98 0 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 804 0 

ACTACTGGAG GAAGCTATTA TGCCATTTCT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 822 0 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 8280 

C AAGT AG CAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8 3 40 

AATTTCCAAG AAAAG AGCT A TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 84 6 0 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 8520 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

ACGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 864 0 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 87 60 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8820 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 8940 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGCTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAC TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 92 4 0 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 9300 

TAGTACTAAA CGCTAGTGAA GCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 93 60 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 94 2 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 94 80 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTCAG CAAGATGCTA 954 0 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTT AACCA AGGTCTTCAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 97 20 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 97 80 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CT AT AG AT AT TATGGAGCCT 984 0 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AAC AACT CAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9 960 

TAATATCCAG ATTTTAGACA TCATCAATAA GG AT AC AC AC AAGGAAATCA TCGCCAAACT 10020 

GGACTACCAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTCGTAT GCCCACTAGA ATTCTCCTTA 1014 0 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAAC C AAAA GATTGCTCAA AAGTTAATTG 102 60 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 1032 0 

GTAAGCTCAA TGACTTTCAC T TTAAAC ATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
{2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2393 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



GTTrTGGGTT 


CTGGAAATTA 


TCAGATGGTT 


GGAAAAGCCG 


TCCACATCAA 


GATAGTGTTC 


60 


GGAGATTTAA 


GTTTAAATTG 


AAGAAACTAA 


CACAGAGGAA 


ATGGAGTATA 


GACCTAACAA 


120 


GACCTATTCA 


GCAACTGAAT 


TTGTCTATTC 


GAGGATGGAT 


AAACTATTGC 


TCATTGGGAA 


180 


ATATGAAAAG 


TATAGTCGCC 


AGCATAGATG 


AGCGCTTGCG 


TACTCGCCTA 


CGAGTCATTA 


240 


TCTGGAAGCA 


ATGGAAGAAG 


AAATCGAGAC 


GATTATGGGG 


ATTGCTTAAG 


TTAGGAGTTC 


300 


CTAAATGGAT 


AG C AG AT AAG 


GTATCTGGCT 


GGGGCGACCA 


TTATCAATTA 


GTAGCTCAGA 


360 


AGTCGGTACT 


TAAACGTGCT 


AT AT C AAAAC 


CAGTCCTGGA 


AAAACGTGGA 


CTGGTTTCGT 


420 


CTTTCCATTA 


TTACCTTGAA 


CGACATGCGT 


TAAAAGTTAG 


TTGAACCGCC 


GTATGCCAAA 


480 


CGGCACGTAC 


GGTGGTGTGA 


GAGGGGCTAG 


AGATTATCCC 


CTACTCGATT 


AACTCCCCTG 


540 


AAATTTATTT 


TAATTATGCA 


AATTTCACGT 


ATTTTTGATG 


CTGAGACGAC 


GATCCTGCGA 


600 
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AL 1111 LALA 


l rt i 1 * i i * ivi 


APTATCTAAA 


TC T ATC ATT A 


G AAAAGCTT A 


GAGCGCCAAA 


660 


LL A 111 LAvjL 


Ul 1 1 * 1 > — I V-JA\ 


TTTTTAAGAC 


TTTTTCCAGT 


CTCTTTTTCG 


ATTGAAGATG 


720 


1 AA I 1A1 I L 1 


ap*pa arraac 

ML 1 AAL 1 rv/\L 


TAACTTCTTA 


GTACTAGCCA 


ACAACGATAA 


TCATAATTCC 


780 


ILL IAAAA1 1 


AuunA 1 AA 1 A 


AAGGCAATAG 




TTCATGTAAA 


AAAC CTC ACT 


040 


TTTGTTTTCT 


vjL Inl 1 1 I ft 1 


flPT AAA AT AT 
VjL l t\f\f\J\ i n. l 


TAAAAATCAA 


ATTTAATTCC 


AAAGTTTGTA 


900 


AL 1 AAAGLll 


(jAuCuL i ALA 




ATTTGTCAAG 


TTGTTAGTCT 


CT C AATT ATT 


960 


TGCAAATTTA 


LL AUA 1 A 1 1 1 


1L1 1 L nuAU 1 


A AC AATC ATT 


GCTAACATAT 


ACATTATTTC 


1020 


ft n ft ft n*f*> ft t""T* r 
AAMTCAUTA 


ftrfwryf f^ft f ft T" 


LAL 1 AL 1 ILL 


TATCTTAATA 


GGAATATCCT 


CTTTTGTTGC 


1080 


G AGTCTTTT A 


LI ILLul * LL 


t*p & p*p & A A Af^ 

1 L AL i A/\/1AVj 




AATAGGGTTT 


TATCTTTATC 


1140 


TCAAlI IGGA 


ft ft ^ ft f**^ ft fTt ft 

AAoAL I AlAi 


I rS. 1 1 LLLLA 1 


A PTfU":T Af^C A 


ATGTTTACCG 


TAATGCAATC 


1200 


LGTAGlGllT 


I iLO LTjALL 1 


ft *p(T & T TTni* 
rtl L A. .rV 1 1 1 u 1 


TfiTTGCAATT 


TCCATACTAG 


ATGGTTTTGC 


1260 


AGC ACCCGTT 


1 LL LAI L"L 1 A 


1 1 luLLALL 


V» 1 A> 1 VJV- VJH'W \a> 


GATTTGGGTA 


AGGCTAATTC 


1320 


AG C l TT AT C A 


ATGAL l LL 1 L 


AAW. IL»1 I LA 


aTTPATAHGT 
All LA 1 A\Lj\j 1 


TGCGGATTAG 


GTGGACTCTT 


1380 


/"il'IIIB^f ft ft /"^ ft 

GTT7GCAACA 


All LL ILigl 


m a rr*p a PP a P 

i. ALL 1 ftLLAL 




TTAGTCTTGT 


ATATCATTTC 


1440 


TAGCTTTCTG 


ATGTTATTTC 


1 1 LL i AALLiL 


Tfl A Af^TfUl AG 


GTGTTAGAGT 


CAGAAACTAA 


1500 


TCTTGAAATT 


IWIV* / Ml»/*> ft ft ft /"^ 

TTG LTLAAAL" 


L 1 1 vjuAaLi 1 1 


AfTTTPrTAnA 
AL 1 1 LL 1 rVtai/\ 


AATCCTAGAT 


TAAGACTTTT 


1560 


TGTATCAGCA 


AA 1 1 1 A i I v»L 


AAA 1111 1 1 L 


AAATACGATT 


TGGGTTTCTT 


CC ATT AT ACT 


1620 


TGTTTTTGTA 


ALuuAOl 1A1 


TAAATAAAAP 
1 AAA 1 nnnnL, 


nnAAAGTTAC 

VJVJ7WWJ * I 


TGGGGATATT 


CTAATACAGC 


1680 


ATACTCTATT 


ffrr\ » iTVTt ft m ft R 
LL 1 A I lAiAA 


1 1 AL 1 LLL 1 I 


/v\ 1 i LL i 1 l * 


AGGCTATCTG 


AAAAGTTCCT 


1740 


TLL rGLTAAA 


1 LOLAAL L L L 


AA 1 1A1 1 L AL 


CPPAAATCTA 


AAAACCATCC 


AGAATCCTTG 


1800 


LL 1 1 AUL 1 1 A 


LA 1 LL 1 LLA i 


VJ\J 1 I 1 W ^ £ A 


TTCACCCAAT 


GGGTGTTTTT 


TACTAGACAA 


1860 


AAAAGAGTTT 


CCCCTTTATG 


GTATAAGTGT 


AGAAAAAAAC 


ACAAAAAGAA 


AGGAAACTCA 


19 2 0 


CATGAACAGT 


TTACCAAATC 


ATCACTTCCA 


AAACAAGTCT 


TTTTACCAAC 


TATCTTTCGA 


1980 


TGGAGGTCAT 


TTAACCCAGT 


ATGGTGGTCT 


TATCTTTTTT 


CAGGAACTTT 


TTTCCCAGTT 


2040 


GAAACTAAAA 


GAGCGGATTT 


CTAAGTATTT 


AGTAACGAAT 


GACCAACGCC 


CCTACTGTCG 


2100 


TTATTCGGAT 


TCAGATATCC 


TTGTCCAGTT 


CCTCTTTCAA 


CTGTTAACAG 


GTTATGGAAC 


2160 


GGACTATGCT 


TGTAAAGAAT 


TGTCAGCTGA 


TGCCTACTTT 


CCAAAATTGT 


TGGAAGGAGG 


2220 


GCAGCTTGCT 


TCACAGCCAA 


CCTTATCCCG 


TTTTCTTTCC 


AGAACTGACG 


AGGAAACAGT 


2280 


CCATAGTTTG 


CGATGCCTCA 


ACCTTGAATT 


GGTCGAATTC 


TTTTTACAGT 


TTCACCAGCT 


2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 23 9 3 

(2) INFORMATION FOR SEQ ID NO: 91: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCCGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 120 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGCGCGT AGTGCTAGAA GAAGCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCCTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGGGTTTGTC TATCTTTTGT 300 

CTCTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGG AT GAT 42 0 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCGATTTTG TCCCCGTTGC 4 80 

ACT AT AC AGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA CAAAAACAAC 54 0 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 7 20 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 7 80 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 84 0 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC C CCTGGTTG A AAAATGCCTC AAT GATCTG A CTGAGCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 102 0 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTCAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACGCT AATAAAG A C A GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGCGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 13 20 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTG ATTG AT TTTAAGGGTA TGTTGGAAAA 13 80 
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TGGTTTTAAG ATTGGAAATG CAGAGGTAGA GAGTCCCAAG TCTATCCACA CTGCGACACC 1440 
ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 
CCGTATCGAT GAAATTTTGG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA IS 60 

TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 
GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT T CACTT CAAA 1680 
TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCAGTCGTT TTGAACGAGA 17 40 

AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 1800 
TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGGAACTCC 18 60 

CAACTATGAC AT CAAGC AGT TGGCTCTACA GTGTGCAACC AAGCGGATGT ATCCAGACGT 192 0 

CTTCTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 1980 
TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 2040 
TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTCCTCTT GAGTCTGAAG GTGATATGAA 2100 
TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 
TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATCCTCCT ATTCTTTATC AGTACGGTGC 2 220 

TTTTGGC CAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CT CTTTAAG A ATCGTCGTGC 2280 

GACCGTTT CG CTGGGCTATA TCGGCTT GTA TGAAGTAGCG ACAG TTTTCT TTGGTAACAG 2 340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AG AT AT AG AC AAGTTTGCCT CTATTCCTGA 2520 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2 580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCGGAA GCAGGTGCGT CAGGTGGTTT 2 64 0 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 2 700 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACTCCGATTG ACCGTTG CT A 27 60 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTG AG AG A GGGTTTG CTT GTCCAAACTG 2 820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTGGCT ACCTAGGTAA 2 880 

TCCTCAAGCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2 940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA C G AT AAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 3 060 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 3120 



t 
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AACAAGAACA AGAAAAAATA 


AAAGTGAGAG 


; CCAGCTCTCG 


; CTTTTCTCAT 


AGTGGGAGGT 


3180 


AAIjUA TLjCjAA 


TTACGCAGAC 


CAAGATTAGC 


GGATAAGAAA 


GCTGTTTTAG 


ATATGATGAC 


3240 


AGAGTTTGAA 


AAATTTCAGT 


CGCCTCACGA 


CGGCGGTTTC 


TGGGATACAG 


AGAACTTTGT 


3300 


GTATGAAGAC 


TGG TT AG AAA 


GCAATCAGGA 


ACAGGAAATG 


GGGATTAATC 


TGCCTGAAGG 


3360 


ATGGGTTTCT 


GCAATTCAGT 


TAGTGGCTTT 


TTCTGAGAAA 


GGTCAAGCAG 


TTGGATTTCT 


3420 


TAATCTCCGG 


TTGCGCCTCA 


GTAACTTTCT 


ACTAGAAGAA 


GGTGGCCACA 


TTGGCTACTC 


3480 


CATTCGTCCA 


TCTGAAAGAG 


GCAAGGGTTA 


TGCAAAAGAG 


ACTCTCCGTC 


AGGGCTTGCA 


3540 


AGTTGCTAAG 


GAAAAGAACA 


TCAAGAAAGC 


TCTGGTGACC 


TGTAGTGTGA 


ATAATCCTGC 


3600 


T AGC AG AGC A 


GTCATTCTAG 


CAAATGGTGG 


AATATTTGAG 


GATGCTCGCA 


ATGGAGTCGA 


3660 


GCGTTATTGG 


AT AG AGGT AG 


CGAATGAATA 


ATCCAAAACC 


ACAAGAATGG 


AAAAGCGAGG 


3720 


ft ft J^>^fT> H a m 

AACTTAGTCA 


AGGTCGTATC 


ATTGACTACA 


ACGCCTTTAA 


CTTTGTGGAC 


GGCGAAGGCG 


3780 


TGCGCAACTC 


TCTCTATGTA 


TCAGGCTGTA 


TGTTTCACTG 


CGAGGGATGT 


TATAATGTTG 


3840 


CG ACTTGGTC 


TTTTAATGCT 


GGCATTCCCT 


ATACAGCACA 


ATTAGAAGAG 


CAGATTATGG 


3900 


CAGACCTTGC 


CCAACCCTAT 


GTTCAAGGCT 


TGACTTTGCT" 


GGGAGGGGAG 


CCTTTTCTCA 


3960 


ATACTGGGAT 


TCTCTTGCCA 


CTTGTTAAGC 


GGATTCGGAA GGAATTGCCA 


GACAAGGACA 


4020 


TCTGGTCCTG 


GACCGGCTAC 


ACTTGGGAAG 


AAATGATGTT 


GGAAACTCCA 


GATAAACTGG 


4080 


AATTCTTGTC 


ACTGATTGAC 


ATTCTTGTCG 


ATGCAAGATA 


TGATCGAACT 


AAGAGAAATC 


4140 


TT AT G CTCCA 


GTTTCGAGGT 


TCATCTAACC 


AACGAATTAT 


CGATGTGCAA 


AAATCGCTCA 


4200 


AAAO TuGGCA 


AGTAGTGATT 


TGGGAC AAG C 


TCAATGACGG 


AAAAGAAAGC 


TATGAACAGG 


4260 


i. GAAGAGAGA 


ATGAAGAAAA 


AGGACTTAGT 


AGACCAACTA 


GTCTCAGAGA 


TCGAGACGGG 


4320 


GAAAGTCAGG 


ACACTGGGAA 


TATA CGGTC A 


TGGAGCTTCA 


GGTAAATCAA 


CCTTTGCACA 


4380 


GGAATTGTAC 


CAAGCTTTAG 


ATTCTACTAC 


AGTAAATTTG 


CTAGAGACAG 


ATC CTT AT AT 


4440 


CACCTCAGGA 


CGCCATCTGG 


TAGTACCCAA 


GGACGCGCCG 


AATCAAAAGG 


TGACAGCCAG 


4500 


TCTGCCAGTG 


GCGCATGAAC 


TGGAGAGTTT 


GCAGAGAGAT 


ATCCTTGCTT 


GCAGGCGGGT 


4560 


ATGGATGTCT 


TGACAATTGA 


AGAACCTTGG 


AAGGCTAGTG 


AGGT CTTGTC 


TGGAGCCAAA 


4620 


CCAATTTTGA 


TTGTCGAAGG 


GATGTCTGTT 


GGCTTTCTAC 


CCAAGGAACT 


CTTTGAAAAA 


4680 


ACCATCTGTT 


TCTACACGGA 


TGAGGAGACC < 


GAATTAAAGC 


GACGCCTTGC 


TAGAGATACG 


4740 


ACTGTGAGAA 


ATCGCGATGC 


GG 








4762 


( 2 ) IN FORMA 


TION FOR SEQ ID NO: 92 











(i) SEQUENCE CHARACTERISTICS: 




« 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAACTTGG 780 

CAAAGACGGT GTCATCACCA TCCAAGAGTC ACGTGGTATG GAAACAGAGC TTGAACTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 

AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 9 60 

CCAAGAAATC TTGCCACTTT TCGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 10 20 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA GTAGCAGTGA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 12 00 

AGATGCGACA ATTGAAGCTC TTCGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 12 60 

GGTTATTGTA GAAGCTGCAG GAAATCCTGA AGCGATTTCT CACCCTCTTG CGGTTATCAA 13 20 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CCCTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT * CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CACCTCTTGC CAATCTGATT CCAGCTGTTC CTACCTTGGA 1560 

ATTGACACGA GATCAAGCAA CAGCACGTAA TATTGTTCTC CCTCCTTTGC AAGAACCCCT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA ACGATCTATC CTTATCGATC CTTTGAAAAA 1680 

TCCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCC AG TGCGTTAACA TGATTGATCA 1740 
AGGTATCATT GATCCAGTTA AAGTGAGTCG TTC AG C CCT A C AAAATG C AG CATCTGTAGC 
CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCACC 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 19 20 
TAT AAAAAAC ACAAAACGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTCTCAACT 
CTACTCCGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 
GTTCAAAGCG* AT AAAAAT C C CTTTTTTGAA CTTTTCAAAG TTTCGAAAAC CAAAGGCATT 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCC TTGATAATCT' TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2 220 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 2280 

ATTCTGAAAG TGAAACACCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2 340 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2 4 00 

AT AAAATC G C TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGAGCTTCG AGTAGCGCTT 24 60 



1800 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC AT AATTTG AA CACGCACACG 2 520 

ACTCATAGCA CGGCTAAGAT G TTGT AC AAT GTGAAAGCGA TCCAACACGA TTTT AGC ATT 2 580 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 264 0 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

CGAACGGCTC TATCGTAGCG AAGAAAGTCA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2 7 60 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AG A CAT AAT C TTTGGAAGCC GAGAAAAATC ATCCTCAAAG 2880 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 294 0 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3 060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTC AT AT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGT GAT AAA ATGTAATTGT 3 300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT GTCGCTTTTC ATTATAGGTC AT ATGGG ACT 3 3 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 34 20 

TATAGAGCCG AAAATTCACA TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3 4 80 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3 54 0 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3 600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACC ATTAAATGAG 3 6 60 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 37 20 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 3 7 80 

T AT ACT AATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3 84 0 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT i'900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3 9 60 

TGTTT AC AC A ATTT ATTTT A TACTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4020 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATGTTTAC ATAGGCTTCT 4 080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATC TTCTTTTAAA 4L4 0 

ATATAGTCCA AGTC AT C AG A ATAATTCATT TTGCCTTTGT GACGCTCGTA CTCTTCAACG 4 200 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 4 2 60 
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TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT TGCAATAGTA AGAAGTTCCA 4 320 

TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 4 3 80 

GGTTCTCGAC TGACCCAACG ACGACCATCA CTTTCGGTAA CAAGTGTATG ATCGTTGACA 44 4 0 

CCAATAATGC CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4 500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAG TCGC CTTCTTTTGG AAGCTTCATA 4 56 0 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATC ATT TACT ATTGTACCAT AAAATTACCC 4 62 0 

AAAAT CTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC T TGCT AT AT C 4 680 

CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTG AGTT AAA CGCTGCTTCA GTTCGTATCC 4 74 0 

TTCATACTTT CGGGC AT AC T TAGTATATTG CTTATCAAGT TCCTTGAAGA TGAGTTCCTG 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GC AATTTT AG CATCAAAATA 4 8 60 

AGAGAAGCCC TTGTTAGTCA AG TTCTGG AT AATCTTATCT TGCAGGGCAC GAGCTGGAAG 4 9 20 

TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4 9 80 

AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT CCTTTTTGTG CTAATTTCTG 5 04 0 

ACT C ACT AC A TAAGGTCCCT TGTCTCCTGA AAGTTGATTC GCATTGATGA TAGCATAAGC 5100 

GTACTGGCTA TCATTAATCC ACTTCTCTTC TTT^AGATTA GCAATGACTT GAGAAACGAT 5160 

CTTTTCATTA ATATCATATT TTTTCAGATA TTCTCTGACC TCTTTTTCAG TACGTGCTTT 52 2 0 

AAAGGATAAG TGGTAGAGGC CCAGATTCTT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 52 80 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT AACATAAAAC GAACAATTGT 5 3 40 

GTCTTCGGTG ATATAGCATT TGTCG 5 3 65 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 
{ B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 12 0 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATCCA TGACTTGTAA AATAATCAGA 2 40 



r 



WO 98/18931 PCTAJS97/19S88 



632 



GTCATTGTAG 


AAAAAATCAG 


GGTGAAGACA 


CCGAAGTTGC 


GGATAAAATA 


ACTAAAGTCA 


300 


TCCGCATACC 


ATGTTTTTTT 


AAGTT TACTG 


AACATCTTTT 


AAAAGATACC 


CAACACTACG 


360 


CAAAGTTTGC 


AAATTCTCTG 


CAAAAGTGGT 


TCCCTTTAAT 


TTCTTACGGA 


CTTTTGAAAC 


420 


ATAGACTTCG 


ACAACCGAAA 


TCGTTGTATC 


ACTATCAAAT 


CCCCATAGAC 


GGTCAAAAAT 


480 


CTGCGTCTTA 


GGCAAAATCA 


CATTTTGATT 


TTGAAGGAAA 


T AAACT AG T A 


AATCGAACTC 


540 


TTTCCCCAGC 


AATTCGACAG 


GAGTATCTTC 


AACTTTAACG 


GTATTGGTTG 


ATAAATTAAC 


600 


CACGATATTC 


CCATAAGTCA 


AGGTG'ITTTC 


ATTAAACTTC 


CCTGAACGTT 


TGAGAAGGGC 


660 


CTGAATCCGC 


ATTTTAAGTT 


CTTCTAGGTA 


GAAAGGTTTG 


GTCAGATAAT 


CATCCGCTCC 


720 


CAGTTCAAAT 


CCATGTCCCT 


TGTCATCCAA 


ACTTTCCTTG 


GCAGTCATAA 


TCAGAACTGG 


780 


TGTCGTAATT 


CCCTTTTCAC 


GCAATTCTTT 


TAAGACTTGG 


AAACCATTTT 


TTTCTGGCAA 


840 


CATCAAATCC 


AGCAAAATCA 


AGTCATAGAC 


ACCACTCTCA 


GCTTCGTAGA 


GACCTTCTTC 


900 


TCCATCAAAT 


ACCTGCATAA 


CATCCGCAAA 


ATCGTCTAAA 


AAGTCAAATA 


CTGAATTTGA 


960 


CAGACCTAGG 


TCATCCTCAA 


CCAATAAGAT 


TTTTATCATC 


AGAAACTCCT 


CCTTATTAAA 


1020 


ACTATTATAC 


CAAATTTGCC 


TTAAAAAAAA 


CTCAACTCTC 


TGCATTTTAC 


ATG AG AT AGC 


1080 


TGAGTTTTCT 


TTT TATTTTA 


GGCTTATTTA 


TGCATTTCCG 


TATTGAAGAA 


CAACTGCTTC 


1140 


GACTGCACCT 


TTTTCACGGC 


TAATCAAGTC 


AACACGCGCT 


GCAATTTCCT 


TGATTCCCAT 


1200 


ACCGATGTTA 


CGGCTAAGAG 


CAAGGTCAGA 


AAGTTGCGGT 


TCAAAGAACT 


CCTTGTATTC 


1260 


CGCCAAGCGT 


TGCTGAGTCT 


TAAATACATG 


AGCACGAAGG 


ATAACAAAGC 


T AT C AAAGCT 


1320 


CATATCTCCT 


CCAAGGGCTG 


CCTTAATCCA 


AGCCCAGTTT 


TCACGCGCCC 


AAGACCAAGC 


1380 


TGTTTTCTGA 


GTTGCTTGAT 


GAGCTAGCAA 


TTGGTAATAC 


CAAGCAGACA 


AGTCCTGTGG 


1440 


TTTGACCACA 


AATTTGTCCT 


TCCAAGAAGT 


AATCAGGTTT 


TGGATATTAT 


CCGCATCTGT 


1500 


ACTGTATGCA 


AGAGCTGCTG 


CCAACTGGCG 


TTTAAAGACA 


GCATCTGTTG 


CGTGAGTATA 


1560 


AGTATCAAGA 


TAAAGTGCTA 


ACAAGTCTTT 


AGTCTCATGA 


TGTTTCATCT 


CATTAATCAG 


1620 


AACTTGTGAG 


CGAATAGCTG 


CTGGGAGTCC 


TGCAAGATTC 


TCCTTGTGTG 


TTGCGAAGAT 


1680 


TTGGCTAGCG 


ACT TG ACT AG 


CTTCTGCATC 


ATTTGAGCGA 


ATCATCATCG 


AAACAGCCAG 


1740 


CTGACGAACC 


AATTCATCCT 


CATCTGATTC 


TCCGTCTTTA 


GCTTCAAAAC 


CAAGACGGTC 


1800 


ATAGTTATGA 


CGAGCCAATT 


TAGCAACCAG 


TCCTTTGAAG 


GCTGTTTCAG 


CATCCGTTCC 


1860 


TTCATCAATA 


AAGCGCTCAA 


GGGCTGAAAT 


CACTTGAGAA 


ACAGCTGAAA 


CCACCAGATA 


1920 


AGACTCTTCC 


TTAGCAAGTT 


TATCAAGAAC 


TGGAAGCAAG 


TCTGCATAAG 


AAATGTGCCC 


1980 


TGCCTCAGCC 


AACAAACGAC 


CTTCTTGAAC 


AATTTGCACT 


TTGCTTGTGT 


TATCAAGTGT 


2040 
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CTCTAGCTCA 


GCAAGAACAG 


CTGCTAACAA 


GTCTCCTTGA 


TAGTCGGTAA 


TATAGTGGGC 


2100 


AGTATTTTCA 


GTGTTGAGAC 


GAAGAGCTCC 


TT C ATTTTC A 


GCAAGAAGAG 


CTGCGTAGCC 


2160 


AC,ClClAT rr *rCG 


ATACTTTCAG 


TTTCGAGTGT 


ATCAGGCAAG 


CCTTTCCAGT 


TGCTATTGAG 


2220 


Vjv_» u v. v. r\ V- ^ 


CAGACACGGT 

\. A u I 


TCTTGTCTTC 


GTTCTCACCG 


ATGAAGAATT 


GTTTTTGTGA 


2280 


AATCTTCAAG 


AC AT C ATTTT 


CAACTTTAAC 


AGTAAGAACT 


GGGTAACCAG 


GCTGTTCCAA 


2340 


CCAAGAATCC 


ATGAAGGCTG 


CGACATCACG 


TCCTGACGCT 


TGACCAAGGG 


CATCCCAAAG 


2400 






TGTATTGGTG 


TTTTTCAAAG 


TAGGCGTGCA 


AACCTTTAGC 


2460 


AAA ATPAflPA 


TCTCCTAGCC 


AACGGCGAAG 


CATGTGCATG 


AGACGGCTTC 


CTTTGGCATA 


2520 




CCGTC AAAGA 


GTGTATTGAT 


TTCATCTGGA 


TGTTTAACTT 


CGACGTGGAC 


2580 




X. f*. 4> ^— #^ VJ 1 ** vJ 


CGTCACGTTC 


AAGAGCAAGA 


GGT ACTCC AC 


CTGTTTGCAA 


2640 


ATCTTCAAAG 


ATATTCCAGC 


TTGGTTCGAT 


GGTATCCACA 


CAGACGTATT 


CCATCATATT 


2700 


AnrnAAACT 01 

A\ \ v_ 1 * 


TCATTGAGCC 


AAAGGT CAT C 


CCACCATTTC 


ATAGTCACGA 


GGTTCCCAAA 


2760 






GGGCCACAAC 


AAGGGCAACT 


TGTTGACGCC 


TAGCAAATGT 


2820 


AHAHTTPTCA 


TCGACAACCA 


AGTAAACTTC 


ACGGTAGGTC 


ACAAGACCCC 


AGTTTTCCAT 


2880 


Ar:f*Af*CAftf*T 




GAAGGGCGAT 


GTGGAGAGAT 


TGAGGAATTG 


GGTACTTAAC 


2940 


TrraTftftTAA 


TPTTPGTAAA 


ACTCGATAGA 


GCGAACAGCG 


ATATCCAGTG 


AGAAATCAAG 


3000 


/\ 1 1 L un/wvj 1 




TGGTTGAGTA 


GACACCTACC 


AGGGTACCAT 


TTTTAGTTTT 


3060 




CCTTGCAAAT 


CACCAGCAAC 


AAAGGCCAAC 


AAGTAAGAAG 


ACATGCGAGG 


3120 


TCTTGTCTCA 


AACTTCCAGA 


TACCTGTTTC 


CTTACCGTTT 


TCAACATCCA 


TTTCTGGCAT 


3130 


GTTTGACAAG 


GCCAATTCAC 


CTTCTGCTTG 


GTCAAAGCGA 


AGAGAoAOtj I 


rial xc r v rr *Ci!^ 


3240 


TTTGGCTTCA 


GGCTCATCCA 


CACATGGGAA 


AGcTTCGCGC 


GCAAAATGGC 


TCTCGAACTG 


3300 


AGTAGACAAG 


ACCTCCTTCT 


TGACTCCATC 


AACTGTATAA 


TAAGAAGGGT 


AAATCCCTGT 


3360 


CATGTTGTCT 


GTAATTTTAC 


CAGAAAAGGC 


AAGAACCAAT 


TCAACTTGAC 


CAGCCTCAGC 


3420 


CAATTCGATA 


TGAAGGGCTT 


CATTGTCATG 


GTCAACTGTA 


AATGG ACG AG 


CTTGACCTGC 


3480 


AACTTCTACA 


GAGGTGATTT 


CCAAATCTTT 


TTGGTGGAGG 


GAGATGCCGT 


CACTCTGTGC 


3540 


TTCACCAGTG 


ATGGTCACTT 


TCCC AG AAAA 


AGTCTTGCTC 


TCACGACTCA 


AATCTAAAAA 


3600 


TAAATCATAA 


TGTTCAGGAA 


CAAATTGCTT 


AATGGG 






3636 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5066 base pairs 



WO 98/18931 



PCT/US97/19588 



634 

< B ) TYPE: nucleic acid 
(C ) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



ATAGCGTGTA 


ATAATCGATT 


TTAGAGGTAC 


CATAAGCCAC 


CTCCTACAAA 


TAGAAACCGA 


6C 


TATAAATCAA 


TGCCTTCCAC 


C CT T AG AC TT 


CCCTAGTTCC 


TGTCTCAAGC 


GAAACATTTC 


120 


TTTGAAACAG 


GAATAAGTTA 


ACCAATTCAT 


ACCAATAGCT 


AGCAGAATAA 


AAAGAAACCA 


180 


AATGCCCCAT 


AACTTCATAT 


CTGTCACATT 


TCTCAAGACG 


GTATTGAAAA 


ACAGAACTGA 


240 


AACAACTGTC 


CAAGCAAGGC 


TAAAAAGAGA 


AT AG AAGGG G 


ATGTAAAACC 


AGTAAAAATA 


3C0 


ATAAAAAATT 


GGAAAAAACT 


TACTATTTCT 


GTTGGCCTTT 


TCAATCCAGT 


TATCAAAATA 


360 


AAAGT AC GGT 


GCTAAAAGTA 


AGAATTTAAA 


CAAATGTTCC 


ATCACCGACA 


TCCCCCCTTC 


420 


TTTTGATAGC 


GTTTTCT ATT 


ATTTTATTAT 


ATCAAAAAAA 


TCCGGAACTG 


TCATTCCAGA 


480 


TTCTACTTTT 


TTATTTGCGT 


TTTCTTGCGA 


TGAGATGAAT 


CGGTGTTCCC 


TCAAAAACAA 


540 


AGGCCTTGCG 


GATTTGATTT 


TCCAAGAAAC 


GCAGGTAAGA 


AAAGTGCATG 


AGTTCTTCTT 


600 


CATTGACAAA 


GATGACAAAG 


GTTGGTGGTT 


TGGTTGCCAC 


TTGGCTCGCA 


TAGAAAATCT 


660 


TGAGACGTTT 


TCCTTTGTCT 


GTCGGTGTTG 


CGTTGATGGC 


AATGGCATCC 


ATGATGACAT 


720 


CGTTCAAGAC 


AGCTGATGGA 


ATACGTGTAT 


TTTGACTTTC 


GCTGATTTGC 


TTAATCATCT 


780 


CAGGAAGTTT 


GTGGAGACGT 


TGCTTGCTTA 


AAGCTGATAC 


AAAGATAATC 


GGTGCGTAAG 


840 


GCAGGTATTG 


GAACTGCTCA 


CGGATATCTT 


CTTCCCACTT 


TT T CAT AGTG 


TGGTTATCTT 


900 


TTTCAAGCGT 


ATCCCACTTG 


TTGACCACGA 


TAATCATCCC 


TTTACCAGCT 


TCATGGGCAA 


960 


AT C CTGCG AT 


ACGCTTGTCG 


TACTCACGAA 


TGCCTTCTTC 


CGCATTGATG 


ACCATCAAGA 


1020 


CCACATCTGA 


ACGGTCAATA 


GCACGCATGG 


CACGCATAAC 


AGAGTATTTC 


TCAGTATTTT 


1080 


CATAAACCTT 


ACCAGACTTA 


CGCATACCAG 


CCGTATCAAT 


CATGGTAAAC 


TCTTGACCAT 


1140 


CTGTATCTGT 


AAAGTG GGT A 


T C AATGG CAT 


CACGAGTTGT 


TCCAGCAACA 


GG ACT AGCAA 


120C 


TAACACGGTC 


TTCTCCCAAG 


ATAGCATTGA 


TCAAGCTTGA 


TTTTCCAACG 


TTAGGACGAC 


1260 


CAATCAAGCT 


AAACTTAATG 


ACATCTGGAT 


TTTCTTCCTC 


ATATTCATTT 


GGAAGATTTT 


1320 


CTACGATCGC 


ATCTAGCACA 


TCCCCTGTAC 


CGATTCCATG 


GACAGATGAG 


ATAGGCAATG 


1380 


GTTCACCCAA 


ACCGAGAGCA 


TAGAAATCAT 


ATATATCATT 


TCTCATCTCA 


GGGTTGTCCA 


1440 


CCTTGTTGAC 


TGCGAGGATA 


ACTGGTTTGT 


GGGTCTTATA 


AAGCTTACGA 


GCTACGTATT 


1500 


CGTCTGCATC 


AGTAATTCCT 


TCCTTACCAG 


AC ACG AC AAA 


. AACGATAACA 


i TCTGCTTCTT 


1560 
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CCATCGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA G CGATCCG AT 17 40 

TAAATAGGGT TGATTTCCCA ACATTGGCAC GTCCTACAAT GGCAATACTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCACC TTGTGGTCGC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2 04 0 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2 2 20 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGCCTCCCTT TTCTGCTCGG 2280 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 2 34 0 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2 4 00 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2 460 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2 520 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 264 0 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 27 00 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACCCTTGAT TGCTCGGCGC 27 60 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2820 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2 880 

ATGAAACACC ACGTTTTTCT G AT AAAT C AT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2 94 0 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT ACCTGGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 3120 

CTTCTTTTTC AGCCGCGATT AG CTC AGG AG TTTCTTCATA TGAAACATCT GCATAACCAT 3 130 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 32 40 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3 3 00 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGG A 3 3 60 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT CACGTTTCAG TACGTCTCCG ACACTTTCCA 3 420 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3 4 80 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACT C CATTT A GGAACATTGC 3 54 0 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3 600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3 660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCC AC CTT ATTAAAGGTC AATAAGAGAA 3 720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3340 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3 900 

GTTTTTGACC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTCATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4 020 

GGTTTAAGAC AT CGG ACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4 080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 414 0 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTGACTGA GACAGGTATT TCTTCACCAT 4 200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4 2 60 

TGCGATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGGAGCATGG AGGGCTAGCT 4 32 0 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AG AG ATT AAA AGCAGATAAG ATTTTCAATG 4 3 30 

ATTTTATCAG TCTATCCATC C C AAAGTTGA GATGGTTGAG AATGCTTGAA AAAAG AT ATT 444 0 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4 500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCG AC AC C CGATTATTCT 4 560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4 620 

GGCTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4 68 0 

TAG C AAAAC G TACATAGGTA ATCTCGTCCA ATTCACCCAA CTCCTCCATG ACGAGTGAAC 474 0 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4 800 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGG AT AATCC 4860 

CATTAAAGAT T TT ATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 4 92 0 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4 98 0 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 504 0 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 





ATTTGAAACA 


GCTATGGAAA 


ACATCATGCC 


TGTACTTGAA 


GTACGTGCAC 


60 




T GGTT CT AAC 


TACCAAGTCC 


CAGTTGAAGT 


TCGTCCAGAA 


CGTCGTACAA 


120 


rACTTGGACT 


TCGTTGGTTG 


GTAACAATCG 


CTCGTCTTCG 


TGGTGAACAC 


ACAATGCAAG 


180 


/\ V* I— X_» X <w 1 1* VJ V* 


AAAAGAAATC 


TTGGATGCTG 


CTAACAACAC 


TGGTGCAGCA 


GTTAAGAAAC 


240 


GTGAAGATAC 


TCACCGTATG 


GCTGAAGCTA 


ACCCTGCATT 


CGCACACTTC 


CGTTGGTAAG 


300 


A^AGGATGCG 


AAACCGTTAA 


GAAAGTCC C A 


GAGAAAATAG 


GGAATCGAAG 


CAGGTTGCGG 


360 


TTCCAACCAA 


TG AG ATTC AT 


CTTTTTCTCC 


AGACTTTTAG 


CTTGAGCTCA 


ACT AAAT CAT 


420 


GATGCTACGA 


ACGGTAAGGA 


TGCAAGGTAA 


AAATAGGAAA 


CTGACGCAGT 


ATTCGACGAA 


480 


TACAAGGAGT 


TTTATCTTTT 


TCACGCAGCA 


TCCCGTTCCA 


GCTCACATCG 


GCTAACTAAC 


540 


TTTAGCCCGG 


GTTCAAATTA 


GCTAAATCGA 


TTAGTATTAlj 






600 


TCGTAACTTG 


AAACCAACAA 


TAGCATGAAA 


ACATTGACAA 


CGGGTACGTC 


CTGCCTATCC 


660 


GTTTTTATTA 


AAATCGTCTT 


ATAATAGAAT 


AGAAATCAAA 


AATAAATAGG 


AGAAACAAAC 


720 


CTCATGGCAC 


GCGAATTTTC 


ACTTGAAAAA 


ACTCGTAATA 


TCGGTATCAT 


GGCTCACGTC 


780 


GATGCCGGTA 


AAACAACAAC 


TACTGAGCGT 


ATTCTTTACT 


ACACTGGTAA 


AATC C AC AAA 


840 


ATCGGTGAAA 


CTCACGAAGG 


TCCCTCACAA 


ATGGACTGCA 


TGGAGCAAGA 


GCAAGAACGT 


900 


GGTATCACGA 


TCACATCTGC 


TGCGACGACA 


GCTCAATGGA 


ACAACCACCG 


CGTAAACATC 


960 


ATCGACACAC 


CAGGACACGT 


GGACTTCACA 


ATCGAAGTAC 


AACGTTCTCT 


TCGTGTATTG 


1020 


GATGGTGCGG 


TTACCGTTCT 


TGACTCACAA 


TCAGGTGTTG 


AGCCTCAAAC 


TGAAACAGTT 


1080 


TGGCGTCAAG 


CAACTGAGTA 


CGGAGTTCCA 


CGTATCGTAT 


TTGCCAACAA 


AATGG AC AAA 


1140 


AT CGGTGCTG 


ACTTCCTTTA 


CTCTGTAAGC 


ACACTTCACG 


ATCGTCTTCA 


AGCAAATGCA 


12C0 


CACCCAATCC 


AATTGCCAAT 


CGGTTCTGAA 


GATGACTTCC 


GTGGTATCAT 


TGACTTGATC 


1260 


AAG AT GAAAG 


CTGAAATCTA 


TACTAACGAC 


CTTGGTACGG 


ATATCCTTGA 


AGAAGACATC 


1320 


CCAGCTGAAT 


ACCTTGACCA 


AGCTCAAGAA 


TACCGTGAAA 


AATTGATTGA 


AGCAGTTGCT 


1330 
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G AAACT G ACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 14 40 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAC7 ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 15 60 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG AT AC AG ACGC TGAAGAAATT 16 20 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 16 80 

TTCCTAGCTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGG TTCAT AC i7 40 

GTATTCAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 18 60 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATCAACCTTC CAGAACCAGT TATCCAATTG ATGGTTG AGC CAAAATCTAA ACCTGACCAA 19 80 

GACAAGATGG GTATCGCCCT TCAAAAATTG CCTCAAGAAG ATCCAACATT CCGCGTTGAA 204 0 

ACAAACGTTG AAACTGG TG A AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TGAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAACTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 22 20 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCC AAACG A AG AAGGT AAA 2 2 80 

GGATTCCAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2 340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 24 00 

GACGTTAAAG CTAACCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTGCC 24 60 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 2 520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTCG TGATGTTATG 2 580 

GGTCACGTAA CTGCTCCTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2 640 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATCTTCGCTT ACGC AAC ACT TCTTCGTTCT 27 00 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2 7 60 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2 8 20 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTCT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2 880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2 940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3 000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGT ATTTT CTAATATTTT 3 060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTC CT ACT AT 


* * f^r* f **yi ryn 

aggga(j(jC i r 


I L 1 L I Lj/\>\L»Lj 


1 LLA I Aj i uV. 1 


Af^TCAAGCTT 


TTAAGTCCCT 


3240 


TT A I LAAAA 1 






T1ATCAGCTA 


TTTATTGAAG 


GATCTGAAGC 


3300 


G ATT GAT AAG 


GAAL An r I A 


AUAAlinA 1 LA 




TTACCCAAAC 


AACTTGAAAA 


3360 


GTTTGGTTTT 


C C AACTTTT A 


ALT\jTLAAt* I 






CCCAAGAGCA 


3420 


GGAAGAGGLC 


I I TLA 1 GL i Lf 


a a a ri^p a^p a 
AAAA 1 LvVjLA 


P & TTHTTCAA 
L»rt 1 Jul I LAn 


GCTGCCAATG 


AGGAAGCGCT 


3460 


CCGTGCTATG 


GAACAACTGG 


AGCAGATLGL 


ALL L A 


ULWnrtUAuA 


AACC AGC CTT 


3540 


TGATTTTCAA 


GCGAAAAAAG 


CTGCAGCTAA 


ALLLAAGL 1 L» 


raTAAGGmr. 

IjA I AAuuLuu 


AGATTACTCC 


3600 


TATGATCGAA 


GTGACGACAG 


AGGAAAATCG 


i L irtl l 1 




TTTTTGATGT 


3660 


GGAGCAAAAA 


GTG ACT AG AA 


CAGGTCGTGT 




TT^T A A A A Tf^ A 


CGGACTATAC 


3720 


TTCAAGTTTT 


TCTATGCAAA 


AGTGGGTTAA 


AAACGAGGAA 


a pnprnr a 

(jAuoLLLAOA 


AG TTTCACCT 


3780 


CAT C AAGAAG 


AATTCTTGGC 


TC CG AGTT LU 


AuuuAn lulu 


r.Ar.ATf.AATA 


ACTTCACACG 


3840 


CGATTTGACT 


ATGAACGTAC 


AGGATLTGLA 


oUAALi 1 1U1 1 




GG AAGG ATTT 


3900 


GATGCCAGAA 


GGTGAGCGTC 


GGGTTGAGTT 


TLA I LiL I LA 1 


a /^"T> a a r* a TnT 
AL 1 AALA lu 1 


PGACTATCGA 


3960 


TGCTTTGCCA 


G AGGTC G AAG 


AG A TTGTTGC 






ACAAGGCGGT 


4020 


TGCTATCACG 


G AC C ATG GG A 


ATGTCCAGTL 


(.11 1 LLALA I 


r:r*r"T a t a. a CI 


CGGCTAAGAA 


4080 


AGCGGGAATC 


CAGCTGATCT 


ATGGGATGGA 


A f-<f-/~ a 1T1TP 
AwLLAA 1 AIL 


r^Tnr: a nr; Apr* 

O 1 \juAU\)A\,u 


GTGTCCCTAT 


4140 


CGTCTATAAC 


GAAGTGGAGA 


ir*/"",/"* A <— *TT^-f /"* 

TGGACTTo FL 


AbAAuLAALL 


T A PPT'P.dTPT 


TT G ACGTGG A 


4200 


AACGACGGGA 


CTTTCAGCTA 


TCTATAATGA 


LI 1 GA I I LAO 


PTTPPPPPTT 


C T AAG AT GT A 


4260 


CAAGGGGAAT 


GTTATTGCTG 


AATTTGATGA 


A f A T/™* ft. A f 
A 1 1 I A 1 LAA I 


PPTP.P.P.PATP 


CCTTGTCAGC 


4320 


CTTTACTACA 


GAGTTAACTO 


/"» A A «T*T^ A r" lp t 


TP 1TP ITHTr 
L LiA 1 lu Iv. 


AAAAATGC c A 


AACCACTAGA 


4380 


ACAAGTTTTG 


L AAG AA I it-L 


a »r a a t» i T'T w * , P 
Anunn 1 1 1 »u 


LAnuun 1 rt^-vj 


nTCCTAGTTG 


CCCACAATGC 


4440 


TACCTTTGAC 


GTTGGCTTTA 


TGAATGCTAA 


TTATGAGCGG 


CATGATCTTC 


CAAAGATTAG 


4500 


TCAGCCAGTT 


ATTGATACGC 


TGGAGTTTGC 


TAGAAACCTC 


TATCCTGAGT 


ATAAACCCCA 


4560 


TGGTTTGGGG 


CCTTTGACCA 


AGCGTTTTGG 


TGTGGCCTTG 


CAACATCACC 


ACATGGCCAA 


4620 


CTACGATGCG 


GAAGCGACTG 


GTCGTCTGCT 


TTTCATCTTT 


ATCAAAGAGG 


TAGCAGAAAA 


4680 


ACATGGTGTG 


ACCGATTTAG 


CTAGACTCAA 


CATTGATCTA 


ATCAGTCCAG 


ATTCTTACAA 


4740 


AAAAGCTCGG 


ATCAAGCATG 


CGACCATCTA 


TGTCAAGAAT 


CAGGT AGGTC 


TAAAAAATAT 


4800 


CTTTAAGCTG 


GTTTC CTTGT 


CTAATACCAA 


GTATTTTGAA 


GGAGTGCCAC 


GGATTCCGAG 


4860 


AACGGTTCTA 


GATGCCCATC 


GAGAGGGCTT 


GATTTTACCT 


TCAGCCTGTT 


CAGAGGGTGA 


4920 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4 980 

TCATTTTATC GAGGTCATGC CACCGGCTAT CTATCCACCC TTGATTGCCA AAG AGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG AT AGAGGTT G GAGACCGCCT 5100 

TGGCAAGCCT CTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 528 0 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGT T A TTGAAAACAC 53 40 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 54 60 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAAC ATC CAT 5520 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATC CTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCCGA TCTAGTTTCG TTGCGACCAT 564 0 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 57 00 

CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 57 60 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTC GGG AG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 594 0 

GGTTGGTACG GTAGCTGCCA AG ACT G CCT A TCCATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAACTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 612 0 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AG AC C AC T C A 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 624 0 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 6 300 

GGATGACGAA GCCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 6 3 60 

ACAAATTGGA ACGC CT AC GG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 6 4 20 

TGGAATGGTA GACGAAACCC AT CCG AC AAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 80 

GTCCCACGGT ACTGATCTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TCCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGG CT AAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6 72 0 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTT ATT ACTA 684 0 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7 080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 72 00 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 72 60 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7 3 20 

TAAAAGAGGC T AACGT AT AT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 7 3 80 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GGATAGTGAG 74 40 

GTTAAGGAGC AGGCCCAACA GGTGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7 500 

AAT ATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTCATGT TAGACTAGAC 7 5 60 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7 620 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7 6 80 

ACAGTTTAAA AAAGATTTTA ACTTGGCTAT GAAGCGTGGT TTGAAGGCAG AATTATTAGA 774 0 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC GTGATCATTC 7 8 00 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCGTGAATGC CATACCCAGC CAGATTGGCT 7 8 60 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT G ATTTT AAA T TTGCTGAGGA CAGGCAGTCA 7 9 20 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAACAAT ATTTG CGGAA 7 980 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 804 0 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 8100 

GAAAAATAAA AATAGAAAAG AG AAAAC AG A ATGGT TTT AC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG TCACACTTTG 8 2 20 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGC AC ATC TAATCGTGAA AGAAGCTTAT 82 80 

GCCTTGGGTG CGCATGAGGT CAT CGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 834 0 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTG C C AG AATACAAGAT TGCTGAGATG 84 00 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 84 60 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCT A AAGCTATCGG ACTTGCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCT AAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 864 0 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 87 60 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT C AGC TGGTGC TGTCAATGCA C AGGGC G AAG AATT CTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGCCATTA AGGTGACCTT TAAGGATGGA 9 000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TCTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9L2 0 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 924 0 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9 300 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCC TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 94 20 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 9480 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 954 0 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTACCTAT TTTTTGGAGA 9 600 

CGAGGAA 9 607 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 12 0 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CT AAG AG AAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAAC TTG AT 24 0 
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ACTATGCGTT 


TTATTGTGGG 


AAGGTTTACT 


CCATTTTCTC 


CTGAAATTGA 


GTTTTTGTCC 


300 


AGCCTCTGTT 


TTTAGGGTTG 


CTAAGAAAAT 


AATGTCATGT 


GGTGAATATT 


TGTAAATCAG 


360 


TCAGCAGACA 


G AACG AT ACT 


CTTCGAAAAT 


CTCTTCACAT 


CATGTCAGCT 


TCGTCTTTCC 


420 


GTATATATGT 


GACTGACTTC 


ATCAGTTCTA 


TCTACAACCT 


CAAAACAGTG 


TTTCGAGCTG 


480 


ACTTGATCAA 


TTTTCAAATC 


TGTACTTTGA 


GCAAGCTGAG 


ACTAGCTTCC 


TATTTGATTT 


540 


TCATTGAATA 


TCAGAAACCC 


ATTCTCCATC 


AAATAATTCG 


ACTGCGTCTA 


AT AATTTTT G 


600 


ATCTGGCACG 


GTGTCTGAAA 


TAAAGGTTGT 


GTATTTGGAG 


AGGGGATTAA 


TTTTAAAAAA 


660 


TCCACTCTTG 


TAAAATTTAG 


AACTATCAAT 


CAGTAAGATG 


GTTTCATGGG 


CTTTGTCAAT 


720 


AAT ATT CTTT 


TTTGAAATAG 


CTTGGCTGAG 


AGAAGCTTCA 


TAAACATATT 


GGTCATCAAT 


780 


ACCTCTTGCT 


GAACAAAATG 


CTAAATCGAT 


ATTAAAATGA 


TCTAATAAAG 


AATTTTCCTT 


840 


ATCATAGTTG 


ACCACGGAAC 


AGGATTGATG 


TTTGACCTCG 


CCAGATGTGA 


TAAAGATTTT 


900 


GGACCTATCT 


TTAACAGTTT 


CAGATAGGGT 


TTGTGCAGTA 


TGTAAACCAT 


TTGTAAAAAT 


960 


AATCAAATTA 


TCAAGTTCAG 


AAAGATAGGG 


ACAGACTTCG 


TAGACAGTAG 


TACTAGAATC 


1020 


TAGATAGATA 


C AC AT AC C AG 


ACCGAATAAA 


GTCTTTAGCG 


AGACTAGCGA 


TTAGTCTTTT 


1080 


TTGCCTAGTA 


CTTTCTCCTT 


CACGTATTTG 


ATGAGAAAGT 


TCAATTGTGT 


TCATAGAGGA 


1140 


CAGGGTCACG 


TATCCGTGCT 


TTCTTTTGAT 


AACACCTTGA 


TTTTCTAAGA 


AAATTAAATC 


1200 


ACGACGTAAG 


GTACTTGTGC 


TGGAGAAAGT 


GATTTCTGCC 


AGCTCTTTTA 


CCCCAATTCT 


1260 




TTGATAATTT 


CAATCAATTC 


AAGTACACGT 


TCATCTTTTA 


TCATAAGCTC 


1320 


CTCCTAATTT 


ATCATTTCAA 


CT AT ATT AT A 


GCACAAATTG 


GAGGAATTTG 


AATTATTTTT 


13 80 


ATG AAT ATTG 


GGTTAACATT 


TGAACATTAT 


TCAAGTAAGC 


GTTCACATAT 


TGAAAAAATA 


1440 


AAACGTGGGG 


ATT AT AAT AA 


AGTTAATCmA 


GGACGAAGAG 


AGAAGAAAAA 


TGGAAGCGGT 


1500 


TTTAGCAATA 


GATTTAGGTG 


CGACTTCTGG 


AAUAOt. AA 1 \_ 




TTTfTf.AAAA 


1560 


T AAACT ACTA 


ATGGAAGAAA 


TAAATCGCTT 


TTCTAATCTA 


CCTATTAGAG 


TAAAAGGGCA 


1620 


TTT AT C TTGG 


GATATTGACT 


TTCTACTAGC 


TAAAATTCTT 


GAAAGTATCC 


GCTTGGCTAA 


1680 


TACTAGTTAC 


AAGATTTTAT 


CT AT CGGT AT 


TGACACATGG 


GGAGTTGATT 


TTGGACTGAT 


1740 


TGATAATGAA 


GCTAAGCTGT 


T ATT AC AAC C 


TGTTCATTAT 


CGTGATGAAA 


GAACAAAGGG 


1800 


AGTGTTAAAG 


GAAATATCTG 


AAATGACTGA 


ATTAGAAAAA 


CTGTATTCAG 


AGACAGGAAA 


1860 


TC AG ATT ATG 


GAGATAAATA 


CCTTGTTTCA 


ACTCTTTAAG 


GCACGTCAAG 


AATCTCCTGA 


1920 


CTCTTTCTAT 


AAGACCAATA 


AGATTCTTTT 


AATGCCAGAT 


TTGTTTAATT 


ATCTCTTGAC 


1980 
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AGGTAAGTTT GCTACAGAAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2 040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAAGAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGGTCTTTG GTTGGAGTGG AACTTACTTC 22 80 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2 34 0 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGACGAAC TAAGACGTTC 24 00 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 24 60 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 2520 

GC ACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2 580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2 64 0 

ACTAGAAGAA CTAACTCATA ACGTTTATAA GAG GAT AT AT GTGATTGGAG GAGGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 27 60 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG G AC AATT AAA 2 82 0 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2 880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2 940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTCTTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3 0 60 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCCTGAAA AACAATTGAT 3120 

G GCAATT AC C CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTCTAGTGAT 3180 

GGATATTAAT GGAAATGTTC TAGAGGGAGA ACGCTTGCCA TCTAGCCAAT GGT ATATGC A 3 2 40 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3 3 00 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3 360 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 3 420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 34 80 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 354 0 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3 600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATC 3 660 

TTAAAACATA T AC C G AAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GCAAATGGGA 3 7 20 

CATGGAGATG AAA7AGTATT AGCTGACGCG AATTATCCTT CTGCCTCATG TGCAAATAAG 3 780 
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CTAATTCGTT 


GTGATGGTGT 


AAATATTCCA 


GAATTATTAG 


ATTCCATTCT 


GTATTTAATG 

VJ x «* ill \ 1 Vj 


3 840 


CCATTAGATA 


GTTACGTCGA 


TAGTTCAATT 


CAGTTTATGA 


ACGTTGTTTC 


GGGTG ATCAT 

vjwvj i vjn i vj r\ i. 


3900 


ATTCCTAAGA 


TATGGGGTAC 


CTATAGACAG 


ATGATTGAAG 


GTCATGGTAC 


AC ATCTT AAA 


3960 


ACGATTACTT 


ATCTTAGAAG 


AGAAGACTTT 


TATGAACGTA 


GTAAGAAAGC 


TT AT GCT ATT 


4020 


GTTGCTACAG 


GAGAAACTTC 


ACTTTATGCT 


AATATTATCC 


TT AAG AAAGG 


AGTAGTTGTT 

rWJ £ r\w 1 1 vj 1 1 


4030 


GAAAGAGAAA 


ATGTTCAATA 


GAGGAATTTT 


AGTTGCCAGT 


CATGGTAATT 


T^G £**T AG CC G 


414 0 


AGCTCTCATG 


ACCG C AG AAA 


TGTTTGTTGG 


TGAGACAACA 


AATGATAGAG 


TTAGG AC ATT 


4 2C0 


AGGTTTGATG 


CCTGGAGAGA 


ATA TTGT AG A 


GTTTGAGCAT 


TATTTTAAAA 


ATCAAGTCGA 


4 2 60 


TGAACTGTTA 


GACTCAAATC 


AAGAGGTTAT 


CGTTTTGACT 


GACTTGATTG 


GAGGAAGTCC 


4 320 


TAATAATGTG 


GCTTTGTCAC 


GGTTTTTAAA 


TTTGG ATT C A 


GTTGATATTG 


1* AAC* accctt 


4 380 


TAATATCCCT 


CTCCTAGTGG 


AATTAATATC 


AAGTTATGAT 


TCAAAAATCA 


AT TT AG AAC A 


4440 


AATTGTTCAC 


AATGCTCAAA 


ATAGTTTGTT 


TAATGTTAAA 


CAACAACTTA 


ACGTAftAfVA 


1 j UU 


GGAAGAAGAT 


TTATGTCTAT 


AGAGTTTGTT 


CGTATTGATG 


ACCCTCTGCT 




4 5 60 


GTTGTCACTA 


CGTGGCTAAA 


AAAGTATGAT 


ATTGAGCAAG 


TTATrATTHT 






ATCTCAGAAG 


ATAAAACACG 


ACAATCTATT 


TTAAAGATTT 


CTGCACCGGT 


AGGTTTAAAA 


4 6 8 0 


ATTGTTTTCT 


TTAGTGTAAA 


ACGGTTTGTG 


GAAGTTTTAA 


ACTCTGTGCC 


AATAAAAAAG 


4 74 0 


AGAACAATGC 


TGATATATAC 


AAATCCAAAA 


GATGTGTATG 


ATTCTATTGA 


AGGAAATT^A 


4 800 


AAATTGGAGT 


ACCTCAATGT 


AGGACAGATG 


ACT AAAACGG 


AGGAAAATGA 


AAAGGTAACC 


4 860 


GGAGGTGTAG 


CTCTAGGTGA 


AGAAGACAAA 


TATTATTTTA 


AGAAAATAGT 


TGATAAGGGA 


4 920 


ACGAGAGTTG 


AAATTCAAAT 


GGTTCCTAAT 


GATAAAGTTA 


CAATGTTGGA 


AAAAT rTV TTTA 


4980 


TAAAAATAAT 


TTAAGGAGGT 


ACAGTATATG 


CTATTCACAC 


AAGCATTACT 


GGTGACATTA 


5040 


GTTCGGATTA 


TTGCCACTAT 


TGACTATAAT 


GGACCCTTAT 


TTATGATTCA 


CCGTCCGTTA 


5100 


GTTACAAGTG 


CAATGGTTGG 


CTTAGTATTA 


GG AG ATTTC A 


C CC AAGGTGT 


TCTTATTGGT 


5160 


TCAGCTCTTG 


AATTAACTTG 


GCTCGGTGTA 


ACAGGTATTG 


GAGGTTATAC 


TCCACCAGAT 


5220 


ACTATTTCAG 


GTGCG ATT AT 


TGGTACTGCA 


TTTGGTATTT 


TATCTGGTCA 


AGGAGAAACT 


5280 


GCTGGTATCG 


CTATAGCAGT 


TCCAATTGCA 


GTTGCT AC CC 


AACACTTGCA 


TGTTCTTGCA 


5340 


AAAACTTTAG 


ATGTTTATTT 


TGTGAAAAAA 


GCTGATAATG 


ATGCTAAAAA 


CGGAGATTAT 


5400 


TCAAAGATCG 


GTTTTTATCA 


TTATTCAAGT 


TTGGTTTTAA 


TCACGTTATT 


TAAAATTGTA 


5460 


CCAATTTTCC 


T AG CT ATT AT 


GCTTGGAGGG 


GAATATGTGG 


CAGACTTGTT 


TGCTAAGGTT 


5520 
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CCACCAATCG TTATGCAGGG ACTTAACTCT GCAGGTGCTT TACTACCTTC AATTGGTTTT 5580 

GGTATCCTTT TAAATATGAT GCTCAAGAAA AATATGTGGG TATTCTTGTT GATTGGATTC 5640 

ATTTGTTCTG TGTATGGAGG AATGTCAACC ATTGGGATCT CACTAGTTGG TATTGCGGTA 5700 

GCATACTTCT ACGATATGAT TGGAAGCAAA CCACAAGAAA CAACTTCAAG TAGTGATGTT 57 60 

GAGGAGGATC TTGATCTATG ATGAATAATA AAGTAACTAA AGTTGAACTT AAAAAAGTTT 5820 

TCAAACGAAG TTTT ATGT AT GGT TCTTC AT GGAACTATGA GAG AATG C AG AACCTAGGTT 5880 

TT CT AT AT AC AATTCTTCCA GTATTGAAAA AACT AT AC C C AG AC AAAG AT TCAGCTTCTC 594 0 

CTGCAATGAA ACGTCACCTT GAGTTTTTCA ATACTCATCA AACAGCGGCA CCATTTATTC 6000 

TTGGAGTTAC TTCCGCTATG GAAGAACAAG AAGGAAATGA AGGTGCAGCT TCAATTACTG 6060 

GTATTAAAGT TGGCTTG ATG GGGCCACTGG CTGGTCTAGG AGATAGTTTG TTCTGGCTGA 6120 

CACTAGTTCC T AT CTGTTTT AGTATTGGTG CGTCTTATTC TAAAGACGGC GGTGC TTT AG 6180 

GTATCTTTAT CGCCTTAATA TTGTTTAATA TTATTAATAT TCCTGTTAAA TATTTCGGTT 624 0 

TGAAATATGG GTATACTAAG GGTTCTAGTC TTATCCAAGA AAATAATACA AAAGGAACAT 6 3 00 

TGAATCGCGT TACGAGTATG GCGACAGCAT TAGGGCTAGT ACTAGTGGGT GGTTTGATTC 6 3 50 

CAT C AATGGT TGGTATTAAT TTT GGATT AG AATTTAAGCA GGGGGAACTT GTTATTTCTG 64 20 

TTCAAGAAAT GATTACAAAA TTAATTCCAG GATTTATCCC TATGCCTTTG ACTTTATTAA 64 80 

TGTCTAAATT AATTAGAAAA GGAAAGAATC CGGTTGTACT AATCTTTAGT GTTATGCCTA 6 54 0 

TTGGAGTTAT TCTAGTTGTT TTAGGAATTT TGAAGTAGTA GAAAGTGTGC AGGTCGTATT 6 600 

TGGGATATCA CCTCCATTTT GGAAGAGAGG TAAAGAGTGA AATTATGCTA TAAGAAAGCT 6 6 60 

GCCGCAAATT GGAATGAAGC CTTGCCGATT GGGAACGGTC ATTTACGTGG TATGATTTAT 67 20 

GGTTCAGCTA CAAAAGAATG TATTCAACTA AACGATGAGA CTATTTGGTA TAGAGGAAAG 6780 

TCAGATAGAA ATAATCCAGA CTCACTATTG CATCTTAAAA AAATTCGGGA ATATCTTTTA 684 0 

GATGGAGAAA TTCAGAAAGC CGAAGAATTG ATAAAGTTAA CAGTGTTTGC TACCCCAAGA 6900 

GATCAAAGCC ACTATGAATT ACTTGGGGAA CTTTACATTG AGCATATAGA TATTCAGTCT 5960 

TGT GCTCTTT CATTGTATGA AAGAGAGCTA GATTT AG AT A CAGCTATTTC TAATGTTGTG 7 020 

TTTGAGCCTA ATAGTTGTAA TTTACAAATA AAAAGAGAAT ATTTTACCAC TTTTAATAAG 7080 

AATATTTTAT GTTGCCGTAT AGTGTCATCA GTTC AAAAC A CATTAAATTT AAACATTAAT 7140 

TTGGGTAGAA ATAAACGGTT TAATGACGAA GTATCTAAAC TGGATTCAAG TACAATTTTA 7200 

ATGTCGGCCT CTGCTGGAGG TAGAAAAGGT GTTC AG TTT A AAGTAGTATG TCATTCTAAG 7 2 60 
GTTACGGATG GTGAAGTAAG TGTATTGGGA GAGACAATAG TTATTCGGAA TGCTACAGAG 7 3 20 
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GTATTTCTTT 


ATCTCAAATC 


AATGACGGAT 


TATTGGGGAA 


ATATAGATAT 


TTCTTCTCTT 


7380 


CAGGGAGAAT 


TTAGTAGTAT 


TGATTACTTT 


ACAGAAAAAG 


ATGAACATGT 


AAAAAAATAT 


7440 


CAGGAGCAAT 


TTAATAGAGT 


TGATTTTAAA 


CTAGACTATA 


GTAAAGGTTG 


TCTT AGC ATT 


7500 


CCAACGAATC 


TACTTCTTGA 


AAACACTAAA 


AAGTATAGTA 


ACTACTTGAC 


TAACTTGTTA 


7560 


TTTCATTATG 


GAAGATATCT 


GTTAATATCG 


TCTAGTCAAC 


CGAATGGTTT 


ACCTGCCAAT 


7620 


CTTCAAGGAA 


TATGGTGTGA 


TGAATTAAAT 


CCAATTTGGG 


GTTCTAAATA 


TACGATTAAT 


7680 


ATTAATACTC 


AAATG AATT A 


TTGGATGGTA 


GGTCCATGTG 


ATTTACCAGA 


AGTAGAATAT 


7740 


CCATTATTTG 


ATATGCTCGA 


AAGAATGAGA 


GAACCGGGAA 


GACTAACCGC 


TAAGAAAATG 


7800 


TATGGAGCTA 


GAGGTTTTAC 


AG C AC ATC AT 


AATACGGATG 


GTTTTGGCGA 


TACGGCTCCC 


7860 


CAATCTCATG 


CCATGGGGGC 


TGCAATTTGG 


GT ATTAACT A 


TTCCATGGTT 


ATGT ACT CAT 


7920 


ATTTGGGAAC 


ACTATTTATA 


TTTCCAAGAT 


GAGCGTATTC 


TTACGGAACA 


TTTTGAAATG 


7980 


ATAAAAGAAG 


CATTTCTTTT 


CTTTGAAGAT 


TATTTATTTG 


AGGTGGATGG 


CTACTTGATG 


8040 


ACAGGTCCAA 


GTGTCTCACC 


GGAAAATAAA 


TATCGCTTAA 


AAAATGGTAT 


TGAAGGAAAT 


8100 


GCTTGTCTAT 


CATCTACAAT 


TGATAATCAA 


ATTCTAAGAT 


ATTTTTGTGA 


TTCATGCATT 


8160 


GGCATTGCAA 


AACAATTAGG 


AGACAATTCG 


GATTTTATTA 


GTCGTGTGAA 


GGAGTTAAAA 


3220 


AAGAAACTAC 


CTAAAACAAA 


AATAGGTAGT 


AATGGGCAAA 


TCCAAGAATG 


GTTAGAAGAT 


8280 


TATGAAGAAG 


TAGAGCCTGG 


GCATAGACAC 


ATTTCACCTC 


TATTTGGGCT 


TTATCCTTAT 


8340 


AATGAGATTG 


ATATTCATAA 


AACTCCGGAA 


TTAGCAGAAG 


CAGCTAAAAT 


CACTATCAAT 


8400 


AGGAGATTAT 


CAAACGCTAA 


TTTTT T AT CT 


TCACAGGAGA 


GGGAGCAAGC 


GATTAATAAT 


8460 


TGGTTAGTAA 


GTGGTTTGCA 


TGCTAGTACA 


CAAACAGGTT 


GGAGTGCTGC 


ATGGCTGATT 


8520 


CATTTTTTTG 


CGAGACTATA 


TCAAGGTGAA 


CCTGCTTATA 


AC C AG ATT AA 


TGGTTTGTTA 


8580 


AATAATGCGA 


CTCTTGGCAA 


TTTATTTCTT 


GACCATCCAC 


CATTTCAAAT 


TGATGGTAAT 


8640 


TTAGGTTTGG 


TGAGTGGAAT 


TTGTGAATTA 


TTACTACAGA 


GCCATCATAA 


TTGGTTATCA 


8700 


CTAATTCCAG 


CTTTACCTTC 


TGCTTGGTCA 


GAAGCAGAAG 


TGAAAGGTTT 


CAGAGTAAGA 


8760 


GGAGGATATA 


AGGTATCGTT 


TGCTTGGAAA 


AATGGGGATA 


TAACATTCCT 


AAAATTGGAA 


8820 


GGAGGAAACA 


AAGATCAAAA 


AGTAAGAGTA 


AGAATATATG 


GCAAAAATAC 


TGATGTACAA 


9880 


AATATTGAAT 


TGGTATTTAA 


TTCAGAAAAA 


ATTATTGAGT 


TAAATTTTTA 


GGTATAAGTC 


8940 


ATGAATAAAG 


AAAAAATAAA 


AAGAAAATTA 


ATCACAATAT 


TGTTTGTATG 


TATTGGGATG 


9000 


TTATGTTTTG 


GATTGTTAGC 


AGGAGTTAAG 


GCTGATAATC 


GTGTTCAAAT 


GAGAACGACG 


9060 
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ATTAATAATG 
TTATGGTGGG 
TATGCAGCGA 
GATACGAAAG 
ATTCCAGTTT 
TTAGATGAAC 
TGGATTTACA 
TATGGAGCGC 
GATCCGACAT 
AATACGCCAA 
GGCTTATGTG 
ACAAACACAT 
TCAATGATTG 
GAATGTGCCG 
ATTATTCCTT 
AATAGAACAA 
TATCAAGGAC 
ATTCTTCCTC 
GCAAAAATTT 
CTTTATCCAA 
ATTTATAATA 
AATAATACAA 
AATCCAAATA 
GCATTATCAG 
AACTGGATAA 
CTTACATTAA 
CATTATACTT 
CATAATGGAA 
ACACCAGATA 
AGTTCTGTAG 



AATCGCCATT 
GGAACACATT 
TTGAACTTCA 
AATTGAGAGA 
TCTTGGTTAT 
AATTCCAAAA 
ATAACCAGTT 
ATTTTATCTG 
TCTTTGAAGC 
TAAGAGATGA 
ATAACTGGGG 
TTGAAACTGG 
CTATGGAAAT 
CGTATACATT 
TCTTTAGACA 
AAGCTGTATT 
TTTATTCGAA 
TAATACATGA 
TGACTAAAAA 
AACTTTATGA 
GTAATCCTAA 
AGTCGTTATC 
ATTTACATAT 
GAAATTTTGA 
GCAAAAATTA 
AAGGGCATAC 
ATACAGAAAA 
TGGTAGAGAT 
AATTTAATGA 
ATTACAATGG 



GTTGCTTTCT 
GAAGGGAGCA 
TCCTGCAAAA 
ATGGTATGTC 
TATGTCGGCT 
GTATAGTGTG 
AGCTCCGCAT 
GCATGATCAT 
GAGTCAAAAA 
TGCGGGTACA 
CTCATCAACA 
AAGAGCTAGG 
GATGAATGTA 
TATGACAAAT 
TGCTATACAA 
TTGGAATGGA 
TGATGAAACA 
GAAAATTGAT 
TAGTGAGGAA 
AGGAGATGGG 
TATCAATAAA 
GTTAGATTTG 
TTTATTGAAT 
TGCATCAAAA 
TTCCATCAAT 
TGGTCATAAA 
TTGGGATGAG 
GTCTATAAAT 
TGGTAATTTG 
AGACCCTAAT 



648 
CCGTTGTATG 

TGGGAACCTA 

GTCTGTAAAC 

AAGATGTTGG 

GGAGAGCGTA 

TTAAAAGGTG 

AGTGCTAAAT 

GAAAAATGGT 

TATCATAAAA 

GATTCTATCG 

GATACATGGA 

GATATGAGAT 

TATACTGGGG 

GATGTACCAA 

AATCCAGCTC 

GAAGGTAGGA 

ATGCCTTTAT 

AAGGAAAAGA 

TTCTCTAGTA 

TATGCTCAGC 

AATCAGCAAG 

ACGCCACATA 

AATTACAGGA 

AGTTGGAAGA 

CCTGTAGATA 

CCTCAGATAA 

AATACCCATG 

ACTGAGGGGA 

AATATAGCAT 

AGAGCTGTGG 



GCAATGATAA 
TTCCTGAAGA 
CAACAACTTG 
AGGAAGCTCA 
ATTCAGTTCC 
TTTTAAATAT 
ATTTGGAAGT 
TCTGGGAAAC 
ATTTGGTGTT 
TTAGTGGATT 
AATGGTGGGA 
CCTATGCATC 
GAGGCACAGT 
CTCCAGCATT 
CAAGTAAGGA 
TTAGTTCATT 
ATAATAATGG 
TTTCATCTAT 
AAGTCAACTA 
GTGTAGGTAA 
TAATGTTGCC 
CTTACGCTGT 
CAGATAAGAC 
AAGAAGAATT 
ATGACTTTAG 
ATATAAGTGG 
TTTATACCAT 
CAGGTCCAGT 
ATGCAAAACC 
ATGGTAACAG 



TGGTAACGGA 
TGTAAAGCCA 
TATTCCACGA 
AAGTCTAAAC 
TCCAGAGTGG 
TGAGAATTAT 
TTGTGCCAAA 
TATTATGAAT 
GGCAACTAAA 
TTGGTTGAGT 
AAAACATTAT 
GGAACCAGAA 
TTATAATTTC 
T ACT AAAGGT 
AGAAGTTGTA 
AAACGGATTT 
GAG AT AT CAT 
ATTCCCTAAT 
TTTAAACTCG 
TTCCTGGTAT 
TATGTATACT 
TGTTAAAGAA 
AGCTATGTGG 
AGAGTTAGCG 
GACAACAACA 
CGATAAAAAT 
TACCGTTAAT 
CTCTTTCCCA 
AACAACACAA 
AAATGGTAAT 



9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
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TTTAACTCTG GTTCGCTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CT AATTTTG A TGTGATTCT A TATGACAATA ATAGAAACGA ACTTGCTAAG 11040 

AAACATGTTA AT AATTT GTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG T AAGCAATC T GAAGAGGATA TAGATAAAAT AACAGAAGAT 112 20 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATCA GGGTCTAGCT 112 80 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 113 4 0 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 114 00 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCT AATTT TGATGTTATT 114 60 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGCT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AG CGG AAGTT GAACTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAA CT TTCT A ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 117 00 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TT CAGTG ACT 11760 

CAT AC C AAAG AAGATTCTCC TTCATGGTGG CAGATAGATT TAGCACAAAC CGAAG AATT A 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AG AG ATT AT C AAATTTTGAT 11880 

ATT ATT AT AT ATGATTCAAA TGATTATGAA G TTTTT AC AC AACATATTGA CAGTTTAGAA 119 40 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTT CTTTG 12000 

AGAAGCGCAG GAATTCCTTT AAGTTT AG C A GAGCTAGAGG TTTATACTTA TAAGT AATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 12120 

AAAAT AAG AG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CCACTATTGA 12130 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAC 12 240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 123 00 

CATTTCTCCA TCTACTATTG GCCGTGTACC AG AGGCTG C A GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTT ATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACCCCC 124 80 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC T AGTG AC AC A GATATTCCAG AAGATGTCAA 12 600 
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AGAAAAACTT 


' TTACGCTATG 


CGCGTGCAGC 


TCTTG C AACT 


GGCTTGATGA 


GAGACACTGC 


12660 


TTACCTATCA 


ATGGGTAGTG 


TTTCGATGGG 


GATTGGTGGT 


TCTATTGTAA 


ATCCGGATTT 


12720 


CTTCCAAGAA 


TACTTAGGAA 


TGCGAAATGA 


ATCGGT AG AT 


ATGACGGAGT 


TCACGCGCCG 


12780 


TATGGACCGT 


GGTATTTACG 


ACCCTGAAGA 


GTTCGAACGT 


GCGCTCAAAT 


GGGTGAAAGA 


12840 


AAACGTAAAA 


GAAGGATTCG 


ACCATAACCG 


TGAAGACCTT 


GTTTTAAGCC 


GTGAAGAAAA 


12900 


AGATAGACAA 


TGGGAATTTG 


TTATTAAGAT 


GTTCATGATT 


GGACGTGACT 


TAATGGTTGG 


12960 


TAACCCAAGA 


CTTGCTGAAC 


TTCCTTTTGA 


GG AAGAAGCG 


GTTGGTCACC 


ATGCTTTAGT 


13020 


AGCTGGTTTC 


CAAGGTCAAC 


GTCAGTGGAC 


AGACCATTTT 


CCAAATGGGG 


ACTTTATGGA 


13080 


AACTTTCCTC 


AATACTCACT 


TTGACTGGAA 


TGGTATTCGA 


AAA C CAT TTG 


TATTTGCGAC 


13140 


AGAGAATGAT 


TCACTAAATG 


GTGTGTCTAT 


GCTCTTTAAT 


TATCTATTAA 


CAAATACTCC 


13200 


ACAAATCTTT 


GCTGATGTGC 


GTACTTATTG 


GAGCCCAGAG 


GCTGTTAAAC 


GTGTAACGGG 


13260 


ACATACTTTA 


GAGGGTCGTG 


CTGCAGCTGG 


CTTCT T AC AT 


CTAATCAACT 


CTGGTTCTTG 


13320 


TACATTGGAT 


GGTACAGGTC 


AAGCTACTCG 


AGATGGCAAA 


CCTATTATGA 


AACCATTCTG 


13380 


GGAGTTGGAA 


GAAAGTGAAG 


TGCAGGCTAT 


GCTTGAAAAT 


ACAGACTTCC 


CACCAGCAAA 


13440 


C CGCG AAT AC 


TTCCGTGGAG 


GAGGATTCTC 


AACTCGTTTC 


TTGACGAAGG 


GGGATATGCC 


13500 


AGTAACAATG 


GTACGTCTCA 


ATCTTCTAAA 


AGGGGTTGGT 


CCAGTGCTAC 


AAATTGCAGA 


13560 


AGGTTACACA 


CTTGAACTTC 


CTGAAGATGT 


TCACCATACT 


TTAGATAATC 


GTACAGATCC 


13620 


AGGATGGCCA 


ACTACTTGGT 


TTGCTCCACG 


TTTGACAGGA 


AAAGGTGCTT 


TCAAGTCTGT 


13680 


CTATGACGTC 


ATGAATAATT 


GGGGAGCTAA 


TCACGGAGCC 


ATAACATATG 


GACACATTGG 


13740 


AGCAGACTTG 


ATT AC CTTGG 


CTTCTATGTT 


GAGAATTCCT 


GTCAATATGC 


ATAATGTACC 


13800 


TGAGGAAGAT 


ATCTTTAGAC 


CTAAAAATTG 


GTCCTTATTT 


GGAACAGAAG 


ATCTAGAATC 


13860 


AGCAGACTAT 


CGTGCATGTC 


AGTTGTTGGG 


GCCACTACAT 


AAATAAAACT 


TGTTTATATA 


13920 


GGAGGTGAAC 


TTACGTCCCT 


CCTATCCTTT 


TAAAAAGATT 


TGTTAAACAA 


TTCACAAATA 


13980 


ATTGAAAACG 


AATACAAAAA 


GT AAT AT AAT 


GATGTTAAAT 


AGATACCCCG 


GAGGCGCAGG 


. 3040 


AGGAAAATTA 


TATGGCTATA 


TTTTATGTTC 


CGGCAGTCAA 


CCTTATTGGA 


AAAGGTGTTG 


14100 


TAAATGAAGT 


GGGTCCTTAT 


ATCAAGGAAC 


TTGGCTATAA 


AAAGGCACTT 


TTGGTGACAG 


14160 


ATAAGTACAT 


CGAAGGCAGT 


GAT ATTTTAC 


CTAAGACTTT 


AAAACCACTG 


GAT AC AG AAG 


14220 


GAATCGAATA 


T 










14231 


(2) INFORMATION FOR SEQ ID NO: 8 2 











(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1699S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



AGTT CT CTTA 


ACTTTTTTAG 


GATGGCATTC 


TCCGCTCTCA 


GGTACTCATT 


TTCTGCTgAA 


60 


GACGTTCTAA 


TTCTGTCCTC 


TCTTCAGGTC 


TCGTTTTTGG 


CTTACGTCCC 


ATTTTAGGTA 


120 


X W X V- 4. V*. A 


T GTTTTCTC A 


ACAATAGTAT 


ACCCGTTTTT 


CCTGTATTGT 


GCTAGCCAGT 


180 


TAAGAAGTAT 


CGTACGACTT 


GGGAGACCGT 


ATTCAAGAGA 


AACTCTATCT 


TTAGTCCAGC 


240 


CTT C ATGTC A 


GACTTTATTA 


CTCATTTCTT 


GTTTTAAATC 


AGGAGAATAG 


TAACGATTTT 


300 


TTrrTTTTTT 


GACGAACTCT 


ATTCCGTAAC 


GATCAATCAA 


TTTAATCATG 


TACCTAATAT 


360 


TAGAATTGCT 


TATCCCAAAT 


TTATTTGAAA 


GCTTCTCTAA 


GCTATATCCT 


TGTTTTCTAA 


420 


GTTCATAGAT 


CTGAACTTTA 


TCATCATAAG 


TTAGTTTCAT 


AATAAAAACA 


CCCCAAAAGT 


480 


TAGATTTTTT 


CTGTCTAACT 


TTTGGGGTGT 


AGTTCATGTA 


CACCTGATAT 


GATGCGTTTT 


540 


& T A ATTTTT A 


AGCCTTTTTG 


CCCAGCCTCG 


TCAAAAGTAA 


TCTTTTGACA 


CAAAATCTGT 


600 


GACAAAACTT 


TAGTTTTAAA 


GGTTTTTAAC 


TTTGTATATA 


CTAGTTTTAA 


GAAAAGGAGG 


660 


ATG AT CTAAT 


GGAAGAAAAA 


GT AT CATTG A 


AAGTCAGGGT 


TCAAAAACTA 


GGGACATCGC 


720 


TTTCAAATAT 


GGTTATGCCC 


AATATTGGAG 


CATTTATTGC 


TTGGGGAGTA 


TTGACTGCCC 


780 


TCTTT AT CG C 


TG ATGGCT AT 




A ar ACT TAG C 


TACTGTTGTT 


GGTCCTATGT 


840 


TAACGTATTT 


ATTGCCAATC 


CTGATTGGTT 


ACACAGGTGG 


AT AT ATG AT C 


CATGGCCAAC 


900 


GTGGTGCCGT 


TGTAGGAGCT 


ATTGCT ACT G 


TTGGTGCAAT 


CACAGGTTCT 


AGTGTTCCTA 


960 


TGTTTATCGG 


AGCTATGGTA 


ATGGGCCCAC 


TGGGAGGATG 


G ACT AT C AAG 


AAATTTGATG 


1020 


AGAAGTTCCA 


GGAAAAAATT 


CGTCCCGGAT 


T TG AAATGTT 


AGTTAATAAC 


TTCTCAGCTG 


1080 


GTCTCGTTGG 


TTTTGCATTA 


TTCCTTTTCG 


CTTTCTACGC 


AATCGGTCCA 


GTCCTATCGA 


1140 


CTCTTACTGG 


AGCTGTTGGG 


AATGGTGTTG 


AGGCTATTGT 


CAATGCTCGC 


CTCCTTCCTA 


1200 


TGGCTAATAT 


TATCATCGAA 


CCGGCTAAAG 


TCCTTTTCCT 


CAATAATGCC 


CTCAATCATG 


1260 


GCATTTTTAC 


TCCTCTGGGA 


GTAGAACAGG 


TAGCTCAAGC 


TGGTAAGTCA 


ATTCTCTTCC 


1320 


TATTGGAAGC 


TAATCCTGGA 


CCAGGTCTGG 


GAATTCTATT 


AG CTT ATG CT 


GTATTCGGTA 


1380 


AAGGTTCTGC 


TAAATCTTCT 


TCTTGGGGGG 


CAATGGTTAT 


TCATTTCTTC 


GGAGGGATTC 


1440 


ATGAAATTTA 


CTTTCCTTAT 


GTTATGATGA 


AGCCTACTCT 


ATTTTT AG CT 


GCTATGGCAG 


1500 



WO 98/18931 



PCT/US97/19588 



652 

GAGCTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCC AG CTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTCTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTCTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT A CCC AAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2 040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCT AT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TCATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 24 00 

CACAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2 4 60 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2 520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2 580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCCAA ATAACGAAGC ATGGGAAATA 2 640 

CTATATTTTG ACTGGAGAGT TGGATGATTT CCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2 760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2 820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCTGAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2 940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3 000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA 7AAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTT AT AG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3 2 40 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCCCTC TTTACGGAGA AATTTGATGG 3 3 00 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAGATTGA 3 3 60 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCCGC TCAGTTTAGG 342 0 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 34 80 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3 54 0 

TCATACAGAG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3 600 

TCCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3 660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 372 0 

TCTAGTAGAT TACCACAGTA TTGATCTCAC TCAGTATGAT T AT ATTTT AT CTACCAAGCC 3 78 0 

G CTG ACT AAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTGA 3 84 0 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3 900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3 9 60 

TC AATTCG AG TTGGTTCAAT T GG AG AAT AA TCAATCATTT GAGCACACGG TAGAACAAAT 4 02 0 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4 080 

CTTCCAGAAT ACTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTGAACTC AAACTACCTA TCTCCGCATT 4200 

GTC AATG AAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TGTCTAAAGA 4 2 60 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 4 32 0 

TCTTTATACA GAGATTTACA AG ACGGGAAA TCAATCCATT ATTTATCAGA TGCT AAAT AC 4 380 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4 4 40 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGC T GG AT ATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAG 4 560 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 620 

ATGATGTCCT CAAGTCTGGT ATT AC AG T CG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 680 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT CGTAATGAAC 4 740 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCACA TGTAGATAAT GTTCTT AAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4 860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GG AAAAGG AG AAATTGAATG 492 0 

AAAC ATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4 9 80 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA AT C AG AT AAT TCATGCTCTG 504 0 
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AATGAAAAGG 


GCAAGTATGA 


AATTGAAATT 


GCACAGAAAG 


GACAGTCTCG 


TATAGAAGTA 


5100 


ACTAATGTGG 


CTGGCATTAA 


TAGCAAAGAA 


CATCCTGAGC 


AAGTCATTGA 


AGCGATTCAA 


5160 


AAGACGGATA 


TTATTACTAC 


TGCAATCGGA 


CCTAATATAC 


TCCCTTTTAT 


CGCCGAACTT 


5220 


CTAGCCAAAG 


GAATCGAAGC 


TCGCCGACTT 


GCAGG AAAT A 


CACAGGCATT 


GGATGTTATG 


5280 


GCCTGTGAAA 


ATATGATTGG 


CGGGTCTCAA 


TTTCTTTATC 


AAGAAGTCAA 


GAAATATTTA 


5340 


ACTCCGGAAG 


GTTTG ACAT T 


TGCTGATAAC 


TACATAGGTT 


TTCCAAATGC 


TGCAGTAGAC 


5400 


ACGATTGTTC 


CAGCACAAAG 


TCACGAAGAT 


TCCCTTTTTG 


TTGTGGTCGA 


GCCCTTTAAT 


5460 


GAATGGGTCG 


TGGAAACCAA 


GCGTCTTAAA 


AATCCAGATT 


T ACGTCT AAA 


AGATGTGCAT 


5520 


TATGAAGAAG 


ATTTAGAACC 


CTTTATTGAC 


CGAAAACTTT 


TTTCAGTCAA 


TTCTGGACAT 


5580 


GCAACTTCAG 


CTTACATTGG 


TGCGCATTAT 


GGTGCCAAGA 


CAATTTTGCA 


AGCTCTTCAA 


5640 


AATCCTAATA 


TTAAATCTCG 


GATTGAATCT 


GTATTAGCTG 


AAATTCGGAG 


TCTCTTGATT 


5700 


GCCAAATGGA 


ACTTTGATAA 


AAAAGAATTG 


GAGAATTATC 


ACAAAGTCAT 


TATAGAACGA 


5760 


CTTGAAAACC 


CTTT CAT AGT 


GGACGAGGTT 


AGTCGCGTAG 


CTCGTACTCC 


AATCCGAAAA 


5820 


TTAGGCTATA 


ATGAACGATT 


CATCCGGCCG 


ATACGTGAAT 


TGAAAGAACT 


CAGTTTGTCA 


5880 


TATAAAAACC 


TACTTAAAAC 


AGTTGGCTAT 


GTCTTTGACT 


ATCGCGATGT 


AAATGATGAA 


5940 


GAAAGTATTC 


GATTAGGTGA 


ATTGTTGGCT 


AAACAATCAG 


TCAAAGATGT 


TGTTATACAA 


6000 


GTTACAGGTT 


TAGACGACCA 


AGAATTGATT 


GAGCAAATTG 


TAGAGTATAT 


TTAATCTTTT 


6060 


TCGAAAATCT 


CTTCAAATCA 


GGTTAGCATC 


CCTTTGTCTT 


AGGCATATGT 


TGTTCTATCT 


6120 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCTGACT 


CCGTCAGTCT 


TATCTGCAAT 


CTCAAAACAC 


6180 


TGTTTGAGTT 


ATCTGCGGTA 


AT CTTTCT AG 


CTTGTCTTTG 


ATTTTTGTTC 


TTATTTATAA 


6240 


GGTAAAAGAA 


GCTGGACAAA 


AAGTCTTCAA 


AATCGGGAAA 


AGGCAGCCTA 


TCGGGTGTTC 


6300 


AAAAATCTTG 


ATAGGATGTC 


CTTT ATT ATG 


GAAAGCCTTA 


TTGGATTTTC 


TCCTCAGATT 


6360 


GAGTTTTTGA 


TCAGCTTTAT 


GAGATAGGTC 


TTGCTAGAGA 


TGTAGCCCAT 


CAT GTT ATTT 


6420 


TTATGGACAG 


TGGGAAAATT 


GTTGAAAAAA 


AT AATGCC C A 


TCAATTCTTT 


AGTCGTCCAA 


6480 


GAGAAGAACG 


AACCAAGCAA 


TTTTGGAACG 


AATTCTTTCG 


AATGCGATCT 


AT AT AGT AAA 


6540 


ATGAAACAAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


AAATGTTTTA 


6600 


GAAGTAGAGG 


TGTACTATTC 


TAGTTTCAAT 


CTACTATATA 


ACTGAAAAAT 


TAGATAAATT 


6660 


AGTTTTGGAA 


AATGACTAAC 


CAAAAGATAT 


CCAAAGTAGT 


CTAAAATTGT 


CTATACTTTA 


6720 


TGAGTGTTTT 


AGTTAGGAAA 


AAGGCTTGTT 


GTCTATAATT 


GT CTGCATT A 


GTCTAGATTT 


6780 


TATTTATACA 


AAATGTTATA 


ATAGACTGTA 


, TTTAAAAAAT 


TTTAAGGAGA 


AATGACAGAA 


6840 



WO 98/18931 



PCT/US97/19588 



655 

TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6 900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTG AAG AAA TCTCTTAATG 6 9 60 

TTCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CG AC C AAAAA TTTGGTGAAG 7020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTCCCAAA CGCTTATGAA GCAGCTGTAA 7080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 72 00 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 72 60 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA CCTGCTCCTG 7 3 20 

AAAACGGCGA CACTGTTGTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7 3 80 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TC AATTC AT C CCTGGTTTCG 74 40 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA AC ATTCC C AG 7 500 

AAG ACT ACC A AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTGACA ACTATCCACG 7560 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC ATTGATGAAG 7 620 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AAT AC AG CAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 774 0 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7 800 

TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7 860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACT AAG ACT A 79 20 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7 980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGC A CAA GTTCAAAACT 804 0 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTG AT C A 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 822 0 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 8280 

TGACTGTTTC AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 8 34 0 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 84 00 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 84 60 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 8520 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA 
TGTATTTACC 
CAGCATCGTT 
CAAAACTGAG 
CATCAATCCC 
TACCATTTCT 
CAATTTCAAA 
GTCGAATCTG 
CCAAGCGAGT 
TGACAGAAGG 
TGGAGAAGAG 
AGTCAGCATC 
GGCGATGTAT 
CAGTTGGAGC 
AAAGGGCATA 
CTGTCAGGAT 
CATACTCAAT 
CATGACTCTT 
CAGCGAAAAA 
CGATCAGGTA 
ACTCAAGGAG 
TTTCCATACA 
GACTTTCGAT 
TATCCTCAAC 
AGTCTTGAAT 
TCATCTCCGT 
AGAGTCCTCC 
CGCCATAGGT 
TTGAAAAGTA 
GCAGTTGTTC 



GACTTCATTA 
AGGAATCAGG 
AACCAAATGA 
TTGATCTTTT 
TGCCGTCCCT 
GAGGGCGGCA 
GTAGGAACGG 
ACTAGCTAGG 
CTGAGGAATC 
TAG CTG ATC A 
TTGATTGGCC 
TAGGTAATCT 
GGTCGCGCTA 
AGCAAGAAGA 
AACAGCAATG 
AAAG ACCTT A 
TCCCAGTTCT 
CTGTTTTCCT 
GAGGCTGTTG 
GGAGAGCTCT 
AGTAAGGGTT 
GGCCTGAAAA 
GCCTAGTTCC 
CAGCTGGTAG 
CTGAAAGGCT 
TCCGTAGTTG 
GATGCCTTCT 
ATCCACGATT 
CTTGACCAAG 
TCCATACTTG 



CCATCAAAAT 
TCTGTGATAG 
ATGACCCTGT 
TGTGGGGGAT 
CGGTACATAG 
CCTAAGATTT 
TCAGCTTTTT 
GTGACGATGG 
AAAGGAATAT 
CTGTCACCTA 
AGCCAAGTAT 
TCCAGATGAC 
GGCAAACCTG 
ATGGGCAGAT 
ATTCCATTGA 
TTCTGG AT AG 
TGCTCGACAG 
TTTTCAAGGA 
TCAAAGATCT 
TGGGCAACTT 
TGTTCCAGCA 
AGACTGTGAA 
TCAGCTAGTT 
GGATAATTTT 
AGTTTGTTGG 
AG ACGG AG AG 
AACTTTTCTG 
TTCTGAGCTG 
CCCTTACTAG 
GAGTGCTGGA 



656 
CAATGACAAT 



CTCCTAAATC 
CTCTCTTACG 
TGAGCAGGTC 
GAGCCAGAAC 
TTTCAATGGT 
TTTGGGTGAA 
TTGATT CTTT 
GAAGTAGATC 
CGATGAGGAT 
CT AC CAT AG A 
TGGTATCATC 
TCAATTCATT 
TGCTTTTCTT 
TAACAGTTGT 
CATCACAGAT 
TAGTGATATG 
TACGAACCAA 
TGGTATCAAT 
GGCTGGGGTC 
AATCCCGTGC 
CTAGACCGGC 
GGTCAGCAAT 
CAACCACATC 
GAATGCCGTA 
TGGAGACGAA 
GGTGTTGCAA 
TCTTGAGACC 
TTGGTTTTG C 
CAATTTGCCC 



CTCGTCTTGT 
CCCATTAAAG 
ATAGTGACAC 
TTGCATGAGC 
TTGGATATCA 
GGCAGGAATA 
ATCAGCTGGC 
GCTTTGTCGA 
CGCTAGAACC 
CTTACTGTTA 
GAATTCATCC 
GTCACCTGTC 
CATGCGACGA 
CCTGAAGTCA 
CTTACCAGTA 
AGCCTGTTTT 
TTTTTGAATG 
GTGACTGCGG 
CTGCTGAACC 
TAGTTCCACG 
TTCAACATAG 
GCGGAAGCGT 
GCTAAAGCCC 
AAGGGTTTCT 
GTTGGCTAGT 
AGCCTCGCGA 
AATTTCGTCA 
AATCCCCTTG 
GCGATCATAA 
CCAAAAAGTA 



TTCGATTCGG 
ACATTGATTT 
TGAGGAGCTT 
TGATTGATAG 
CGGGCGGGAA 
TGG C C ACT AG 
AAGATGCCCT 
TAAATTTTTT 
TGTCCAGGAC 
GAAGAGATAT 
ACGATGATAA 
ATTCCCAAGT 
GCAGCTCGAC 
AGTCCTTCTA 
CCAGGCCCAC 
TGAATGTTAT 
GTTTCTAAAT 
ATGCCTTCCT 
TTGTCTTCTT 
GGACGGGAAG 
GTGTCCCCTG 
TCAGGAGCCT 
AAACCCTTGA 
TCCTTGTAAA 
TTGGCCAAAA 
TTTTTGGCAG 
ATGGTATTTT 
AAATGGCTAC 
CGACTGATTT 
TAGTCTTCGC 



8640 

8700 

B760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 
10140 
10200 
10260 
10320 
10380 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATCATTTC AAAATCATCA AAATC CTCTG 10440 

CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTC CAT AAA ATTCCTTTGC ATGAATAGGT 105 60 

GAG AGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10 620 

CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10 680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT CTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 10860 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTCTCAT CCTTGAAGCG 11100 

TTTG AT AT AG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 112 20 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 11280 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT C C ACTTGGG A GCTGATAACC 115 80 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 117 00 

ACGATTGGCC TCTTGTGCCA AGTACTTCTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 117 60 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT CATTATGGAG 118 20 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTCCCTGCT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 11940 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 12 000 

GTCAGGTGTG ACAAACGCAC CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12060 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAAGAAACTG 


GCGTATTTTT 


CAGCCCCTTC 


ACCCTGAAGC 


AAGATTTTTC 


CAGAAGTATA 


12180 


GATAGAAACC 


GTT GCTTG AG 


GTAGTTTCAA 


AAAGTAGCGG 


ATATAGGGAT 


TCTTGCTAGG 


- 12240 


AGCCAGACTG 


GTTTGATAGT 


GTTCAAGAAA 


AGCCTGAATA 


TCCTTTTCGC 


TTGGTGTGAC 


12300 


TGTGATACTT 


GCCATAGTTT 


CTATTGTACC 


ACAAAAGCAG 


TAAAATTTGT 


AAAAACTGAC 


12360 


AAAATTAGCG 


AATTTTGGTA 


TAATATCGTG 


AGGTGAATTT 


TATGGCAAAT 


CT AAATCG AT 


12420 


TCAAATTTAC 


ATTCGGGAAA 


AAATCGTTAA 


CCTTGACAAG 


CGAACATGAC 


AACCTTTTTA 


12480 


TGGAGGAAAT 


CGCTAAGGTT 


GCGACAGAAA 


AATACCAAGC 


AATTAAAGAA 


CAAATGCCTA 


12540 


GCGCAGATGA 


TGAAACAATC 


GCTCTTTTGT 


TGGCAGTCAA 


CTGTTT AT C A 


ACTCAGCTCA 


12600 


GCCGTGAGAT 


TGAATTTGAC 


GATAAGGAGC 


AAGAGCTAGA 


AGAACTCCCT 


CACAAGCTTG 


12660 


TGACTTGTAA 


GCAAGAACAG 


AGCAAGATTG 


AGGATTCCTT 


ATGATTTCAT 


TCCTTCTTCT 


12720 


ATTGGTCTTG 


GTTTGGGGAT 


TTTATATCGG 


CTATCGGAGA 


GGCCTGCTCT 


TACAGGTTTA 


12780 


TTACCTGATT 


TCAGCCATGG 


CATCGGCTTT 


TATGGCTGGC 


CAGTTTTATA 


AGGGGCTTGG 


12840 


AG AGCAAT TC 


CATTTATTGC 


TCCCTTATGC 


AAATTCGCAG 


GAAGGTCAGG 


GGACTTTCTT 


12900 


TTTCCCATCG 


GATCAACTCT 


TTCAGCTGGA 


TAAGGTCTTT 


TATGCAGGTA 


TCGGCTACTT 


12960 


GCTTGTATTT 


GGGATTGTCT 


AT AG C ATTGG 


TCGTTTACTT 


GGTCTTCTCT 


TACACTTGAT 


13020 


TCCTAGCAAA 


AAACTGGGTG 


GTAAGTTGTT 


CCAAGTTTCA 


GCAGGTATCT 


TGTCCATGTT 


13080 


GGTGACCTTA 


TTTGTCTTGC 


AAATGCCCTT 


GACAATCTTG 


GCGACCATCC 


CCATGGCAGT 


13140 


TAT AC AAAAT 


CCTCTTGAAA 


AGAGTATCGT 


CGCAAAAC AC 


ATCATCCAGA 


GCATACCGGT 


13200 


AACAACCAGT 


TGGCTCAAAC 


AAATCTGGGT 


G AC AAATT T A 


ATCGG AT AAA 


AAGGGCAGGA 


13260 


GTTTTCCTAG 


CCCTTTGTTT 


ACAGATTTGA 


CTCGAATCTA 


TCAGAATGTA 


AAAAGCTACC 


13320 


ACACCTAGAC 


ATTCAAAGAC 


AAGGAAATAA 


AGATGAATAA 


G AAAAT ATT A 


GAAACATTAC 


13380 


AGTTCGATAA 


GGTCAAGGCC 


TTGTTTGAGC 


CTCATTTGTT 


GACCGAGCAG 


GGCTTGGAGC 


13440 


AATTGAGACA 


ACTGGCTCCG 


ACTGCCAAAG 


CAGAT AAAAT 


CAAACAGGCT 


TTTGCTGAGA 


13500 


TGAAGGAAAT 


GCAGGCTCTT 


TTCGTCGAGC 


AACCGCATTT 


TACTATTCTC 


TCAACTAAGG 


i.3560 


AAATTGC AGG 


AGTCTGCAAG 


AGGTTGGAGA 


TGGGAGCGGA 


TCTCAATATC 


GAGGACTTCC 


13620 


TACTCTTGAA 


ACGCGTGCTT 


CTTGCCAGCC 


GAGAACTTCA 


AAATTTTTAC 


ACCAATCTGG 


13680 


AAAATGTCAG 


CTTGGAAGAA 


TTAGCCCTTT 


GGTTTGAGAA 


ATT ACAT GAT 


TTTCCGCAAT 


13740 


TACAAGGAAA 


TCTTCAGGCC 


TTTAATGATG 


CGGGTTTCAT 


TGAAAATTTT 


GCCAGTGAAG 


13800 


AATTGGCGCG 


AATCCGTCGA 


AAAAT ACATG 


ATAGCGAGAG 


TCAGGTACGC 


GATGTTTTAC 


13860 


AACACTTCCT 


CAAGCAAAAA 


GCGCAGCTGT 


TGACGGAAGG 


AATTGTTGCT 


AGCAGAAATG 


13920 
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GCCGTCAGGT TTTACCACTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13 980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTGAGCG 1404 0 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC CCTATGAAAT GCTTCCCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCCCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 1422 0 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 14 340 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 144 00 

C AGG ATT G C C GATTTTAGCA GACAAGGGAA CTCGTGTTGG TATTTTTGAA GAAATCTTTG 144 60 

CTGATATTGG AG ATG AG C AG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AG TC AT ATG A 14 520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14 580 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14 640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACC 14 700 

GTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 147 60 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14 880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACGTTTGG 14 940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTCCTG 15060 

AGATTGTGGA TATGGCCCTA ACTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 1512 0 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGCC CAAGAAAAAA CGAGCTCCAA 15240 

AGGTGGCAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 15 300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15 360 

AGTTTGATCT TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG GT C AATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 154 80 

AAGAAGCCAT GAATGAGCTA G AT ACCTT C A TCGACCAAGC CTTGCTTAAC AATATGGCTC 15 540 

AACTTGATAT CATCCATGGT ATCGG AAC AG GAGTCATCCG TGAAGGAGTT AC C AAAT ACT 15600 

T G CAAAG AAA CAAACATGTC AAGAGTTTCG GCTATGCCCC AC AAAATG CT GGAGGCAGTG .15660 
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GTGCGACTAT 


TGTCACTTTT 


AAAGGATAGC 


AGTATTCTGG 


ACTTTATAAA 


GTAAAAACTG 


15720 


TTGAACTAAT 


TTTTACTAAT 


AAACACATTG 


ACAAAAGCCA 


ACATTTTTTG 


TAAAATTAGA 


15780 


ATCAATTAAA 


TACCAACACC 


GAATGAAGTT 


TAATAGAAGT 


GGGGAATCGT 


TTGATTTTCC 


15840 


ATGACTGTAA 


ATGGACGGAA 


CTCTGGAGAC 


ACCGTAAAGG 


CACCGAAGGG 


CAAGGCAGGC 


15900 


AACTGCTCAA 


ACTCTCAGGT 


AAAAGGACAG 


AGCTAGGATA 


GACCGCTTTT 


TAGCATTTAT 


15960 


CTAAGCATTC 


C AGAGT AC AT 


GTATCTTGCA 


TGTGCTCTTT 


CTTTTGGGGT 


TGAAACGATA 


16020 


GGAGAAGGAA 


ATGTTAGAAT 


TGCTTAAATC 


AATCGATGCT 


TTTGCTTGGG 


GACCGCCCCT 


16080 


CTTGATTTTA 


TTGGTCGGAA 


CAGGGATTTA 


CCTAACTATT 


CGGCTAGGAC 


TCTTGCAGGT 


16140 


TTTGCGTCTA 


CCCAAGCCCT 


TTCAGCTTAT 


TTTTATCCAG 


GATAAGGGAC 


ATGGTGATGT 


16200 


ATCCAGTTTT 


GCAGCTCTGT 


GTACAGCCTT 


GGCATCAACT 


GTTGG AAC AG 


GAAATATCAT 


16260 


AGGAGTTGCG 


ACGGCTATCA 


AGGTTGGTGG 


ACCAGGAGCT 


CTATTTTGGA 


TGTGGATGGC 


16320 


GGCTTTCTTT 


GGAATGGCTA 


CCAAGTATGC 


GGAAGGACTC 


TTGGCCATCA 


AATACCGCAC 


16380 


CAAGGACGAC 


CATGGTGCAG 


TAGCGGGAGG 


TCCCATGCAT 


TATATCCTTC 


TAGGGATGGG 


16440 


AGAAAAGTGG 


CGACCACTTG 


CTGTTTTGTT 


TGCAGTAGCA 


GGAGTATTGG 


TTGCTCTCTT 


16500 


GGGAATCGGA 


ACCTTCACCC 


AAGTCAACTC 


GATTGCAGAA 


TCTATCCAAA 


ATACAACGAC 


16560 


GATTTCGCCA 


GCCATCACAG 


CTCTCGTCTT 


GTCTGTCTTT 


GTAGCGATTG 


CAGTCTTTGG 


16620 


TGGACTCAAG 


TCTATTTCTA 


AGGTTTCAAC 


TACTGTTGTT 


CCTTTTATGG 


CCATCATTTA 


16680 


TATCTTAGGA 


ACTCTTACAG 


TTATTTTCTT 


TAATATCGGA 


AAAATCCCTG 


GCACAATCGC 


16740 


TTTAGTCTTT 


ACCTCAGCTT 


TTAGTCCCCT 


TGCTGCGGTA 


GGTGGATTTG 


CTGCTGCTAG 


16800 


CGTTCGGATG 


GCTATTCAAA 


ATGGTGTGGC 


GCGTGGTGTG 


TTCTCAAACG 


AATCTGGTCT 


16860 


GGGTTCTGCT 


CCTATTGCAG 


CTGCAGCTGC 


CAAGACAAAT 


GAACCAGTAG 


AGCAAGGTTT 


16920 


GATTTCCATG 


ACAGGAACCT 


TTATTGATAC 


CCTCATCATT 


TGTACTCTAA 


CTCGTTTGAC 


16980 


CATCTTGGTA 


ACTGG 










16995 


(2) INFORMATION FOR SEQ ID NO : 83: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 
(D> TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 
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ACTAAGCGCT TTAATAAACA GTTAGATAAA 
CAGGCTATTT CGGAGATTCC TGGGGTCTTG 
ACGCCAGACC ATGTCAAGGA GGCGGGCAAG 
ACAGGGATGA GTGGTCTGCT GACTCTACGT 
TACCAACTGG ACTATGCTCC TGAAAATGAA 
TTATCTGCGA CTTTGACGGC TATTTTGGAA 
GCTTATCCAG GCTATGAACC GATTGTTAAC 
ACGACTGAAA ATGGTTTTGT CTTGACTCCT 
GGTGATAAGC TCAAGGCGGT TATTCTCAAC 
AGTCGAGAGC AGTTAGAGGC CTTGGCAGCT 
TGTGATGAGG TTTACTCAGA ATTGACCTAC 
TGTTGAGAGA CCAGGCTATT ATTATCAATG 
GGCGTTTGGG GCTGATTTTC GCTCCTGCGA 
AGTACTTGGT CACTGCCGCA AATACCATGG 
CTGGTAAAAA CGATGCGGAC CCATGAAGAA 
CGAAAAAATG ACTGCTCTTG G TTTTG AG AT 
TGCTAAAATT CCAGCGGGCT ACAATCAAGA 
GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC 
CCGCCTATCT TATGCAGCCA GCATGGAGAC 
GTACATGAGA GAAGCATGAT TCAGTCTATC 
AATTTTCGTG AGGATGACAA GCTCGTCAAA 
TTTTTTGTCA AACACGCTGG TCAGTCTAAG 
CCACGATTTC TCTTGCGAAT CAATGATGAC 
GTCATGACTT TTCCCAAGAT TAATAGTGAC 
GCAGCTCTTG CAGATGCTAG TTTGCAGGAC 
TTGCAAAAGA CTTTGCAGTT GATGGAAGCA 
TTTGAAATTC AAATTTTGAC TCGATTTGGA 
TGCCATCGGG TTGGTCAGGC TTTTG ACTTT 
GAGCATTATC ATGAGGATAA GACACGTTGT 



661 

ATTCAAGTTT CGTTGATTCG 
CGTTTGACCT TGGGGGAACC 
CGAGCGATTG ATCAGAACCA 
CAGGCAGCCA GTGACTTTGT 
ATCTTGGTTA CAATTGGGGC 
GAGGGAGACA AGGTACTTTT 
TTAGTTGGGG CAGAAATTGT 
GAGATGTTGG AGAAGGCCAT 
TATCCAGCCA ATCCGACAGG 
GTTTTACGCA AGTACGAAAT 
ACAGGCGAAG CCATGTGTCT 
GTTTGTCTAA ATCGCATGCC 
CCTTCACAGC CCAGTTAATC 
CGCAACATGC TGCGGTAGAA 
GGAATATATC CAACCTCCCC 
TATCAAACCA GACGGTGCCT 
CTCCTTTGCT TTTCTGAAGC 
AGCCTTTGGA CGTTACGGGG 
TATCAAAGAA GCCATGAAAC 
ACGAGTCAAG GCTTGGTGCT 
ATTTTTACAG AGCAGGTTGG 
CTGGCGCCTG TTATTCAGCC 
GGACTCAGTT ACATCGAAGA 
CTCTTTGTC A TGGCCTATGC 
AATCAGC AG G ATGCTCCCTT 
GGCTTGGATT ATCAGGTTTT 
ATCAGCCTCA ATTTTAATCA 
TCTTTCAAAT ATGGAGCCTG 
CATCTCAATC CCAATATCCC 



TCAGTTTGAC 120 

TGATTTTACA 180 

ATCCTACTAT 24 0 

TAAGG AAAAG 300 

GACAGAGGCT 360 

GCCAGCTCCT 4 20 

TGAGATTGAT 4 80 

TTTGGAGCAG 54 0 

AATTACCTAC 600 

TTTTGTTGTC 660 

CTAGGAACGA 720 

ATGACAGGTT 7 80 

AAGAGTCACC 84 0 

GCCTTGACGG 900 

ACTATATCAT 960 

TCTATATTTT 10 2 0 

ATTTTGCTCA 10 80 

AAGGCTACGT 114 0 

GACTTGAGCA 12 00 

TTACAATCGC 12 60 

CAAACGCATG 13 20 

CTTGGTG CTG 13 80 

CTATCATGAG 14 4 0 

GACCTATGTG 1500 

GTTTGCTTTT 15 60 

G AC CAATATT 16 20 

GTGTGTCTTC 16 80 

CCTCTGTCCA 17 4 0 

CTATCTGCTC 1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 18 60 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG G ATT CACCT A 1920 

AAAT CAAAG A AATTT ATTG A TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GAT GCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 204 0 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AG AAG ATTG A TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGT A 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2 280 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 234 0 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCT AG AC C 24 00 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 24 60 

TCT ATGCT AA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA . 25 20 

CAGACACTAG CAAGGGCGAC CCGCTTCGTA AGCAAAC7TA TCAATTACTC GCGGCTGATG 2580 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2640 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 27 00 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TCGTCTTCGA GCGAAACTAG 27 60 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTCAAAAA ACAGCTCAAT TATTCAGATG 2820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 29 40 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AAT AAAAG AG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCT GGAAG AT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTG AC C AA C TCCAAAACG ? 3180 

AGGAGATGTG GTT AAAAT C A TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 124 0 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGG 3 300 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 33 60 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 34 20 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATC TTTTCAACAA CCT AT CG AAA 3480 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 354 0 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3 600 
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AAAATAGAGA TGATTTTGAA ACAAAAAAGC TAATTCAAGA CGTTTCGATG CCAATTCAAG 3 660 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACG AT G TCACAATTTG 3 7 80 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 38 40 

TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3 900 

GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC T AT ACT TGG A GGTGTGGCCT 3 9 60 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTT AAAAGT TTTATTATTG GTTTAGTAGT 4 020 

TGGTATATTT GGTCCTTATA TGGATGATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 4 080 

GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 4140 

AAATTATTTT GCGCATCTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4 200 

TGAATCGTAA TTTAGAACGG TCTTATCTAT TCTGACTAGG AATAGATCAT ACCAGAGGTA 4 2 60 

GCTTAGAAAT AGCAGAGACA TTACAAATTC AAGTAATAAA TAGGATGTCG TAAGTGTTAC 43 20 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTCCTTT 4 3 80 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTG CCT ATTTTTTTAT CCAAAAAGTG 4 440 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 4 500 

GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 4 560 

ACGTCGCCTT GCCCTATATA TCTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4 62 0 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4 680 

GGGAAAAGGA GATGAATATG AAATTTG GG A AACGTCATTA TCGTC CGCAG GTGGATCAGA 474 0 

TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4 800 

TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 4 8 60 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT T AAGGC AG AT ATCACGCTTT 4 92 0 

TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4 980 

TCCACTACTA TGTGGTGACT GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 504 0 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG AC AGG AGTG A 5100 

CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGGTCTGC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 5 220 

CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 5 2 80 

CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 5 34 0 
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TCTACATCTT 


CCAGCAAATC 


TTGTCTTACG 


CTCAGGAGTA 


TCTCTTGCTT 


GTTTTGGGGC 


5400 


AACCCTTCTC 


GATTGACGTG 


ATTTTGTCCT 


ATATCAAGCA 


TGTTTTTC AC 


CTCCCTATGT 


5460 


CCTTCTTTGC 


GACACGCAGG 


ACAGGGGAGA 


TCGTGTCTCG 


TTTTACAGAT 


GCTAACAGTA 


5520 


TCATCGATGC 


GCTGGCTTCG 


ACCATCCTTT 


CGATTTTCCT 


AGATGTCTCA 


ACGGTTGTCA 


5580 


TTATTTCCCT 


TGTTCTATTT 


T C ACAAAAT A 


CCAATCTCTT 


TTTCATGACT 


TTATTGGCGC 


5640 


TTCCTATCTA 


CACAGTGATT 


ATCTTTGCCT 


TTATGAAGCC 


GTTTGAAAAG 


ATGAATCGGG 


5700 


ATACCATGCA 


AGCCAATGCG 


GTTCTGTCTT 


CTTCTATCAT 


TGAGGACATC 


AACGGTATTG 


5760 


AGACTATCAA 


GTCCTTGACC 


AGTGAAAGTC 


AGCGTTACCA 


AAAAATT G AC 


AAGGAATTTG 


5820 


TGGATTATCT 


GAAGAAATCC 


TTTACCTATA 


GTCGAGCAGA 


GAGTCAGCAA 


AAGG CTCTG A 


5880 


AAAA GGTTG C 


CCATCTCTTG 


CTTAATGTCG 


GCATTCTCTG 


GATGGGGGCT 


GTTCTGGTCA 


5940 


TGGATGGCAA 


GATGAGTTTG 


GGGCAGTTGA 


TTACCTATAA 


TACCTTGCTG 


GTTTACTTTA 


6000 


CTAATCCTTT 


GGAAAATATC 


AT C AATCTG C 


AAACCAAGCT 


TCAGACAGCG 


CAGGTTGCCA 


6060 


ATAACCGTCT 


AAATGAAGTG 


TATCTAGTAG 


CTTCTGAGTT 


TGAGGAGAAG 


AAAACAGTTG 


6120 


AGGATTTGAG 


CTTGATGAAG 


GGAGATATGA 


CCTTCAAGCA 


GGTTCATTAC 


AAGTATGGCT 


6180 


ATGGTCGAGA 


TGTCTTATCG 


GATATCAATT 


TAACCGTTCC 


CCAAGGGTCT 


AAGGTGGCTT 


6240 


TTGTGGGGAT 


TTCAGGGTCA 


GGTAAGACGA 


CTTTGGCCAA 


GATGATGGTT 


AATTTTTACG 


6300 


ACCCAAGTCA 


AGGGGAGATT 


AGTCTGGGTA 


GTGTCAATCT 


CAATCAGATT 


GATAAAAAAG 


6360 


CCCTGCGCCA 


GTACATCAAC 


TATCTGTCTC 


AACAGCCCTA 


TGTCTTTAAC 


GGAACGATTT 


6420 


TGGAGAATCT 


TCTTTTGGGA 


GCCAAGGAGG 


GGACGACACA 


CGAAGATA7C 


TTACGGCCGG 


6480 


TCGAATTGGC 


AGAGATTCGA 


GAGGATATCG 


AGCGCATGCC 


ACTGAATTAC 


CAGACAGAAT 


6540 


TGACTTCGGA 


TGGGGCAGGG 


ATTTCAGGTG 


GTCAACGTCA 


GAGAATCGCT 


TTGGCGCGTG 


6600 


CTCTCTTGAC 


AGATGCGCCG 


GTCTTGATTT 


TCGATGAGCC 


GACTAGCAGT 


TTGGATATTT 


6660 


TGACAGAGAA 


GCGGATTGTC 


GATAATCTCA 


TTGCTTTGGA 


CAAGACCTTG 


ATTTTCATTG 


6720 


CTCACCGCTT 


GACTATTGCT 


GAGCGGACAG 


AGAAGGTAGT 


TGTCTTGGAT 


CAGGGCAAGA 


6780 


TTGTCGAAGA 


AGGAAAGCAT 


GCTGATTTGC 


TTGCACAGGG 


TGGCTTTTAC 


GCCCATTTGG 


6840 


TCAATAGCTA 


GAAAGAGGAG 


AGGATGAAAC 


CAGAATTTTT 


AGAAAGTGCG 


GAGTTTTATA 


6900 


ATCGTCGTTA 


C C AT AATTTT 


TCCAGTAGTG 


TG ATT GT ACC 


CATGGCCCTT 


CTGCTTGTGT 


6960 


TTTTACTTGG 


CTTTGCAACT 


GTTGCACAGA 


AGGAGATGAG 


TTTGTCCACT 


AGAGCTACTG 


7020 


TCGAACCTAG 


TCGTATCCT7 


GCAAATATCC 


AGTCAACTAG 


CAACAATCGT 


ATTCTTGTCA 


7080 


ATCATTTGGA 


AGAAAATAAG 


CTGGTTAAGA 


AGGGGGATCT 


TTTGGTTCAA 


TACCAAGAAG 


7140 
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GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT GGACATGCTA AAGGATCAAA 7 200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGACG 7 260 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 7 3 20 

GGCCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 73 80 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC GATTACCAGA 7 4 40 

CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7 500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCACTTG 7 5 60 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 6 20 

ATGCACGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCAGTCAA TTGGAATCCC 7 6 80 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 774 0 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7 800 

CGAGTCAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATCGTTGCAG 7 8 60 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AG AAGGG AAA GCCAAACTCA 7 920 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGCTCATTCT GTTCGCTATA 7 980 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC TACT ATT AC A AGTATTGATG 804 0 

CG AC AG CT AC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGCCGGAG ACTAATCTAA 8 LOO 

CTTCGGAGCA GGCTGAAAAA CTTAGCTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCCT 8 2 20 

GTTTTT AG AG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 8 280 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8 340 

GTGGT AAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATGTCAAA ACAATTGATC 8 4 00 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAAT CTT AT TATTTCAACT 8 4 60 

TACAAGGACC AGGCGACTGC TTTCAACCGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8 52 0 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8 580 

ACTCACTTTC TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8640 

CCTTTGGAAG TCCTGCTCCG CAACTATACT GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 87 00 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 88 20 

ATTGC CT ACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTGGTTTGCT 888 0 
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GAGATTCGCC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 89 4 0 

ATTATCTTCG C AGACGAATT TTCACCAGAT AACTGCC G CT TGTGGGACGC TCATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 912 0 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGCGTTTAA AATAACAACC 92 40 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 9300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 9 3 60 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 94 20 

GT ATTTG ACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC AT GTTTT AG A TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 954 0 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCCTC ACAGGAAGCC 9 600 

TTGCTTTTCT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 9 660 

AATAAAGATA TTGATG CG AC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9720 

GATTCTCGTT T C AAGG AT AT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 97 80 

AAGACCATTC C C AAATTG AC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9 840 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9 960 

GACCACTGCC GTCATACGAC TTTTG AG AC A GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT G AC AAGT AT A TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 1014 0 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 10200 

G AAATTG AAG TGG ACGTTG A T GGTGTC AAG GAACCTTGGC TCCTCATGTT T AAAAACG AA 102 60 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGG AG CGGCTACCTG TATTGGTGGA J.0320 

GCTATTCCTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC G ATTTCGG AA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT T C AT ATGGT A ACCAGATTGG GCTTGCAACA 10500 

AC CT AC GTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACGTATGGA ACTTGGTGCC 105 60 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTG AAA AACCTGAAGC AGGTGATGTG 10 620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT ' 10680 
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AAGGTTCAAA CAGTTGAGTC TGTAGAGACT GCTGGTGCTG AGGTTCAAAA AGGAAATGCC 1074 0 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGGC AGGCGGTGTC TGTCTGGCTA TCGGTGAATT GGCAGACGGT 10860 

CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT 10920 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCGTGGTTC GTCCTGAAGA TGTGGATGCG 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTGACTTGGA GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGGTTCTA 11220 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 11280 

CGCTCAACGG TTAATCACCC ACTTGCTGGT CGTTACCAAC TCACACCAAC TGAGGCATCT 11340 

GTGCAGAAAT TCCCAGTTCA ACAC GGTGTG ACTCATACTG CGTCGGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTGCTTATGC GGTTATCGAA 114 60 

GCAACTGCTC GTTTGGTGGC TCCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 11520 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CT AGGTTCT A TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TGGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 117 60 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TGTCAAATAC 11880 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGC7 ACCTTTGGAA ACTATATTGG TGCAGAGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGG CTTTGT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AG AG AAGGT T GGACAAACGA AAGCAGACTT TACACTGACT 12 060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG C ATTTC AAGG GACATTGGAA 12120 

GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12190 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTG AAAAAC CTGTGGTTTA CATCCCAGTC 122 40 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGCCCT TCGAAAAAGA AGGTGCAGAG 12 300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC AGTTGAAACT 12 360 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 124 20 
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GATGAACCAG 


ATGGTTCAGC 


TAAGTTTATC 


GTCAATATCC 


TGCTTAATGA 


AAAAGTGCGT 


12 480 


GTGGCTATTG 


ATAGCTTTAT 


CGCCCGTGGT 


GGTTTG ATT A 


TCGGTATTTG 


TAATGGATTC 


1254 0 


CAAGCCTTAG 


TCAAATCGGG 


TCTCCTACCC 


TACGGAAACT 


TTGAAGCTGC 


TAACAGTACT 


12600 


AGCCCAACCC 


TCTTCTACAA 


TGATGCCAAC 


CAACACGTGG 


CCAAGATCGT 


GGAAACTCGC 


12660 


ATTGCCAATA 


CCAACTCACC 


ATGGTTGGTT 


GGTGTGCAAG 


TGGGCGATAT 


C C ACGCT ATT 


12720 


CCTGTTTCGC 


ACGGTGAAGG 


GAAGTTTGTC 


GTGACGGCTG 


AGGAATTTGC 


AGAGCTCCGT 


12780 


G AC AATG G AC 


AAATTTTCAG 


CCAATACGTT 


GACTTTAACG 


GTAAACCAAG 


TATGGATTCT 


12840 


AAGTACAATC 


CGAATGGTTC 


TGTCCATGCC 


ATCGAAGGAA 


TTACCAGCAA 


GAATGGTCAA 


12900 


ATCATCGGTA 


AGATGGGCCA 


CTCAGAACGT 


TATGAGGATG 


GTCTTTTCCA 


AAATATCCCA 


12960 


GGCAATAAAG 


ACCAACACCT 


GTTCGCATCA 


GCGGTTAAAC 


ATTTCACTGG 


AAAATAAGAC 


1 3020 


TTACAGATTT 


TCTAATAGAT 


AGTATCAGTA 


ATGTAAAAGT 


CATGTAAATC 


TAGCTCTTGA 


13080 


TG ATT AC AAA 


TGAAAATTAG 


GTATAAAAAA 


TGACATACGA 


AGTAAAATCT 


CTTAATGAAG 


13140 


AATGTGGTGT 


T TTCGGT AT T 


TGGGGACATC 


CAGATGCTGC 


TAAGTTGACC 


TATTTTGGAC 


13200 


TCCACAGTCT 


TCAACACCGT 


GGTCAGGAGG 


GGGCAGGAAT 


CCTCTCCAAT 


GATCAAGGAC 


13260 


AACTGAAGCG 


CCATCGTGAC 


ATGGCGCTTT 


TATCAGAAGT 


TTTCAGAAAT 


CCAGCTAATT 


13 320 


TGGATAAATT 


GACAGGAGCT 


GGTGCGATTG 


GGCATGTGCG 


TTATGCGACT 


GCTGGCGAAG 


13 380 


C TTCTGT AG A 


TAACATCCAC 


CCCTTCCTCT 


TCCGTTTTCA 


CGATATGCAG 


TTTGGTTTGG 


13440 


CTCATAATGG 


AAATCTGACC 


AATGCAGCCT 


CTCTCAAGAA 


AGAACTGGAA 


CAAAGAGGAG 


13500 


CAATTTTCAG 


CGCGACTTCG 


G ACT CTG AAA 


TCTTGGCTCA 


CCTCATTCCT 


CGCAGTCATA 


13560 


ATCCTAGCCT 


GATGGGCAAA 


ATCAAGGAAG 


CGCTCAGCCT 


TGTCAAAGGT 


GGTTTTGCCT 


13620 


ATATCTTGCT 


GTTTGAGGAC 


AAGTTGATTG 


CGGCTCTTGA 


CCCAAATGGA 


TTCCGACCGC 


13 680 


TTTCGATTGG 


TAAAATGGCT 


AATGGAGCAG 


TTGTTGT ATC 


TT CTG AAA CC 


TG TGCTTTTG 


13 7 40 


AGGTCATTGG 


TGCCGAGTGG 


ATTCGTGATT 


TGAAGCCAGG 


TGAGATTGTG 


AT C ATTG ATG 


13 800 


ACGAGGGCAT 


TCAGTATGAC 


AGCTATACAG 


ATGATACCCA 


GTTGGCGGTT 


TGTTCTATGG 


13860 


AGTATATCTA 


CTTTGCTCGC 


CCTGATTCTA 


ATATCCACGG 


TGTCAATGTC 


CATACGGCAC 


13920 


GTAAGAGAAT 


GGGAGCGCAA 


TTGGCGCGAG 


AATTTAAGCA 


TG AGGCAG AT 


ATTGT AGTTG 


13980 


GTGTGCCCAA 


TTCTTCCCTA 


AGCGCGGCTA 


TGGGATTTGC 


GGAAGAATCA 


GGCTTACCAA 


14040 


ATGAAATGGG 


TCTGATCAAA 


AACCAATACA 


CCCAGCGAAC 


TTTTATCCAA 


CCGACTCAAG 


14 100 


AATTGCGGGA 


GCAAGGAGTG 


CGGATGAAAC 


TGTCTGCTGT 


TTCGGGTGTT 


GTCAAAGGCA 


14160 


AACGTGTGGT 


CATGGTGGAT 


GATTCCATTG 


TACGTGGAAC 


AAC CTCTCGT 


CGTATCGTTC 


14220 
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ACCTCTTGAA ACAAGCGGGT GCGACTGAGG TTCACGTTGC CATTCGAACT CCTGCACTAG 142 80 

CGTATCCATG TTTCTACGGG ATTG AT ATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 14340 

ATACGGTCGA AG AAACTCG C CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

ATGGCTTGAT TGAGTCGATT GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 14 4 60 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTT ATG ACT A CGAAGAAGAC TATCGTAGAA 14520 

GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14580 

AAAAATAAAT GACAAATAAA AATGC AT ATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14640 

AGCATTTAGT CAGTAGACGC TTTGTCCTAT AGGATCAAAG CTAGAGCCCT GACTAGTATT 14 700 

TTT AG AT AAA AAGATGGTTT AT CT AAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 147 60 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAATCT GGTGTGGATC TTGAAGCGGG 14 82 0 

TTATGAAGTT GTTGAACGGA TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14 880 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAG ACT GGGG TTAAAGAACC 14 940 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTG GCT A TCAAGTACGA 15 000 

CAAGCACGAT ACCATCGCGC AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

AGGTGCGGAA CCCCTCTATT TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GCTAGAACAA GTGGTTGCTG CTGTGGC AG A AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

CGGTGGGGAA ACGGCTGAAA TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 152 40 

TTTTGCGGTC GGTGTGCCTG AAAAATCTCA AATCATTGAC GGTTCAAAGG TGGTAGAGGG 15300 

AG ATGTT CT T CTCGGACTTG CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 15 3 60 

TCCTCTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTGGAAG GCAACAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATGTCAAG GCTGTCTTGC CGCTCATCAA 15480 

AG AAG AGTTG GTCAACGGCA TTGC CC AC AT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15 540 

TCGTATGTTT GCAGATGACC TAGCTGCTGA AATTGATGAA AGTAAAGTTC CAGTGCTTCC 15 600 

AATTTTCAAA ACC CTTG AAA AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGTAGACC GTGTAAAAGA 15720 

ATT GTTGG AT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 157 80 

C ATT AT C AAA TGAAAAAAAT AGCGGTTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTC C AGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15900 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACCAAGCAGC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 1602 0 

TGCCTAGCAC GCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGCCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

CCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAG CTG ATG A TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGCCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16 380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATGTC TATCTGATGT TATTAACTTG GGCAATCTTA TTTAAGTTTG 16 500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGG ATT C CAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 16620 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 166 80 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTG CTTAC AG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGGCT 16900 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 16 980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTCCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTC CCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTT GTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 17 280 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 17 34 0 

TGCTTCGTTC AGC AG CGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT .7400 

ACGCTGTGGT TTTGGATGAA TTGGC AG C AA ACGGCGAAAC CTCTTATCAA ACTCGCCAAC 174 60 

GTTTAGCAGC CAAAGTATTT CGTCAC AC AG CGGCTTATGA CGCCTTGATT GCAGAATACT 17 520 

TCACAGCTCA AGTG GGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17 580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17 640 

CT AC AG ACT A CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 177 60 
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TTGTGGCT CT CAAACACATG AATCCATGTG G AATTGGTC A AGCTGATGAC ATCGAGACTG 17320 
CTTGGGACTA CGCTTATGAG TCTGACCCAG TCTCTATCTT TGGTGGGATT GTCGTCCTCA 17880 
ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 17940 
TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 
GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCCA AGTGGAAGCA G AAT AC AC AG 
GTGTAGTCGG TGG ACT TCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 
GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTCCTCTT GAGTTCGCTT 
GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 
TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAG 
CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 
ACGTGGAAGA AATCGCCAAA GC AGGAATT A AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 18420 
GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 18480 
TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 
CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATG TTGGTAAATA 
ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTGAGCAT 
GCGATTGCTA AAAAGTTACT TGAATCAAAA GACCTGGAAA AAGTCTTTGT AGCTCCTGGG 
AATGATGGGA TGACTCTGGA TGGTTTGGAA TTGGTAAATA TCTCTATTTC CGAACATTAT 
AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 
GCCCTTGCTG CTGGTATCGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTCGTCCC 
ACTAGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGCAAAT CATCCTCAAA 
TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 
ATCGAAAAGC ATGGTGCGCC TATCGTAGTC AAGGCCGATC GCTTGGCACT TGGGAAGGGT 
GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAAGCCG CTCATGAGAT GCTTTTGGAC 
AAT AAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AGGAGAGGAA 
TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 
CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACGG GTGGTATGGG TGCCTATGCG 
CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCGC TTGACACCAT TGTCAAGCCA 
GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG G AGTTCTTT A CGCAGGGCTT 
ATCCTGACAG CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 



18000 
18060 
18120 
18180 
18240 
18300 
18360 



18540 
18600 
18660 
18720 
18780 
19840 
18900 
19960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
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ACTCAGATTA TCTTGCCTCC CTTGACCTCT GACTTTGCTC AAAAT AT C AC AGATATCCTG 19560 

GATAGCAAGG AGCCAAATAT CATGTGGACG GACAAGGGTG TGACTCTGGG TGTGGTTGTC 19620 

GCATCCAAGG GCTACCCGCT AGACTATGAA AGGGGC GTT G AGTTGCCAGC CAAGACAGAA 19680 

GGCGATGTCA TCACCTACTA TGCAGGGGCT AAGTTTGCGG AAAATAG CAG AGCACTGCTC 19740 

TCAAACGGCG GACGAGTTTA TATGCTCGTT ACCACAGCAG ATACCGTCAA AGAAGCCCAA 19800 

GCCAGCATAT ACCAAGAACT ATACCAACAA AAAAT AGAAG GAC TCTTCT A CCGAACAGAT 19860 

ATCGGAAGCA AGGCAATTAA GTAAAGATAT AAGAATAACG CGCCGTAGTC GCCAAACACG 19920 

ATAATGGTCG TCGTGGTGAA AAGACCAGAA CAGTGAATGT TCTGGTCAGG GGGAAACTTG 19980 

GAGACCTTAG GCTCAAAGTT TAGGAATGAA ACCGAAGGTT TGCTTCCGCC TCCATCACCT 2 0040 

AAGACCATTA TCAAAAAGAA AAATAAAAAT TCACAAAATA CCTTAATGAT CGTATGGTTT 20100 

GCGAGCGTTA GCGAGCTAAT ATAGAACAAT CACCGCCGTT GTGAAAGAAC GATTGGATGA 20160 

TAATCCAATC GTTCAGGGAA ATTGGAAGAC CTTGGGTTTC CAATTTAGGC ATGAGACACC 20220 

TTTGGTGGCT GCTGCCGTCC CTCACAAGCT AAGGTGATTG TTGAAAAAGA GGAAAAAGGA 20280 

GAAGAAATGA AACCAGTAAT TTCCATCATC ATGGGCTCAA AATCCGACTG GGCAACCATG 203 40 

CAAAAAACAG C AGAAG TCCT AGACCGCTTC GGTCTAGCCT ACGAAAAGAA AGTTGTTTCC 20400 

GCACACCGTA CACCAGACCT CATGTTCAAA CATGCAGAAG AAGCCCGTAG TCGTGGCATC 20460 

AAGATCATCA TCGCAGGTGC TGGTGGCGCA GCGCATTTGC CAGGCATGGT AGCTGCCAAA 2 0520 

ACAACCCTTC CAGTCATTGG TGTGCCAGTC AAGTCTCGTG CTCTTAGTGG AGTGGATTCA 20580 

CTCTATTCTA TCGTTCAGAT GCCGGGTGGG GTGCCTGTTG CGACCATGGC TATCGGTGAA 20640 

GCTGGAGCGA CTAACGCAGC TCTCTTTGCC CTCCGTCTCC TCTCTGTAGA AGATAAGTCC 20700 

ATTGCGGATG CACTTGCCAA CTTTGCTGAA GAACAAGGAA AAATCGCAGA GGAGTCGTCA 207 60 

AATGAGCTCA TCTAAAACAA TCGGAATTAT CGGTGGCGGT CAACTGGGTC AGATGATGGC 20820 

CATTTCTGCT AT CTACATGG GCCACAAGGT TATCGCGCTG G ATCCTG CGG CGGATTGCCC 20880 

GGTCTCTCGT GTGGCGGAAA TCATTGTGGC ACCTTATAAC GATGTAGACG CCCTCCGTCA 20940 

GTTGGCACAC CGTTGCGATG TCCTCACTTA TGAGTTTGAA AATGTCGACG CTGACGGTTT 21000 

GGATGCCGTT ATCAAGGATG GACAACTCCC TCAAGGAACA GATCTGCTCC GCATTTCGCA 21060 

AAATCGTATT TTTGAAAAGG ACTTTTTGTC AAACAAGGCT CAAGTCACTG TGGCACCCTA 21120 

CAAGGTCGTG ACTTCTAGCC TAGACTTGGC AGATATCGAC TTGTCGAAAA ACTATGTCCT 21180 

CAAGACTGCG ACTGGTGGCT ACGATGGT C A TGGACAAAAG GTTATTCGTT CAGAAGCAGA 212 40 

CTTGGAAGCA GCCTATGCGC TAGCAGACTC AGCAG ACT GC GTCTTCGAAG AATTTGTCAA 21300 
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CTTTGACCTT 


' GAGATTTCTG 


TCATCGTGTC 


AGGAAATGGC 


AAGGAGGTGA 


CGTTTTTCCC 


21360 


AGTTCAGGAA 


AATATCCACC 


GCAACAATAT 


CCTGTCTAAG 


ACCATCGTAC 


CAGCCCGCAT 


21420 


TTCTGAAAGT 


CTAGTAGACA 


AGGCTAAAGC 


TATCGCAGTG 


CGAATCGCAG 


AACAACTCAA 


21480 


CTTGTCTGGA 


ACTCTCTGTG 


TGGAAATGTT 


TGCGACAGCT 


GATGACATCA 


TTGTCAATGA 


21540 


AATCGCCCCA 


CGACCACATA 


ACTCTGGGCA 


CTATTCTATT 


GAAGCCTGTG 


ATTTCTCTCA 


21600 


GTTTGACACC 


CAT ATTCTG G 


GTGTTCTCGG 


AGC AC C ATT A 


C C AGT CATC A 


AACTCCATGC 


21660 


GCCAGCCGTT 


ATGCTTAATG 


TCCTCGGTCA 


GCATGTCGAG 


GCTGCTGAAA 


AATATGTCAC 


21720 


AG AAAATC C A 


AGCGCCCACC 


TCCACATGTA 


TGGTAAAATA 


GAAGCAAAGC 


ATAATCGTAA 


21780 


GATGGGACAT 


GTGACTTTGT 


TTAGTGATGT 


GCCGGATAGT 


GTGGAAGAGT 


TTCGGGAACC 


21840 


GATTGATTTT 


TAGGACAAGT 


CTATGATACA 


AATTATCGTT 


AATACATTTA 


TTGAAAAGTA 


21900 


TAAGACTGGA 


GCAGTTGTTG 


AAGTGT TGT A 


TGCCAGTGCT 


GACCAAGATA 


AGGTACAAGC 


21960 


TAAATATGAA 


GAACTAGCTG 


CACAATACCC 


CGAAAATT AT 


TTAGCTATCT 


AT AATGT A CC 


22020 


GCTGGATACG 


GATTTGAATA 


CACTAGATCA 


TTACCCGTCT 


GTGTTTATTG 


GAAAAGAGGA 


22090 


GTTTGAGTAG 


AAATCTTGGT 


TTACCTAGAT 


AGCTTATTCC 


CAACAGCTTA 


AGAAGAAAGG 


22140 


AAAAATTAAC 


ACATGATCAA 


CCGTTACTCT 


CGCCCTGAGA 


TGGCGAATAT 


TTGGAGTGAA 


22200 


GAAAATAAAT 


ACCGTGCTTG 


GCTTGAGGTG 


GAAATCCTCT 


CTGACGAGGC 


ATGGGCTCAG 


22260 


TTGGGGGAAA 


TCCCTAAGGA 


AG ATGTGG CT 


TTGATTCGCA 


AGAAGGCGGA 


CTTTGACATC 


22320 


GACCGTATTT 


TGGAAATTGA 


GCAGGAGACG 


CCCCACGATG 


TGGTGGCTTT 


CACGCGTGCG 


22380 


GTTTCTGAGA 


CTCTTGGTGA 


AGAGCGCAAG 


TGGGTTCACT 


ATGGGTTAAC 


TTCTACTCAC 


22440 


CTGGTGGATA 


CTGCTTATGG 


TTACCTCTAC 


AAGCAGGCCA 


ACGACATCAT 


CCCTCGTGAC 


22500 


CTTGAAAACT 


TCACTAATAT 


CATCGCTGAC 


AAGGCCAAGG 


AGCACAAGTT 


CACCATCATC 


22560 


ATGGGGCGTA 


CTCATGGTGT 


GCACGCTGAG 


C C G AC AACPT 








TGGTACAGCG 


AAATGAAACG 


CAATATCGAG 


CGCTTCGAGC 


ATGCGGCTGC 


TGGTGTAGAA 


22680 


GCTGGTAAGA 


TTTCTGGTGC 


GGTTGGGAAC 


TTTCCCAATA 


TCCCACCATT 


TGT AG AGG A G 


22740 


TATGTCTGCG 


AT AAACTTG G 


CATCCGTGCC 


CAAGAAATCT 


CTACACAAGT 


CCTTCCTCGT 


22800 


GACCTTCACG 


CTGAGTACTT 


TGCGGTTCTT 


GCCAGCATTG 


CGACTTCAAT 


CGAACGTATG 


22860 


GCGACTGAGA 


TTCGTGGTCT 


ACAAAAATCT 


GAGCAACGCG 


AAGTAGAAGA 


GTTCTTTGCT 


22920 


AAAGGGCAAA 


AAGGGTCTTC 


AGCAATGCCT 


CACAAACGCA 


ACCCAATCGG 


TTCTGAAAAT 


22980 


ATGACTGGTC 


TGGCGCGTGT 


CATTCGTGGT 


CACATGATTA 


CGGCTTATGA 


AAACGTCGCT 


23040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC GTATCATCAC ACCAGATACG 2 3100 

AC C ATTTTG A TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 2 3 220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23 280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23 340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 2 3400 

ACACCAAACG AGTGG AT GAT ATCTTTGAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 23 4 60 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AG AC TT AG TC TTCTTTTCTT 2 3520 

TTAGTGAGTC C A T AGGCTGC TAGTGTGGAC ATGAGTCCTG CG ACT AC TAG TCCTGCAGAA 23 530 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 2 3 640 

AGAACTTTGC TTTCCTCAGC AGG AG C AG TT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 2 3700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 2 3760 

GTTCCGACTT CGACTATTTG AGTAACGCCT TCCTGTGCTA CG AC ACT ATT T AC AAGTGTT 23820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 2 3 880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGG ATTTTC ACAAGTCACT 23940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 2 4 060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGCAATTTC TTCAGTTCTT 24 2 40 

GTGATCAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGCCTTCCT 24300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 24 3 60 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAG CTG TTTT AGCTGCGTCA 24 4 20 

AGGGCCTCAA GACTTTCTTT ACT AT AT C CT TCTAAGTTTG TAGGAATTTT AGCTAATTCT .;4 4 80 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGG C ACT C 2 454 0 

ATCAGTTCAA AGATTTCCTC TTCCTTGTAT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 2 4 600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 2 4 660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 2 472 0 

TTTTGACCAA CAATGTAGTA CCAGTCACCG TTCGTATTCA AG AATTT AT A GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC. CTTTAGACCA GTAAGAAATC 2 484 0 
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AAGACATCTT TGTCAAACTG AACATCCTCC 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC 

AATTTGCCAT AGAGTTGATA CCACTTCAGG 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA 

TATTTACCGA TGAGGGCTTT TACAAAGTTC 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA 

ACCAGCATAG CATCCATGTG ACCTGGACTG 

GATTTAGCGT AT TC AATT AG CTCTGTTACT 

TCGTCGTAGT AACCTTTACT TCCTTCGATA 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA 

AGAAGGAGAT GGACATCAGA ATATCCGAGC 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA 

TTTTCAACCT C ACGTTT AG C TTCTTCTTCT 

TTGACAGTTT CTTGAAGTTT AGCAATGGCT 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC 

TCAGCTGCGA AGTATTCACC GTTGGCATTT 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA 

GGGATATCAG CTCCTTCTGT TTTTGGTACA 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG 

CAAGCATCAT TGGTATTAAG GATTTGGTGA 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA 

TGCGATTTTA CAATACGAGC GAGGTCATTG 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA 

TCTGTCGCAT CATTGGCATA TTCATCAAGT 

AAATAAGCAG CATACTTGTC GATAAGGGCT 

AGATCGACAG TACGGGCTGA TTTCTTCCCA 

AATTCTTTCA TGGCATTGAG AATCGCATCC 



675 

TTGTCTTCAT AGTAGAAGCC 
ATAGCTGCGA GGGTGTTGGC 
TAGTACCAGC CTTCGGCACT 
AT CTTTGTTT TACCTGCAAA 
ATCGCTTCTT CGTTTTTCAA 
GGATTTTTAA TACCTAATTT 
TTAATAGCTG GGATGAGACC 
TCTGCCTGTG TTAGTGCAGT 
ATAGCTTTTT TAACGTCATC 
TCGAGTAGAA AGCGAAGTCC 
TCACTGGCCT TGTCTACGAT 
TCGATTGAGA TTACCTTCTT 
TTTTGAGCTT CAGGCGTGAG 
TGATCAATCG TATCTTGTTG 
TCAGCTTCTT TTACGGCCGT 
TCCTTAAGTG CTTGCTCTGC 
GCAAAATGAC GCATGAGTTT 
GCCCAAGCAG CTACCATACC 
GAAGTGATTG GTGTGTTTTT 
CCATCAGCGT TTCGTCCAAG 
CCTTTTTCAC CTAGTAGTTT 
ATAGAAACGA TGATGTCTTT 
CCGTCGTTAA AACCCATTCG 
GCGTAGGCAA TAAATTTTTC 
TCAGCTTGAA GCACACTCCA 
CCGATGTTGA AG ATT TC AGT 
TTTGTAAAAG CGACAGCTTG 
AAATAGCTAA AGTTAGGGTT 
ATGTGTCCAG GACTATTTAC 



ATCGTTGAAG 24 900 

ATATTCGGCA 2 4 960 

AGTCGCATCG 2 5020 

GAAGTCCATG 2 5080 

GTCCATAGTT 25140 

TTCCATGGC A 2 5200 

GATGTCCTTA 2 5260 

ACCGTTTGGA 25 320 

ACT AGCATAG 2 S 3 8 0 

GTCATTTCCT 2 5440 

GCCTTTGAGC 25500 

TTTGGCAACT 2 5560 

GGTCAACTTG 2 5620 

GGCACGGCTA 2 5680 

GACCCTTTCT 257 40 

AG ATTCAT AA 25800 

GAAGAGGCGT 2 58 60 

ACCGATGATT 2 5 920 

AATACCATTG 25980 

AACGTAGTAC 2 6040 

AGAAG AAGCG 26100 

GTCAAAACTA 26160 

TTTG AGACCG 262 20 

ATAGCCTTTT 262 80 

ACCTTTAGCA 263 40 

CTTTTTCGCG 2 6400 

TTCGTTGTCA 264 60 

TTGGATTCCC 26520 

TGTCGGAATC 2 6580 
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AGACCGATAC 


CTTTATCTTT 


GGCATAGTTA 


ATCAGATCTG 


TCATTTGACT 


TTCTGTTAAG 


26640 


TGATTGCCGT 


TTGGATCGTT 


GTAATAATCA TTTGTACCTT TTTCAATGGC 


GCGTTTGACA 


26700 


TCGTCACTGG 


CATAGGTCTT 


GCCGTTAGCT 


GTGATGCTCA 


TATCGTCCAA 


CATGAAACGG 


26760 


AGTCCATCAT 


TTCCGACTAA 


TAGGTGTAAA 


TCAGTGTAGC 


CATAATGTTT 


CGCTTTATCG 


26820 


ATCATTTCCT 


TGAGCTGTTC 


TGGTGAGAAA 


TATTTACGTC 


CAGCATCAAT 


AGAAACAATT 


26880 


TTCTTTTTCG 


CTAGTTTTTC 


ATTTACAGTT 


GCAGCACGTT 


CCTTTCCTGC 


CTCTGTTGCC 


26940 


GGTTTGTCAG 


CCTCTGCTTT 


CGCTTCATCT 


TTTTTAGCTG 


GTTTATCCTT 


GTCAGTCTTG 


27000 


TCTGTATTTG 


ACTCTTTAGA 


ATCAACCTCT 


TTCGCTTCTT 


CCTTTTTAGG 


GCTAGCTTCT 


27060 


TCTGCCTTTT 


TATTAGCAGT 


TTCTTTTTCA 


GCAGAAGTTG 


GAGTTACCAC 


TTCTGCTTTA 


27120 


TCACTAGGAG 


TTGAACTAAC 


TTCCTCTTGT 


GGTTTTTCTT 


CTGTTTTTGG 


AAGACTAGCT 


27180 


ACCTTATCAG 


TAGCTGGAGT 


TTC TGTTTCT 


ACAGTTTTTG 


GAGCTTCTGG 


TTGAAGCACT 


27240 


GCTTTAGGTG 


TTTCCTCAGT 


CCGATTTTCG 


GATGATTGAG 


GGGAATCACA 


AACCGTATGG 


27300 


ATGGTCGGTT 


GCTTTTCTGT 


AGTAGTAGGA 


GT AACTCC AT 


CGGCTGCAAC 


AGTCTGTGCT 


27360 


TGGAAGGCAA 


ATCCAATTAG 


AACAGAAGCT 


CCTCCTACAG 


CGTATTTACG 


AATAGAAAAA 


27420 


CGC TGTTGTT 


TTTCATGTTT 


CATTGCAAAA 


CCTCCTGATT 


GCATTGTTAT 


ATTGATAGCG 


27480 


ATTATATAAA 


TCAACGCCTT 


TATTTTATTT 


CTTATATTAA 


TTTCTTATAT 


TAACGAGAGT 


27540 


CAAGAGGAGA 


TGACAAAAAA 


CTATAATAAG 


TATAAAAAAA 


TATAAAATTT 


AAACTTAAGA 


27600 


TTTCAGATTG 


GTCGGAAAAA 


ATA CGT AT AT 


ATATCTAGTA 


TAATTTTTGG 


TTCTATTTCT 


27660 


ATAAAATATT 


CC AC AAATT A 


T AG AATTTT C 


CAAAAATAGG 


TAAGCGCTAC 


CTTTTTGGTG 


27720 


TAGTATAATA 


AGCATAGAAA 


AAGCCCAAGC 


GATTAGCTCA 


GGTTTTCTTC 


TTAGTGATCA 


27780 


CGCTCACATG 


AGATAAATTT 


AATCTTGTAG 


TAATCAGATC 


GTTTGTAAGT 


TTCACTGTAT 


27840 


TCTAAAACTT 


GGCCAGTTGA 


TTCGAGTTTG 


GTGATTTTAG 


TTTGTAGGAC 


AGTAGGGAAT 


27900 


TGTTCATCGA 


CTCCGAGGAC 


TGAAGCTGCA 


TGTTCTGGAC 


TTGG AAAG AC 


TATTTCCTTG 


27960 


ATTTCTTCAA 


AGTGTT CATC 


ATTCATGTGA 


ATGTGGTAGT 


CTAACTTGAA 


ACGATTATAC 


28020 


ATAGAACTAT 


AGTATTCAAG 


GTTTGGATAA 


TTTGCGTTGA 


TATATTGTTC 


TGGGATGTAG 


28080 


GATGTATGGT 


AGATATAAAC 


GACACCGTTT 


GATTCGCGGA 


TACGTTCAAT 


CTTGTAGTAG 


28140 


AATTG ATCGC 


CGCGTAGACC 


CAATTTTTCC 


AAGTAAACAA 


GCTTGTTTCC 


CCGTTCAATT 


28200 


GAAAGAACAG 


TTACCTTATC 


ATCTTTAGCA 


TTGAAGAGTT 


CAATATCTGA 


AAACTCTACA 


28260 


AGCTTGTGTT 


TGCGTGCACG 


TGAAACGAAG 


, GTTCCTTTTC 


CTTGTTGGCG 


GACAATATAG 


28320 


CCATCTTTGG 


i CAAGGTCGTT 


TAAGGCGCGA ACAACTGTGA 


, TAGAGCTGAC 


! ATCGTACATT 


28380 
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GAAATGACTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 284 40 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 847 3 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCCC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AG AAACTG AT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 24 0 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 360 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG CAAATTATTT CT ATTG AT AT 4 20 

CTCAGGTCTT GACCCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 4 80 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTCATGAAA TTCACCAAAT 54 0 

TCTTGGTGCT GGTAGCACTC GTGGAGACAG TGGTTCTAAA GCACTTCCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 72 0 

TCCTTCGGCA GAGAATACTT TTAAAATTCT T C AAGG AATT CGTGACCTCT ATCAACAACA 7 80 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCACCG GTGGATTATT CTGTTCAATA 84 0 

CATTCCTCAA CGTAGCTTGC C AG AT AAGG C TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCCGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 9 60 

AAAAG AC AAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 102 0 

AACACGCATT GC AG AATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 12 60 
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TTTTGATGAA 


GGAAATCGCC 


CAATCGGCAA 


CTTCCTCTTT 


GTAGGGTCTA 


CTGGGGTTGG 


1320 


TAAGACGCAG 


CTTGCTAAGC 


AATTGGCACT 


CGATATGTTT 


GGAACCCAGG 


ATGCGATTAT 


1380 


CCGTTTAGAT 


ATGTCTGAAT 


ACAGTGACCG 


CACAGCTGTT 


TCTAAGCTAA 


TTGGT AC AAC 


1440 


AGCAGGCTAT 


GTGGGTTATG 


ATGACAATAG 


CAATACCTTA 


ACAGAACGTG 


TTCGTCGCAA 


1500 


TCCATACTCT 


ATCATTCT CT 


TGGATGAAAT 


TGAAAAGGCT 


GACCCTCAAG 


TTATTACCCT 


1560 


TCTCCTCCAA 


GTTCTAGATG 


ATGGTCGTTT 


GACAGATGGT 


CAAGGAAATA 


CAGTAAACTT 


1620 


CAAGAACACT 


GTCATTATTG 


CGACCTCAAA 


TGCTGGATTT 


GGCTATGAAG 


CCAACTTGAC 


1680 


AGAAGATGCG 


GATAAACCAG 


AATTGATGGA 


CCGTTTGAAA 


CCCTTCTTCC 


GTCCAGAATT 


1740 


CCTCAACCGC 


TTTAATGCAG 


TCATCGAGTT 


CTCACACTTG 


ACTAAGGAAG 


ACCTTTCTAA 


1800 


GATTGTAGAT 


TTGATGTTGG 


CTGAAGTTAA 


CCAAACCTTG 


GCTAAGAAAG 


ACATTGACTT 


1360 


GGTAGTCAGT 


CAAGCGGCTA 


AAGATTATAT 


CACAGAAGAA 


GGTTACGACG 


AAGTCATGGG 


1920 


GGTTCGTCCT 


CTCCGTCGCG 


TGGTTGAACA 


AGAAATTCGT 


GATAAGGTGA 


CAGACTTCCA 


1980 


CTTGGATCAT 


TTAGATGCTA 


AACATCTGGA 


AGCAGATATG 


GAAGATGGCG 


TTTTGGTTAT 


2040 


T CGTG AG AAA 


GTCTAAGACA 


GAATTTTGAG 


GATAAAAAAG 


AAGGAGCCAG 


CTGAAAAAAA 


2100 


CTGGTTCCTT 


TTTAGGTACG 


ACAGGCATGT 


CCTATAGTAG 


AAGTGTATTA 


TTCT AGTTTC 


2160 


AATATACTAT 


AGTAGCTCAG 


AAGTCGGTAC 


TTAAACGTGC 


TAT ATCAAAA 


CCAGTCCTCG 


2220 


AAAAACGTGG 


ACTGCTTTCG 


TGTTT GG ATT 


ATTACCTTGA 


ACGACATGCG 


TTAAAAGTTA 


2280 


GTTGAACCGC 


CGTATGCCGA 


ATGGT ACGT A 


CGGTGGTGTG 


AGAGGGGCTA 


GAGATTATCC 


2340 


CCTACTCGAT 


TTT AAATC AC 


ATGACGTTCA 


AAGGCATCAT 


CTGAAATCCC 


TTGTTCCAAG 


2400 


ATGAGTTTTG 


CCCATTCTTT 


AGCAGAGAAG 


AGGCTGTGGT 


CCTTGTAGTT 


TCCGCAAGAT 


2460 


TCGATGGTTG 


TCCCTGGGAC 


ATCTTCCCAA 


GTAGTAGTTT 


CAGCGATTTC 


CTTGACCGAA 


2520 


TCCTTGATAA 


CAGCTGCGAT 


TTTAGCACTG 


GTGTGACGTC 


CCCACATAAT 


CATGTGGAAG 


2580 


CCTGTGCGGC 


AACCAAATGG 


TGAACAGTCA 


ATCATGCCGT 


CAATGCGGGT 


ACGGATGAGT 


2640 


TTGGCTAAGA 


GGTGCTCGAT 


AGTGTGAAGG 


CCGGCAGTAG 


GGATAGAGTC 


TTCGTTTGGT 


?700 


TGCACCAAGC 


GAATATCATA 


ATTGGAGATG 


ATGTCTCCTT 


TTGGTCCTGT 


TTCTTCCCCA 


2760 


ATCAAGCGAA 


CATAGGGTGC 


TTTGACAATG 


GTGTGGTCAA 


GTTCAAAACT 


TTCGACAATA 


2820 


ACTTCTTTTG 


ACATGGTAAA 


TCCTTTCAGT 


TTTCTTCTCT 


CATTATATCA 


TAAAGGTTGC 


2880 


TCCTGAGACA 


GAGAGAAAAC 


CTCTCCGAGG 


CTGGAGAGGT 


TGAAATCTTT 


ACTTACGATA 


2940 


TAAGCGGTCG 


TATTGCTAGT 


ATGGGTCAAA 


GGTTACGTTG 


ATACCCAGTT 


TACGAAGGAC 


3000 


ATTCTTGTCT 


TCATCAGTCA 


AG AT G ATGGT 


TGAGTGGGCT 


TCGCTTCCTT 


TGAGGTTGCC 


3060 



WO 98/18931 



PCT/US97/19588 



679 

GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3 120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT T AGTTTT ACT TCTTTAGCGA TGTCAGCTGA 3 24 0 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3 300 

ACT G ATG ATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 33 60 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3 4 20 

GAGCAACTCA ATTTTCTTGA CCGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3 4 80 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3 540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3 600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3 7 20 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 7 80 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AG ATTTG ATG CCATTGATTT GGTCGTGGTA 3 840 

CAT ATT GG AC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3 900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCC C ATG CCTTCTGGGG AAATGATGTC 3 9 60 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATCAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 4 2 60 

ATAACCAGGA AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4 3 20 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4 3 80 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4 500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4 560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4 620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4 680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CT ATG AT ATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA 


TC AG C TG ATT 


GCAGAAGCTA 


AG AAGCG TG A 


CATTCGTATC 


ATCATGGACT 


4860 


TGGTGGTCAA 


T CAT AC CTC A 


GATGAACATG 


CTTGGTTTGT 


CGAAGCCTGT 


GAAAATACTG 


4920 


ACAGCCCTGA 


GCGAGACTAC 


TATATCTGGC 


GCGATGAACC 


CAATGACCTA 


GATTCTATCT 


4980 


TTAGTGGGTC 


TGCTTGGGAA 


TACGATGAAA 


AGTCAGGTCA 


AT ACT AT CTC 


CACTTTTTCA 


5040 


GCAAGAAACA 


GCCGGATCTC 


AACTGGGAAA 


ATGAAAAACT 


TCGCCAGAAA 


ATTTATGAGA 


5100 


TGATGAACTT 


CTGGATTGAT 


AAAGGTATTG 


GTGGTTTCCG 


TATGGATGTT 


ATTGACATGA 


5160 


TTGGCAAAAT 


TCCTGACGAG 


AAGGTAGTCA 


AT AATGGT CC 


TATGCTCCAT 


CCCTATCTCA 


5220 


AGGAAATGAA 


TCAGGCGACC 


T TTGGAG AT A 


AGGATCTCTT 


GACAGTAGGG 


GAGACTTGGG 


5280 


GAGCAACTCC 


AGAGATTGCC 


AAGTTCTACT 


CTGATCCAAA 


GGGGCAAGAA 


TTGTCTATGG 


5340 


TCTTCCAGTT 


TG AAC AT AT C 


GGTCTTCAGT 


ATCAGGAAGG 


TCAGCCTAAA 


TGGCACTATC 


5400 


AAAAAGAGCT 


GAATATCGCT 


AAGTTAAAAG 


AAATCTTCAA 


CAAATGGCAG 


AC AG AGTT AG 


5460 


GAGTTGAGGA 


CGGCTGGAAT 


TCCCTCTTCT 


GGAACAACCA 


TGACCTCCCT 


CGTATTGTCT 


5520 


CAATCTGGGG 


AAATGACCAA 


GAATACCGCG 


AAAAATCTGC 


CAAAGCCTTT 


CCAATCTTAC 


5580 


TTCATCTCAT 


G AG AGG AAC T 


CCTTATATCT 


ACCAAGGTGA 


GGAGATTGGG 


ATGACCAACT 


5640 


AT CCGTT TG A 


AACACTGGAT 


CAAGTAGAAG 


ATATTGAATC 


TCTCAACTAT 


GCGCGTGAGG 


5700 


CTCTTGAAAA 


AGGTCTTCCG 


ATTGAAGAAA 


TCATGGACAG 


TATCCGTGTT 


ATTGGACGTG 


5760 


ACAATGCCCG 


TACCCCTATG 


CAATGGGACG 


AGAGCAAAAA 


CGCTGGTTTC 


TCAACAGGTC 


5820 


AACCTTGGTT 


GGCGGTTAAT 


CCAAATTACG 


AGATGATCAA 


TGTCCAAGAA 


GCGCTGGCAA 


5880 


ATCCAGATTC 


TATTTTCTAT 


ACCT AT C AG A 


AACTGGTCCA 


AATTCGCAAG 


GAGAATAGCT 


5940 


GGCTAGTTCG 


AGCTGACTTT 


GAATTGCTTG 


ATACGGCTGA 


TAAGGTCTTT 


GCTTATATAC 


6000 


GTAAGGATGG 


CGACCGTCGC 


TTCCTAGTTG 


TGGCTAACTT 


GTCCAATGAA 


GAGCAAGACT 


6060 


TGACAGTAGA 


AGGAAAAGTC 


AAATCTGTCT 


TGATTGAAAA 


CACTGCGGCT 


AAAGAAGTAC 


6120 


TTG AAAAAC A 


GGTCTTGGCT 


CCATGGGATG 


CTTTCTGTGT 


GGAATTACTA 


TAAATATTTT 


6180 


TTGCAGAAAA 


ATTTAAAATT 


GAAATCGTAT 


AAAAACAAGG 


GAGGACTGTA 


TAAAAGACAG 


6240 


AAATCCTTTG 


TTTTTTATAA 


CCAAAGTTTA 


TAAACTTTCA 


TTCTTGAAAT 


TCAATTAACT 


6300 


TTACAAATTC 


CC ACT AT T AA 


GGAGAAAGAA 


GATGAACATA 


AAGAAGCGTG 


TCCTTAGTGC 


6360 


AGGCCTGACT 


TTTGCATCTG 


CTTTGCTTTT 


ACCCAAATCA 


TTCATACCTC 


TCTCAACTAG 


6420 


ATGTAACTTA 


CAAAACCCCT 


GACCTCATGA 


GCCACTTTCT 


TCCTCCTCAT 


GAGGTCAGTT 


6480 


TTACTTTCTG 


CTGTTCCAGT 


ATCGTTTTTC 


CTCGCTAGAT 


TTCCTCAAAA 


GGGCAGACTC 


6540 


CTCCCTTGGT 


GCGTCACACG 


ATTTTTTCAT 


CTCGACTGTT 


CTTTAATGCA 


TCATTAACGA 


6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGC AT AG AG A CTAGACAATT TGAGGAGCTG 672 0 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 67 4 9 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 120 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 180 

ATATTGCTTC TC CTTT AGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 240 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCC AAATTC A G ACTTGG AT A AACTCGCTTA 300 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 3 60 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 42 0 

TCAAAACAAT CTGAAGAATA GCTCATCATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 480 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 540 

CTTTCTCTAT TTTTACTTAG ATAGAGATCA ATCATGGGAC ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCGGCTCG ATAGCCTTGT CCATATAAAA CAAAAACGAG A TTTTG ATG A 7 80 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 84 0 

TTTTGAAGAT AGGATTC GT A AAATAGTGGG TACTTATGTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 960 

CAAATTTGAA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 102 0 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATT C TT AT AC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 114 0 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 1200 
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TTTCCAAATT CATCAAATCG CATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 1260 

GCTTTTACCC AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 132 0 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 1380 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 14 40 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1S00 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC 16 80 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG 17 4 0 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 
(8) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY; linear 



1560 
1620 



1800 
1842 



60 
120 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
TCATCTTTAT CTCCTCGAAA TTTTGTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 
CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC AGATTGAGCA 180 

CTGCATCGTG ACTCATCAAG ACTTG AC C AT ACTCTTGTAA GACTGAGCGA CTG AT ATC AC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 3 00 

TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 3 60 

TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 4 20 

AAAGG ATGT A GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 4 80 

GCTCTTCTTC CTTGACTTCA ACTTG CT CAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 54 0 

TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 
ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 
CTGTATCCAG AAAGCTATCA GGCATCTCTA GCT CATC CAA GATAGCATAA AAGAACTCTT 



720 
780 
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CTGTTTTTTC TTGAGGAACG AG AAAGT C AG CAATCTCCAT TCCACGATCC ATCAAATCCT 84 0 

CTAAAGATAT CGTGATTTTT AAAGTTGTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GAG ACT C C AA AAGAAAATTT 102 0 

CTGCAGACAG AT AG AAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 114 0 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 12 60 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATG 132 0 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAAC AT CTG CTAGGTTACC AATTGTTGTA 13 80 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 144 0 

TTAAAGAGAT AAACTAACAC ATT TCC CATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 162 0 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 174 0 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTC 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC AT TC AAATT A 18 60 

CCATCAATCA ATTTCCC AT A AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 19 80 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 204 0 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGC AAAC AAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AG AATTG CAT 222 0 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGC AT AAAG A ATAGATTATT 22 80 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTG C AG CAT 2 3 40 

GGCTATAAAT AAT ACGCCG A GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2 4 00 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 24 60 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACT ATT ACTA TCAATAGCTA GTTCCACACT 2 52 0 



WO 98/18931 



PCT/US97/19588 



684 

ATTGGTTGTA TCTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2 580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2 640 

AAATAGAAAA CCTCGAGCCA GAACACC AAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 27 60 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 2820 

AATCCATTTT CCTTTTTC AG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA AT ACT ATT AG CTGCAATAAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT AC AT AC AC AT 3000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 30 60 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCCTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3 240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3 300 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3 3 60 

GGTCGGTCTT GATACCAGAA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3 4 20 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3 4 80 

AAGTAAGCTA ACATTATTAT T ATT AAAC AG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT- GGTCACCCCC 3 600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3 660 

AATATTTATG AATAGGGCCA AAGC ACT AAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AG AAAT AG C A AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 37 80 

TTT AT ACT G A TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3 84 0 

GAAAAAAGCG ACCTCTTTCA AGTCAGG AT A GTTTTTCACA ACTTCATTAT AAAACTT T TG 3 900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3 9 60 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 402 0 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AG AAATG TT T GAGAATCACA G AC AAAAT AG TCATCCTGAT TGGCTGACTG 4 200 

TGGGCTTTC A TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 42 60 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4 320 
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AATAAAAATC AGATAATCCC CGTGAGCCTG CTTCATCCCA TCATTTCGTG CTTGCGACAA 4 3 80 

TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 44 4 0 

CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4 500 

CGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGG 45 60 

C AC AAT C AC A CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 4620 
CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 474 0 

TTTGTCCAAA TCCTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4 800 

AAG GGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4 860 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4920 
AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4980 
CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 
CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5 LOO 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATAGTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGG AT AAACTTC C TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGG CTTCTATGTG 52 80 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 534 0 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 54 00 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA TAAATTCTCA 54 60 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 552 0 

TGCGT ACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT T C AAG AAT AT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCGGTACC 57 60 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CACTTCTGAG A T AG AC ATC A 5820 

TAGCCGATTG GAATAGGCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCACGACCTG AGCATAATTC TGTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGG AG AG ATT TTGGAGTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG 


GAATAGTTTG 


CGCACAAAAG 


AGTGAATAAA 


TCAAGGCCCA 


ACGAACCTGT 


6120 


TTTTTCCGCT 


CAGCTGGATT 


TTTCGGATAA 


TAATCCAAAG 


GCAAAACATC 


CAAGGCCAGA 


6180 


CCATGTGGCA 


AATCCAAATC 


CTGCTGATAA 


GGCTTGATAC 


AGGTGGTTTT 


CTTGTCACGA 


6240 


ATGGTAATAA 


AAAGATTACG 


ATCAACAAAA 


TCCTTGTGAC 


TCTTTGACAA 


GAAATAACGT 


6300 


TCATCTCCAT 


AACGAGGCCA 


TAATTCTGCT 


AATTTCTCAT 


AATCTTTACG 


AGGCATAAAA 


6360 


AAGTCTAGGT 


CGTCGTCCCA 


AGGAATAAAT 


CCCTTGTTTC 


GAAGGGCACC 


AATAGCGCCT 


6420 


CCGCCACAGA 


GATAACAGAG 


CAAATCATGT 


TCTTTACAAA 


AGGCCACAAA 


ATATTCAGCC 


6480 


ATCTCCAGAC 


TACGAGCCTG 


AATTGCTTTT 


AAATCAGTCA 


TATTGTTCAT 


TATTCTTTCT 


6540 


ATCGTATCGT 


TTCATTATAC 


CACAAACAAG 


GGGTGAAAAT 


CTATTGCAGA 


CTGTAAAAAA 


6600 


TCAAAGCCTG 


ACTGCTATCC 


AAATAGCTAT 


CAAACTTTGA 


TTTTTCTGTC 


TTATACTCTT 


6660 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCTTCA 


CCTTGCCGTA 


GCTATAGGTA 


ACTGACTTCG 


6720 


TCAGTCTTAT 


CTACAACCTC 


AAAACTGTGT 


TTTTAGCAGC 


CTGCGGCTAG 


CTTCCTAGTT 


6780 


TGCACTTTGA 


TTTTCATTGA 


GTATTATCTT 


ATCTTAAGCC 


CATTTGAGCG 


AGCTTGGTTT 


6840 


GATATTTGTT 


TTGATCAACC 


AGCAGGCCCA 


AGCCCCCATA 


AACATCATAG 


GCATCTACCC 


6900 


AGTCACCCAG 


TTCTGGAATC 


GTCAATTTTT 


CAATACCATT 


TTTTGCTCCA 


TCCAAAACAG 


6960 


ATAAACCGTT 


TGTTAGGAGG 


AAAGTATAGG 


GTACGTTGGT 


TGAGGTCATA 


GCAAAAACCT 


7020 


TTCCAAGAGC 


TTCAGAACCA 


GTGAAAAGTT 


TAGTGGGATC 


TTTAATTTGC 


TCTAAAATTG 


7080 


CTGTTAAAAC 


TTGTTGCTGT 


CTTTTTGTAC 


GGCCGTAATC 


TGCCTCATCA 


TCATCACGGA 


7140 


AACGAGCATA 


ATTGAGCAGG 


GTCGAGCCAT 


TCATCTGCTG 


TTTTCCGACT 


TTAATGGTTT 


7200 


GGGTTGGAGA 


CTCAGTCTCG 


GT AGCGT AT A 


AATCATCTCC 


GACTGTAGCT 


TCTGTTAGGG 


7260 


GACGCCCATT 


CAATGTTGAA 


AATTGAGCAT 


CAATCGTCAC 


CCCATCAGGG 


AAAAGCGTGT 


7320 


CAATCGCTGT 


GGCAAAGGCC 


TGGAAATCAA 


CCAAGGCGTA 


GTACTTAATG 


TCCAAGTCAA 


7380 


AATTATCTTT 


CAAGACTTGG 


CGAACCATTT 


CTGCCCCTTT 


TTGCCCCTCT 


TGTTCTCCTA 


7440 


ACTCGTAGGC 


TACGTTTAAC 


TTGTTATCTG 


TCTGTTTTCT 


ACCATTAATC 


ACTTGACTAT 


7500 


AACCATCTAT 


ATAGACCAAA 


TTATCACGCA 


TGAAACTGAC 


TAGCTTCATT 


TTCTTATCTG 


7560 


AGCCCCCGAC 


ATTTAATACC 


ATAATAGAGT 


CAGTTCGTGT 


CTCAACACTG 


TTCTGGCCGA 


7620 


TTCGACCATC 


AGTACCCATG 


ATTAAAATAT 


TAACTCCATC 


TCTAGTGTCC 


TG AC C ATT AA 


7680 


AGACTTCTAC 


TTGAGCTGCC 


CGGGCATCAG 


CAGTTTTCTT 


TGCGCTAGCA 


TCTTGGTAAC 


7740 


CACGCAAAAA 


CATGAATACC 


ATGGCCAAAG 


CCACACAGAC 


CAAAAGTGAA 


AAAATCACCA 


7800 


TAAAAATTCG 


TTTAAGACGG 


AGCTTCCGTC 


TTTTCTTTTT 


TGGAGGGAAA 


GAGAGTGCTT 


7860 
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GTGATTTGGA 


TTGTGAGCGA 


CTCCGGTTCC 


CATAGCTTGG 


TAAGTCAACC 


TGCTCTTCTC 


7920 


TTTCTTG TT C 


CAAGCTAGAG 


CTACTATTTC 


CCCTAGCAAG 


AGTTACCTTT 


TCTTCCAAAT 


7980 


AGGCAAACTC 


ATTTTTTTCT 


CTCTCATTGA 


GATAGTGAAT 


ATTTTTTAGC 


AAATAATCAT 


3040 


AACGCAACTG 


CTCATGATGA 


CTT AAGGG AT 


TTTCTTTACT 


CATCTTCTCT 


CCTTTCCATG 


8100 


GTCTGATATT 


GGATAAATAG 


GATAGGCACC 


CAGAATTTTA 


TACTGGATTC 


CAATCGCTTC 


8160 


T AATTCTT TT 


TGGGCAAAGT 


GGACCAAGTC 


CTTATCGGTA 


TAATCCACAT 


CGATAATGAA 


8220 


AAAGT ATT C A 


CCCAGTGCTG 


TCTTG AGTGG 


AC AACTTT C A 


ATTTTTCTCA 


AGTCAATTCC 


8280 


TCGCCAAGCA 


AAGGTCGACA 


GGGCCTTATA 


AAGTGC AC CT 


GGAAGGTTGT 


CAGGTAATGT 


8340 


CAAGGCCAAA 


CTCATCTTTT 


CAGTTTGTGC 


TTGCAAGGGA 


ATACTAGGCT 


TTTCAGCTCC 


8400 


TAGAACCCAG 


AAACGTGTGA 


AATTGGCTTC 


CATTTCCTGA 


ATATCCTCGG 


CAATCAGTTC 


8460 


CAATCCATAT 


TCTTCAGCAG 


AACTTCTAGG 


TGCAACTGCT 


GCAAAGGGCT 


GGTCTGGATG 


8520 


TTCGGAAATA 


AAACGGGCCG 


CATAAGCTGT 


ACTAGCTGTT 


ACCTCGATTT 


GAGCCTCTGG 


8580 


ATATTGTTCA 


TCGATGAATT 


TCTTTCCTTG 


AGCCAAGGCC 


TGTGGATGTG 


AAAAAATCTT 


8640 


TTCAATCTTA 


GTATGGCCTG 


GAACCACCAT 


CAACTGCTGA 


TGAATAGCCT 


GAACGATTTC 


8700 


TGCTACTGCT 


TGGATGTGAG 


CCTGATGAAA 


AAGATACTCC 


AAGCTTTCAT 


GAACACTACC 


8760 


CTCAATAGAA 


TTTTCAACTG 


GCACCACAGA 


ATAGTTCACT 


AATCCTTGCT 


CATAAGCCTT 


8820 


GATGACATCT 


GTAATGTTGG 


CAAAAGCCTG 


CAATTCCTCA 


TGACGAAAAG 


CTGTCTGCAC 


8880 


AACGTGGTGT 


GAAAATGATC 


CCTTGGGACC 


TAGATAAGCA 


ATTTTCATCT 


TAGTTCCTCT 


8940 


ATAATTTCCT 


CTGGGCTTAG 


CTTGGTCACA 


TCCAAAACCC 


GACTAGCCAC 


TTCCTCATAC 


9000 


CAAGCCTGTC 


TTTCTTGGAA 


AATAGCTACT 


AGTTCTTCCT 


TGCTATTATT 


T AG AAAAAGC 


9060 


GGTCGCTGAT 


TGTCCTTATC 


AGCTGCGATA 


CGTTGGTAGA 


GGGTTTCAAA 


ATCTGCTCTC 


9120 


AGGTAGATGT 


TATCTGTATT 






TTCTCTGAGA 


AATAACCACT 


9180 


CCTCCTCCAG 


TTGACACGAC 


TTGGTCTGTT 


TGTAGTAAAT 


CAGCTAGGAC 


TTCTGATTCT 


9240 


ACCTGACGAA 


AGGCTGTTTC 


TCCCTTTTCA 


GCGAAAAAAT 


TCGCAATGGA 


CATACCTAGG 


9300 


CGATTCTCAA 


TCAGAGCATC 


CATATCAAGG 


TAATTAGGGT 


CCAAGCCTCT 


TGCAATAGTC 


9360 


GATTTTCCAG 


CCCCCATAAA 


CCCTAATAAC 


ACCTTAGCCA 


TGAATCAAGC 


TCTCCAAATC 


9420 


ATCAAAGAAA 


CTAGGATAGC 


TGGTATTGAT 


GGCTTCTGCA 


CGGTCAAGCT 


CCACCTCTCC 


9480 


ATCTGCAACC 


AAGAGGGCTG 


C GAT AG CTGT 


CATCATGCCG 


ATACGGTGGT 


CACCAAACGT 


9540 


AT TG ACTCT A 


GCACCGTGAA 


GAGCTGATTT 


TCCTTTGATA 


ATCATCCCAT 


CTGCCGTAGG 


9600 
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AGTAATATCT 


GCTCCCATAC 


TATTTAAGGC 


GTCTGCCACA 


ACCTGAATAC 


GGTCTGTTTC 


9660 


CTTGACCTTG 


AGCTCCTCAG 


CATCCTTGAT 


AACTGTTACA 


CCTTGGGCTT 


GGGTCGCAAG 


9720 


CAGGGCAATA 


ATGGGCAATT 


CATCAATCAA 


TCGTGGAATC 


AAAGCGCCAC 


CAATCTCTGT 


9780 


TCCTTTCAAG 


TCAGAAGACT 


CAACAATCAA 


GGTAGCAGAT 


TTAGCGACTG 


GATCGATTTC 


9840 


AGTTATTTCC 


AATTTTCCAC 


CCATGGCACG 


AATGACATCA 


ATAATACCGG 


TGCGAGTTTC 


9900 


GTTGATCCCC 


ACATTCTGCA 


GCACTAGACG 


AGAATTTGGA 


GCAATCAAAC 


CTGCGACTAA 


9960 


CCAAAAGGCT 


GC ACT GG AAA 


TATCTCCTGG 


TACGACCACC 


TTCTGTCCTG 


TCAATTTTTG 


10020 


TGGCCCCTGG 


ACTGTGATTT 


TCTTACCATC 


C AC ACTT AAA 


TGACCACCAA 


ATTGTTTCAA 


10080 


CAT AT CTT C A 


CTATGATTAC 


GGGTGTACTC 


TTTTTCGATA 


ATAACTGACT 


CCCCCTTAGC 


10140 


TTGTAAGGCT 


GCAAACATCA 


AGGCTGACTT 


GACTTGGGCA 


GAGGCAATTG 


GCAACTCATA 


10200 


ATGAATAGGT 


CTTAGGTTTT 


TCGTCC CTTT 


TAAGCGAAGG 


GGAGGCAAGT 


CTCGTTCAGT 


10260 


TTGCCCTGAA 


ATGCTGACGC 


CCATTTTTTT 


CAGTGGAAGG 


GTCACACGGT 


CCATAGGACG 


10320 


TTTGGAAAGA 


CTATCATCTC 


CAAACATCTC 


TACTTCGAAA 


TCTGCACCAG 


CAAGGACACC 


10380 


TGAAATCAGG 


CGAATCGAGG 


TGCCAGAATT 


TCCCATATTA 


AGGGCATTTT 


GTGGCGCTTT 


10440 


TAAGCCAGCC 


ATGCCTACAC 


CTTGAATGGT 


AATAACCCCA 


TCTTTATCCT 


CAATTTCAAC 


10500 


ACCAAGGTCA 


CGAAAAACCT 


G CATGGTCG A 


AAGAACGTCT 


TCACCTCGCA 


G AATAT CAT A 


10560 


AACCTTCGTC 


TCACCCTCAG 


CCAAACTTCC 


AAAGATAATG 


GAACGGTGGC 


TGATAGACTT 


10620 


GTCACCTGGG 


ACGCCGATAC 


TACCATGTAA 


ATGGCGAATG 


TTTGTTTTTA 


GTTTC AT ACT 


10680 


GGACCTCATA 


CTTGCAATAC 


TTTTACCTAT 


TTTATCATAA 


AAAGCCAGAA 


ATTCCTT AAA 


10740 


AATTCCTGAC 


TTTAGGATCG 


TTCTTTTCTT 


ATTTCAGCAA 


TTCTGAAACT 


GGTTCAAAAA 


10800 


CAATTTTTTC 


AATATCAGAA 


AGGTAAATGG 


CCAATTGTTG 


TTGCTTGGTA 


AAGAATTCTG 


10860 


ACAAGAGGCT 


ATTTCCTTGA 


ATCTGTTTAC 


CAAAGCCTTC 


CATCTTAGCT 


TGGAAGGACG 


10920 


C ATCTGG CAT 


TTGACCTGTC 


TGTGCTAGTT 


TTTGAATTTC 


CTCTTGAAAG 


GCAAGATAAT 


10980 


CTGTAAAGAT 


TTTGCTTGCC 


TCAGCATCTG 


CTGCAATCGC 


ATCTTTAGCT 


G CTTT AAC AG 


11040 


CCTTGTATTC 


TGGTAATCCG 


CGTAGACCGC 


GACTGAGTTC 


GTTTGC ACT A 


TCGTAAATAT 


11100 


TTGACATGTT 


CTTCTCCTTA 


TTTGATGACG 


ACTGT AT ACT 


CAGTATTTTC 


TGTTATGAGA 


11160 


TGCTCAGCTC 


TTTCCAAGTC 


TTGAGCATTT 


TTAAATGAAA 


TTTGTAGGAT 


TCCGTGAATA 


11220 


TCCTCACCAT 


TTTCCTCGTT 


GATGTGGATA 


TTAACCAAGG 


AAGTTCCACG 


TAGCAGTTCC 


11280 


AAAATCCGCA 


GGATGACATC 


TTCTTCATCA 


GGAACGTCAA 


CATAGAGGTC 


GTAAGAGCTA 


11340 


TCCACACCAC 


CACGCTTATG 


GATTTCCATG 


GTCTGGCGTT 


GTTCACGCGC 


TTGGTTAAAA 


11400 
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AAGTTCCAAA TTTGCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCCC TTCCAAACGT 11460 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGC GTCATATCTC GAAAACCACC TGCCGCAAAG 11580 

CGCCTTGCCA TCTCATGCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATATGAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG AC CTGAAAG C AGATCCTTCA TTTCCTTAAC CGTGTCCTGA 117 60 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGCCATTTT CAAAAAGATT GACATCTGCC 1182 0 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCGA ACCAGCATCT 1194 0 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGC C AACT CCTTAATGAA AGCAATAGTT 12000 

TCTTTGATTG GCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT ACCAAAATCA 12060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 12120 

TTATAACCTA AAATTT CAT A ATCTGGATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12240 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT GACCATTCCT 12360 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCCTGGGTT 12420 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGGTTTCAT GACCCCACGA 12 4 8 0 

ACAACGATGG GTTGCCCATT AG T CAT AC C A CCTTCAAAAC CACCTAGATT ATTGGTACGG 12 54 0 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12 600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 12720 ■ 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC ACGTTTGATT 12780 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TG AC AAT AG A AACTT C AG AC 12840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12900 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12 960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 13020 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 13140 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13 200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13 260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC ACCTGTCAAT 13 3 20 

GG AAGTCC AG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TGATTCTCCT 13 380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTC C AG A 13 440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13 500 

CCACGCCCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13 560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC AT CTCTGT G A TAGATTGGGT AATGCCAGCT 13 620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTCTA CCATTCCCAT GGCAACAGCC 13 680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCT G AC AAGAATGTTC AATCAAGGTC TCTGCATGTT C C AAAAT ACT CTCAACAGAA 13 8 60 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTCGATCCT CAATCAAGCC ATACTTGATA 13 920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13 9 80 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTT AG CAAA TGGTGTATTA 14 04 0 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG CCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14 2 20 

T AG ACTTTC T GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14 2 80 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAG AG AGGCT 14 340 

ACATGGTTAT CTGTCACAAT CACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14 4 00 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 144 60 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 145 20 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGC AGCTTG A 14 580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCA TTTC AAA 14 64 0 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14 760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14 8 20 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTACGTAAG GTCTTGTTTT TTCCATAGAA 14 8 80 

ACGGAACGAA CAAAGACCCA AATCTGACTG ACGCCATCCA AAAT AG CCTG TGCCAAGATT 14 94 0- 
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GATTT AGCCG 


CACCACCTGC 


ACCCAGCAGG 


GTCATCTTTT 


TACCTGAAAT 


TGTAAAAGAA 


15000 


GGCAAGCACT 


TAAAAAATCC 


CTTGCCATCT 


GTATTATATC 


CAATTAAATT 


CCCATTCTCA 


15060 


TT G AC AAC C G 


TATTAACCGC 


ACCAATCAAG 


CGCGCTTCAT 


CGCTCAGCTT 


ATCCAAATAA 


15120 


GGAATCACCT 


GCTCCTTATA 


GGGCATGGAC 


AGATTGATGC 


CAAACATCTG 


GTAGCGACGA 


15180 


ATATTGGCCA 


CTGTTTCTAC 


CAAGTCACTC 


GCTTCAATCT 


CCCAAGCCAC 


ATAAGCACCG 


15240 


TTf.GTAGCTG 


TCGCCTCAAA 


GGCTCTATTG 


TGGATGAAGG 


GAGAAATAGA 


ATGCTTAATA 


15300 


G GAT TGGC AA 


CAACTGCAGC 


T AAACGTGT A 


TAGCCATCAA 


GCTTCATCCA 


AAATCTCCCT 


15360 


G ATT TTTTT C 


ATGCTAGCTA 


GAGAAATCTG 


CCCAGGCGCA 


CTAACCTCAT 


CCAGACTGGC 


15420 


AAAAGACCAA 


CTCGAACCAG 


TCACATCCGC 


AGTGATACGA 


GAGACCTTGC 


CCACCTTACC 


15480 




GTCACATATT 


CCTGTTCAGG 


ATTGAGGGTT 


TTAAAGCCTC 


GTGTATAGTT 


15540 


CATCAAGTCT 


AAGACATCCT 


GCTCCGTGTG 


AGCCATCACC 


GCAACCTTAA 


CAAGTTTTGG 


15600 




GTCAACTCTG 


ACAAGATTTC 


CATCATGTTC 


TCAGGTGTTT 


CTTGGAAATT 


15660 




AAAACAAGAT 


TTGGGAAGTC 


CACCATTTCC 


TCAAAAACAT 


CCTTGT AG CT 


15720 


ATAflTArTCA 


AAATCAATAT 


AGTCTGGTTG 


ATAGAGTTCC 


GCAACTTCCT 


TGATTAGATG 


15780 




TCTGGAGAAA 


GGTCGATTTC 


TCCACCTTCG 


GAGCCAGTTC 


GT AG CGTGAA 


15840 


AACCAACTCA 


CGGCCTGCGA 


ATTTTTCAAA 


AATGGCTGGA 


GCTACCTGCA 


AAATCCCTTC 


15900 


TTTAGGCAGA 


TAGTCGGCAC 


GCCATTCAAT 


GATGTCGGCA 


TCCAGGTACC 


TCGTGGCATC 


15960 




GCCTCCTCTA 


AACTTCTTGG 


CATTACTGAA 


ACGATTAATT 


TCATTTACTA 


16020 


APfTTCATAC 


TAATCACCTT 


GAGGTAATTA 


CTACTTTCAT 


CTTTTTTATT 


ATAGGCAAAA 


16080 


TCTGCTGGAA 


GACCATATTT 


GTTTAAAATC 


TGGTAACTTC 


TTCCTGCAAA 


ACCTTTATCA 


16140 


ATTTGTTCTG 


TAAATTTCTG 


ACGGGAAACA 


TTGGCAGCAT 


TGGTACTGGC 


AATGATAATC 


16200 


CCTCCCGGAT 


TTAAAATCTC 


AAGACTCTGG 


GAAATCAACT 


TGTGATAATL 




16 2 6 0 


GAGAAAGTTT 


GTTTTTTATT 


CCGAGCAAAG 


CTACGCGGAT 


CTAGGACAAT 


CACATCGTAG 


16320 


GTCAAGTCTT 


TGCGTTTGGC 


ATATTTGAAA 


TACTCAAAGA 


CATCCATGAC 


TATAAAACGA 


16380 


TGCTCGTCTG 


TGCTGAGCCC 


ATTTGCCTGA 


AAATGCGCTT 


GAGACAATTC 


TCGTGAACGT 


16440 


TTGGCTAGAT 


CAACAGAAGT 


TGTATGGCTA 


GCTCCTCCCA 


TGGCCGCAGC 


TACTGAAAAA 


16500 


GCCGCTGTGT 


AGGAAAACAT 


ATTGAGTAAG 


GATTTACCCA 


TAGCCAAGCC 


GTCAACTAAA 


16560 


CTACCGCGAA 


CCTCATGCTG 


GTCTAGGAAA 


ATTCCTGTCA 


TCAAGCCATC 


ATTCATAAAG 


16620 


ACTTG AT AC A 


GGACACCATT 


TTCTAAAACA 


TTGAAAAAGT 


CAGGTGCTTC 


TTGACCATAA 


16680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTCATTGAA GGCCACCTTG 169 80 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCrr CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 17 2 20 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGCTATACT AGACTTCCTG CAAAACTAGG 17 280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCG T CTTCG AGG A GCATTTCTTT 17 34 0 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CACGTTAACT GTGAGATTAT 17 400 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT T AGTG AC AAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 175 80 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATGTTC TATCAAAAAT 17 640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TC C AAAT AAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 177 60 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17 820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17 880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACCATAA 17 940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGG CCATC AGGATTGCAC TAT AAAT AAA CAATAAAACT ATGGCGAC f A 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AG ATTTTT AA ^8120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGACCAAGC AAAGTCAGAA 1824 0 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 13300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 183 60 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAACG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGCAA CTATCTCAGA ATCTCGGATT TTGGAAATCA 184 80 
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GACCGATCAT CATGAGATAA GGAAGGAAAG C ACTTG T AAA AAGCACTGTA ATCACGCCAG 18 540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 1360 0 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAACC AGGAAGAGAG 18720 

TTAGTCCTAT CCAAGCGAGC AG AC CC ACT A AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18 840 

TACTTCCAAA AGTCGACTGA AGAAAGAACA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 189 60 

AATGCACTGC CACTACCATC ACTAGAGCCA C AG AG AC AAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACC AG AAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 1914 0 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AAT C ACT AGT AAAAT AGCAA 192 60 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT AT ATT ACT AA 19 3 20 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGT AAG AACT TAAGACTCTA GCGGAAACAT 193 80 

TTACTTTTTT 193 90 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18436 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGC 60 

AAAAT ATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACACGACTTC 12 0 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGC AAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTT AAAAT CTTGGGC AC A GCTAGAAAGT GATGATTTCA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT .ACAGGAAATG 360 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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GTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC ACCAGACTCT ACTGACCACT 4 80 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGCGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC GTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 7 80 

ACCCTAAAAA TGCCAAAATG GTAGATGTCT CTCAACTTCA AAACATGGAT TACGGTTTCG 84 0 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TGCTCTAGCA CTTAGCTGGC 900 

TTCCTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 9 60 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 10 80 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 1200 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 132 0 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTC AGTC CT ACTACCGTTT CTC GGGTT AT CAATAAAAAA 1500 

GGGTATCTAT CTGAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 15 60 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 162 0 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTCATTG AT AAATTGG A ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 174 0 

GAATACATCG AAATGTTGGA AGCCAATCAC GTGGACGGCA TCATTTCTGG TAG TC AC AAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 18 60 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTCGGGTTCT TGCTGCCCAA 1920 

ACCTTGGTCA AGACAGGTGC CCACTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 1980 

TCGCGAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2 04 0 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 2 22 0 
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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCT ACT A 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCAGT 


TACGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC 


TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGGC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGGATGC 


2580 


CCGTACTTAC 


CAAGCCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


GATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT 


GGGCAGGCTC 


ATCTCACCCA 


27 60 


CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


AC CTTG AT AG 


CGTCCATCTC 


2 82 0 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGGCACGCCA 


GTCATTACCA 


GTGGCAATCA 


2880 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


294 0 


CCTGCGCAAA 


CTGACTAGCC 


AACG C AATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TGATCCTTTT 


3 000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


AGCTTGCACT 


CACCAGAGAA 


TCGGTCGCGT 


3 060 


AGTTGGACAG 


GATTCCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTG GTTT C A 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


CAGAAAGTCT 


GGTTCGCCTT 


TTATCTGCTC 


GACATGCAGA 


TCACGCGCCC 


3 240 


CACCTCCACG 


TTTAACGATA 


GAAGGATAGG 


CTTG ATTGG C 


AAGCTCCAAG 


AGCTCCGCTT 


3 300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTGATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTGACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTGC 


TCTGCCACAG 


TCACGCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT 


CT CAGG ACT A 


AGGAGCGCCT 


GAGCTTTTAA 


CAGCTCGAGG 


CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAACG 


3720 


GCGTTGGTGG 


TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTCTGCAAC 


3840 


G G AT AGGGC A 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3960 
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GGCGATTTTA 


TCCCCAGCTT 


TCAAGCTATC 


TGTATTTTCC 


AAGAGAGACA 


AAAGTCCAAG 


4020 


GAAAAGTGAA 


CCTGTGTAGA 


TATTCCCCAC 


CTTTTCACTC 


TAGAGAATAG 


ACTGGTCAAA 


4080 


ATGCTTTT GT 


AAGAGGTCTT 


TTTTCTCTTG 


AGGCAGGCTC 


TTATCCATGA 


TTTTTTTCAA 


4140 


GCCTTTTAGC 


GCTAATTTAG 


GATAAGGCAA 


GTGGAAACAA 


ACAGCCGCAA 


AATCATCCAA 


4200 


AGTAAGCTGG 


TAGCGTTTTT 


GATATTCAAG 


CCAAGTCGTT 


TTC AAACT AT 


CCAAGTATTG 


4260 


TTGGGTAGAA 


TAGACACCAT 


TTACATAAGG 


AGTTGTCGAG 


TAATTTGGTC 


GCCAGAAATC 


4320 


CATGATGTCA 


CGGGTCTGAG 


CTACATTGTC 


ATT AT T AAAG 


GCCATCATGC 


GTGGATTTTG 


4380 


TGTAATCAAC 


AT AGC T AC AC 


TTCCAGCACC 


TTGAGTTGGT 


TCTCCTGGAG 


TTTCAATACC 


4440 


GTATTTGGCA 


ATATCACT GG 


CAATGACCAA 


GACCTTGGAC 


TCCGGAGAAT 


TTTCCACATG 


4500 


CAATTTGGCA 


TAATGGAGGG 


CAGCAGTCGC 


TCCGTAGCAG 


GCTTCTTTAA 


TCTCGAAACT 


4560 


ACGAGCAAAG 


GGCTGGATGC 


CCAGCAAGCC 


ATGC AC AAAG 


ACGGCCGCAG 


C CTT ACTCTG 


4620 


GTCAATTCCT 


GACTCGGTCG 


CCACAATGAC 


CATGTCAACT 


TCTTGTCTTT 


CTTGCTCAGT 


4680 


TAAAATAGAG 


TCACTAGCAC 


TGGCCGCCAA 


GGTCACGATA 


TCCTCAGTTA 


GGGGCGCAAT 


4740 


ACTCAATTCC 


TTGAGTAAGA 


GTCCTTTACT 


TAATTTTTCA 


GGGTCAATTC 


CCCTCGCTTC 


4800 


TGCTAAGTCT 


TGTAATTTCA 


AGACATATTG 


ACTGGTCGCA 


AAACCAATCT 


TATCAATACC 


4860 


GATTGTCATA 


TTTACCTCTG 


TTTT AT C ATT 


CATGTAAAAA 


ATCGTTCTAT 


ACT A TTTT AT 


4920 


CACAAATGGC 


AGT AAAAG AG 


AGAAAAAAGA 


CTTGATTCAC 


caaAtcaagc 


CTCTT ATT GG 


4980 


TCATCATTTT 


AAAGAATGAT 


TAGTTG CT AG 


AGAGTTCACC 


GAT AT AAGT A 


GCTTTATAAG 


5040 


CTCCATTCAC 


AGTTATCAGC 


TCCTGGAGGA 


TCAAATTTCC 


TGAGTAAGTC 


CTTCCCATCT 


5100 


CATCTACAAA 


TTTTTGATAA 


AACTG ACT GG 


TCGGAATTTC 


TCTGACATCC 


TTATCAAATG 


5160 


TCTTATCAAC 


TGTT TT ACT A 


ACCTTCTCAG 


CAATCAATTG 


ATGCTCTTGC 


CATCCACTTT 


5220 


G AAACT CTG A 


GCCCGAACTA 


GAAACCATGA 


CTG GG AT AAA 


CAACAAGGTC 


AGTAGATTTA 


5280 


CAGACAATAA 


GGAAAGTAGT 


AG ACTTC CTG 


CAAAACTAGA 


ATCCTAGTTC 


ATGATTGATA 


5340 


AT AC C AG CAA 


TCAAATTCAT 


TCGTAATCCG 


AAGCGTTTAC 


GATGATTTCG 


ATAGGTTGTT 


5400 


GAAAACATTT 


TAAACGTTTT 


T ACTTTGGC A 


AAGATGTTCT 


CAACCTTGCT 


TCTCTCCTTA 


5460 


GAT AG CGC AT 


GGTTACAGGC 


TTTATCTTCA 


GCTGTTAGCG 


GCTTGAGTTT 


GCTGGATTTA 


5520 


CGTGGAGTTT 


GTGCTTGAGG 


ATATATCTTC 


ATGAGCCCTT 


GATAATCACT 


GTCAGCCAAG 


5580 


ATTTTACCAG 


CTTGTCCGAT 


ATTTCTGCAA 


CTCATTTTGA 


ACAACTTCAT 


ATCATGACTA 


5640 


TAGTTCACAG 


CGATATCCAA 


AGAAACAATT 


CTCCCTTGAC 


TTGTGACAAT 


CGCTTGAGCC 


5700 


TTCATAGCGT 


GAAATTTCTT 


TTTACCAGAA 


TCATTCGCTA 


ATTGTTTTTT 


AGGGCGATTG 


5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA ACCTGAGAGG AGTTCTTGAA 58 20 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG CATTAAGTTC 5 880 

CTTTCGTGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGC AC AT AT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GAGATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTGCCAAA ATGGCCAAGA TAAAGACCAC AGCAGGTGTC TTATAAAGAA TACTTAAAAT 6 2 40 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGC 6 3 00 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 6 3 60 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 6420 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 64 80 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GCCAATACCT GGTACAAACG 6600 

GCATGACCGC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 672 0 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 67 80 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 6840 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTCCC CCCCAACTAG CTGACGAGAA ATCTGGTTCA 6960 

CATCGC AG A C TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 714 0 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7200 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TAT C AC AAAT CCGGACTCAA 7 260 

AAAAAATCTT TGCCATGAAA TCATGACAAA GAT TG ATT AC TCATTTTGAT TATCCATCTG 73 20 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7 3 80 

ATTGAGGTCT ACCTTTTCAC CTGCTCTAGG ACTTTG TTC A ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7 500 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7 56 0 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTG AG? A GGTTT ACT C A CATCATAAGT 7 62 0 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTSGCTTTTT TAGCTACTGT 7 800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC T CTCATTCG A CTCTTCTTCC TCAATTTTAA TCAAATTATC 7 920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7 980 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTC CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGG AT AGT AT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 8 34 0 

GTTTTTGATT GATTTGTGTT CTGTTTG CGC TTGAACCTTA GGAATAGATG TCAAGGTACT 84 00 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TCCCTTGCTC GTTTCATCAA AGATTAACTT 84 60 

ACTTTCATTT CTACGATTGT AGGACAACCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8 580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATCCTGGAG 8 640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATGGC AT AG A TATCACTCTG CACAGTCCCC TTCGAACCAC GCGCCTGCTC 87 60 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTGTCTCTGC 8 820 

AAAGGCTACA GCAATCCCAA AGTCTGTG AC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8 880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT. 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGG T ACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC CAACGATATG AGG ATGGT CT AGATCTGCCA TAGCTCTCGC 9120 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 924 0 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CG ATTTGG AT 9 300 
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CATTCTGCAT CCTCCTCGTT CATAGAAACA AGGGCAACCG TAATGTTGTC TAAACCTCCT 9 360 

GCATTCTTAG CAAAACGAAC AAGTGTCTCC GTTTTATCTG CTAAAGGAAT ATCACTCGTT 9 420 

ACAATATCAC GAATCTCACT CCCTGAAATC ATGTTGGTCA AGCCGTCACT ATTGAGCAAG 9 4 80 

AGATAGTCAC CTGACTCAAG GATAACTGTC CCAAAATCAG GCTGAATTTC ATCTTTTTGC 9 540 

CCAATAGACT GGGTGATAAT ATTTTTTTGC GGATGAGCTT CTGCCTCTTC TGGTGTCAAT 9 600 

TGACCAGCCT TGAGCAATTC ATTAACCAAG GAATGATCGC TCGTCAACTG ATGGTATTCT 9660 

TCTCCACGAA TCAAGCCGAT ACGCGAATCA CCAATATGAG C AT AG AT AG C CTGATTATCA 97 20 

ATAATAGCAA GGACTTCCAA AGTAGTTCCC ATGCCTCTGT AAGCTTCATC CTGACCAAGC 97 90 

TGGTGAATCT TTTGATTTTC AATTTCTAGG TAATGGGCCA ACCATTCACG CACTTCATTG 9840 

ACTGTATCGA TCTGGGTATC AACCCAAGCT ACACCCAGGT CTCTGACCGC CATTTCACTA 9900 

GCGATATTCC CTGCCCGATG ACCTCCCATC CCATCAGCTA AAATAATCAT GGTACGTCCA 9960 

GCTCTATTGA CATAGTGCTT GACATAGTCT TGGTTATTTG TTCGTTTCTG ACCAACATCT 10020 

GTTAATAATG AAATTTC CAT GTGTCAGTTC CTTCCTAATC CGATATCTTG CGAAATTGAC 10080 

TGATGAAGAA TCCATCACTT CCATACAATT CAGGTGTAAT GAGGATACAG CC GTCTTTC A 10140 

TGATATCCTT ACATTCATGT T CTAGTTTT A CCTGCTCGAA CTCGGGATGA CTCTCTAAAA 10200 

AGGCCTTAAC GACTTGAAAA TTCTCCTCTG AGACGATAGT GCAGGTGCTA TAAGTTATTA 102 60 

TACCACCTTT GCCTAGTATT TGACAAACAC TACCTAATAT TTCTAACTGA ATTTCCTGCA 10 3 20 

AGGACGCGAA ATCTGCCGTT TCTTTATTGT ATTTGATATC TGGTTTTCGG CGCAAAAGAC 10 3 80 

CGATTCCTGA ACAAGGAGCA TCCACCAAAA TCTT ATC AAA GGAATCCTGG TCAAAAAACT 10440 

CATGCACCTT TCTGGCATCC AAT TTTTG AG TTTGAACCCG ATCTTCAACT CCCAGACGTT 10500 

GGGCATTTTC TTGAATTAAA TCCAACTTGT GGTCGTACAA GTCCAGAGCA GTAACCTGAC 10560 

CTGTCGTAAG ATAAGAGGCT ATATGGGCTG TTTTCCCACC TGGAGCCGCA CAGGCATCAA 10620 

GCACTCGCTC ATCACCTTGT AAATCAAGCG TCGGAGCAAC CAGCTGACTG GACTCATCTT 10680 

GGATGGTAAT GGCTCCATCC GCAAACAAAT TATGCCCTGC AAAATGCCCC TGCTCCTTAA 10740 

CCAGACCAGT GGTTGCTAAA AGGGAATTAT TCGCCTCCAA CAAGGCTTGG ATTTCCTCTT 10800 

TTCGACTTAG GTCTGTTACA CG AAT ACT GG CTTTGTTTCG CACTAACAGG CTTTCAAAGA 108 60 

TGGCTTTTGC TCTCTCCTCT CCGTATTCTT CCTTGAGTTT GGCAACTAGC CAAACTGGGA 109 20 

GAGAATAGGC AATGGAGTCA CGCTTGTTTT TTCGCTTGAT GCTAGCAATA TCTGCCCAGC 10980 
CTTCACGCAA GATACGGCGA AGGACAGCGT TGACCAATTT TTCACTGCCT TTTTTACGGA . 11040 
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GTTTGGCCAA TTCCACTCCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC 11100 

GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGCTCTCTGT 11160 

CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 11220 

CCAGCTCGGT CACTAAGCCC TTGTOTGCTG CCAAAAGTTG ACTTCCCTTT AG ATGCTT AT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 11400 

ACTCCGTTGA CGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 114 60 

AGGG AT AG AG CCCCTTCAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AG AGAGAATC 1152 0 

TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGG CTT CAT AAATCTTAAA GCGGTCGCCC 11580 

TTAAGGAAAG TATCGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 11640 

CGATTGGTTT TGTTCCAGTC C AGTTTTT CT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 117 60 

TCCAAAAGCA AATCACGACC AACTAGCGCC AATTTTTCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 11880 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAACGCATA ATGGATAGGC 1194 0 

GCACCACCAC GGTGTCTAGG AAGG AGGG AG GCATGAACCT TGACAGCAAA GTCCATGCTA 12000 

TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12060 

AGCTTCATAA GATCTTCCAT CT CTGGACTT CCACATAATT TTTCAGGTTG GTACATAGAT 1212 0 

AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGGTTTCTT GG AT AACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 12240 

CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA AG ATT AGTTT TGTCATATCT 12300 

TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATCC TG AG AC GG AG CTCACTATTT 123 60 

TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 124 2 0 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTG GG TACGGGCAAT CGGTTTTGGC '.2 4 80 

GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12 540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12 600 

TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCC 12660 

TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAC 12720 

ACTTGACCTG CCTTTTCAGC CCGACCTGAT CGACCTGCCA CCTGAGTCAA GAGCTGGAAG 127 80 

GTTCT CTC AG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG TATCCGCATT T AG AACTCCG " 12840 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14 64 0 

CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 14 700 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 147 60 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTCTTT AGCAATTTGC TCTTTGATTT 1482 0 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14 880 

GTTTTCCTTC TGCATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14 940 

GAAGAGTTGA TTTTTCAG AC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 15060 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT T AT CGGGT AG 15120 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCC TTGGC CTTTTCGATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA C CAAGCG AT C TTGCAATTCA TCCAAATCTG 153 60 

GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTG ACCTG A AGAGCACCCT 154 20 

CAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGCTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15 720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 1584 0 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACGGTTAC CTTG A TAT TT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15960 

TGTGAGCCAA GATTGTGACT TTGTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCC AAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 1614 0 

GGGCTGCGCT C AAGGC AT CT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTT ACGGG 16260 

CCAATTCCAT AC CG ATTTC A ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 163 20 

TATCGTGAAG GAATCCAGCA CGACGCGCAA . GAGCCGC ATT TTCACCAAGT TCGCTCGCCA 16 380 
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TGATACCAGC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 164 40 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT GGCGCACCAA 16 500 

TTTCATAGGC AGCAGCGTCA CCGTATTCAC GAATCTTATT GTCAATCTCT TGACGCTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16620 

TAGTCATACG GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC AC C ACTTCTG 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGGTACGA ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTG 16800 

TTGACTCCGC T AC AT ATT C A CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 16860 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCAGCTTC GCGAATGCGA CTGGCAATCT 16920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AGATAATATC TCGTGCTTCT GCCTGAGACA 169 80 

GCGCACCAAT ACGCTCTAGT TCTGCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGGTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGGCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 172 20 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 172 80 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTC AGCTTGTCCA CGTAAATTAG 17 340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 17400 

TAGCTGAGAT GCTGACATAT CCAATGACTA AACCAATGAT CACGGCAAAA ACAGCAATCG 174 60 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT GACATACATT 17 520 

CTTTTACATT CT AC CAT AAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAC 17 580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17 64 0 

CTTGAGAAAA AGCCTACCTT TCAATAGACT TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17 700 

ACGCTATCTC CATCCATCAT ATAAATCAAG CGATTTTCTG CATCAATACG CCGTGACCAG 177 60 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 178 2 0 

TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG 17 880 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTG AG CA GC ATTTCTT A CTCCTCAATA 17 9 40 

ACATGGACCT GAGTACTTCC ACCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18000 

AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTCGATAC TATCCCACTC A CTCTTTG AA 18060 

AGG A CT AC AA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 18120 
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TTTACCTTCT TCATCTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG G ACTG CTTCC 18180 

ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 18 300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 18 3 60 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGCTAGACT 18420 

TAGTTTCTTT CTATTC 184 3 6 
(2) INFORMATION FOR SEQ ID MO: 88: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7001 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG GTATTGAAAT 60 

AAACG TCCT A GGTATCTTTC TCACTCTATC TGACTTACAA GGGAAAACTC TTTTCGAGAC 120 

ACAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 180 

AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 240 

A CCTGGAC AT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 300 

TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAAT AAT AT 3 60 

CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 4 20 

TTTCCTACAC GCTGGATTAG G TATTT AC AC TTCCTTTTTC ACAAAAGAAA AAAT AG G AGC 480 

CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 540 

TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGCATGCTT CGTTAATCAA 600 

ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTGTAAAGAC 6 60 

TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 720 

ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 780 

CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 8 40 

T CAT G AAGT C AGGGATAAAA TCCAAGACCA GCTCCACTTC CTTCCCTTTA CTCGTAATAT 9 00 

AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGACCTT GTGCATTAGC 9 60 

TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 102 0 

T ATT AG AAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 114 0 

ATTTATATTA AAATAACTTT TCTTCTAGCT GC ATTTT ATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 1260 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGC TC AAA 1440 

AACAATCAAT TG ATGCT AAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAG 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATC CCT AAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 174 0 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 
TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 
GTG CTTTGG C ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CAT ATTT AGG AACAGGATAA 19 80 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTT AAT AAA C 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TACT AC AAAC ATTATCATAC 2 220 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTC AT 2280 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2 34 0 
AGCAACAAGA AGATTTTCAC TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 
TTTTGAATAT AAACTACAAT AAT AT AAAC T AAATTTTATA GGAGGAAGAC AATGGATTGG 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 252 0 

TATTTGCGAA AAGTT ATTTT TG AAG ACT AT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2580 

TTGCTAGACT CCAAAGAACT AAC CCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2640 

GAGCATACTC CAACTCATAA AT AT GTG ATT T C ATT AAA T A AACCTGCTAA GTTAACCAAT 27 00 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 27 60 

AAATAAGGAA TT AAT TC AG G AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTT ATGT 2820 



2400 
2460 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTT GAT GCTTTTATGG CTCTATAATA 2 880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 294 0 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3 2 40 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3 300 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TG AC CG AT AT 3 3 60 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 34 20 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTCG GACGTTGAAA CAGTCCGGCG 3 4 80 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3 540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3 600 

TAT GAT AG AG CCGTCCGATG TCGCGTCAAA ATTATTACTA TGGATATGTT TAGTCCTTAC 3 660 

TATCACTTAG CTAGACAACT TTTC CCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3 72 0 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3 7 80 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3 840 

CGTAAACTCA GCG AT AAAC A TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3 900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4 020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4 080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 4 2 60 

CGACCTTACC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4 3 20 

TAG CTTTCCT TTCATTTCTT TTTATTTCCC TCGT AGT AAA CGTGCTAGCT T C C AC AAAAC 43 80 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4 4 40 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4 500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC G TTT ATT AAA 4 56 0 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4 620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 4 680 

CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTT AG C TGATCCAAAT GGCGGAAAAA 4740 

G AC ATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTGAG CGAATCCCTG TACCATCCAG 4 800 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4 860 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 4 920 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4 980 

GCGATTTTGC GAAGCCAAGC GTTGATTCTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATCCATT TCACAAATGT AGCATGACCT TTCACCTGTC CAACAGACTG 5100 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 52 20 

CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 5 2 80 

CTCCTAGTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTC 5 340 

CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 5 400 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 54 60 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 5 52 0 

AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAATTCTG 5 5 80 

CCGTTGGCAA TCCTCT GTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 5 64 0 

GCGAAGCCCC TTCTAACAGC CCTGCCCCTT TCTGAAGAAT CTCCTCACTC GTTCCGACAT 5700 

ACATCGGAAT TTCCTG ATC A ATCG CAGGAA TTTCCACATA GCCAATCCGC TCATGGACCT 5 7 60 

TT AG CAT ATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTG CTCTGT A AAAGGATCAA 5820 

GAATTTCAGA TGGTTTCAAG GTCG CATTG A AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5880 

GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 5 9 40 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 6000 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACGTC 6060 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 6120 

TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTG CCACATAAAA CAGGTAGCGA T AG AG ATGAC TGAGTTTGTT TGCTGCAATA 62 40 

AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATCG 6 3 00 

GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 64 20 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 64 80 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 65 4 0 

CCGA7CGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAAC GGGG A TTTCCACATG CCCCATCCGC 6660 

TCATCGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTC CG AC 67 20 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 67 80 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA AT ACAAGCG A GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6 9 60 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GAGGGAGCTT AAGAAGTTAC CACCGTCCTC TAGCGCCTTA TCCGCATCAA AGTTAAGGTT 60 

GATATTTTTA AAACTGTCGC CAGCTTGTGA TACGATGCTT TGTTTAAGGT CATTTAGGGT 120 

TTTAGTGAAA TCTGCATTGC TGAGGATATC ACTCTTTGAG AGATTCAAGG CAAAATTGAT 180 

GAT GAT ATT G ATCTGGTTTC CTGTTATGAC CTGATCAACT TTGTAATTTT TTAAGGTATC 2 40 

TTCAACAATC TTGCGGATAT CTTCTTCTGT CAGATTTCCC TTACTTTCTT TAGCTTTGGC 3 00 

GAGTCCTGAC TTGATATCAG CTAGGGCAAC GTTTAATTTA TTAGCATCAT AGCCTGATTT 3 60 

GTCCTTGTTT TCAGCATTGA TATCTGACAA AGCTTTTAGC TCTTCTTGAG CCAAATCTTT 420 

ATTAGCTTGT GGCACCTTGG CTCCATTAGC CTCTAGCGAA TAGTAAATCC CTGCTAAAGC 4 80 

ACTTTCTCCT GTAACTGGAA TAGGGGCTGC TACAGTGATT TTGGCATGTT CCATACCCAG 540 

CGTTACTGCT GCGTTTCGGT ACATATCCTG AGTCACCTTA GTGATATTTT CTGGTGTTTC 600 

AATCTTGACC TCAAGTGGCG ATTTGTCACC TAGCTTTTGA ATCTTGGCTG ATGAATACAA 660 

CTGTAAGCTA GAGTCATTGG C CACATTCAT GATTTTAGAA TAAACATCAG GTGTCATGGT 720 

CTTGAGTTCT TTGGTATCTG TTGAGGCATT GTAGCCCAGT TTTTTAAGAG TTTCATTTTT . "7 80 
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TTGCTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGCT TCGACATAGG TTTCATCCAT 84 0 

AACTTTTTGA AC ATCTG TTG CTGCATGGAC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 9 60 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTCGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT L080 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGCG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AG TCTC ACG A GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT 7TTGGGCGAT AAAATCGATA TAGGCAGGTC 12 60 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 13 20 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TC ACT AG ACT GGATAACATC GCCTTCTTTT CCTTTTGTTT 144 0 

CAGTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

T TTCTCGC AC CATGGGTTGA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG IS 60 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 162 0 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAAGATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 17 40 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACAGGTT CTGGTACGGC TTCTTTCACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTGTTGT C AA AAAGGTCATT AGCCAATTTC TCAGGTGACA ATTCATTGCT 19 20 

TTCTTCTACG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 19 80 

GTT AAAAG AT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2 04 0 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100* 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTGCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2 220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 2 2 80 

TTCTACACCT TCTTTAGAAA ATTG C AC AAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2 3 40 

AATGCTAAAC TCCTCTTTCC AG AG ATT AG C CAGCGTTACT GATGTCTCCA ACAAATCGTC 2 4 00 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2 4 60 
ATACACATGT TCAATTTTTT 7ACGCAGGTA TTCTTCGATT TTTGGAGTAA T ATT GAG AAA . 2 520 
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CTTATCTGCT AAGAACACTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2 580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 27 00 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 27 60 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA G CTTTCGTAA GGGATTGAGT 2 820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2 880 

CAACTTTAGT C AC ATC AAC A AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2940 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTC GTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 3240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3 300 

ATTTTGAAAA ATCGATAATT TGAGAATACG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 3 60 

TTACTTCTTG GGCTTGTTTC AAG AT AGC AT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 3 420 

CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 3 480 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA AT C AAGG AC A 3 54 0 

TGTAAGCCGC ACAGTTAGCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3 600 

CG AAAATTT C TTTTGCGCGT TCAATAGCAA G AGTCTC T AC AACGTCTACT ACATCAGTTC 3 660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 3 7 20 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 37 80 

GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGC AT CAT AT GCTTTAAAAT 38 4 0 

CATCTTTGTC AAAAATCATA GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTC TGC AT TTTATCACAA 3 960 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AG AATTT AAG 4 02 0 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4 0 30 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACT GCT AAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4 2 00 

GC ATC CTC AT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4 2 60 

GCCGTTTCCT CTGGACTAAA AGT AT AAC CC AAAGCCTCTT GGTTGATCTC ACATATAGCT 4 3 20 
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TTCACATCAG 


TTTCTCTTAA 


ATCTCTTAGC 


ATCTCATTCC 


TCCTC AAAAG 


AAATCTTTGG 


4380 


CAACCGAGCA 


AGAATATCTT 


CTCGCTTAAT 


GGCCCCTTGA 


CGTAAGATTT 


TCACCTTGTC 


4440 


TCCCGACAAA 


TTCAAAATAG 


TTGAATCCTG 


TCCAGTTAGA 


AAAGCATCGT 


CTTCCAGACC 


4500 


CAGAACCTCT 


TGGTCAAAAT 


CCTCTAGAAT 


TTGATTAAAG 


GTCACTCCAC 


TCGCCTGACC 


4560 


TGAGATATTG 


GCAGACGGCC 


CAATCAAGGG 


ACCTGTCTCT 


CGAATCAAAT 


CAAGGGTAAT 


4620 


GGGATGACTA 


GGCATCCGAA 


ATCCAACAGT 


TGCAAGGCCA 


GAATTGACCC 


AATAGGGAAC 


4630 


TCGGTCATTA 


GCTTCGAGAA 


TAATGGTCAA 


GGG ACCTGGT 


AAAAACATCT 


CTACAAGTTT 


4740 


TTGAAGATAA 


GTTGGCTGAT 


TCTTTGAAAA 


GTACAAGATG 


TCCTCTAAAG 


AGGCAACATT 


4800 


GAGATTGAGC 


GCCTTGTCTC 


TACGTCGACG 


TTTAAGCTGG 


TAAACATGGT 


CAACTGCTTT 


4S60 


TTCGTCTAGC 


GCCTTAGCAA 


AGAGACCGTA 


AACTGTCTCT 


GTAGGCAAAA 


CGACAGCTCC 


4920 


ACCATTTTCC 


AACTCTTGTC 


TAATCCTGTC 


CATCATCAAC 


GACAACCATC 


CTATCTTGAC 


4980 


CAAATTGGTC 


CTTGAGTGTT 


CGTACTCGCT 


TTTCAGGAAG 


ATCTTTCCTA 


AAAAGTTCAG 


5040 


GAACACTTTG 


ACCTTGCTTG 


TATCCAATTT 


CAAGGTAAAT 


CTTACCACCA 


TCTTTGAGAT 


5100 


AGTCTTTTGC 


ATCTTCCGCA 


ATTCTACGGT 


AAATAGCTAG 


GCCATCCTCA 


TCTGC AAAC A 


5160 


GAGCTAGATG 


AGGCTCCGAA 


TACAAGACAT 


TCAAGCCTAC 


CTCTGACTCA 


TCTTCACGAG 


5220 


AGATATAGGG 


TGGATTGGAA 


ACAATTATAT 


CATATTTTTC 


AGAAATTTCT 


GTAAAACAGT 


5280 


CAGATTTTTT 


TAAAAATATT 


TGAAGATTTT 


GATTTTTAGC 


ATTTTCGCTA 


GCTACATCTA 


5340 


AAGCATCTTG 


GGAAATATCT 


GCTGCCGTCA 


CTGACCAATC 


TGGTCTGTTT 


TTTGCTAGAG 


5400 


CGAGAGCAAT 


AGCTCCACTA 


CCTGTTCCGA 


TATCTAGGAC 


CAT AAGATTT 


TTCACAGGAT 


5460 


TTTCAGCCAG 


GATAAGCTCC 


ACCAACTCCT 


CTGTTTCTGG 


ACGAGGAATC 


AAAACCCGTT 


5520 


CATCCACCTT 


TAAATGCATT 


C C AT AAAAAT 


CTGCCTGTCC 


AATGATGTAC 


TGAGCTGGCT 


5580 


TGTGAGCTGC 


TAGTTGCTGG 


TAAAIAILI I 


\, I f\^.t\J\J\ 1 lu 


TTTTTCTTCC 


TCTGTTGTCA 


5640 


CCTCCTGCTG 


GAGGGCAAAA 


ATAAAGTCTG 


TAAAAGATAG 


ATTTTTCACA 


CTACGATAGA 


5700 


C AAAAG AG AG 


GCTTTCCGCT 


TCCTCTCCTT 


GTCTTATCAA 


CTCTTCTTCA 


AAATTTGAAA 


5760 


ATAATTGAGC 


TAATTTCATT 


ATTTGTTTAA 


TTCTTCTAGT 


TTTTGTGTTT 


GGTCATAAAG 


5820 


CACCAAGGCA 


TCCACAACTT 


CGTCCAATTT 


ACC AG AC AAA 


ATCGTATCTA 


GTTTTTGGAG 


5880 


GGTCAAGCCG 


ATACGGTGGT 


CTGTGACACG 


GTTTTCTCCG 


AACTTATAAG 


TTCGGATCCG 


5940 


TTCTGAACGG 


TCACCAGTAC 


CGATTGTCGA 


CTTACGCTCA 


GCGTCCTGCT 


CAT CTTG AGC 


6000 


AATCTGAGCA 


AAGTGGT C AG 


CAACACGGGC 


ACGGATGATT 


TTCATCCCCT 


TCTCACGGTT 


6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTG ATA TTGGTTGGCA ACTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 618C 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGG CAT AAC 62 40 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 63 00 

CACACGGTGG GCACCTGATT CAT ACTT AAG CTTAGAGTAT AC AG ACTG AC CTGAAACCAT 6 3 60 

AG C AACC ACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 6 4 80 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG AT ATTC TTGT CATCGTTTGG 6 540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGCC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6 660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 67 20 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGG TGTCTGAAAC 67 80 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT ACATTCCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 69 60 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 7020 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGC AT C ATT A TCATAAACTG CCAATGGAGC 7 2 00 

TACAAAATCT CCATTTTTAC TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTTGGTTGC 72 60 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7 3 20 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT AT C ATTGGCG 7 3 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTGVT 7 4 40 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7 500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7 560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAACCC AAAAGCCATG ACGGGTATGT 7 620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7 680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 7 74 0 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC AC AT AG C C AA 7 800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7 860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 79 20 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG T AAAAT AG AA GAAATCAAAA 804 0 

ACTAGTGGAG GAAGCT ATT A TGCCATTTGT ACG CATC GAT TTATTTGAAC GACGCACGCT 8X00 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG T CAT C ATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 822 0 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 82 80 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8 3 40 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 84 60 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 852 0 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AG ACCTT C AC TCGTGAGGAT ATTATCGAGA 8 64 0 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCCTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTCC TTTTTTAAAC GGTCGCGTAG 87 60 

AC TTG AC AC A GGCAGAGGCT GTGATGCATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 88 2 0 

ACATTCCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGACC TCAATATCGA CTATCCTGAG TATGACGATG 8 9 40 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGCTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTG ATT GAT A CACCCGGTAT TCGTGAAACG CATGATATCG 924 0 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 93 0 0 

TAGTACTAAA C G CT AG TG AA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 9 3 60 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 94 2 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACG CCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTCTTAACCA AGGTCTTGAA CTAGGGATGC CACTTGACTT CCTTCAAGTT GACTTGACCC 9 660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 97 20 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT C ATT CGGTC A 97 80 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 984 0 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9 900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9 9 60 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA G ATTGCTC AA AAGTTAATTG 102 60 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10 320 

GTAAGCTCAA TGACTTTCAC T TTAAAC ATG ATTTTTCTTG T CTTCCTG AG ATTATGTCTT 103 80 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2393 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



GTTTTGGGTT 


CTGGAAATTA 


TCAGATGGTT 


GGAAAAGCCG 


TCCACATCAA 


GATAGTGTTC 


60 


GGAGATTTAA 


GTTTAAATTG 


AAGAAACTAA 


CACAGAGGAA 


ATGGAGTATA 


GACCTAACAA 


120 


GACGTATTGA 


GCAACTGAAT 


TTGTCTATTC 


GAGGATGGAT 


AAACTATTGC 


T CAT TGGG AA 


180 


ATATGAAAAG 


TATAGTCGCC 


AGCATAGATG 


AGCGCTTGCG 


TACTCGCCTA 


CG AGTC ATT A 


240 


TCTGGAAGCA 


ATGGAAGAAG 


AAATCGAGAC 


GATTATGGGG 


ATTGCTTAAG 


TTAGGAGTTC 


300 


CTAAATGGAT 


AG GAG AT AAG 


GTATCTGGCT 


GGGGCGACCA 


TTATCAATTA 


GTAGCTCAGA 


360 


AGTCGGTACT 


TAAACGTGCT 


ATATCAAAAC 


CAGTCCTGGA 


AAAACGTGGA 


CTGGTTTCGT 


420 


GTTTGGATTA 


TTACCTTGAA 


CGACATGCGT 


TAAAAGTTAG 


TTGAACCGCC 


GTATGCCAAA 


480 


CGGCACGTAC 


GGTGGTGTGA 


GAGGGGCTAG 


AGATTATCCC 


CTACTCGATT 


AACTCCCCTG 


540 


AAATTTATTT 


TAATTATGCA 


AATTTCACGT 


ATTTTTG ATG 


CTGAGACGAC 


GATCCTGGGA 


600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCGCCAAA 660 
GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATC 720 
TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTACCCA ACAACGATAA TCATAATTCC 780 
TCCTAAAATT AGGAATAATA AAGGCAATAG TTTTTGTTTT TTCATGTAAA AAAC CTC ACT 840 
TTTGT TTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 
ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 960 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTGC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTGGTAGGA ATCTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TGTTGCAATT TCCATACTAG ATGGTTTTGC 12 60 

AGCACCCGTT TCCTATCCTA TTGTGCCACG CTATGCCACC GATTTGGGTA AGGCTAATTC 13 20 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGGT TGGGGATTAG GTGGACTCTT 13 80 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 14 40 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCT AG A AATCCTAGAT TAAGACTTTT 1560 

TGTATCAGCA AAT TT ATTGG AAATTTTTTC AAAT AC G ATT TGGGTTTCTT CC ATT AT ACT 162 0 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GG AAAGTT A C TGGGGATATT CTAATACAGC 1680 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAAGTTCCT 174 0 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC ACAATCCTTC 1800 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGGTGTTTTT TACTAGACAA I860 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 192 0 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 1980 

TGGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2 040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTGTCCAGTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATGCT TGTAAAGAAT TGTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 2280 

CCATAGTTTG CGATGCCTCA ACCTTCAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2 34 0 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 239 3 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 12 0 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTT CCT ACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAACCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGCCTTTCTC TATCTTTTGT 300 

CTGTTC CTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 42 0 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCGATTTTG TCCCCGTTGC 4 80 

ACTATACAGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA AC ACTCTT AA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 6 60 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 7 20 

TTTTTCTTTT TAGAGTGCTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AG AGC ATGC A 7 80 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 84 0 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

C AAG GTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGACCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA TTCAAAATAT 1020 

CGTAGAACAT CAACTCCTTG AAGCCAAAGA AT ATGCG CTG GCTCAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAG AC C AG A CAGTTGTCAA TGAAAACGCT AATAAAGACA GTGATGTCTT 1200 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCT AAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 13 80 
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TGGTTTTAAG ATTGGAAATG CAGAGGTAGA CAGTCCCAAG TCTATCCACA CTGCGACACC 14 40 

ACAGATTTCT CAAATCATTG C C AACGTTG C TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 
C CG T AT CG AT GAAATTTTGG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA 15 60 

TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC CCTTGGAAGA AAGCGCAAAA 1620 
GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1680 
TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCAGTCGTT TTG AACG AG A 17 40 

AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 1800 
TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGG AACTC C I8 60 

CAACTATGAC ATCAAGCAGT TGGCTCTACA GTGTGCAACC AAGCGGATGT ATCCAGACGT 19 20 

CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 19 80 

TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 2040 
TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 2100 

TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2 220 

TTTTGGCCAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCGTCGTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACAGTTTTCT TTGGTAACAG 2 3 40 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2 400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 24 60 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AG AT AT AG AC AAGTTTGCCT CTATTCCTGA 25 20 

TATCACAGAC AACGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2 5 80 

ACCGTTTGAA AAATTGGACT TT G AG AAAGT CTATCCGGAA GCAGGTGCGT C AG GTGGTTT 2 64 0 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 2 7 00 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACT CCG ATTG ACCGTTGCTA 27 60 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGGTTTGCTT GTCCAAACTC 2 820 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTGGC T ACCTAGGTAA 2 880 

TCCTCAACCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2 940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCCTTA TCACGAGAAA 3060 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA G CTTC AG AG A ACAATTTTTA 3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 
AAGGATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTCTTTTAG AT ATGATG AC 3240 
AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTCT 3300 
GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3 360 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 
TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTCGCTACTC 
CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 
AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 
TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 
GCGTTATTGG AT AG AGGT AG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TCTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GG C ATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3 900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG C CTTTTCTC A 3960 

AT AC TGGG AT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4020 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 4200 

AAAGT GGGC A AGTACTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 4 2 60 

TGAAGAGAGA ATGAAGAAAA ACGACTTACT AG AC C AACT A GTCTCAGAGA TCGAGACGGG 4 3 20 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4 380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 4 440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGG TGACAGCCAC 4 500 

TCTGCCAGTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4 560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGG AG CCAAA 4 620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4 680 

ACCATCTGTT TCTACACGGA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4 74 0 
ACTGTGAGAA ATCGCGATGC GG 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 



3420 
3480 
3540 
3600 
3660 
3720 



4762 
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(A) LENGTH: 3832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



V 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



GATGCAGGT T 




TATTfCAGAA 


AATTACTTTA 


AAGATGATGA 


TGTTAATCAG 


60 


GTACCTTGTC 


1 1 IUI LOVj 1 *- 




GCCCTCTTTT 


TCAGTAATTG 


GGTAGACCAT 


120 


GCGGTCTATC 


AG G AO ACLj L. L 


1111 un l l "oU 


AGAAAGATAG 


AAGATGATGC 


ATCTG CAT AT 


180 


GGGTATTTAT 


» ft /"^ i/^/^ A ft 'Tup 

AAGAGUAA I l 


I vJ/\L-/\ In! 1 


TAGACGCTTT 


TAAATCAGGT 


ACCTTGGTTT 


240 


TACCGAGTGC 


ClTTGCTL. * TO 


f* m rr | *T > T v T , & A f^TI 

1 1 i 1 /VfWjVj 




TTCTAGCGAC 


GATTTTCTGG 


300 


TTTGGCAATT 


TTTCTA1 I LO 




rAHCCTTAGA 


AGAAATGTCG 


CCAAGCCAGA 


360 


TTGCTGAAAG 


G ATTOOv, AAO 


0/WV\ A. 1 1 Vb-V^VJ 


ATGTCAACCA 


GTCCATTTCT 


AATCTGACCG 


420 


AAAGGGGACT 


GCTCC AOT A I 


prTArriTPf; 

v_\j L 1 rl 1 


AATTAAATGG 


CGAAATTGAA 


TTGCTCTTTG 


480 


ATGCT AGTTT 


T 

GGC CTTGGAA 


Lb 111 i lj 




AGCAGTTCAT 


TCAAGTTCAG 


540 


ACCAGCTAAC 


ALL ILnAAAL 




ATTTGGTGGA 


AACCTTCCAG 


CAGGAGTTGG 


600 


GACGATTGTT 


GACGCCTTTT 


OAO/t 1 1 v».r\VjO 


ATTTGACCAA 


GACACTAAAG 


GAAGATGGAA 


660 


CCAGTGCTGA 


L I l uA 1 I AAvj 




CTG AAG CTGT 


TTTGAATGGA 


AAACCAAACT 


720 


GGAAGTACAT 


I LAuVjLuA 1 1 




GGCGCCATGA 


AGGAATCAAG 


ACTGTGCCTC 


780 


AAATTGAGGC 


CAAGAGAGCA 


GAAAGAGAAG 


CAAGCAATCC 


TCAGTTGACA 


CAGGTATL \ O 


84 0 


CAGATTTCAT 


AAATGCCATG 


GATCTCTGGA 


AGGATTAATC 


CATGCAAGTA 


GGCTTGAAAT 


900 


CCGAGTAAGA 


TTTGCAAGCT 


GTGTATAATT 


GTGATAGAAT 


AAATAGAAAA 


TAAATTGAAA 


960 


AAAGAGGTAT 


GTGAAATGTC 


ACCTAAACCA 


TTTATCCCTG 


GTAACTGGAA 


AATGAACAAA 


1020 


AATCCAGAAG 


AAGCTAAAGC 


ATTCGTTGAA 


GCAGTTGCAT 


CAAAACTTCC 


TTCATCAGAT 


1080 


CTTGTTGAAG 


CAGGTATCGC 


TGCTCCAGCT 


CTTG ATTTG A 


CAACTGTTCT 


TGCTGTTGCA 


1140 


AAAGGCTCAA 


ACCTTAAAGT 


TGCTGCTCAA 


AACTGCTACT 


TTGAAAATGC 


AGGTGCTTTC 


1200 


ACTGGTGAAA 


CTAGCCCACA 


AGTTTTGAAA 


GAAATCGGTA 


CTGACTACGT 


TGTTATCGGT 


1260 


CACTCAGAAC 


GCCGTGACTA 


CTTCCATGAA 


ACTGATGAAG 


ATATCAACAA 


AAAAGCAAAA 


1320 


GCAATCTTTG 


CGAACGGTAT 


GCTTCCAATC 


ATCTGTTGTG 


GTGAATCACT 


TGAAACTTAC 


1380 


GAAGCTGGTA 


AAGCTGCTGA 


ATTCGTAGGT 


GCTCAAGTAT 


CTGCTGCATT 


GGCTGGATTG 


1440 


ACTGCTGAAC 


AAGTTGCTGC 


CTCAGTTATC 


GCTTATGACC 


CAATCTGGGC 


TATCGGTACT 


1500 
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120 

GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 1560 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGCTTCTGTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 16 80 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATT C A GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGCC AACTAGTCAT 18 60 

GTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 1920 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT AC AG ACT AAT 1980 

GCTGCTGCCC ACTGGGATGG GGATTATTAT GTAAAGGATG ATGCTT CTAA AGCTCAAAGT 2 04 0 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 2220 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 2280 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 2340 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 2400 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 24 60 

TATATGGCTC GGAATGAGTG GC AAGGCAAC TACTATTTGA CTCGAAGTCG TGCCATGGCG 2520 

ACTGACGAAG TGATTATGGA TGGTACTCGC T AT AT CTTT G CGCCCTCTGG TG AGCT C AAA 2580 

GAAAAAAAAG ATTTGAATGT CGCCTGGGTT CACAGAGATG GTAAGCGCTA TTT CTTT AAT 2 64 0 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TCATTGATAT TAGTGAGCAC 2700 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGT CATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2820 

AACCGTCTGG GAATTCCTTA TGGTGT CT AT CTCTATACCT ATGCTGAAAA TGAGACCCAT 2880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AAT AC AAT AT GAACCTGTCT 294 0 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTG AT AC AG GCACTTGGGT TAAAAT CATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CT AT AG CT AT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3 240. 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATCTG 3 300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3 3 60 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAAC AT TTT A GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3 540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3 6 60 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3 720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3 7 80 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3832 
(2) INFORMATION FOR SEQ ID NO: 93: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

TGAAAAAATC CTCATGAACC TGGCGCCAAT AGACAACTGT CTTGTTTCCC TCACCTTCCT 60 

TATAGGCATG GTCAGCTGAC ACTCGATTGA AGGGTTTAAC AGAAACCTTT GTAATTTCGA 12 0 

CAATGCAGAC AGCCTGATTT TGACTATCTA AAATGACATC GAAGGTCCCT ACTTGGGGAA 180 

GTGGTTCGTC TTCTAGCACA TAGAGGTCAT AGGCTGATGC TGTTGCTGTC TTTTCTCCTT 24 0 

TAAACACCAA ATCCGCTAAA AGGTCTGGTT CAACTCCAAA AGCCCAGGCA TCGATTTCAT 300 

CTCCGATCAA AGGATTGATT TGCTTGTATT TATTCCACAT TTCTTGCGGT ATCATGGGTG 3 60 

CTCCTTTGTA ATTTTTTACT TTCTTCTTTT ATGTGTTTAA GATGATCTGG ATGGTCAATC 420 

TCTAAATCAA AAATCTCTGG AATAGAACTG TAGTGGATAA TGCACTTGAT ACCCAACTGA 4 80 

TTCATTTTTT GTATGAAAGA AGTATTCAGA TAGCCTGCTA CAGCAAAATC AATCTTGTTC 54 0 

TTTCTTGCTT TATCCTGCAT ATCTCTTAGC ATATCTAACA TTATTGCACT TTCCATATCA -600 

TGCCATTGAC TGTTTCTCAT AGTCGCAAAA ACAAAGGAAG TCAAATCATT CATTCCAACT 660 

ACAATCTTTG AAATGCCCGT TTCCAGTATA CTAGATAAGT CAAAATACGC TGACGGTAAT 720 

TCAATCATCG TTCCGACTTT CCCAGTAAAA CCCTGCTGAC GC AATACTCT AATAGCTTGT 780 

TTTAATTGGT CGG CATC ATT GACAAAAGGA AACATAACAC ATAGATTGGG GTTGGTTTGA 840 



WO 98/18931 



PCT/US97/19S88 



722 



TAAACTTCTG 


TAACGACATG 


TGCTTCAGCC 


TGAAATTCAT 


CCAAACACGC 


CAGTAAACGC 


900 


CTAGTTCCTC 


TATAGCCAAA 


CAAGGGATGC 


CCTTCGTCAA 


AAAACTCTTT 


AGTCCCCACT 


960 


AAACAATTGG 


CTTCTGTATT 


CGTTAATTCA 


GTAAAACGAT 


ACCAAACTTC 


CTTACCTAAG 


1020 


TAAAAGGAGC 


AAATAGTATC 


AAGATAATCT 


TTCACAAATT 


CCTGACAACT 


TTGTAATAGT 


1080 


ATATTTTGAT 


TGAGCTCTCT 


CAATAAGTAT 


TCCCCACGAA-P€ATGCCGAC 


GTGGTGAAAT 


1140 


AGTTGAGGAT 


AAATTTTTTC 


AAGAATTTTT 


TCGCCACTAA 


GGGCAAGTTG 


ATTTCTCATC 


1200 


ATTCACCTTC 


CAATTCATGT 


AAGAAGTCTT 


GTCCAGTTCT 


GGAAATCCTA 


ATAATTCAGA 


1260 


CTTAACCTTC 


AAGACTAATG 


GCGATGCATT 


TTCTTCTGTA 


ATCTCTTGAA 


TATCCATCCA 


1320 


AATATATCCA 


AGTGAATCAT 


TCGCACCATC 


AGACACAGCT 


TCCGAAATCG 


TAACTTGAGG 


1380 


TGCACTCTCA 


TTCATTTCAA 


CATCATACAA 


GGCTATGACA 


TGGTGAACCA 


TAAAATTTTT 


1440 


TAACTCTTCC 


CTGACGAAAA 


CATCGTAGAT 


TCGAGGATTA 


GAGTAGCTTC 


TAACAGTAAA 


1500 


TCCCGTCTCT 


TCCATAACTT 


CTCTAGTCAC 


CGTTTCCGTC 


AGTCCTTCAC 


CAAGTTGCTG 


1560 


ACTGCCTCCA 


GGTAGATCAT 


ACCGATGTTG 


ATAAGGGCCT 


CTCGTTTTTT 


CAATGCAAAG 


1620 


TAACTTTCCA 


TTTTCAAAGC 


AAACACAGTA 


GACCCCAAAG 


TGATTTTTGA 


TTTCCATCCA 


1680 


ACTCCTCCTA 


CTTCAAAGAC 


CAGCCACCAT 


CTATTGTCAA 


GATTTGTCCT 


TGCATGGCGC 


1740 


TCGCTTTTCC 


ACTTGCTAAA 


AAAAGACTAA 


GCTCTGCTAT 


TTCCTCTGGC 


TCAATCC AG C 


1800 


GCTTGATTGG 


GGTTTCACTA 


GCCACCCAGT 


CAGCCAAACC 


ACCTGGTTCA 


AAATCCGCAG 


1860 


CGGTCATAGC 


TGTCTTGACT 


G CTCCTGG AG 


CGATACCAAA 


GACCTGAATC 


CCAGCTTCAG 


1920 


CATAGTCTAG 


AGCCAACTGC 


TTGGTGAAGC 


CAGCCAAGGC 


ATGCT TGG AT 


GAAGTATAGG 


1980 


CGTGACCACC 


TCCACCTGCT 


AGCCTAGAAG 


CAATGGAACA 


CATATTGATG 


ATGATTCCCT 


2040 


TTTTATTTTC 


CAGCATTTGT 


GTCAAATAAT 


ACCGAGTCAA 


CTCTACTGGA 


ATAATGTAGT 


2100 


TGATTTCAAA 


AATCTCTTGA 


ATGTCCTGCG 


CCGTTTGTTC 


CAACAGTGGT 


TTGTAATCAT 


2160 


CCAAAACTCC 


AGCAGTATTA 


CACAAAACAT 


CCACCTGAGG 


GCACCAGTCA 


AAAATAGGTT 


2220 


CCAAGTCCAA 


GGTCAAATCT 


CTCTGTAAAA 


AGCGAAAATC 


ACCCTCTAAG 


AGTGGCTTTT 


2280 


CACCTTGGTC 


AACTCCATAA 


ACTTG AT AG C 


CCTTCTCTAA 


AAAGAGGCGA 


GCTTGAGCCA 


2340 


ATCCGATCCC 


TGAACTCACT 


CCTGTAATGA 


GTACACGTTT 


AGTCATGCAC 


TTCTACCCAA 


2400 


TCCGTTCCCA 


AAACATCACA 


AACTGTCGGG 


CTCCACATGG 


AAAAACCTTC 


TCCTTCGCCA 


2460 


GAAACGTTGA TTAGGAAATA 


AGGTGTCATT 


TCAAGTGCAA 


GCCCATTTTG 


CTCGATGGTA 


2520 


TCAAAGAGTT 


GGACATAGTT 


TTCCGCACCT 


CCCCAACCAG 


TTCGTACATA 


TTTTCTCTTA 


2580 


GCCTTTAACC 


CAGGCAGGAT 


CTCTTCAAAT 


GTCATGTTTT 


TCTCCTTTAA 


TTCTACATTC 


2640 
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TTCATTTAAT T AT AGC AAAA AACCCCTTTA TACGGCTTTT TGAATGTGAG TTATTCAAAC 2700 

CTGCTACTAC TTACGGCAAA TTATTCCCTG CAGCAAGATA AATTTCATAC CATTCTTTTC 27 60 

TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 2820 

CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA AG AAATGGC A ATAGTTGAAG 2880 

AGGTTACTCC ATATTTAATA GCTAAACGAT CAAGTACTTG ATTTAAAGCT TGAAATTTCT 29 40 

CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA GACAGACCAT CCTTGAATGA 3000 

CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC TCGCATAGCT GCTTGACTAT 3060 

CTTCCATATT AACATGAAAA GCTGATTCAA ATCCTGGAGT AAAAGCCGCA CTCAATTGTA 3120 

GCTGATTAAC AGCTAACGGC TGCTTGACAT CTTTTTTAAG CAACTCCATC ATCATAGGAT 3180 

TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 3240 

CTGCTACTTG GTCAGATTCC ATCAAAG CAT CTGGTCGATG AAGGAGCAAG CTATCTAGAT 3 3 00 

GATCAATCTT CAATCTTTCC AAAATACCGT CTACTGATTT TATAATATAG T C CTT AG AAA 3 3 60 

AATC AAAAT A GGTAAATTCT TCAATGCGAA TGCCACATTT GGACTGAATC C AC ATCTTTT 3 4 20 

CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG TTCTTCACAA CGACCACGAC 34 80 

CAT AAAT ATC AGCCAAGTCG AAG GCATTG A TTCCAACAGA AAGTGCTGTT TCTACAAGCT 354 0 

CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 3 600 

ATTCTTTGTC AT CTTG ACCA AGAGTTATGT ATCTCATCAA ATTTTTCTCC TTTAATTTCT 3 660 

AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA CGACTTTTTG ACTATACTTC 3 7 20 

ACTCCATTTT ATCTTCTTAA ACCCACGGAA CAAGACAAAG ATTCCAATAA AGAGGACAGC 37 80 

TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC ATCCACTCAT AATAACGGAG 3840 

CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA AATGGACGAA AGACCCCTTC 3900 

TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG ACAGACAAAC TATCCCAGGG 3 960 

AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT ATCCTAAAAC 4 020 

AACTAGAAGA ACTATAGTCC CAACAACAAT GTAAGTGCCA ATTTTCATTT TAG GAG AATC 4 080 

TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA AATCCAATCA GAAAAATAAG 4140 

AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA G G AC CTTT AG AAAAATCACT 4 200 

TAGTAGTTGA TAATAACGTA AT ACCGC GAG GACAAGAATT GGCGTCAAAA GGGACTCTTT 4 260 

GATAGAACTG CGAGGTGCTC CCTTGAGAAT CTCTTTCATT ATTTTTTTAG GATTCTTACC 4 320 

TAGATAATCC TCTGCACTCA TGCCATCTCG TTCTGCTTCT GAGAAATCTA GCATCATCAA 4 3 80 
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ATAGATCTGC TCTCTGAGAT AGTCTTCATC ATAGAGAAAT CCAGCAAGAT TAAAACTTTC 4 440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTGT 4 500 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCCCCA TTCTTAATCC CTTCTACTTT 4 5 60 

TTCACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 4 620 

TAATGAAAAA TACTTCCGAT CTGGACCATC TGGCGACGGG CGCATGTCGC CTCTTATCCA 4 680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 474 0 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4 800 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTCA GTTTCTTTCA TCACTTCTCC 4860 

CTTCTAATCT ATTTTCTAAT AC CT AC T AG T GACTTCACCT ATAGTATATC ACTTCTACAC 4 920 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4980 

CCCCTACCGT ATCTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTC ATCT AC A ACCTCAAAAC AGTG TTTTG A GCTGACTTCG 5100 

TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTC ATCT AC A 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 5220 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAATAA 5280 

AAAAACAGAA CT AGCCTGAA CTAGTCCTGT CT ACTTTT A C CCAATCACAC TTCCATTTGC 5340 

TACAGCTGGA TCAACTGTGA GAACGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 5400 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTAGCAA CGATTTGAAC 54 60 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5520 

ACGATCTTCT CCATCACCAG CATCCAAG CG GAATTGAAGC AACTTATCTG AACCTTCTAC 5 580 

TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 5 64 0 

GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 5700 

TGGTTTATTG CCTTCCATTT GTTCCTTGAT AT AGG CG ATT TCTTCTTCCA TATTTAGACG 57 60 

TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCAC AT C T GCTCGGAAGT CAGCCAAACT 5820 

CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACGACTAGT 5880 

TTCCATCATA AATGGTTCAA TCAAGTGAGC AACT AC ACG A ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT CACGAAGAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG T ACG AG AG AT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCAGC 6X20 

AAGAACATGA TC AT ATT C AG TCACACCTTC T AC AT AGG C A GGGATTTGTC CATCAAAGTA 6180 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 6 24 0 

GTTGATACGG CCGACATAGT CTTCAGCAGT AAAGGTTCCG TCTGAACCAA CTGGAAGGTT 6300 

ACGCATGAGG TAGTAACGAA CTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 6360 

AACGACATTC CCTTTTGACT TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 6480 

GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGG AAG ACT GTTCCATTCC AGAACTTCTC 654 0 

AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 6 600 

AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6 660 

TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 672 0 

AAGGCGACCA TCTGGCGTGA TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 6780 

TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA Ac CCATTCAA CCTCATGACC 684 0 

TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 6900 

GCTTTCT GT A AAGAATTCTT CGTCTCATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 69 60 

GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 7020 

ATCAGTTGTA CGGATAAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 7140 

CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 7200 

GCGTTTGTAA CGTGCTAGGA CATCACATGC G AT AGTTGT G TAGGCAGAAC CGATATGAAG 7 26 0 

TTTCCCAGAT GG AT AGT AAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 732 0 

TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 7 3 80 

ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 74 40 

TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTCTTGCCTG GCAAGACTCC 7500 

AAAAATCATA CCAACACTAG CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 7 560 

TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 7 62 0 

ACTTG CAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 7 6 80 

AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACCAAGAC CAATCTCACC 77 40 

AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7 800 

AGAAATCCCT GCGATGCAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTG AATTT C 7 860 

TGCAAATACA ACGGACTCAT CTGCCACCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7920 
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TGTCATTTTT 


CGTGCCAGTT 


CTGGACCCAG 


AGTTGGGGTT 


AAACTGGTAT 


CATTCACTCG 


7980 


AAAGACAATA 


TTAGCTATTT 


CATCTACATT 


AAAATTCGCA 


GCAAGGGAGA 


TATTGGTAGT 


8040 


AATAGGCAAC 


CCACCAAACC 


CATATATTTT 


TGATCTTTTA 


GCCTCCGGAC 


TAGTATAAAC 


8100 


CCCAATGACC 


CGGTAACTAA 


ATCCATTGAC 


TTCTACAACC 


TTGTT AATAG 


CCTCTTGAGG 


8160 


AGATTCAAAT 


AAACTAATGG 


ACAATTCCTC 


ATCT AG C AAA 


ATGACACTTG 


CAAACTCTTT 


8220 


GAAATCTTGC 


TCTCTCAGAC 


TACGACCTGC 


AATAATTTCA 


TTCTTAACAG 


CGTCCATGTA 


8280 


AGTTCTGTTT 


CCACCTGTCA 


AATTAGCATT 


CTCAACCTTT 


TTATCTTGAT 


AGGTCAAGAT 


8340 


GGCATTCGTT 


GAATTGGTTA 


CATAGTAACT 


ATCCACTCCC 


TTCAGTTTAG 


CTGCCTCTTG 


8400 


GACCCAGGAT 


TCTTGCGGTT 


TTGGCGGTTC 


AACAGGAACT 


TCCTCTTCCT 


TTCCAGAAAC 


8460 


CGTAAAAGCT 


GATTGTTTCT 


GAGTAAAAGA 


CCCG TCTTTA 


CTTTTTTTAG 


GAGAGAAAAA 


8520 


GACGCTAATA 


TTTTT CTGAG 


ATTTAGTCAT 


ATCTTTATTG 


ACTTGACGAG 


ATAGGGAATC 


8580 


ACCCAAAGCC 


ATAATCACAA 


CAACTGATGA 


AACACCGATA 


ATAATCCCAA 


TCATAGTAAG 


8640 


CAAAGAACGC 


ATCTTGTGAG 


CCATGATAGA 


TGAAAAGGCA 


AATTT C AG AT 


TCT GCATCTT 


8700 


AGTTTTCCTC 


CTTTCCTAAC 


TGAGCACTGT 


CAGACGAAAT 


GACCCCATCC 


CGAATGACAA 


8760 


TCTGACGTTT 


GGCATAGGCA 


GCAATCTCAG 


GCTCATGCGT 


TACCATGATA 


ATGGTTTTTC 


8820 


CTTCTTTATT 


CAAATCAACC 


AATAATTGCA 


TAATTTGGTT 


ACCTGTTTTG 


GTATCCAAGG 


8880 


CTCCTCTCGC 


TTCATCCGCT 


AGG AT AATAG 


AAGGATTGTT 


TACCAAGGCA 


CGCGCAATGG 


8940 


CTACACGTTG 


CTTTTGACCA 


CCAGATAATT 


CTCAAGGTAA 


ATGGTGACTA 


CGTTCTG TC A 


9000 


ATTCAACCTT 


GTCTAAATAT 


TCCTCAGCCA 


ACTTGCGACG 


TTTTGAAGAC 


GAAACTCCTG 


9060 


CGTAAATCAA 


GGGCAATTCT 


ACATTTTGCA 


GAGCATTGAG 


CTTCGATAGA 


AGAAAGAACT 


9120 


GCTGAAAGAC 


AAAACCGATT 


TGTTGGTTAC 


GGACCTTAGC 


TAGTTGTTTT 


TCACCAAGCC 


9180 


CAGCCACTTC 


TTGACCTTCA 


AG AT AAT ATT 


CTCCACTGGT 


TGGTGTATCC 


AACATGCCAA 


9240 


TCGTATTCAT 


CAGAGTGGAC 


TTACCAGACC 


CAGATGGTCC 


CATGATGGCT 


ACAAATTCAC 


9300 


CCTCATTCAC 


TTCTAGATTG 


ATATTTTTGA 


GAACCTGCAG 


TTCTTGGTCA 


CCATTACGGT 


9360 


AACTTCTGAA 


GATATTTTTT 


AGACTAATTA 


GTTGCTTCAT 


CAGCCTTCAC 


CTCTTTTCCT 


9420 


TCTTCCAAGG 


AAGATGTTGG 


ATTACTGATG 


ACCTTAGCAC 


CGTTCGTTAA 


ACCAGAAGTG 


9480 


ATTTCTTGAT 


TTTCTGCGTC 


AGCATTTCCC 


AATGAAACCT 


CAACT TTTTT 




9540 


TGTTCATCCA 


CAATCCAGAC 


ATAATTTTTA 


CTATCATCCA 


TTACTAGACT 


GCTAACAGGA 


9600 


AC AAG AATAG 


CCTTAGTTTT 


GCTTTTAACC 


TCAATGTTGA 


CAGAAAAACC 


TTGTT TCAAA 


9660 


TCACCAACCT 


CGCCTGTCAC 


ATCAATAGTA 


TAAGGGTATT 


TAGAACCTGT 


ATTATTCCCG 


9720 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 97 80 

CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTC ACCTACAGAA 9 840 

AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9900 
ATATGAACCA T AACTTG ACT CGCCCCTGTT GGAGATTTAG AAACATTGCT AT TG ACTTCG 9 960 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTG C AT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGACTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 1014 0 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 102 00 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 102 60 

GCTTCTGAAC TACTGTACTT G ACT AAAG C C TGCCCTTCGC TGACCTTATC GCCCACAGAA 103 20 

ACAAGGATTT CAT CT AAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 103 80 

GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 10440 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGACAAGGTT GT CT AAAG AG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

CCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 10620 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10680 

TGCTTTTCGG 10690 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
< D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT CGTATCTTTA 120 

TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA T TTTTGTG AT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TT ACGGTT C C TGGCCTTTTG 24 0 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 3 00 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 3 60 
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AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC 4 20 

GATTCATTAA TGCAGCTGGC AC G ACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA 4 80 

CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 5 40 

GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA ACAGCTATGA 600 
CaTGATTACG AATTCGAGCT CGGTACCCGG AAAATCCAGA AAATGCTTGA AAAAAATCCT 6 60 

AGAAGATGGT ATAATACTAA ATTGTAAGGG TTATCACATA TAACTCAAAA AAAGAAAGAA 7 20 

CAAAAGGAGA GTCAAACTAT GGCTTCTAAA GATTTCCACG TAGTGGCAGA AACAGGTATT 7 80 

CACGCACGTC CAGCAACATT GTTGGTACAA ACTGCTAGCA AATTTGCTTC AG AT ATC AC T 840 
CTTGAGTACA AAGGTAAATC AGTTAACCTT AAATCAATTA TGGGTGTTAT GAGTCTTGGT 900 
GTTGGCC AAG GTGCTGACGT AACTATCTCA GCTGAAGGTG CAGATGCAGA TGACGCTATC 960 

GCTGCAATCT CAGAAACAAT GGAAAAAGAA GGATTGGCAT AAGGGAAATG AC AG AAATG C 1020 

TTAAAGGAAT CGCAGCATCT GACGGTGTTG CAGTTGCAAA AGCATATCTA CTCGTTCAGC 1080 

CGGATTTGTC ATTTGAGACT ATTACAGTCG AAGATACAAA CGCAGAAGAA GCTCGCCTTG 114 0 

ATGCCGCTCT ACAGGCATCA CAAGACGAGC TTTCTGTTAT T CGCG AG AAA GCAGTAGGTA 1200 

CGCTCGGTGA AGAAGCAGCT CAAGTTTTTG ATGCTCACTT AATGGTTCTT GCTGACCCAG 1260 

AAATG AT C AG CCAAATCAAG GAAACTATCC GTGCGAAGAA AGTGAATGCA GAAGCAGGTC 13 20 

TGAAAGAAGT TACAGATATG TTTATCACTA TCTTTGAAGC CATGGAAGAC AACCCATACA 1380 

TGCAAGAACG CGCAGcGGAT wTCCGCGACG TCACAAAACG TGTATTGGCA AACCTTCTTG 14 4 0 

GTAAAAAATT GCCAAACCCA GCTTCTATCA ATGAAGAAGT GATTGTGATT GCGCATGACT 1500 

TGACTCCTTC AGATACAGCT CAATTGGACA AAAACTTTGT AAAAGCTTTT GTAACCAACA 1560 

TTGGTGGACG TACAAGCCAC TCAGCTATCA TCGCACGTAC ACTTGAAATT GCTGCTGTAT 1620 

TAGGTACAAA TAACATCACT GAAATCGTTA AAGACGGTGA CATC CTTG CT GTTAACGGGA 1680 

TCACTGGAGA AGTGATTATC AACCCAACAG ATGAACAAGC GGCAGAATTT AAAGCAGCTG 174 0 

GTGAAGCCTA TGCGAAACAA AAAGCTGAAT GGGCACTTTT GAAAGATGCT CAAACAGTGA 1800 

CTGCTGACGG TAAACACTTC GAGTTGGCTG CTAATATCGG TACTCCAAAA GACGTTGAAG 1860 

GTGTTAACAA CAACGGTGCA GAAGCTGTTG GACTTTACCG TACAGAGTTC TTGTACATGG 1920 

ATTCTCAAGA CTTCCCAACT GAAGATGAGC AGTATGAAGC ATACAACGCT GTTCTTGAAG 1980 

GAATGAACGG TAAACCTGTT GTCGTTCGTA CAATGGATAT CGGTGGAGAT AAGGAACTTC 204 0 

CTTACTTCGA TATGCCTCAC CAAATGAACC CATTCCTTGG ATTCCGTGCT CTTCGTATCT 2100 

CTATCTCTGA GACTGGAGAT GCTATGTTCC GCACACAAAT CCGTGCTCTT CTTCGTGCGT 2160 
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CTGTTCACGG 


TCAATTGCGT 


ATCATGTTCC 


CAATGGTTGC 


GCTCTTGAAA 


GAATTCCGTG 


2220 


CAGCGAAAGC 


AGTCTTTGAT 


GAAGAAAAAG 


CAAACCTTCT 


TGCTGAAGGT 


GTTGCAGTTG 


2280 


CGGATAACAT 


CCAAGTTGGT 


ATCATGATCG 


AGATTCCTGC 


AGCGGCTATG 


CTTGCAGACC 


2340 


AATTTGCTAA 


AGAAGTTGAC 


TTCTTCTCAA 


TTGGTACAAA 


CGACTTGATC 


CAATATACAA 


2400 


TGGC AG C AG A 


CCGTATGAAC 


GAACAAGTTT 


C AT ACCTTT A 


CCAACCATAC 


AACCCATCAA 


2460 


TCCTACGCTT 


GATTAACAAT 


GTGATCAAAG 


CAGCTCACGC 


TGAAGGTAAA 


TGGGCTGGTA 


2520 


TGTGTGGTGA 


GATGGCTGGT 


GACCAACAAG 


CTGTTCCACT 


TCTTGTCGGA 


ATGGG CTTGG 


2580 


ATGAGTTCTC 


TATGTCAGCA 


ACATCTGTAC 


TTCGTACACG 


CAGCTTGATG 


AAGAAACTCG 


2640 


ACACAGCTAA 


GATGGAAGAG 


TACGCAAACC 


GTGCCCTTAC 


AG AATG CTC A 


ACAATCGAAG 


2700 


AAGTTCTTGA 


ACTTCAAAAA 


GAATACGTTA 


ATTTTGATTA 


ATCGAAAAGT 


CCCTGCAACT 


2760 


CAGTTACAGG 


GATTTTTTTG 


ATATTTTAAA 


AAGAATTTTC 


AAGAAAATCT 


TTCTTATAGA 


2820 


AAGTCCAACC 


TTGAAAAAGT 


AGTGGTCAGA 


ACAAAAAATA 


CTTAAATGGT 


TCATAAAATT 


2880 


CTTGACAAGT 


TGGATATTTA 


GGAGTAAACT 


ATTAACCAGT 


TAAGTAATAG 


AGACGAGTTT 


2940 


CTGCAATTTA 


GAAATGAATT 


GCAACTAGAA 


ATATCAAATA 


GAAAGAGAGT 


TTCGATGAAA 


3000 


ATTAATAAGA 


AATACCTTGT 


TGGTTCTGCG 


GCACTTTGAT 


TTTAAGTGTT 


TGTTCTTACG 


3060 


AGTTGGGACT 


GTATCAAGCT 


AGAACGGTTA 


AGGAAAATAA 


TCGTGTTTCC 


TATATAGATG 


3120 


GAAAACAAGC 


GACGCAAAAA 


ACGGAGAATT 


TGACTCCTGA 


TGAGGTTAGC 


AAGCGTGAAG 


3180 


GAATCAATGC 


TGAGCAAATC 


GTCATCAAGA 


TAACAGACCA 


AGGCTATGTC 


ACTTCACATG 


3240 


G CG AC C ACT A 


TCATTATTAC 


AATGGT AAGG 


TTCCTTATGA 


CGCTATCATC 


AGTGAAGAAT 


3300 


TACTCATGAA 


AGATCCAAAC 


TATAAGCTAA 


AAGATGAGGA 


TATTGTTAAT 


GAGGTCAAGG 


3360 


GTGGATATGT 


TATCAAGGTA 


GATGGAAAAT 


ACTATGTTTA 


CCTTAAGGAT 


GCTGCCCACG 


3420 


CGG AT AACGT 


CCGTACAAAA 


GAGGAAATCA 


Alt- UALnAftn 


t\\— t\r\\jf\\j\-t\ i 


AGTCAACATC 


3480 


GTGAAGGTGG 


AACTCCAAGA 


AACGATGGTG 


CTGTTGCCTT 


GGCACGTTCG 


CAAGGACGCT 


3540 


ATACTACAGA 


TGATGGTTAT 


ATCTTTAATG 


CTTCTGATAT 


CATAGAGGAT 


ACTGGTGATG 


3600 


CTTATATCGT 


TCCTCATGGA 


GATCATTACC 


ATTACATTCC 


TAAGAATGAG 


TTATCAGCTA 


3660 


GCGAGTTGGC 


TGCTGCAGAA 


GCCTTCCTAT 


CTGGTCGAGG 


AAATCTGT C A 


AATTCAAGAA 


3720 


CCTATCGCCG 


ACAAAATAGC 


GATAACACTT 


CAAGAACAAA 


CTGGGT AC CT 


TCTGTAAGCA 


3780 


ATCCAGGAAC 


TACAAATACT 


AACACAAGCA 


ACAACAGCAA 


CACTAACAGT 


CAAGCAAGTC 


3840 


AAAGTAATGA 


CATTGATAGT 


CTCTTGAAAC 


AGCTCTACAA 


ACTGCCTTTG 


AGTCAACGAC 


3 900 
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ATGTACAATC TGATGGCCTT GTCTTTGATC C AG C AC AAAT CACAAGTCGA ACAGCTAGAG 3 9 60 

GTGTTGCAGT GCCACACGGA GAT C ATT AC C ACTTCATCCC TTACTCTCAA ATGTCTGAAT 4020 

TGG AAGAACG AATCGCTCGT ATTATTCCCC TTCGTTATCG TTCAAACCAT TGGGTACCAG 4 080 

ATTCAAGGCC AGAACAACCA AGTCCACAAC CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 4140 

CTGC AC C AAA TCTTAAAATA GACTCAAATT CTTCTTTCCT TAGTCAGCTG GTACGAAAAG 4 2 00 

TTGGGGAAGG ATATGTATTC GAAGAAAAGG GCATCTCTCG TT ATGT CTTT GCGAAAGATT 4 260 

TACCATCTGA AACTGTTAAA AATCTTGAAA GCAAGTTATC AAAACAAGAG AGTGTTTCAC 4 320 

ACACTTTAAC TGCTAAAAAA GAAAATGTTG CTCCTCGTGA CCAAGAATTT TATGATAAAG 4 3 80 

CATATAATCT GTTAACTGAG GCTCATAAAG CCTTGTTTG A AAATAAGGGT CGTAATTCTG 44 40 

ATTTCCAACC CTTAGACAAA TT ATT AG AAC GCTTGAATGA TGAATCGACT AATAAAGAAA 4 500 

AATTGGTAGA TGATTTATTG GCATTCCTAG CACCAATTAC CCATCCAGAG CGACTTGGCA 4 560 

AACCAAATTC TCAAATTGAG TATACTGAAG ACGAAGTTCG TATTGCTCAA TTAGCTGATA 4620 

AGTATACAAC GTCAGATGGT TACATTTTTG ATGAACATGA TATAATCACT GATGAAGGAG 4 680 

ATGCATATGT AACGCCTCAT ATGGGCCATA GTCACTGGAT TGGAAAAGAT AGCCTTTCTG 47 40 

ATAAGGAAAA AGTTGCAGCT CAAGCCTATA CTAAAGAAAA AGGTATCCTA CCTCCATCTC 4800 

CAGACGCAGA TGTTAAAGCA AATCCAACTG GAGATAGTGC AGCAGCTATT TACAATCGTG 4860 

TGAAAGGGGA AAAACGAATT CCACTCGTTC GACTTCCATA T AT GGTTG AG C AT AC AGTTG 4 92 0 

AGGTTAAAAA CGGTAATTTG ATTATTCCTC ATAAGGATCA TTAC CAT AAT ATTAAATTTG 4 9 80 

CTTGGTTTGA T G AT C AC AC A TACAAAGCTC CAAATGCCTA TACCTTGGAA GATTTGTTTG 5040 

CGACGATTAA GTACTACGTA GAACACCCTG ACGAACGTCC AC ATT CT AAT GATGGATGCG 5100 

GCAATGCCAG TGAGCATGTG TTAGGCAAGA AAGACCACAG TGAAGATCCA AATAAGAACT 5160 
TCAAAGCGGA TGAAGAGCCA GT AGAGG AAA CACCTGCTGA GCCAGAAGTC CCTCAAGTAG 5220 
AGACTGAAAA AGTAGAAGCC CAACTCAAAG AAGCAGAAGT TTT GCTTG CG AAAGTAACGG 5280 
ATTCTAGTCT GAAAGCCAAT GCAACAGAAA CTCTAGCTGG TTT AC G AAAT AATTTGACTC 534 0 

TTCAAATTAT GGATAACAAT AGTATCATGG CAGAAGCAGA AAAATTACTT GCGTTGTTAA 5400 
AAGGAAGTAA TCCTTCATCT GTAAGTAAGG AAAAAATAAA CTAATGAAAA ATGAAAGTCT 54 60 

CG AT AAAG AG GCTTTCATTT TT ATT ATGT A TAT ATGT AAA ATTCTTGACA AGCAATATTA 5520 
AAAAGAGTAA ACTATTAACT AGTTAATTAA CCGGTTTATT ACTTTATAGT GAATCAAATA 55 80 

TACTTAAGAA AAGAGGAAAG AATGAAAATT AATAAAAAAT ATCTAGCAGG TTCAGTGGCA 5640 
GTCCTTGCCC TAAGTGTTTG TTCCTATGAA CTTGGTCGTC ACCAAGCTGC TCAGGTTAAG 5 7 00 
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AAAGAGTCTA ATCCAGTTkC TTATATAGAT GGTGATCAGG CTGGTCAAAA GGCAGAAAAC 57 60 

TT G AC AC C AG ATGAACTCAG TAACAGGGAG GGGATCAACG CCGAACAAAT CGTCATCAAG 5820 

ATTACGCATC AAGGTTATGT G AC CTCTC AT GGAGACCATT ATCATTACTA TAATGGCAAG 5880 

GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA AAGATCCGAA TTATCAGTTG 594 0 

AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG TT AT CAAGGT AGATGGAAAA 6000 

TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATA TTCGGACAAA AGAAGAGATT 6060 

AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 6120 

GTTGCAGCCA GAGCCCAAGG ACG CT AT AC A ACGGATGATG GTTATATCTT CAATGCATCT 6180 

GATATCATTG AGG AC ACGGG TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC 624 0 

ATTCCTAAGA ATGAGTTATC AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG 63 00 

AAGCAGGGAT CTCGTCCTTC TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA 6 3 60 

TTGTCAGAGA ACCACAATCT GACTGTCACT CCAACTTATC AT C AAAATC A AGGGGAAAAC 64 20 

ATTTCAAGCC TTTTACGTGA ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT 64 80 

GATGGCCTTA TTTTCGACCC AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC 6 540 

CCTCATGGTA ACCATTACCA CTTTATC CCT TATGAACAAA TGTCTGAATT GGAAAAACGA 6600 

ATTGCTCGTA TTATTCCCCT TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA 6 660 

GAACAACCAA GTCCACAATC GACTCCGGAA CCTAGTCCAA GTCCGCAACC TGC AC CAAAT 6720 

CCTCAACCAG CTCCAAGCAA TCCAATTGAT GAGAAATTGG TCAAAGAACC TGTTCGAAAA 67 80 

GTAGGCGATG GTTATCTCTT TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT 6 84 0 

CTTTCAGCAG AAACAGCAGC AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT 6900 

CATAAGCTAG GAGCTAAGAA AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG 6960 

GCTTATGACT TACTAGCAAG AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT 7020 

GATTTTGAGG CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TG AT AAAGTC 7080 

AAGTTAGTGG ATGATATTCT TGCCTTCTTA GCTCCGATTC GTCATCCACA ACGTTTAGGA 714 0 

AAACCAAATG CGCAAATTAC CT AC ACT GAT GATGAGATTC AAGTAGCCAA GTTGG CAGGC 7 200 

AAGTACACAA CAGAAGACGG TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG 7 2 60 

GATGCCTATG TAACTCCACA TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT 7 3 20 

GAAGCTGAGA GAGCGGCAGC CCACGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 7 3 80 

ACAGACCATC AGGATTCAGG AAATACTGAG GCAAAAGCAC CAGAAGCTAT CTACAACCGC 7 4 40 
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GTGAAAGCAG CTAACAAGGT GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7500 

GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC AT T ACC AT AA CAT C AAATTT 7 560 

GAGTGGTTTG AC G AAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7 620 

GCGACTGTCA AGTACTATGT CGAACATCCA AACCAACGTC CGCATTCAGA TAATGGTTTT 768 0 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 77 4 0 

GATAAGGAAC ATGATGAAGT AAGTGAGC C A ACTCACCCTG AATCTGATGA AAAAGAGAAT 7800 

CACGCTGGTT TAAATCCTTC AG C AG AT AAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 78 60 

ACAGAGGAAG AAGCT G AAG A TACCACAGAT GAGGCTGAAA TTCCTCAAGT AG AG AATT CT 79 2 0 

GTTATTAACG CTAAGATAGC AGATGCGGAG CCCTTCCTAG AAAAAGTAAC AG ATCCT AGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGG CTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTT ATGG AG A AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA G AAAAACG T A ACAGA 8195 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

AT AGGT AT GG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 12 0 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT TCACCTACAT 180 

AGAAGGTTAC GATTTCACTG T CTTCATC C A ACATATGTTT CAAGGTTTCA GTCAATGT rT 240 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC C AT ACCT AAA TT AT CGTTTT 300 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTCTGACG CTTCCGCTAA 3 60 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 4 20 

CAAAGGCAAG AACACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 4 80 

GTTGCTCCAA AACTTCTGCC G C AG ATTG AG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 54 0 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACCCC TTGAGAACAG AAGATATCTG 660 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTG AGT AGCTTCTTTC TCAACCTGTG C TTCGTGTTG GTTACGCATA TTGTCAACTT 78 0 

TTACCTTGAC CAAGCTACCA TATTTGAGAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 84 0 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CATGATTTCA GTACAGTAAC CAAACCTGAT CTCCTCAGTC GCTACGTGAC 1020 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGCCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 1200 

CTGGTGTTTT AGCTAGAGCT GTTTT AG C AC CTTCCAAGGC TGCGCGCATG ACT T C AAC AG 1260 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 1320 

AAATCGTTCC TTCAACAGGT TT CATC ACT G CCTTATAGGC AACTTCCACA CCTGATTGGA 13 80 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAG ACGT A ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCT GG CTTGTCTGCA ACTTCTTTAG 1560 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 162 0 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTG AT ATT 174 0 

TTGAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 1860 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 192 0 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2 004 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i). SEQUENCE CHARACTERISTICS: 

<A> LENGTH : 11915 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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C CGGGTTGGG CTGTTCGCCC ATTAAAGCGG CACCACAGCT GGGTTCAGAA CGTCGTGAGA 60 

CAGTTCGGTC CCTATCCGTC GCGGGCGTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 120 

GAGGACCAGA GTGGACTTAC CGCTGGTGTA CCAGTTGTCT TGCCAAAGGC ATCGCTGGGT 180 

AGCTATGTAC GGAAGG G AT A AACGCTGAAA GCATCTAAGT GTGAAACCCA CCTCAAGATG 2 40 

AGATTTCCCA TG ATT AT ATA TCAGTAAGAG CCCTGAGAGA TG ATCAGGT A GATAGGTTAG 300 

AAGTGGAAGT GTGGCGACAC ATGTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 3 60 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTG AATAGATATT CAATTTTGAG 4 20 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGGAGAT ACACCTGTAC CCATGCCGAA 4 80 

CACAGAAGTT AAGCCCTAGA ACGCCGGAAG TAGTTGGGGG TTGCCCCCTG TGAGATAGGG 540 

AAG TCGCTT A GCTCTAGGGA GTTTAGCTCA GCTGGGAGAG CATCTGCCTT AC AAGC AG AG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTT AGCGG GT GTAGTTTAGT GGTAAAACTA 660 

CAGCCTTCCA AGCTGTTGTC GCGAGTTCGA TTCTCGTCAC CCGCTTTGAA CTTTGTTCTT 720 

TGTACCAAGT TTTTGACTTG GGCGCGTAGC TCAGGTGGTT AGAGCGCACG CCTGATAAGC 7 80 

GTGAGGTCGG TGGTTCGAGT CCACTCGTGC CC AT AGTGTT TAGTCCATTA CTAGCGGATT 84 0 

GGAATATTAT CTGTTCACTA AGAGGACACG GGCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAC TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

GTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 102 0 

CGTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGGT 1080 

AGCTCAGTTG GTAGAGCAAT GGATTGAAGC TCCATGTGTC GGCCGTTCGA TTCCGTCTCG 1140 

CGCCATTTAT ATATTTTGGA AGGGTAGCGA AGAGGCTAAA CGCGGCGGAC TGTAAATCCG 1200 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACGGGCAT AGTTTAAAGG 12 60 

T AG AACT AAG GTCTCCAAAA CCTTCAGTGT GGGTTCAATT CCTACTGCCC GTGTTAATAG 13 20 

AATTATGGCG GGTGTGGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 1380 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGT AT AG CCAACCGGTA AGGCAAGGGA 14 40 

CTTTGACTCC CTCATGCGTT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

GCGG AATTGG C AG ACGCGCT GGACTCAAAA TCCAGTGTCC GCAAGGACGT GCCGGTTCGA 1560 

CCCCGGCCGC CCGTATACTA TAGTGTTAGG AACGTTGTTA TTCTTCGTTC CTTTTTTATA 1620 

TTATTTTTGG TATAATTATA GTT ATT C AAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 1680 

TATGTCTTCT TCTATCGATT TATTAAAACA TCGGTATTTG AAAAATATTA AAG AAAATC C 17 40 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTT AG AAG GGGATGCTAC 1800 
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AGATGTTGAA 


GTTATGAAGG 


ATCTATTTCA 


TTATTTAGTT 


TCTACTTTGG 


ATCTCAPPHT 


18 60 


AGCAAAGGTA 


GATGATTTTG 


GCAATCTGAT 


CCAGTTAGTA 


GATCCGATAA 


GTC AGG ATG C 


1920 


TATTTTATTT 


GAAGTTTCCT 


ATACAACGAT 


TGAGTTTGCA 


TTTGGTAAGG 


CTCAAAfHAT 


i qpn 


TCAAGAGGTC 


GAAAATCGTT 


TCAATAATTA 


TATGAATGTA 


ATTCAGAGAA 


AGTTAfirTr.A 




ATCAAATCAT 


G CT ATTGTTG 


GCTGTGGTAT 


C C ATCC CAA C 


TGGGATAAAA 


ATGAfJAATTP 


"> 1 Art 


TCCAGTGGCT 


TATCCACGCT 


AT CAG ATGTT 


G ATGG ATT AT 


TTGAATTTO A 


v> * AUAAA 1A1 




TATTAAATCA 


GATTTACATC 


ATTTCCCTGA 


ATATGGTACT 


TTTATPTdTd 
i i i nik. iu 1 u 






T C AGCTGG AT 


ATTTCAAAAA 


CCAACTACTT 


ACGGGTGATT 


ft/\ 1 1 1 lift 




O *5 Q f\ 

a\ £ a v 


AGCGGCTAAG 


G CTT ATTT AT 


TTGCAAACTC 


TGAATTTTCC 




fyjrnm A^f A a 




AATTTCAAGG 


GATATTTTCT 


GGGAAGAATC 


i n i vjv_n i uu i 




nunn i Vj I i 


J4UU 


GGTCAATGCT 


AGACTCCTTA 


ATG AT G AAAC 


4 VJA 4 A 1 1 4 4 1 




A TP A TTPTW 




GATTTTTACT 


GCGGAACGTG 


ATGGGCAGAC 


w 4 / • * a n 4 I 4 1 




irnrrfwv A 


£ D 41 U 


CTATTTGGCT 


ACG TCCG AAA 


TCCAAGCATT 


TGCTCTGAAT 


GGGG ATH AHC^ 


TTATTl TTT1 
1 ini inl 1 Ln 




CCCCCAAGAG 


AAGGATTTTG 


AAACTCATCG 


T AGTT AC CAG 


TACCAAflATT 


TAArciArTrr 




AGGAACAGTT 


GAGTTTCGTA 


GTGTGTGTAC 


ACAGCCACTT 


A T A flfT A PTT 


1 4 V>V. * 1 l_ i^jL. 




AGCTTTTCAC 


TTGGGATTAT 


TGGTTAATTT 


AGACAAGTTA 


GAAGCTTACT 


TAItAAAPAPP 


0 7 fin 


ACCTTTCTTT 


AAAG T ATTTG 


A 1 A 4 4 «V 


wnnw i v. i i i n 




TTTPT A if"! A & 
1 1 1L1 /VMVj An 


4Q^U 


AAATCTTACA 


GATGAGGAAG 


AAACTACGAT 


TATTGAATTT 


TCCAAAHAPT 


t ArTrrr a pt 


2 880 


AGCTGAGGAG 


GGACTAGTGG 


TGAGAAATAA 


GGAAGAAATG 


ACCTATTTAC 


AGC CTTTflAf: 


2 94 0 


AGAAGAATTG 


AGCCTATAAT 


TTCTCTTATA 


AAGGGAGAAT 


TTTCTGAAAA 


AT CATC AT A m 




AATGGACGAG 


ACTATAGATA 


AAGGATAGAG 


AGTAATGACA 


A^IT'PT' A TP 
1 i nvj 1 1 1 r\ 1 w 


AATTAArnrr 


JvOU 


TGATGCCAAC 


AATACAGTAA 


CTGCCAGCCA 


AGCAATTTTG 


CAAGGTTTGG 


CGACGGACGG 


3120 


CGGTTTGTTT 


ACACCGGATA 


CTTATCCAAA 


GGTAGATTTG 


AACTTTGACA 


AATTGAAAGA 


3180 


TGCTTCTTAC 


CAGGAAGTTG 


CTAAGCTAGT 


TTTGTCACCA 


TTTTTAGATG 


ACTTTACAGT 


3240 


TGAGGAGTTG 


GACTACTGTA 


TCAACAATGC 


CT AC GAT AG C 


AAATTTGATA 


CTCCAGCTAT 


3300 


TCCACCATTA 


GTGAAATTAG 


ATGGGCAATA 


CAATTTGGAA 


CTTTTCCATG 


GTTCAACGAT 


3360 


TGCCTTTAAG 


GATATGGCCT 


TGTCT ATTTT 


GCCATACTTT 


ATGACGACTG 


CTCCTAAGAA 


3420 


AC ATG GTTTG 


GAGAACAAGA 


TTGTTATCTT 


GACAGCGACA 


TCTGGTGACA 


CGGGGAAAGC 


3480 


TGCTATCGCG 


GGGTTTGCGA 


ATGTGCCTGG 


TACTGAGATT 


ATCGTCTTTT 


ATCCAAAGGA 


3540 
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TGGTGTCAGC AAGATTCAAG ACTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 3600 

TATTGCTATT GATGGTAACT TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 3 660 

CGTGGCTCTT CGTGAAAAAT TGACTACCAA CAACTTGCAA TTTTCATCAG CTAACTCTAT 3720 

GAACATTGGT CGTCTGGTGC CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 3780 

TAAGACTGGT GAAATTGTAG CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 3 840 

TGGAAATATC TTGGCTGCCT TTTATGCCAA ACAAATTGGT TTGCCAGTTG GTAAATTAAT 3 900 

CTGTG CTTC A AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 3 960 

AAAACGTGAG TTTAAGGTAA CAACCAGCCC ATCTATGGAT ATCTTGGTAT CTTCAAACTT 4 020 

GGAGCGCTTG ATTTTCCATC TTTTGGGAAA TAATGCTGAA AAGACAACTG AACTTATGAA 4 080 

TGCCTTGAAC ACGCAAGGAC AATATAAGTT GACAGACTTT GATGCAGAGA TTTTGGACCT 4140 

CTTTGCAGCT GAATATGCGA CTGAGGAAGA AACGGCAGCA GAGATCAAGC GTGTTTGTGA 4 200 

GTTAGATTCT TATATCGAGG ACCCTCATAC AGCTGTTGCT TCAGCAGTTT ATAAAAAATA 42 60 

CCAATCGGCC ACTGGAGATG TAACTAAGAC AGTGATTGCT TCAACAGCTA GTCCATACAA 4 320 

GTTCCCAGTA GTTGCAGTAG AAGCTGTAAC TGCAAAAGCA GGTTTAACAG ACTTTCAACC 4 3 80 

CTTGGCTCAA TTACATGAAA TCTCAGGCGT TGCAGTGCCA CCAGCAGTTG ATGGGCTTGA 4 4 40 

AATAGCTCCA ATTCGTCACA AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 4 500 

GGCTTATTTA GGACTTTAAG ACAGAGGGAG CAAACTCGGT TGGGAAACCA ACTGAGTTTC 4 560 

TTTTCATCAG GAGGAGAGAT TGTTTAAGAA AAATAAAGAC ATTCTTAATA TTGCATTGCC 4 620 

AGCTATGGGT GAAAACTTTT TGCAGATGCT AATGGGAATC GTGGACAGTT ATTTGGTTGC 4 6 80 

TCATTTAGGA TTGATAGCTA TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 4 740 

TCAGGCGATT TTCATCGCTC TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 4 800 

GCAGAAAGAC CAGTCGAAGT TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTACT 4 8 60 

ATTAACTTTC CTTTTAGGAT TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TAGGACTTYT 4 920 

GGGGACGGAG AGGGATGTAG CTGAGAGTGG TGGACTGTAT CTATCTTTGG TAGGCGGATC 4 9 80 

GATTGTTCTC TTAGGTTTAA TGACTAGTCT AGGAG CCTTG ATTCGTCCAA CGCATAATCC S040 

ACGTCTGCCT CTCTATGTTA GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 5 LOO 

AGCTATTTTT GTTCTGGATA TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 5160 

TTTGGTTGCT CTTGTGATTT TGTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 52 20 

TGGTTTAGAT AAGGAACTGT TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 5280 

GAGGGCTGGA GATGTAGTGA TCATTGCCTT GCTCGTTTCT TTTGGGACGG AGGCAGTTGC 5 340 
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TGGCAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCGC 54 00 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG GATGATTGGA AAAGAGTTGC 54 60 

TAGTTTGAGT AAAC AAACC T TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 5520 

TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAG CGGTGG A 5580 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTCGG AC C CCTATGACGA CAGGAACAGT 564 0 

CATCTATACG GC AG TCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 5700 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 5760 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5 820 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5 880 

TAT TTGGG AT TT AG AC GGG A CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA G AT AGAAAT C TGGATGTTGA 6060 

GGTGCTAAAT CAGGTGCCTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTGAT 6120 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACTCATAAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGGTGG AATCCTATTT 6 24 0 

TACAGAGATT TTAACCAGTC AGAGTGGCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 6 300 

CTATCTGCTA GATAAGTATC AGTTGAATTC TCATAATACT TATTATATAG GGGATCGGAC 6 3 60 

TCTGGATGTG GAATTTGCCC AGAATAGTGG GATTCAAAGC AT C AACTTTT T AC AGTCT AC 64 20 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGCAGATATT TCCCGTATTT TTGAGACTAA 6480 

GTGATAAAAA GATTGTGTCA GTTTTGTGAC AGAGACCTAA CAAACT AT TT CAAGTAACCT 6 54 0 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTTCTACA 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGGAA TTTGAAAGAA TCAAGAAAAG 6660 

AATGTT ATT A GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTC AAGC AG A 672 0 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AACGATTTGA CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG CAGAAGCCTT 68 4 0 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 6900 

CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGT TG AAA T 6960 

CCAAACACCT CAAGC AG A CT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 702 0 

TAATCAAGTG ACCGTTGATG ATCAAACTGT TCAGGTTGCA G ACCTTT CTC AACCAATTGC 7080 
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AGAAGTTACA AAGACAGTGA TTGCTT CTG A AGAAGTGGCA CCATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 7200 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 7 260 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 7 320 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7 3 80 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCAGGTCTTC AACCACAAAC 744 0 

AGCTGC CTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCCTGGCAT 7 620 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7 680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 77 40 

TCACGTTT C A ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7 800 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 7860 

CTTCAAACCA CGT C AGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7920 

ACCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7 980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 8040 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC CCAATTGTTT GAAAAAAGCC 8160 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 822 0 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGCAAG 8 340 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 8 5 20 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8640 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8 820 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA AT ACACTTG A 8 880 
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TCTTTC C C AG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC CTTTGATTAA 10680 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AG AAATC AT C ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTC AGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 10860 

TGGAGATGAA CTCATTGCAC GCGCTGCTGG TCTCCACAAG TTCATGAATT GGG ACCAG CC 10920 

TATCTTGACA G AT AGTGGT G GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACCT TAAG AAATC G ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCACCATGGA 11280 

TTTCT C AGGC TACTCTATCC GTGGTTTGGC AGT GGG AG AA ACCCATGAAG AGATGAATGC 113 40 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTCT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCCTGGG GTGGATATGT TTCACTGTCT 11460 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11520 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTCCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACT AGCT AC C ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAACTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTCG AAAAATATGG 117 60 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTG AT AG AATAAAGTGT ACAATGAAAG GAAGAATAAA ' 11880 

CTCGT ATGCG CATTAAATGG TTTTCCTCCA TTAGG 11915 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTG C AG AT AT TCAAGAATCG 
GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 



60 
120 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACT CT ACCTT GCTTGCTATC AAGGAGACCT 180 
TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 24 0 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 
CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 3 60 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 4 20 

CTGACTGAAA ATAAATTGAG TTC AT AG ATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 4 80 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 540 
GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 
TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 
TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AG AAAGAGGA 7 20 

TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 7 80 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 84 0 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 900 
TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 960 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATGA CCGTG CT TTG GCCTTGCAAG ACCAAATTAT 1080 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 12 00 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGCTC TGAAACAAAT 12 60 

CGAGGGTATC AAACATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 13 20 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 13 80 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT C AAAAC ACT G 14 4 0 

TTTTGAAGTT AT GG AT AG AA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGCTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGG AT AG AA CTGACGAAGT 15 60 

CAGCCCAAAA CACTGTTTTG AG G TTGTGG A TAGAACTGAC GAAGTCAGTA ACCATACCTA 1620 

CGGCAAGGCC ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT . 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 1740 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGG C A AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 18 60 
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ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 1980 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 2160 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 2 220 - 

AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 234 0 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AAC AAAT ACT TCATAACCGC AGGGACACCA 2 4 00 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 24 60 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TCCGGCATGG 2520 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GT AG AAC CAC CGAGAGCCAT CGTTACAGTG 2580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG AT AT CTG ATG GTTTGAGACC AAGTTCCAAC 2640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG AT ATCTTCTT TCTTATCAGC TGATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2 820 

TTCACGTCCT CAGCTGTCAT GTC AC CGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA 2 880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TACTTCCACC ATAGGCGAAA 2 940 

AT AG CTG GG A TATCCATATT AGCAATAGCA ATC AT AGATC CAGGCATGTT CTTGTCACAG 3000 

CCACCGATAG CGACGAAGGC AT C CACGTTG TG ACC ACT C A TAGCCGCCTC GATGGAGTCC 3 060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CG CAT ACC AG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATG C AAGTG A ATGTTACATG GTCTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCACGAAGC 3 300 

ATAGCACGGT TAGGTGATTT AAC C ATG CTG TCATAAATGC TACTGCGGTG AC GTTT ATCT 3360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTC T AT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TT AG TGC ACT TT CTG ATG AC CAGAATATGC TTTTTAATCC 354 0 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3 600 

CATCTATCTT GC ATT AT AAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3660 
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ATACTCATTA AGAAGATAAT CCA TTTT CCC TACTTCACCG AATCTTTCTT GAACACCCAT 3 7 20 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 37 80 

ACCTATCTGA TTGTGGTTTT CT AC ACT AAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3 340 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3 900 

AGACAGT TCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACCATTAC 39 60 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4 020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT ACTCCTTTTA AGTCTAATGT 4 080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 414 0 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4 2 60 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT 4 320 

AAACCCCTGA AT AG AC AAGC CTGCTCCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 4 3 80 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4 44 0 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4 500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4 560 

AATCTT TCT A CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4 620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4 680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4 740 

AT AT CTG AAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4 800 

TC AC CTGG AT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 4 8 60 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4 920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4 980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 504 0 

ATATCTATGC CTCGCGTT AG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCCTGC TGCAAATATT 5220 

T CTGG AGTC A TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 52 80 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 53 40 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AGACCAATCA CAATGCCTCC TTGTG AG CCA AATTG ATT T A ACATTCCTAA AATAATTCCT 5 520 

GAT AG AC C AA AATCTGCATC TGAGAAAGTT GATC CTTGG A AACCAAGTCC TCCCAAAACT 5580 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAAT ACCTT GTAAAAATGC TCCAATAGTG 5 64 0 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 57 60 

ACTAAACCAC CAACAAAACT AG AG AT AAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGC AGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5 8 80 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC C TG CT AAAAT AAC AAAT A CC 5 940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA G AC C ACTTG T ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTC AT TTCCACTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 62 60 

AT AT AT AG AG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGAGCGATT 64 20 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG T AGCTG AG CC ATATGTTCTT 64 80 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 654 0 

TCAAACATCG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGC AATT AAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 67 2 0 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAiIT 6780 

TT ACT AATT A ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC T AGCTG AATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 69 60 

AAT AC AG ATT CAATATTCAT CTGTACCATA AAACTTGAAC CT AATCCTG A ACCACAAGCT 7 020 

GTACCAATTT TT AAC AT T AT CTAATCCTCC TGTTT AATT A TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA C AATCTT ACC 7200 
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TCTTCTTCTG GATTATCCAA TAAATAAATC GGTTCTTCCA AAACTAACAT TGACATTCCT 7 2 60 

ATTTCATTCA CACCTTCATC TGGCCGAGCG TGAGCAATTG CTACTCCCTT CCCTAAATTA 7 320 

AT AAAAGGT C CAAACTCTTC TACTTTTTGA ATCATTGCCT CAGGGTAGTT CTCAGTTATC 7 3 80 

TTATCTTGAT CCAAAAGCGG TTTAGCTGCT AAACGAATCG CCTCCTTCCA TCCTAATTTT 7 44 0 

TGCGAACTAA CCTGATAGGT TTCTTTGGTA ATAAGTTGTT CTAGCACTGG TACAATTTCC 7 500 

TTTCTATCAT TTTTTTGGTA AAGATAATTC TTTAACGCCA ATCTTAATTC CAATTCTTGT 7 560 

GTAATAATTC CATATCTTTT GACAATATTC AGGATTTCTT CAATCTCAAA ATCTCCATAC 7 62 0 

TCTAAATTCG G AAAAT CTTT TAACACTAGT TCTACTAGTT GTATTGCTTG CTCTTCAGTC 7 6 80 

ATCATAACCG AAACTAGATA ATTTGGCTTT TCTGTCTCCA CCTTTATGGT AGAAAAAACC 77 4 0 

ATATCATAGT CACTACTAGC TTTCACCTGT AAATCATCAA TCTTTGAGGT TCCTATAAAC 7 800 

TCAATTTGAG GAAATAATGC TAATAGATTC TCTTTTAACA TCAATGAAGA ACT AAC AC C A 7 860 

TTAGGACAAA TGATTGCTGC TTTATACCAT TTTTGAGGCA AAGTATCTGC TTT CTTT AAA 7 9 20 

TAACCTCCGA AATGGATAAC AAAATATGCT GTTTCACTAT CAGGTATGGG ATTCTCAATA 7 9 80 

GCGTCCATCA AGGGCATCAA AGAATCTTTG ACTAATTCAA ATAAATCAGG ATAATGTTCT 8040 

TTAACATGCA ATACATATTC ATTTGAACTA GGTAGGCCGA ACTTTAATCT ATAGTAAGCC 8100 

GGTATAAGGT GGCGGCGAAG ATTTTCTCTC AATCCTTCCC TTTGTTTAAA ATGTAACAAA 8160 

GAAATATCTT CCATTCTACT TATAATAGCC TCTGTTAATT GATTAAAGTA AACCGGAGCA 82 20 

ACATCTACTT C AC CTTC AAA GCAACTTGAT AATAAAACGG TGATATAGCG ATAATCATCC 8 2 80 

TCAGAAAACA CCGTATCTAT AATTCCCAAA TCAACCACTG TATCCAATAA AATAGTGGTT 8 34 0 

ATATCTTGAA TAACAGGAGA TACTAATGTC TCTGAAAGAC AT ACT CTTT C AACATC CCTT 84 00 

TGATACCTAC ACAGAATGAA TACTAAACCG AAAAGGT AAA CTTTTAATTG ATTAACAATA 84 60 

GGT ACT AG CT GTAGCTTCTC ATAATAATCT TTAACTACCT GATCAATCAA ATCATAAGTT 85 20 

AATGAATACC CCCAACTGGA TAAAACATAA TCCAAACCCC AAAT CCCT AT GGAGGATTCC 8580 

AGCAACTCAC TAACCATTTG AAAAGCTAAG CGCTCCTTAT TCCACTCTGA ACCGTGTAAA 8 64 0 

GTATAACCTT TTGCTCTACT GTACCCTAGC TCC AAAT CAT TATCTAACAT AATCTTTCTT 87 00 

AATGATTGAA TAT C AG AT AA GGTTGTATTC TTACTTACTT TCAAAAAGTC TTGGTAATGA 87 60 

CT ATTCG AT A TAAAATCTAA TCGGCAAAAA GTGTAAAGAT AGATTAAAGC TAAGCGAGTC 88 20 

GACTTTGGTA AAACCAATTC ATCCGACTTA ATAATATCTG TCAAAGACTG CTTCGTACGA 8880 

TTTG AT AAA C TATAGCGACC TTGCTTTTTA TCCAGCACTA TCCCTTTATT AGCTAGATAA 8 94 0 
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GGCACTAAAT AATCT ATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 

AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9060 

TAATCAAAC 9069 



(2) INFORMATION FOR SEQ ID NO: 98: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8654 base pairs 

( B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 

TTCTTCCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 240 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTCAGCCTCC 300 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCACTCGTAG GACCTGCGAC 3 60 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

A C CAAGTGCG AC TC AAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 480 

CCAAGATAGC TTCCAAGCAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 54 0 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATCCT AAAGGGATTG CAAAATCTTT 600 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 660 

CTTCCAAGCA GCCCTTACAA AAATGAAAGG GAAAGACTTT GATCCTATCG TTGTTCCTGG 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCGT GGCATGGGAA TTGACAAACC 780 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGG TGCT A AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATG C A GACCACAACA CAGTCAAAAC TGCTTACATG ATGACCATGA ACAATGGTAA 114 0 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA AT AG AAAAAT GTTGAAATAG GGAATGAGCC 1200 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 12 60 
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ACGTCGCCTT GGATTATATA TGTG ACTG AC TTCGTCAGTC TTATCTACAA CCTCAAACCA 1320 

GTGCTTTGAG CAACCTGCGG CTAGTTTCCT ACTTTGCTCT TTGATTTTCA TTGAGTATAA 13 80 

GAACCTATCA AAAAGTGAGC GAAAACCCTC GGAATTATAA ATAGAAAGAG TGAATCTTAT 14 4 0 

GCTCCAACAA CTCGTAAATG GTTTGATTCT AGGTAGTGTT TACGCGCTGT TAGCCCTAGG 1500 

ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC GCCCATGGTG ATATTTATAT 1560 

GATGGGAGCC TTTATCCCTT ATTTCTTGAT CAATTCTTTC CAAATGAATT TCTTTGTAGC 162 0 

CCTTATTGTA GCTATGCTAG CGACAGCTAT TCTTGGTGTC GTCATTCAGT TTCTTGCTTA 1680 

CCGACCTTTG CCCCACTCTA CTCGTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 17 4 0 

CCTATTGGAG TATGGAATGG TCTATCTGGT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 1800 

GATTCAAACA GTTCCATATG ATTTGGG AC C AATTACCTTA AC AAATGTG C AGTT AATG AT i8 60 

TTTGGCCATT TCCTTGATTT TGATCATTTT GTTACAAGTC ATTGTCCAAA AGACTAAGAT 19 20 

GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACGCG GCCCAATTGA TGGGGATCAA 19 80 

TGTAAACCGT ACCATTACCT TTACCTTCGC TTTGCCTTCT CCTCTTCCGC GT GCGGCTGG 204 0 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGCCGTTA CTCCAGGTCT 2100 

TAAATCTTTC GTTG CCGCAG TACTTGGTGG TATCGGAATT ATTCCTGGTG CGG CTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT CAGATTTCCG 2 22 0 

TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTGATT GTCCGCCCAC CTGGTATCCT 2 2 80 

TGGTAAGAAT GTGAAAGAGA AGGTGTAAAC GATGAAGGAA AATTTAAAAG TTAATATTCT 2 3 40 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTGCTTT CAGTCGGAGT 2400 

ACTTAATCTA TTCTATCTAC AGATTTTACA ACAAATTGGA ATTAATATTA TTTTGGCTGT 2 4 60 

TGGTCTCAAC TTAATCGTTG GTTTTTCAGG AC AATTTTC A CTTGGTCATC CTGGTTTCAT 25 2 0 

GGCGATTGGT GCCTATGCAG CAGCTATTAT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2 5 80 

CTTTGGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA GTTCCCTTAC TTGTCGGCAT 2 640 

TCCAACCTTG CGCTTGAAGG GGGACTATCT TGCGGTAGCA ACTCTGGCTG TTTCTGAAAT 2 7 00 

TATCCGTATC TTTATCATCA ATGGTGGAAG CCTTACAAAT GGTGCGGCAG GTATCTTAGG 2 7 60 

CATTCCTAAC TTTACAACTT GGCAAATGGT TTACTTCTTT CTCGTGATTA CAACCATTGC 2 82 0 

AACCTTGAAC TTCTTGCGTA GCCCAATTGG TCGTTCAACC CTCTCTCTTC GTGAAGATGA 2 880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAATCATCG CTTTTGT CTT 294 0 

TGGTGCCATT ACTGC AAGT A TTGCTGGGTC ACTTCAGCCA GGATTTATCG GGTCTGTTGT 3000 
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ACCGAAAGAT 


TACACCTTCA 


TCAACTCAAT 


CAACGTTTTG 


AT T ATTGTTG 


TATTTGGTCG 


3060 


ACTCGGTTCC 


ATTACAGGTG 


CGATTGTTTC 


GGCTATTGTT 


CTGGGAATTT 


TGAATATGCT 


3120 


TCTCCAAGAT 


GTTGCTAGTG 


TGCGTATGAT 


TATTTACGCT 


TTGGCCTTGG 


TATTGGTAAT 


3180 


GATTTTCAGA 


CCAGGTGGAC 


TCCTTGGAAC 


ATGGGAACTG 


AGCCTATCAC 


GTTTCTTTAA 


3240 


AAAATCTAAG 


AAGGAGGAAC 


AAAACTAATG 


GCATTACTTG 


AAGTAAAACA 


GTTAACCAAA 


3300 


CATTTTGGTG 


GTCTAACAGC 


TGTTGGAGAT 


GTGACTCTTG 


AATTGAACGA 


AGGGGAACTG 


3360 


GTTGGATTAA 


TCGGTCCAAA 


CGGAGCTGGG 


AAAACCACCC 


TTTTCAACCT 


TTTGACCGGT 


3420 


GTTTATGAAC 


CAAGCGAGGG 


AACAGTAACC 


CTAGATGGTC 


ACCTTTTGAA 


TGGGAAATCA 


3480 


CCTTATAAGA 


TTGCCTCTTT 


GGGACTTGGA 


CGTACTTTCC 


AAAATATCCG 


TCTCTTTAAA 


3540 


GATTTAACAG 


TTTTAGATAA 


TGTTTTGATT 


GCTTTTGGAA 


ACCATCACAA 


ACAGCATGTT 


3600 


TTTACTAGTT 


TCTTACGCTT 


ACCAGCTTTT 


TACAAGAGTG 


AAAAAGAATT 


AAAGGCTAAA 


3660 


GCTTTGGAAT 


TGTTGAAAAT 


CTTTGATTTA 


GATGGTGATG 


CAGAGACTCT 


TGCTAAAAAT 


3720 


CTTTCCTACG 


GACAACAACG 


TCGTTTGGAA 


ATTGTTCGTG 


CCCTTGCTAC 


GGAACCTAAA 


3780 


ATT CTCTT CT 


TAGATGAACC 


AGCAGCAGGT 


ATGAACCCAC 


AGGAAACAGC 


CGAATTGACT 


3840 


GAGTTAATTC 


GTCGTATCAA 


AG AT G AGTTT 


AAGATTACAA 


TCATCTTGAT 


TGAAC A CG AT 


3900 


ATGAATCTGG 


TCATGGAAGT 


AACAGAACGT 


ATCTACGTAC 


TTGAATATGG 


CCGTTTAATC 


3960 


GCTCAAGGAA 


CTCCAGACGA 


AATTAAGACC 


AATAAACGCG 


TTATCGAAGC 


TTATCTAGGA 


4020 


GGTGAAGCCT 


AATGTCTATG 


TTAAAAGTTG 


AAAATCTTTC 


TGTGCATTAC 


GGTATGATCC 


4060 


AAGCAGTTCG 


TGATGTAAGC 


TTTGAAGTTA 


ATGAAGGAGA 


AGTTGTTTCC 


CTTATCGGTG 


4140 


CCAACGGTGC 


AGGTAAGACA 


ACTATTCTTC 


GCACCTTGTC 


AGGTTTGGTT 


CGACCAAGTT 


4200 


CAGGAAAGAT 


TGAATTTTTA 


GGTCAAGAAA 


TCCAAAAAAT 


GCCAGCTCAG 


AAAATCCTGG 


4260 


CAAGTGGTCT 


TTCACAAGTT 


CCAGAAGGAC 


GCCACGTCTT 


TCCTGGCTTG 


ACTGTTATGG 


4320 


AAAATCTTGA 


AATGGGAGCT 


TTCTTAAAGA 


AAAATCGTGA 


AGAAAATCAA 


CCTAACTT3A 


4380 


AGAAGGTTTT 


CTCACGCTTT 


CCTCGTCTTG 


AAGAACGGAA 


GAACCAAGAT 


GCAGCCACTC 


4440 


TTTCAGGGGG 


GGAACAACAA 


ATGCTTGCCA 


TGGGACGCGC 


CCTCATGTCA 


ACACCAAAAC 


4500 


TTCTTCTTTT 


AGATGAACCA 


TCAATGGGAC 


TTGCCCCAAT 


CTTTATCCAA 


GAAATTTTTG 


4560 


ATATCATTCA 


AGATATTCAG 


AAGCAAGGAA 


CAACGGTCCT 


CTTGATTGAA 


CAAAATGCCA 


4620 


ATAAAGCACT 


TGCAATCTCT 


GACCGAGGAT 


ATGTACTGGA 


AACAGGGAGA 


ATCCTCCTAT 


4680 


CAGGAACAGG 


AAAAGAACTC 


GCTTCATCAG 


AAGAAGTCAG 


AAAAGCATAT 


CTAGGTGGCT 


4740 


AAAACAATCC 


AGTGCATTGT 


TTTAGTCGGC 


AGATGGAGAT 


TACGAAGTAA 


TCATCAATAT . 


4800 
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AGTCCGGGGG 
AAAGCTTAAT 
AGCTCGATTT 
AATGTAAGCG 
AGAAGTCTCA 
AT AACAGT AT 
ATCGAAAATG 
TCTAAAGCAA 
AAAGATGTCA 
ACTTATCTGA 
TACGGAGTTA 
G AAG AAGGG A 
GTTT CTTTG A 
GATGGTAAGG 
GAAAAATTTG 
TTGTAAGAAG 
GGAAAGAAAT 
TTTCAGTAAT 
CTCTTTGACC 
CCTGATTTTT 
AAAAACACTT 
TTGGATTTTT 
CTTGATAAGA 
AACAATGCCA 
GATATGTTGC 
TTGGATTACC 
CCTAATGCCT 
CCATTTGACT 
TTGGATGATA 



ACCTTTTTAG 
TTCTAATAAT 
CAGAGCTCTT 
TTTGATAGAT 
TGGCACTTAA 
CTCATGCAGC 
ATCAATTAGT 
CAAGTCTTTC 
TGATTCGCGA 
TGTTGAAAAA 
TTACTGACCG 
TTCGTGTACG 
TTGTAGAAGA 
TGATTATCGA 
AAGCAAATGG 
GGAAGCCCAA 
GAT AAAAT AT 
TCGTCAGAAA 
TCGAAGGGCT 
GGAACGATAA 
ACAATACCTT 
TGGCTGAAGA 
TAATGACCAG 
TCTTGGAAAT 
TTCGTATGTA 
AAGCAGGTGA 
ATGGTCTCCT 
CTGCCAAACG 
CTATTGAAGT 



TCGGTAGATT 
TGAAAAAATC 
TTTGCTAGCT 
TTACAAAAAG 
AGATTTTATG 
AGATTTGATG 
TGGTTTGGTG 
TATCTATGAG 
TGTTGTCACT 
TAAGATTAGT 
TGACGTTTTC 
CTTTGTTACA 
AAATTTGAAT 
AGTGCAAATC 
TATTCAAGTG 
AGGCTTCTTT 
GCTATAATGA 
ATTGACGCAA 
AG AGG AAG AG 
TATTGCGGCC 
CCATAAGATG 
CGACTCAGTG 
CTACGAGATG 
CCATCCAGGT 
TACTCGTTAT 
TGAGGCTGGT 
CAAGTCAGAA 
TCGCCATACC 
GGAAATCCGT 



749 
GAGATTGCAA 
GAATGAAAAA 
TATTCATACT 
ATTGTATAAT 
ACCCGCAAGG 
AGAGAGCAAG 
ACTGACCGAA 
ATGAATTATC 
GTCTCAGGCT 
ATTCTCCCTG 
CAAGCCTTTC 
GAAGATGAAG 
ATCTCCCATA 
GATGGATCAA 
GAAGAAATCG 
TTTCATGAAA 
AATAATGTAA 
ATCGTGAAAA 
ATTGCCATCT 
CAAAAAACGT 
GAAGAGTTGC 
CATGATGAAC 
ACTCTACTCT 
TCTGGTGGTA 
GGTAATGCTA 
ATTAAGTCGG 
ATGGGTGTTC 
TCTTTCACAT 
GAAGATGATA 



ACAAATCTGC 
TTTCTTACCT 
TTTCTGAATT 
AGGGATAAGA 
TAGTTTATAT 
GTTTGCACCG 
CCATTGCACA 
TTCTGAATAA 
ATGCT ACTCT 
TCGTAGATAA 
TTGAAATTGC 
TTGGTGTTCT 
CAGTCAATAT 
TTGATTTACC 
CTCGCACTTC 
AGGGG ATT AG 
AAAAGGAGTA 
ATT AG CTTCT 
TGGAAAACAA 
CGCAAGAATT 
AGGATGAAGT 
TGGTAGCGCA 
TGTCAGAACC 
CTGAGGCGCA 
AAGGCTTTAA 
TAACTTTATC 
ACCGCTTAGT 
CTGTAGAAGT 
TCAAGATGGA 



ATCTACATTG 
TCATTCACAG 
TCGAAAAAGA 
ATAGAAAAGG 
TAGTCCAGAT 
TCTGCCTGTT 
AGCAAGTCCA 
GACAAAAGTA 
AGAAGATGCA 
CCATCAAGTA 
AGGTTATGGC 
TGGAAAAATT 
TCCGCGTAAG 
AGCCTTGAAA 
AGCAAAAGTC 
AG C AAAAGAT 
TTTATGGACA 
TTCAGGGGGT 
GATGACAGAA 
AAATGAATTA 
CGAAATTTTA 
GTTAGCCGAA 
TTATGACCAC 
GGACTGGGGT 
AGTGGAAGTG 
ATTTGAAGGG 
GCGAATCTCA 
GATGCCAGAA 
TACCTTCCGT 



4860 

4920 

4930 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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TCAGGTGGTG CCCGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAATTGTTGT CCAATCAACA GTAGATCGTA CCCAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 67 20 

GACGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCGTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 684 0 

GT AG AT AAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 69 60 

AATACGACAA CGGAACAACT GCTCTACGCG GTGTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTCCTTACAT CGTAGGACCT TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGT AT C 7 080 

GTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGCTGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TGTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG G ATTAT AAAT 7200 

TGTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATGGAAGTA ATCGGGGAAA 7260 

ATCGCCGTAA TATCAAAAGA CGAGTGATGG AAGTTTTGGA CTTGGTTGGA TTGAAGCATA 7 320 

AGGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCGATTGCGC 7 380 

GTGCAATTGT AAAT AATCCC AAACTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CCGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT GCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCCTGTCCTT CGTGACGAAT CAAAAGGAGA GTATGGATAC GATGATTAGT 7 62 0 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7 680 

GTAGCTGCTG TCAGTTCAGT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7 800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAAGGTCA AAC TGTT AC A 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7 920 

ACCTTTTCAA GTAAAGAAGA ACAATATGAA AAATTAACCG AGATAATGGG AGATAACTGG 7 980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATGATGCCT ATATTGTAGA GGCAAACACT 804 0 

CCAAATGATG TAAAAACTAT AGCCGAAGAT GCTAAAAAAA TTGAACGTGT CTCTGAGGTT 8100 

CAAGATGGCG GTGCC AAT AC AGAAAGACTC TTCAAGTTAG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 8220 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGAGCTAAA 8280 

AACAGTTATA TCCGTGGACC GTTC TTGTT A GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 834 0 
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ATCGCACCAT CTGTTTTGGT CTTTATTCTT TATCAAATTG TTTACCAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTACTCC GTTGATGATT 84 60 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8S20 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCT CCTTT 85 8 0 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 864 0 

CCAGAGAAGA CTTC 86 54 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANGE DN ESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

TGTCGCGTCA AAATCATTAC TATGGCTATG TATAGCCCTT ACTATCACTT GGCTAAACAC 60 

GTTCGCTTTC AAATTTCTAG GCTCAGGCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 120 

TGCTAAAATC GTTCTTGATC GCTTTCACAT TGTACAACAT CTTAGCCGTG CTATGAGTCC 180 

TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA AGGCTATCAA 240 

GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC GATTTTATCG 300 

CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 3 60 

AGAAG ACT TG AAAC AC C ACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 42 0 

AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 480 

TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 540 

ACACTATTCT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 600 

TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTCAACAT 6 60 

CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 7 20 

TTGACAAAGA GCCTAATTTC CATAAAAATT G AC AT GG AAA TTATAAAACC ATTACTAGTT 780 

TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 84 0 

GATTTCTACC ATAGATGGCT TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 900 

CAAGACCTTC TTACGTTCCA AGATTGGATC TGGGATTGCT ACCGCTGAAA GCAAGGCTTG 9 60 

AGTATATGGG TGAATTGGAT TGTTAAACAA TTCTTCTCTT TCTGCAACCT CTACAATAAC 1020 
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ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA AC AACCG AC A AGTCATGGGC 1080 
GATGAAGAGA TAGGTCAGGC CGAGCTCTTT TTGGAATTTT TTGAGCAAGT TCAAGACTTG 114 0 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 12 60 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 132 0 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AG ACCTT C AG AAATAATATA 13 80 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

ACGAATCAAT TCCGCAGCTT GTTCACGCGA TTTCTTACCA TTAATCTTTT GACCATCAAA 15 00 

AATGATATC? CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 15 60 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 162 0 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 174 0 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACT GAG AAT TGAGGAGCTT 1860 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 192 0 

GGTCAGTATA AAGTGACGGA CGTGTTCCTC GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AAGAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 204 0 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

C AAT ACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AACGCACTTC 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC CGCAGGCAAG GGCAATAGCA ATAACGATAC 22 80 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTCTTTl'TC 2 400 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 2 460 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2520 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 2580 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2 640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTACTTC AAGGGAAACG CCGCGAATGC 2700 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 27 60 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2820 
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TAAGTTTTGA 


CCAACTACGA 


. AAAGGGACAA 


» GGATACCAAG 


ACAAGGGTTG 




2 880 


CCAGAACAAG 


TAAGCATTGG 


TTGTTACGTT 


TTGTGAATAA 


TCCGAAATCA 


AACGACCCAA 

AA\.UAtvV.AA 


2 940 


ACTTGGCACT 


GTAATCGGTA 


ATCCAAGACC 


GAAGAAAGAC 


AAGAAG G CTT 


rGT4TTAf:4T 
irt a\j/\u/\ 1 


3000 


AAAGCTTGGA 


AGCATTTGAG 


TCATGGTTGT 


CACAATAACA 


CAT Art"* A ATT 


oAbuLn 1 \j A 1 


jUOU 


AT TTTTG GC A 


ACAATCTTCA 


AGGTTGGTGT 


TCCCAAAGTA 


CGTG ACG CCA 


nu i Ivilnl a-*— 




CAAGTCACGA 


TAGCGCAAGA 


TTTGCACACG 


GAT CATGAAC 


GfAATAr^AA 


TrTATPT^TT 
1 \— r\ lvjl l\j l 




TACGCTCATG 


G C AAAAATC A 


G ATTC C AG AA 


TCCAGC^CCG 






i "5 ^ n 


AACAATCAAA 


AGAGGTGGGA 


TGTTTGAGAT 


GACGTTGTAA 


ACTTCCATCA 




J JUU 


AACTGATTTT 


GAAATACCCC 


AAATACCA CC 


GACAAAAAfA 


CCG AT A ACC A 






TGTCGCAATC 


ACAGAAATGA 


GGATGCAGTT 


ACGAGCTCCG 


AA CC AG AC AC 






CGATTTACCG 


TTACTGTCAG 


TACCGAACCA 


ATGCTCCGCA 


TTTGGCTTGA 




J40U 


ACTAAAGTCG 


TTTACCTTGC 


TGACATCATT 


GAAATCAAAP 


TTAGAAAACA 


TTCftCT nr AT 


J 34 U 


GAAACTTATC 


AAAATGATGG 








ATTTTTTCTT 


3 600 


CATAAATTGT 


TTAAACACTG 


ATTTCCAGTA 


AGAATA^nf^ 


CTGCGCATCA A 


1 nb III LAV*A 


J ooU 


GGCAAAATCG 


TCACGTTTTA 


CAAACTGAAA 


l*fM|NIWIMly l *T l, | l *l*l 

* 1 1 1 1 \— i 1 I t\ 




At.A I I A 1 I A Li 


3 720 


CCTCCTTTCT 


CAGTCAATTT 


AATACGTGGG 


TCAATAATAG 


TCATCCA A AT 


» ipPTV'fr AAA 


J / By 


AGACGTGAGA 


AGATAGAAAT 


ACATGTAAAG 


ATGAAG AC AA 


HACCAACflAC 


riTari rtT a 

^.nl J\\jf\\» I in 


i o a rv 
J04U 


TTAGATGCTT 


TTACAGAGTC 


AATCAACATT 


TTACCCATAC 




f* AAfiprrTi* 

UAAuAL I I 1 




TCAGTAAGGG 


TTGCACCACC 


GATAACCCCA 






I oAAAULAuL 


*i a £ n 
J 7 DO 


GGAACCATGG 


CATTTTTAAA 


GATGTGTTTG 


TT^GAAATTT 


fTTTTTPAGA 
<>— i a i 4 * 


V-fWil-l. Ll.ll 




GCACGAGCGA 


AACGAACAAA 


GTCTTGAGAT 


TGCAAGT CAA 


TCATGTAAfG 


APGAATfTAA 




ATGGCTGTAC 


CAGGAGCACC 


CAACAAACCA 


AGGATGACTG 


CTGGTAAAAC 


GTAAGAACGC 


4140 


CAATCTCCAG 


CTCCCAAGAT 


AGGGAATGAA 


TCTGGAAGGG 


CAATAGATGA 


TCCAATCAAT 


4200 


CGAACGATGT 


AAACCAAGGC 


AATCGTTGGA 


AGAGCAAGCA 


AGAAGGTCAA 


AGCCCCTGTT 


4260 


G AG AGGCT AT 


CAATCCAAGT 


GTTCTTGAAA 


CGAGCCATGG 


CTGAACCAAG 


TGGCACGGCA 


4320 


AGAGCATAGG 


CAAGAACCAA 


ACCAATCAAA 


CCAGTAATAG 


CAGAGCTGAC 


AATCATAGAT 


4380 


GGATATTGGT 


AATTACTTTC 


AGTCGCTGTA 


TAAGGATCAT 


CTTTCCCATA 


CCTACCTACT 


4440 


TCACGAGAGT 


CAGCCTGACT 


AGGTGACTTG 


TAGGTTCTTG 


AGTAAATATT 


TACAGAAGAC 


4500 


GTTTTCTTAC 


CTGTTCGGAA 


CTGAACTTGG 


GCAGTTTTGG 


TTTGTCCTTG 


ACCTTGAGTA 


4560 
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ATAACCTGAA 


GAACTGGTGT 


ATTAGCATAG 


GTTGGGTAAG 


AGTCACCTAA 


ATTCAAGTTC 


4620 


AC AAAGTT TT 


GATGAACAAA 


TGGGAACTGA 


CTGTTAAAGT 


ACAAGAGATA 


TTT ATGTTT A 


4680 


GTTCCTGAAC 


CGACCAATGA 


C CAT CCG AT A 


GCTGGATCAT 


TTTCAAAACG 


AAGGTAGCGT 


4740 


TTCAAGTCTG 


GATTTTCAGG 


GTCTTGGATT 


TTATTTGTAT 


GGTCAATGTC 


AATCAAGTTA 


4800 


GCATAGAAGT 


GAAAAACACG 


TTCAAAAATT 


GGAATTTCAC 


G AGT AG CAT A 


GAATTCACCA 


4860 


CTTTCAGTAA 


ATTCTCCCAA 


AGTCCAACCA 


TGACCTAATT 


GATTGATGTA 


CTTTTCATAA 


4920 


ATAGCTTTAT 


TGGTCGCATT 


TGCTTCTACT 


GTT AC AG AAG 


AATCCATGCT 


ACTTGCCTTT 


4980 


TCTTGCAACT 


CTTTAGTATC 


GTAATACTCA 


ATGTAGCCCA 


T ACGCT C AAA 


CACAGTATTT 


5040 


TCATAGTTAT 


CACGTTTATC 


AGCCGTTGTC 


GCAATTTTAT 


TATACTTAGG 


ATCCTGCTTG 


5100 


AAAATCAATT 


TTCGAGGAAC 


CAAGGTATAG 


ATAATCGTCT 


AGGTCAAAGT 


CGTTACTAAG 


5160 


AAAATCGAAA 


CCAATGACCG 


C AAAAC ACGC 


AT AAAAAT AT 


ATTTTTT C AT 


ATT ATTTCCT 


5220 


TTAAAAATCC 


CAAAAGAACC 


TTCTCCTCAT 


GGAGAGAAAG 


TTCTATTAGA 


AATTATTTAC 


5280 


TTCACATGAC 


TTGCCAATTC 


TTTTTGAGCT 


TTCTCATTTG 


ATTCAGCTTT 


TTCTTTCAAC 


5340 


CATTTTTCAC 


GAGCTTTTTC 


AT ACT CTTCC 


TTAGTCACCA 


CTTTATCTTG 


TGATTTCAAA 


5400 


TATTTGAAGT 


AAACATCTGA 


CCCCTTAGAG 


CCTCTTTGCG 


CAGAAGCTCC 


AGTAAATGGA 


5460 


ACAATTCGTG 


AAAGCACTGG 


TGCTGCACCA 


GAAGAAGCCA 


TAGCAGGAAT 


AAAGAGTGAA 


5520 


CTATCTGTCA 


ACCATCCTTG 


AGCCGCTGCA 


TATTTTTCAT 


AACGGACATT 


CAAGTCGCTT 


5580 


GTCTCTCTGG 


CAGCTTCATC 


AACTAATTTA 


TCGTATTCTT 


TCAAACCAAC 


TTGAACTACT 


5640 


GAAGGGCTAT 


TTGGATTATC 


AAATCCTAAA 


TATGTTTTTG 


TAGTTTCACT 


GCTAGTTGTT 


5700 


TTTAAAATAT 


CCAGGTAAGT 


AGATGGGTCT 


TGATAGTCTG 


GCCCCCATGA 


AACTCCTCCT 


5760 


GATACATCCC 


AATCCTCAGA 


TGAAGCATTG 


GCAGCATAGT 


AAGTAATATT 


AAGGAATTCA 


5820 


TCACTTGTCA 


TTTGTTGAAT 


ATCAACAACG 


ACATTTTCAA 


CACCAAGAAC 


TGTTT CT AC A 


5880 


GATTGTTTAA 


AGGACTGAAT 


ACGAGATATG 


TAGTTTTTTG 


ATGCTTGGTC 


TACTGGAACG 


5940 


TCCAGATGAA 


TAGGAAACTG 


AACGCCGTCT 


GCTTCTAAAG 


CTTTCTT AG C 


TTTCGCAAAC 


6000 


TCTGCCTTGG 


CCTTGTCAGC 


ATTGAATAAA 


CCATCCTGCC 


CATCAGCTAA 


ATTCACACCT 


6O60 


TTCCACTCAT 


C AC CAT AAGC 


AGGAAGTTGA 


GCAGCGACTA 


AATCACCAAA 


GGTCTTCTCA 


6120 


CCAGCTGAAA 


CAAAGTCTGG 


TTTTACAAAT 


AAATTACGAA 


CTGCTAAAGC 


TGCTCCATCT 


6180 


TTACCATTGA 


TTTGAGCTGA 


GTAAGCTGAG 


CGATCAAGAG 


CAAAATTCAA 


GGCTTGACGG 


6240 


AAATCTTTGT 


TAAGCAATGC 


CTTCTTAGTA 


GCTACTTTCT 


CTGAATCTGT 


AGTTTTAGAA 


6300 


GT AT AGTTGT 


AACTTTGGCG 


ATCAATATTC 


ACACCCAGAC 


CAGCAATCCC 


AGAGCCTGAT 


6360 
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TGTGTGTAAT AG AT ATTGTC CTTGTATTCT TCTGCAACCT TAGAATAGTT GGAGCTGGTA 64 20 

GGGTAAAGAC GGGCATAACT ATAAGC TCC A CTAGTGAACT TACGCTCTAG CGACTCCTGA 6430 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 6540 

TATTGCTCAT TTTTTACAAA CTCTACAGAA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GGACCATTAT AAAGCAAGGA TGTCGGATCT GTTGGTTT AG CAAAATCGCT TCCTTTTGAT 6660 

GTTTCGAATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTC CAGAAC 67 20 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GTATAATCAT CAACCGCCTT G AC AC C AACT 6780 

GTTG AAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 684 0 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TT AG CCGTC A CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCT TATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AAT CCTCTGC AACTGCAGGA 7020 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7080 

GTTGTACTAT T TTTACTTGA AATCAGGTAG TCCAAGGTTT CTGGGTCTGC TGTATAAACA 7L4 0 

TAGCCATAAG CTTTAGGGGC TGATGAATCA GATGATTTTG AAGAACTGCA TGCTGCAAGT 72 00 

AC AC CTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 72 60 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 7 3 20 

ATCAAACAAA TTTTCAGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7 380 

ATATGATTCA AATTGTCCTT CGAAGTGTCA AAGACTACAG TGAAAATAGG AAATTTGACG 74 4 0 

CAGAAACTTT GGACTTTAGG AAGACATACA GTAAAATGAA ATACGGACGG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG T ATTCAAG AT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TATCCCGAAT TCAATTACTT TTGTGATTTA 7620 

CAT AT AT AG A TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7680 

TGCTCCTCGC TATCACGTTT CTATTCATAG CCTACGATTT C ACT ATTG CT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG ACTTAAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

CACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC CCAAGGTAGA 7920 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 8040 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA GATAAAGGCC GATAATCGGA 8100 
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CGTTTCCCCG ACTCCTGACT CATATCCATC ATCAAGCGAA CAGGAGCAAC AGAAGACAAA 8160 

ACTAATAAAA TAGTCCCCAC AATTCCGTAA CTCAGAATCG TAT C AAT AT A AAGACTGTGC B220 

GCATGTTCAT GAT AAGGAG C ATGTATCCGA GGATAAGAGT TCATATAGGT CAATGGCCCT 8280 

TCACCCCAAA AAGGATTTTG CTTAAACAAG GCCATCCCAG CATCCCAGAT AGAAATGCGT 8340 

TCTTCCATAG AAGAGTCTAA AGTACCCATT CGAACTCCCA AATCACTAGA AAAGAGGAAA 8400 

CTCAAACCAA TCGCGAAGAC CCCAATACTA AGCCAAAAGG CCTTCCAGTT TTTAATAGTC 84 60 

GTAAAGAGAT AGATAATTGC TCCAGCGATA ATAGCAGGAA AGGCAGTTCG ATTTTG AGT A 8520 

AAGTTCAAAC CAAAGAGATT AACAAAGCCT GCAATCACAC AGAATACTTT CAACCAATTC 8580 

AACTTGGTCG TTGT AAACAG ATAGAAAGCA ATCATAATAC AGAAACAACA AATAATTCCA 864 0 

TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC CTGCATATTG 8700 

GGTGAAAGAA AAGCATAGTT AAATTTCTTC ACAATTTGGA AATGTTCTAA ACTGGCAAAA 87 60 

GCAGCTGACA AGACACTACC AAACAAGACA AACTGCAAAA TCAATCGAAA GAATTTATGG 8 820 

G AT AAAATCG ACTGATAGTG CAAAAAGAAA ATACTAAATA GAAACATTCC TACTGAAGCC 888 0 

ACAAGACCCA TCCAATTTTG TGCAAGAATG GATATAACAG TACTATAGCT AAGAAAAAGA 8940 

AGCAGCATCG GATGCTCCCC CATTTTCTGA AGAATACTTT TCATGTCTCC TGTAAAAATC 9000 

AAACTGATAA T AT AT AAAC A GAGTACAACT AC AAAAAG AT AAAAGGGTAA AAAGATACTC 9060 

AGGATAATTC CCAATAAAAT CAGCTCTTTA CTAGACAACC CCTTCAGCTT TTCAATAAAG 9120 

CCTATTGATT TCAAAATGAA TCCTTTCTCT CCAAATC AGC TGATTCAGAT AATAGTAAGC 9180 

TATCCTATAT TGTACCACTT TTTTAGCAAT TTGAAAACAA AGGAAACGTT TTCCAAAATA 9 24 0 

AAAACCCTAT TTTATCCACC ATATCAAGGC TTCAAAATGA TACTTCAACT CCATTCTCAA 9 300 

TTACCCGATA AGTCTGATTT TGCAAATCAA TTT CT ACT AC TGCTGTTACG GACTTATCTT 9 3 60 

TATTTTGACG TTTGATTACA ATG CTGTG AG CTGTTGGTGT CTCTATCTCA GTAGTCCCTT 9420 

CTAGATCAAA GGCTTCTGAA CGGTTACGGA AAGAAAATAG ATTGAGAAGG GCCTTCACAA 94 80 

CAGGTCGTTG CACTTCTTTT GCTATTTC CT CGTTGCTATA GTAATGACGA TTAATATTTC 954 0 

GACCTTCTTT AGTTTCTTCT AAT AAT TT C A AGTCATTCTT GCCTGCT AAT AGACCCACAT 9600 

AGTAAATCTG AGGAATACCT GGGGCAAAAG CTTGAATTAG ACGAGCGAGA AAATACTTGA 9660 

CATCATCATC TCCAAGCGCT G AAT AGT AGG TTGAATTGAT TTGGTAGATA TCTAAGTTGT 9720 

TAT ACT CGGC ACT AG AGT AC T T ACGTTTGA CATTGGCTCC AACCTTATAG AGTTCATTTG 97 80 

AAGCATAGTC AATCTCCTCA TCGGTCAGGA TATCCTTGAC ATCTACTACT CCAATCCCAT 984 0 

CATGGGTATC TAGCGTCGTA AATTGCTTCA TCGGGCTCAT CTTTAACCAC TTAGCCAAAC 9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGGAAGAGCA AAATCATAAA 9960 

CAT AG T AAT C ATGCTCTGCT ATTTTAAACT GAATCGAATA GTGTTCATGA ATCTCAGGTA 10020 

AAAGCT CTGT CCCATACTCA GC AG CG AT AT CTCGAACTTT GTCCAATAAA TC CCAAATAT 10080 

CTGCTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT GGTCTTACGG ATAAATTCCA 102 00 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA ACCATCTTCA AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 103 2 0 

AAATTAAATC TACATCAGAC TGTGTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 10 3 80 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 1044 0 

ATT G ACGAG A AATATGATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 10560 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA ATAGCATCTC 1062 0 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTGAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATT C ATT TTAAATCTCT A TTT ATCATC 10800 

AAACTCGTAC T AAT AT AG AC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 10860 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAG GTATGGTTAC 10980 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TGAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAG AG AT AG A CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGGTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 11220 

AAGGCAGAGT C CACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAACTCAT 112 80 

TC C AGAACC A AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 1134 0 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 114 00 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 114 60 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA ACGTATTACC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGCA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATGAT GGT AAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TACGCTGCCA 11640 
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TAGGGATAAA 


GATCATTACT 


GCAAGTAAAG 


ACAAGACAGT 


GATGACGACA 


GAGTTCCAAT 


11700 


AATAGCCTCC 


AATCCCATCA 


GCTAAGAGAC 


GGCTAAAGTT 


GTCCCATGTG 


AAGTTGGTTG 


11760 


GAAAGCCAAA 


GAAATTATCT 


ACAATATCCT 


TAGTCGGTTT 


GAAGGAACTA 


AAGAGGGTAG 


11820 


CAAGGAGCGG 


CACTAAAATC 


AGAACCGATC 


CTAGAATCAA 


TAGAATGTAT 


TTGCCAATCA 


11880 


GGGCTTTTCT 


TTC AT CTTGT 


TTCATCATGC 


TTCTCCTCTT 


AAATTTCAAA 


TTTCTTAGAT 


11940 


ACTCTCAATT 


GGATGATCGA 


AATCACTACA 


ATTAAGAAGA 


ACAAGATTAC 


GGCAATGGCA 


12000 


TTGGCATAAC 


CGAATTGGTT 


GTTTTTAAAG 


GCATAGTTAT 


AAACCAAGAG 


CCCAAGTGAG 


12060 


GTTGTGG C AT 


TGTTTGGACC 


ACCACCGGTC 


ATGGCAAAGA 


CTTGGTCAAA 


GGCAGTCAGC 


12120 


CCACCTTTTA 


GGGCTAGGAT 


AAAG AC CAT A 


GAGACACTTG 


GTAGCAAGTA 


AGGCAATTCA 


12180 


ATGTTCCAGA 


AAACTTGCTT 


GCTAGTCGCA 


CCATCAATCC 


TTGCTGCCTC 


TGTAATCTCA 


12240 


GTTGGAATAG 


ATTGCAAACC 


AGCTAGGAAG 


ATGATGATGG 


GCATAGCCAC 


CCCTTGCCAA 


12300 


AGAAGGACAA 


AGACAGCCGC 


AAAGATTGCT 


CCCCACTTAG 


TCCCTAAAAG 


ACTGGTTTGG 


12360 


AAAAATTCAA 


TATGAAGGGC 


ATTTCCAATC 


GCTGGAAGAC 


CGTAGTTGAA 


GACTTGCTTG 


12420 


AAGATCAAAG 


CCACTGTCAA 


ACCAGATAAA 


ACAGCTGGGA 


AGAAGAACCA 


AG C ACGGAAG 


12480 


AAGGTTTGGC 


CTTTGATTTT 


AGAATTCAAG 


ACACGCGCAA 


TGAAGATCCC 


GAGTGCAATC 


12540 


TCACCAACCA 


CCATGGCAAT 


CGCAATGATT 


GCGGTAAAGC 


CAATCGCATT 


CATGAATTTT 


12600 


GGATCCATGA 


AGAGGAGCTT 


AAAGTTGTTT 


AAGCCAACAA 


ATTTGTAGTT 


ATAAGTCAAT 


12660 


CCTGTCCAGT 


TGGTAAAACT 


GTAAAAGGCT 


CCTTGAAACA 


TCGGCACATA 


GAAGAAAATT 


12720 


GCTTGTAACA 


AGAGGGGGAT 


GACCACAAAA 


GCCCATGCCC 


AATATTTTTG 


TAATACTTTT 


12780 


TTCATAGTCT 


CTCTACTCCT 


AATCCACATC 


CGCTTTCATC 


GGGTTAAAGA 


AGGCATTCAA 


12840 


ATCATTGACC 


ATGCCTTGTT 


TATCACCGGT 


CAAGACATAG 


TTCATGGTCA 


AGGTATGGAA 


12900 


GTCTGCTTCA 


CTGGTCCAGT 


ATTGTTGCAA 


CCAGACCAAG 


TGACGATCCG 


TAAAGGCATA 


12960 


T TCGGTC AT A 


CCAGCAAGCG 


GTGAATCTTC 


TCCTGCTTGT 


TTGACCCCTT 


CG ATCGCTG T 


13020 


TGGAGATCCG 


TCCACATCGT 


AGTATTTTTG 


CATGACTTCT 


GGACGGGTCA 


TATATTCCAC 


13080 


AAAGGCATTG 


GCTTCTTTTG 


GATGTTTGGT 


GGTGGCTGAG 


ATAGACCATG 


CCAAGTCTCC 


13140 


CGCACCAACG 


GTTAAGCTTT 


GTCCTTTTTC 


TTTTCCTGGA 


ATCATGAAGG 


TCCCAATCTT 


13200 


AAAGTTCGGT 


TTTTGTTCAT 


TAATCGCTGT 


GATCGCCCAA 


GACCCATTTG 


GTGTCATGAG 


13260 


GACATCCCCA 


CGTGCGAAGG 


CTCCGATAAC 


ATCGGTATAG 


CCAGCACCTT 


CCCAGTTCTT 


13320 


TTGCTTAGAT 


CCATTGATGC 


GAAGGATGTC 


CATGACCTTG 


AT ATC AT CTT 


TCATAATCGG 


13380 


ATCCGACAAT 


TTAATGGCAT 


TTGGTTGAGA 


ATAACGAAGG 


TATTGATTTG 


CTTCTTTTCC 


13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC L35O0 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13 560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13 620 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTC GTTT A CAGCATATTT 13 680 

TTCAGCGTAG CCATTTTTCA CGCCTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13 740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13 800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CATTGACGAC 138 60 

TTTG AC CTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13 920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13 9 80 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA ACCTAAACCT GTAGTTGCAA GAAGTCCGAT 14 040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTCAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14220 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTC TAATTTATAT ACTCCCTCTT 14 2 80 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14 340 

CGTAGGTTAC AAT CGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 14 4 60 

ACAAGTTCAC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14 520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT GGTGTCATTC 14 580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGACCCCC CATAGAAATG GTTGGATAGA 14 640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 14700 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCACG 14760 

AC TCAAAG AG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACG AGTTC A TAAAGCCCCA 14 820 

GCATGTACTG ATGAGATTCC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14 880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGCAGTTGA7 14 940 

CT AAG AC ACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

AC AAA TC ACT ATCTACAGAA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 15120 
TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA . 15180 
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CCCAATCACC 


TAAAGCACGA 


TTATCATCAA 


AACGATTGCC 


AAACCAACCA 


TCATCTAATA 


15240 


CAAAAAGTTC 


AATGCCAACT 


TTCTTAGCTT 


CATCTGCTAA 


CTCTAACAGT 


TTTTCTCTCT 


15300 


GAAAGTCAAA 


GTAAGTAGCT 


TCCCAGTTAT 


TG ATT AG AAT 


TGGACGTTCT 


TTTTTAGAAA 


15360 


ATTCACTTAG 


CATAATCTGC 


TTCAGTACAA 


AAT T CTG ACT 


TTC ATG ACT A 


ATACCAGTTA 


15420 


ATCCCTGATC 


TGAATGAGTC 


ACTAAAGCTA 


CCGGTGTTTC 


AftAGTATTCC 


TCAGGAGCTA 


15480 


ACTTCCAAGA 


AAAGTTTTCT 


GGATTAATGC 


CAATAGCCAC 


CCGAACTTCA 


TT C AATTG AT 


15540 


TTTTTTGAAC 


AAAAGCTTCA 


AAGTTG CC AC 


TATACATTAG 


TTGAATAGCA 


AACACATTCC 


15600 


CAGCATCCTC 


TGTGACTCCT 


TGTTCGC ATA 


GTAGAAGAGC 


T GGTGTTTG A 


GC ATG AC CAG 


15660 


AAGCACCTCG 


GTTTGAACTA 


ATCGAAAAGA 


TTCCTTGTTC 


TACCTGTTGA 


CGTCTAACAG 


15720 


TCTTTTCACG 


AGCATAAGCA 


CCCTGCAGAG 


TTACTATTTC 


GTAATCTGCA 


GCTGGAAAAT 


15780 


CAGCCATAAA 


AGAAAAATCT 


TTATGGATGA 


CAACTTCCTG 


ATT ACT ATT A 


TTATCTAATT 


15840 


TACTGTAGCT 


AGCAATAGTC 


GCATCATTAT 


T AAAAGT ACT 


ATAATACAAA 


GTCAGACTAA 


15900 


GTTGAGCCTT 


AG AATCTT CT 


AACATTAAGA 


CAAGAGTCTC 


TGTATCGTCC 


ATGCTATGTG 


15960 


GAGAAGGTAA 


GCCCTGTGGA 


CCATTCTGAC 


CTTTTAAAAT 


CTTTGCTTCT 


ACAAATCGAA 


16020 


AGTCT GTT AC 


TTCAGTTACA 


CTATGCTGAA 


CCTGTATGGT 


TGGTTTCCTA 


AAATCTCCTA 


16080 


AGCCATGTTG 


TCCAAAAATC 


TGTCGCTGAG 


TATCTAAACT 


AAAGGTTCGA 


TTAGTAGCCG 


16140 


TTGGATTTCC 


TGAAAAGGCA 


TGGTCTCGTT 


CATAAACACT 


ATTGGAACCT 


TTATAGTTCT 


16200 


TAATAGTCTT 


TCCTAAATGT 


TTCAAAAGTA 


AGTAGCCATT 


TCGATTTTCA 


ATAATCAAAC 


16260 


TTAGATTTTT 


ACTCTCAACA 


TAAAATAGAT 


TATTCTCTAT 


CCTAACTCCC 


ATTTACTTCA 


16320 


CCTCATCACT 


TTATTGATTA 


TATTTTATCA 


CCTGAAATCG 


CTTTCCAAAA 


TAGAAAAATG 


16380 


TCTCAAGAAT 


ATGGTAAAAT 


GTTAGGTAGG 


AGGTAGCACA 


TGTTAGTTTT 


TTC AGAAT AC 


16440 


CAGACTGGAA 


CAATCGACCT 


TGCCCTAAGC 


TTTTATGGAT 


ATGAGGAATG 


CACACCTAAT 


16500 


TACTCTTTTG 


GTCCAGCCAT 


TCGTGATACA 


TACGTTCTAC 


ATTACATTAC 


TAAAGGACi'JV 


16560 


GGAAAATTTC 


ATTACAAGGG 


T AAAATT GTT 


GATTTAAAAG 


AAGGAGATTT 


CTTTCTATTA 


l6620 


AAACCAGAGG 


AACTAACCTT 


TTATCAAGCA 


GATAGTAAAG 


AACCT TGGGC 


CT ACT AC TGG 


16680 


TTAGGAATCA 


CTGGAGGGAA 


AGCCCCTGAT 


TATTTTGCTC 


TTTCCCAAAT 


TTCTGATCAA 


16740 


TCCTATCTCA 


TCCAATCTGA 


AACTTGTCAT 


ACCCAGACTA 


CTG C AAAACT 


CATCTCAGAC 


16800 


ATTGTCCGCT 


TCGCTCAGAT 


TACAAAATCA 


AGTGAATTAG 


CTCAACTCCA 


TATCATGGGA 


16860 


CAACTTCATG 


AACTGATGTT 


TCATCTGGGA 


ACTATTGCTC 


CC AAT CAG AA 


AAAAAAGAAT 


16920 


ATTTCATCAA 


CCCACCAACT 


CTATCTTGAA 


TGCAAACGAT 


TAATTGATAG 


CCACTATCCT 


1698.0 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 1704 0 

AG CGT ATT C A AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT C C ATTTTT CG AAAGCTTATA AACAATACTT TAATCAGACT . 17220 

C C AAGTC AT A CAACAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 172 80 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17 340 

AAATCGCAGA AAAACTATCC CTAAGCCGAA C AGC AATTTG GAAAGCCATC AAG CG ACT AG 17 400 

AACAAGAAGG CATTGAAATT GATAGTATCA AAAAT AG AGG AT AT AAACTG ATGAATGGTG 17460 

ACCTTATTCT T C C AG AG ATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 17520 

AAAC AAAAT C AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17 580 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17 640 

CACCACAAGG TGGTATTTAT ATG AC ACT CC ATCTT AAACC AAATCTCCCC TATGACAAAT 17700 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 177 60 

TAATAGATCT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 17820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT ATCATTATTG 17 8 80 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17940 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTC AC TTTC AC A CT AG AGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 18120 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAAT AACAC AATTATAATA T AAAC GAT AT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 18 300 

TAAAAATAAA AAAGAGAGTT AC AGACT CTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 183 60 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 18420 
TCTCCAATTA ATGGGCACGA GTGCACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 
CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GT AATTTG AA CAAAGTTCAA 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 186 60 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA CCCTTATTGC GGGAGCAGGA 187 20 



18480 
18540 
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TTTG AACCT A CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTCCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 1884 0 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18900 

TGTAGGACTT G AACCT ACG A CCACTCGGTT ATGAGCCGAG AG CTCTAACC AGCTGAGCTA 18960 

AAGGTCCGAC AAG AT C ATT A TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGG AAG AC A GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 1914 0 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 19200 

TATCCAGTTG AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 192 60 

TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 193 20 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTC T A TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19500 

ATATCTTAAT GCGGGTTAAG CGACTTGAAC CCCC-ACGCCC TTAAGCGCCA GATCCTAAAT 195 60 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCG AAC CAG 19620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 196 80 

G CGTT ACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4117 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 100: 
CCCTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG ACCTGAAAGA AGCCTGAGAA 60 
TAATTATGGA GAGTAGCATT CTGAGAGGTG TTACCAGAAC CATATGACAG AGCTGTTTGA 120 
AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 
GTT AGAT CAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG ACAACTTGCT 24 0 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 
CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGCT CCTAATACCT 360 
TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 4 20 
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TGGATGCGCG 


TGAGTGGGTT 


CGTGTGCTCA 


AGGAAACGGG 


CTTCAAAAAG 


TTGATTTTGG 


480 


TGCTCAAGCA 


CCACGATGGC 


TTTGTCCTTT 


ATCCGACAGC 


TCACACACAT 


TATTCGGTTA 


540 


AGGTCAGTCC 


TTGGAGGAGA 


GGAAAGGGCG 


ACTTGCTCCT 


TGAAGTATCC 


CAAGCTGCCA 


600 


CAGAGTTTGA 


TATGGATATG 


GGGGTCTACC 


TGTCACCGTG 


GGATGCCCAT 


AGTCCCCTCT 


660 


ATCATGTGGA 


CCGAGAAGCG 


GACTACAATG 


CCTATTATCT 


GGCTCAGTTG 


AAGGAAATCT 


720 


TATCAAATCC 


TAACTATGGG 


AATGCTGGTA 


AGTTCGCTGA 


GGTTTGGATG 


GATGGTGCCA 


780 


GAGGAGAGGG 


CGCGCAAAAG 


GTTAATTATG 


AATTTGAAAA 


ATGGTTTGAA 


ACCATTCGTG 


840 


ACCTGCAGGG 


CGATTGCTTG 


ATTTTTTCAA 


CAGAAGGCAC 


CAGTATCCGC 


TGGATTGGCA 


900 


ATGAACGACG 


GTATGCAGGT 


GATCCACTGT 


GGCAAAAGGT 


GAATCCTGAT 


AAACTAGGAA 


960 


CAGAAGCAGA 


GCTGAACTAT 


CTTCAGCACG 


GGGATCCCTC 


GGGCACGATT 


TTTTCAATCG 


1020 


GAGAGGCAGA 


TGTTTCCATC 


CGTCCAGGCT 


GGTTCTACCA 


TGAGGATCAG 


GATCCTAAGT 


1080 


CTCTCGAGGA 


GTTGGTCGAA 


AT CT ACTTTC 


ACTCAGTAGG 


GCGAGGAACT 


CCACTCTTGC 


1140 


TTAATATTCC 


GCCGAATCAA 


GCTGGGCTCT 


TTGATGCAAA 


GGATATTGAA 


CGACTTTATG 


1200 


AATTTGCGAC 


CTATCGCAAT 


GAGCTCTATA 


AAGAAGATTT 


GGCTCTGGGA 


GCTGAGGTAT 


1260 


CTGGTCCAGC 


TCTTTCCGCA 


GACTTTGCTT 


GTCGCCATTT 


GACAGACGGC 


CTTGAG AC C A 


1320 


GCTCTTGGGC 


AAGCGATGCA 


GACTTGCCCA 


TCCAGTTAGA 


ACTCGACTTA 


GCTTCTCCTA 


1380 


AAACTTTTGA 


TGTAATTGAG 


TTAAGAGAAG 


ATTTGAAGCT 


AGGGCAACGA 


ATCGCTGCTT 


1440 


TTCATGTGCA 


AGTAGAGGTG 


GATGGTGTCT 


GGCAGGAGTT 


TCGTTCGGGT 


CATACTGTTG 


1500 


GTTACAAACG 


TCTCTTACGA 


GGAGCAGTTC 


TTGAGGCACA 


GAAGATACGT 


GTAGTCATTA 


1560 


CAGAATCACA 


GGCTTTGCCT 


TTGTTG AC C A 


AGATTTCCCT 


TTATAAAACT 


CCTGGATTAT 


1620 


CAAAAAAAGA 


AGTTGTTCAG 


GAACTAGCAT 


TTGCAGAAAA 


AAGCCTAGCT 


GTGGCAAAGG 


1680 


GAGAAAATGt 


CTATTTTACA 




GAGAATGTAG 


TGGTCCTTTA 


GAAGCTAAGA 


1740 


TTTCGATTCA 


ACCGGGGACA 


GGTGTCCATG 


GTGTCGCCTA 


TCAGGATGAG 


ATTCAAGTCC 


1800 


TTGCGTTTCA 


AACTGGTGAG 


ACTGAAAAAA 


GTCTGACGCT 


ACCAACCTTG 


TATTTCGCAG 


1860 


GAGATAAAAC 


CTTGGATTTC 


TATCTGAACC 


TAACGGTGGA 


TGGTCAGCTT 


GTGGATCAAC 


1920 


TTCAAGTCCA 


AGTTTCATAA 


AAGAAGAACC 


TTTGCG CG AT 


GCAAAGGTTC 


TTTTGGTTAT 


1980 


TAGTGACTTG 


GTAACCAGCT 


GAGGGTGAAA 


GTTAGTTGTT 


CAGCTTTTAA 


GAGGTCTTGG 


2040 


TGTTGAATAG 


TTGATACGAG 


TGTTTTGTCC 


AGTCGGCATT 


CTTTGACAAA 


GTTAAAATGG 


2100 


TTGTGGTTTT 


GTTTAGTATG 


GATATCCAGC 


CATTTATCTT 


CTTTAGCGAG 


GTAGACTCGT 


2160 
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AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTCGACA GGTTGGATAA 2 2 20 

AATCCGAGAG CTGACCAGAT GTACCAAGCA GAG AG ACT AC CATTGTCTTC ATCTCCAGGA 2280 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGAGCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA ACGGAAGAGA TAAGGAATGT GGAAACTAGG CTGGTTGGAA 2 400 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 24 60 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT TCAAAAGATA GTTGCTAAAG 2 52 0 

GTTTC TTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2 5 80 

GCTTGAATGG CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2 640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCGTC AAATAGCTGG 2 700 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTT TCAGCGATTT CTATGTTCTC TAGTTTTTTG 2760 

CCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2820 

TGGTGGTCGG TAGAGAGGTA ACCTAGTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2880 

ATGCCGAGAG GGTCGGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2940 

AGGTCGGGGG TCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3000 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGCAA GGAAACCAGT ATCGCGGTAG 3060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 3120 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAGCCAGATC CATGTGGATG CCTTCCCCTG ATTCATTAAT CTCATAAAAA 324 0 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA ACAAGGTTCG GTCAGCCTCT 3300 

CCTGTCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTCAT 33 60 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAG ATG AAA 34 20 

G AAGTGG CT A GTTG CAT CTC GGTTTGACTA CTTGCTAAGT C AATTCGCC A GTCTCCAGCT 34 80 

TCTTGGCTGA TAGCAAGAAT ATCCCTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3 540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3 600 

TCTACTGTCA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3 660 

TTCAAACGAA T AG AAGC AC C ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 37 20 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 3 7 80 

TGGCGGTGAA AGAGGCTGTC TCCCCCCAGT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 384 0 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3 900 

AGATGTGGAT CAAAAAACCA AG ATCCATC C TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3 960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 4 020 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATACTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAC TAATAGAGGG CCTTTAG 4117 
[2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

( A] LENGTH: 2727 base pairs 

( B ) TYPE : nucleic acid 
(CI STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTG TT AG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 12 0 

AAAGTTCGTT TATTAGTAGC CTTTTTT AG C ATTGTCATAG GCTACATCCT GAGTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 24 0 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 3 60 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 42 0 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 4 80 

AGGTGGATGC TACTGGCGAC ATCACTCAGG AAGCCCCTTA CAAGTATCTC AGCAAGATGC 54 0 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGT AC G ATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 6 60 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATCGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 7 80 

CAGCGACTCT GGCTGATGGG GTGACAGTCA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 84 0 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAACGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 9 60 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 1020 

CAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 10 80 

AAGT AATTG A AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATCTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT C AATTT A C AG 12 00 
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1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



TCCAACACCT AGAAGAGATG CGCCGCATGG GCTTCCATTC TGAGATTATC CGTGATACAG 

CTCGTATTGT TGGTGGACAG CCTTTGCAGG GAGCAGAAGT TCTTTCAACT GACCTTCGTG 

CCAGTCCGGC C TTG ATTTTG ACAGGTTTGG TAGCACAGGG AGAAACTGTG GTCGGTAAAT 

TGGTTCACTT GGATAGAGGT TACTACGGTT TCCATGAGAA GTTCGCGCAC CTAGGTGCTA 

AGATTCAGCG GATTGAGGCA AGTGATGAAG ATGAATAAGA AATCAAGCTA CGTAGTCAAG 

CGTTTACTTT TAGTCATCAT AGTACTGATT TTAGGTACTC TGGCTCTAGG AATCGGT TT A 

ATGGTAGGTT ATGGAATCTT GGGCAAGGGT CAAGATCCAT GGGCTATCCT GTCTCCAGCA 

AAATGGCAGG AATTGATTCA TAAATTTACA GGAAATTAGG CTGGAGAACC AGCCTTTTTC 

TAAAGATAAG GAGAAATATG AACAAAAAAA CAAGACAGAC ACTAATCGGA CTGCTAGTGT 1800 

TATTGCTTTT GTCTACAGGG AGCTATTATA TCAAGCAGAT GCCCTCGGCA CCTAATAGTC I860 

CCAAAACCAA TCTTACTCAG AAAAAACAAG CGTCTGAAGC TCCTAGTCAA GCATTGGCAG 

AGAGTGTCTT AACAGACGCA CTCAAGAGTC AAATAAAGGG GAGTCTGGAG TGGAATGGCT 

CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA TGCCAAGGTT TCAAGTAAGC 

CCTACCCTGA CAATAAAACA AAGACAGTGG GCAAGGAAAC TGTTCCAACC GTACCTAATG 2100 

CCCTCTTGTC TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGGG AATCGTTCAA 2160 

CTTCTTGGAC TCCTCCAGGT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 

CAGTCGATAG AGGTCATTTG TTAGGCTATG CCTTAATCGG TCGTTTGGAT GGTTTTGATG 

CCTCAACAAG CAATCCTAAA AAGATTGCTG TTCAGACAGC CTGGGCAAAT CAGGCACAAG 

CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAACCC TTGGACCAAA 

AC AAG CGTGT CCGTTACCGT GTAACCCTTT ACTACGCTTC AAACGAGGAT TTAGTTCCCT 2460 

CAGCTTCACA GATTGAAGCC AAGTCTTCGG ATGGAGAATT GGAATTCAAT GTTCTAGTTC 2520 

CCAATGTTCA AAAGGGACTT CAACTGGATT ACCGAACTGG AGAAGTAACT GTAACTCAGT 2580 

AAAAGATACG CCTACACTCC TATGTCACTT ATGGATGTAG GAGTTCTTTT TACTAG TTT A 

AGCACCACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC TTCCAGTTTT 

GG G AG AG AGA TGGAAGTTAC TTTGAGA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS : 

tA) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



1920 
1980 
2040 



2220 
2280 
2340 
2400 



2640 
2700 
2727 
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txi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 

ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 

TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 

ATTAATTCAA TTCCTTTTTT TCAGTATGAT AGTGGTATTA TTTTAAGATA CTTGAATTCT 24 0 

AACAATAATA ACT TG AATTT TCAATATATA GTTCAACTTT TAATAGCATT TGTTATTATT 3 00 

TATTTTCCTT TGAGTCAACT ATTACAGTTT TTCACACCCA ATATTATTGT TCGTAGTATA 3 60 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT ATATTATATA TGATAGGAAG GACGAAATAT 42 0 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATT T ACGTGG 4 80 

TCGCAGTATC CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA TCAACGATCT 540 

ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 600 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAGATATTCT 660 

AG AACT AAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT ACAGAATAAA 7 20 

ACAGTATTAG TATTAGGAGC AGGAGTCTTA CGATGTTATA TATCTCTAAG TCTAAGTATG 7 80 

TATGGAGTGA GGAAACTTAT TGTCGCTGAT TACGATATAA TAGAACCATC AAATTTAAAT 84 0 

AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AGATTAATGT TCTTTCTGAA 900 

AAAATACACA AGTATAATTC AGATGTTC AG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 9 60 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGACTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CC ATT GAT AT TATAAAAATT GTCAATCAAT TTGCTGTATC GCATAAGATA 1080 : 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTATTA TTGATAATAT ATATATCCCT 114 0 

ACCATCGGTT CTTG CTTTGG TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 1260 

ACT AATTT AA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AGATAACGCT 13 2 0 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 13 80 

GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 1440 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 16 20 



WO 98/18931 



PCT/US97/19588 



768 

TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGG AT ATTTT TATTGGTATG ATAATATTTT TAATGCTGTG GATATTTATT ATGTT AG GT A 174 0 

TAGTGTGTCT TACTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG T ATGT TT AAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTGCTT GAGAATGTAT 1860 

C GTATT AT C A ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTT AT TGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAAGGAGGT GCAATGAGTA TGATTGAAGT TACCCATTTA TCAAAAAGTT TTGGTGATAA 204 0 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAAGCG GTGAATTAAA 22 20 

GAGAATTGGA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA AT TTGT T AAA 2 340 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 24 00 

CCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTGT ACTCT TTCTGGATGA 24 60 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2520 

GAAAACAGCA GGGACAACGA TTTT T CTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 2580 

ATGTG ATT AT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2700 

GATAACTTTT GATTTTACAT C ACT AG AAC A GGTATCTCAG ACTGATCTGG AAAAT ATTTT 27 60 

TTCAATTCAT TCATGTGAGC CTACTTT AG A AGATATTTTT ATCACATTAA CAGGAGGAAA 2 820 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGT C AAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTCCTTT CACAT ATTTT TATAAATATC 2 940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3 000 

TACCTTTTTC TTTTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 3060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATT AT ATT AC AAT CGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TTATTGATAG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3 300 

AGGTTATCAG TCTTCCTGCT ATGATTT T AG TTGCTTTCTT ACCGATGCTA TCTGGTTTGG 3 360 

ATAAGACAGT TCCGAAGATA ACAGATTATA GTTTTATGGG ACT ATT TACT AACTTTTTCA 3420 
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CAAAATGGGA CCAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 34 80 

GGATTGTTCT TCTATTAACT TTAATTACGA T AAC TAT TAG GAAAAAGAAA ATTTCTTAAT 3 54 0 

TGAGTTATTT TAATCATTAT AAACACAAGT GGGAAGGAAA AAATGAACTG ATCTTTTTGA 3 600 

CAGCAATTCT ACAGAATACT CTT ATTGCT A TATTTTGATT TGAGTGTACG AAAAAAGAAA 3660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

GCAATAAAAA AT GC AAC ATT TTTAAATCTC CTCTATAAGT GCTTCAAAAA GTGCTTCAAA 3780 

ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 3 84 0 

TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3 900 

ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAC GCATCCGCAA AATAAAACAG 3 9 60 

AATTCC CATT TGTTCAATTA AAGGGTAACA AG C AAACT CT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTGAGGTGT TCAATACCAG TCAACATGGA 4 080 

TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4 140 

AGCAGTCCAG AGACGCAGCT AAGGTTAGAC GGTGAAAGGG TGGAGACTAC CCATTTTTCC 4 200 

TGGAACCTTG CTGTTGG C AG GTTCCTTTTT TCGTGCCTTC TGTTGG CC AG ACTCTCTCAC 4 2 60 

TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTGC TCTTGATTTT 4 3 20 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAAGA AAGCCTTTAT 4 3 80 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGGGCCTG AGATTGTGTC CTACTTAAAA 4 4 40 

GGTTTGGGTC ATAGTGTCTT TTTGGATCTC AAACTTCATG ACATTCCTAA TACAGTCAAG 4 500 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATCTCCA TGCGGCTGGT 4560 

CGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGCCA GTCAAGCCAA ATTGATCGCT 4 62 0 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CAGATGCAGG ACTTTCAAAA TATCCAAACC 4 68 0 

AGTCTGCAAG AGTCTGTGAT TCACTATGCC AAGAAGACAG CTCAAGCTCC CTTGGATGGT 4 7 40 

GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG CTACCAATCC AGATTTTATC 4 800 

TGTCTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 4 8 60 

ACACCTGCTG ATGCCTATCA AATCGGCAGT GACTATATCG TAGTGGGACG TCCCATTACC 4 92 0 

CAAGCTGAGG AT C CTGTTG C AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4 9 80 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTGC TAAAGATATC 504 0 

GCTAGCCACC TCTTGAAAAT CCAAGCCGTT TACCTCAAAC CAGAGGAACC CTTCACTTGG 5100 

GCAT.CTGGTA TCAACTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA .5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 52 80 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5 340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 54 00 

GGTGGTTCAG TTCTTGAAGC TCTAGCAGCA GCCAAGCGAG AAGGAGCACA TGTACTTGGA 54 60 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5 520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC C AG AGGGCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGACGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 57 1 1 

(2) INFORMATION FOR SEQ ID NO: 103: 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5558 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA ATCTTGCGAC CTGGATCAAG CCCTTCATCA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 12 0 

CTATCACACC CAATACCTAT CTAAGCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 240 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 300 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 3 60 

ATGAATTTTT G AG AGG AT C A GAATGAAAAA ACTAGCAACC CTTCTTTTAC TGTCT ACTt IT 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 54 0 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT AT AACGGT AT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 78C 

TACAGGTCCT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 84 0 
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CAAGAACGAG ATTACTCCTT ATTTCTATAA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 
TGGTCAACCA AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA CAGATACCTC 9 60 

TTCTAAACTC CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 1020 

AAACCCTAGT CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 1080 

TGTGGATAAG ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATCAC 114 0 

AATCGACAGC ATCGAACTGG TGAAAGACTA CGATTTTAAA TCTTAAAAAC CAAAAAAATA 1200 

CAGTATCCAC ATTCGGTACT GTATTTCTTT TACTCTCATT CTTAAGTTAA ATTATTAAAA 1260 

TCCCATATTT GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 132 0 

GATCCCCTAT ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 1380 

TTGGTCGGCT CGTCCAGCAC TAAAACGTTG TTTTCACGAT TCATCAAGAG ACAGAAACGA 14 40 

ACCTTGGCTT GCTCTCCCCC TGATAATACT TGAATCTGGC TTTCAATATC TTTGGTTGTC 1500 

AAACCACAAC GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 1560 

CAGACAGCTT CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 1620 

AGTTCTAAAT AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 16 80 

CTCTTCAAGA GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCCCAAC CTTTTGATTG 1740 

CGTTCGAAGG TAAGATTTAA AGGCTTAGTA AGAGGACGGT CGTAACCAAT TTGCAAGTTC 1800 

TTGGCTTGCA AGATAAAGCG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT I860 

GGTTTCTCAC TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 192C 

GCCATATTAC GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT G AGGTCTG C A 1980 

ATCTCTTTCT GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2040 

TCTTGGAACT GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC CACATGATAG 2100 

ACAATATTAA TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

TCATAGTTTT GGAGATAGCG CTTGAGCCAA TCAATATCCT CAGCATCCAA GTAGTTGGTC 2 22 0 

GGCTCGTCCA ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 2 280 

GTTCTTTGCC CACCTGACAA AGAAGTTACA TCCGTATCCA TCCCAAAGTC CATAACACCA 2 340 

AGAGCACGCG CTACTTCGTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 2 4 00 

CGGTCTTGAA GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 2 4 60 

ATTTTCATAT AGAGGTCATT GATACGAGCT TCACCTTTGA AAAGCTCATC AAAAGCCGTA 2 520 

CGGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC CAAGTAACCA 2 580 
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3120 
3180 
3240 



GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTtcItCTG GCAGCATTTT ACCACTCACG 254 0 

AT ACT CAT AA AGCTTGATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 2700 
TTGAGGAGAC GGAACGACAC ATCTTCAAAA ATTGCACGGT CACCAAAACC GTGACTCAGA 2760 
TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 2820 
AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAATCATCAA TCTCAAAGAC 2880 
• GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAAGACTCAC 2940 
AAGAAAACTA AAAAGAACGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 3000 
TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 
TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 
AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 
TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATGACTC CCCAGACCAA 

AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCG CTGCGACTGC CTTCTGGCGG 3300 

TCACGTTTCA TTGTGTTAGA CCCCAATTGT CCACAAGCTG CGTCAATATC TCTACCATGC 3 3 60 

TCTTCACCAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TATCATAGAA AGCCAACACG 3420 

CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 3480 

TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 3540 

ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG GTTTGTTGTC 3600 

TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG GAAACGCACG GTTAATCTTC 3 660 

ATGATACPTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 372 0 

ACCCCTTCM CAGCAAAGTC ACGAATTTTA TCACCCAAAC CTGAGGTTGA AACCGTGATG 3780 

TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 3840 

TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT CATGCGTTCA 3 900 

TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 3960 

TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGACGCAC AGAAGGTACA ACCGATATTA 4020 

CACCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT GTTGACGCAT CAGTACAGTC 4080 

TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGTACCATC AGCAGACTCT 4140 

TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AG CAATCAAA 4200 

TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC CT7TACGGTA GAGCCATTCC 42 60 

CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA ATACCCATTC CTGCATGGTT 4320 

TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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GACCAATGAC AAAATGACGT TCTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 4 4 40 

CTGCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC TTTTCTAGAA GGTGTTTCTT 4 500 

CTTCCGTCTT ACGCATTTTC TTCTCAAATC ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4 5 60 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4 62 0 

TTTCAAGTTT TTCTTCTCTT CGGTCATCCT TGTAAAAACC TCGTAGGCGA AGCTGTTCGT 4 630 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CGCTGATTAA 4 74 0 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4 30 0 

TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCACC AAACTTGTTA TAGTTGTATA 4 3 60 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AAT AC TTT AT 4 9 20 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACCATT 4 980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA 504 0 

TTATTTACCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTG AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAC TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GAT T GGTGTG TCTTGGTTTG CTCTTGAAGT 522 0 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 52 80 

TCTTCCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5 3 40 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAACTGCT TGACGGATAT CAGCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 5 520 

AGCTCCATTC GCATTGTTGA CATAAGTTTG ATGGTCTT 55 58 

(2) INFORMATION FOR SEQ ID NO: 104: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6735 base pairs 
IB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: L04 : 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 12 0 

TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 180 
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TTAGTGTTTC 


GTCCTTTATG 


GGCTATTTAA 


AGGGCAAGAG 


CAGTTTGATG 


ATTTTTGATA 


240 


AGCATGCGAA 


TTTAAAATAC 


AAATATGGCA 


ATCGCAAGTT 


TTGGTGTAGA 


GGCTATTATG 


300 


TAGATACGCT 


AGGCCCTAAT 


CAGAAAGTGA 


T AGCTG AAT A 


TATTCAGAAT 


CAATTACAAG 


360 


AAGACAGAGT 


AGCAGACCAG 


CTCACGTTAT 


TCGAGTCAGT 


AGATCCGTTT 


ACTGGCGAAA 


420 


TAAATAAGAG 


GAAGTAACTA 


AGGTGCTTTA 


GCACCTCCTC 


GGGAAAGTGG 


TGCGCGAGGA 


480 


AGCTATTTCG 


GTGGGCCTTT 


GGCCCTGGCC 


GGTAGAAGCG 


GCTTATAGCC 


GCAGAACAAA 


540 


CCACCAGTTC 


ACACTGGTGG 


TTTTGATTTA 


AAAAACTTGA 


TACATAAAAA 


TAAAAGTCTA 


600 


TATAAAGGAT 


GGTAAAATTC 


CTGTTGTCCG 


ATTTGGACAA 


TATCCTAAAT 


AGTTACAATA 


660 


TATGGTCTAT 


ACTTTTTCTT 


AGGAGAAAGC 


TAGATGTACA 


GACGTTTGAG 


AGATTTGAGG 


720 


GAGGATCATG 


ATCTGCCCCA 


AAAGCAAATA 


GCTACAATAC 


TTTCGTTTAC 


AAATTCAGCT 


780 


TATGCCAAAA 


TTCAACGGGG 


TGAGCATGCG 


TTGACGGCTG 


ATGTATTGGT 


TAAACTCTCA 


840 


GATTTCTATC 


ACGTCAGTAC 


AGACTATTTA 


TTGGGATTAA 


CTGATTTTCC 


TGATAAAATT 


900 


CG CTTT AG AA 


AAT AATCTCC 


TCAATTTCAT 


AGAGTTTGAA 


AATCAGTGAG 


ATT TTTT ATT 


960 


TGCCCTTTGA 


CAACTGAATA 


GCCTAAAATG 


GTACTTTCCT 


CATTTGTGGA 


GCAAATTTGA 


1020 


ATGGCTCGCC 


ATGATAAGAG 


CGATTTTAAA 


ATCATCAATA 


AAATAGAGCG 


ATACTTTATA 


1080 


TGCCATGATA 


CAAATGATAT 


ACAATGATAC 


TTCTGACCGT 


TCAGCCTGCC 


AACGTAAAAG 


1140 


AGCAGCAAGT 


GAAATTCTTA 


TGATGACTTC 


ATCAGTCATG 


CCACGTTGAA 


TGTGTGAGTT 


1200 


TGTTAGATAA 


ACCCAATTAA 


TCCTCAAAAG 


GTTCCCCGAA 


CCTTTTCACT 


TCTACAGACG 


1260 


CATCACGTGG 


AGTGTGTAAG 


CTTGTTGCTA 


AAAGCGTAAA 


AACCTTGGAA 


CGAAAGGAAT 


1320 


AAT AG ACTTT 


CTGCGAAACA 


AAAATATAAT 


ACAATAAAAC 


TATGAATGAT 


GAAGCAAGTA 


1380 


AACAATTGAG 


CGATAGCCGT 


TTCAAGATCC 


TTGTAGGTGT 


TCAGCGCACG 


ACTTTTGAAG 


1440 


AGATGTTAGC 


TGTGTTAAAA 


ACAGCTTATC 


AACGTAAACG 


CGCAAAAGGT 


GGACGAAAAA 


1500 


GCAAATTAAG 


CCTAGACGAT 


CTCCTTATGG 


TAACTATTCA 


ATACATGCGA 


G AAT AG AG C A 


1560 


CTTATGAACA 


AATTGCGGCT 


GATTTTGCCA 


TTCACGAAAG 


CAACTTAATC 


CGTCCGAGTC 


i.620 


AATGGGTTGA 


AGCAACTCTT 


ATTCAAAATG 


GTTTTACGAT 


TTCAAATTCT 


GCCTTAATTC 


1680 


TGTAAAAACA 


GTAAAATTCG 


AAGGATTGTA 


AGGTAAGAGT 


TTTTTTCTTT 


CTGAAAAAAT 


1740 


GGT AT AAT AG 


CAATCAAAAC 


TAGAAAATAA 


AACGGAATTT 


GGAACAGATT 


TGTCTCTATC 


180C 


CTAGTAGAGT 


GG TG AT ACT A 


TGAAGATTAG 


TAAGAGGCAC 


TTATTAAATT 


ATTCCATCTT 


1860 


GATTCCCTAC 


TTGCTTTTAT 


CTATTTTGGG 


CTTGATTGTC 


CTCTATTCGA 


CCACCAGTGC 


1920 


TATTTTAATT 


GAAGAAGGCA 


AGAGCGCCTT 


GCAGTTGGTT 


CGAAACCAAG 


GAATCTTTTG 


1980 



WO 98/18931 



PCT7US97/19588 



775 

CATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TT TTG AG AAA 204 0 

TGAGCGACTA ATCATTTTAG TTATATTAAT AG AAATG CTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCAGCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 222 0 

GCAAGAAGAA ATAGCTACTT ATG ATTTTC A AGTTTTGACT CAAAATCAAT GGCTTCCCCG 2280 

TGCTTTTAAT GATTGGCGAT TCGTTCT CCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 23 4 0 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTC ATTATGTATA CAGTTAGTGG 24 00 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 24 60 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2 52 0 

TGTAGCCAAG CCCTTTACTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGGTCTTGG 26 4 0 

AAACTC G ATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC C AG T CTT ATT TTAGCTCTCT TGTTTTTCAT 2 7 60 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2 82 0 

ACTCGGTGTC CGAGGGATGA TGTTGGTT C A GGTATTTGTC AATATCGGAG GGATTTCGGG 2 880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2 940 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCACTGAAA AACGCCCTAA 3000 

ATTGTACCGA G AATTGG AAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3 0 60 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 3120 
AAGTCTTGAT TCTAACAGAA TTACTGGAAG AT ATT ACT AA AAATATGCTT GCCCCACAGA 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 3 2 40 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3 300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AG CTT ATG AA ATCAATCATC 3 3 60 

AAAATAATAT TGATGAGGAC TATTTAGGTA AATTATCTAC AAC G ATT AAA TTGGTAGCAG 3 4 20 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTG AC AG CCC 34 80 

ATCCAACACA AGTGCAACGC AAAAGTATGT TGGATTTAAC AAATCATATT CAT AGTCTTT 3 54 0 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3 600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3 6 60 

TG ACT AACG A AATCACGAAT- GCTATGGAAT ATT AT AAC AG CTCCTTTTTG AAAGCTGTAC 37 20 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTCAAT TTAAAACAGG 3780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCG TGATGGAAAT CCATTTGTTA 3 84 0 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3 900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3 960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 4020 

AG CTTT ACC G TCGTG CCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4 080 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 414 0 

TGATTGCC AT TCGAGATTCT CT AC TAG AAA AT AAGGGC G A GTCCTTG ATT TCAGGTGATT 4 200 

TTGTGGAATT ATTGCAGGCA G T AG AG AT AT TTGCTTTTTA CTTAGCATCA ATTGATATGC 4 2 60 

GACAAGACTC TAGCGTCTAT GAAGCCT GTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4 3 20 

ATTCTCGTT A TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 4 3 80 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 444 0 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA G TTGGG AG AT GATGTCATCC 4 500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4 5 60 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4 620 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 4 6 80 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4 740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4 800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4 860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 4 920 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4 980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 504 0 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG C ATTTCT AT G 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 52 80 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5 3 40 

AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATCGTTTT GTCAAAATCA AATATGAATA 54 60 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 5 520 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGCC TATTGAAGGA CATGACGAAC 5 580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 564 0 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCC AG TG 5 700 

ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 57 60 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AAT AG ACT AT S820 

TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 58 80 

TTAGAGAGTC TG TGGTAGCT GAAAACAGAT AAGTGG C AAT GATGAAAATT GGGCTGAATG 5 94 0 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTT AT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 6060 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 612 0 

GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 62 40 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 6300 

AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 6 3 60 

CAAGATGGAC TTGCAGAAGA AGG AT AT AAA GATGATCAAG TTAAAATTGA TTTTATGAAC 64 2 0 

TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGACTAAAC AAT TGGTTG C AAATGGGAAT 64 80 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 654 0 

CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAACATTTG 6600 

AAAAAACCAG GTGGCAACGT T AC AGGGGT A TCTGACCACA ATCCAGCTCA ACAACAAGTT 6 660 

GAACTCATCA AGGCTCTGAC ACCGAATG TG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 6720 

GAAGACAATT CAAAA 5735 

(2) INFORMATION FOR SEQ ID NO: 105: 

<i> SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTAGAGGATC CCAGCAGGTA AATTGG CTTC AG CTGGC AAA AAAGTTGCCC TCGTTGAACG 60 

CAGCAAGGCT ATGT A CGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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CCTAGTTGCT 


GCTGAAAAGG 


ACTTGTCTTT 


TGAAGAAGTC 


ATTGCTACTA 


AAAAC A CG AT 


180 


CACTGGTCGC 


CTCAACGGTA 


AAAACTATGC 


GACTCTTCCT 


GGTACAGGCG 


TAGATATCTT 


240 


TGATGCGGAA 


GCTCACTTCC 


TTTCAAATAA 


AGTCATCGAA 


AT CC AAGCTG 


GTGATGAAAA 


300 


GAAAGAACTG 


ACTGCTGAAA 


CAATCGTCAT 


CAACACTGGT 


GCTGTTTCAA 


ACGTCTTGCC 


360 


AATCCCTGGA 


CTTGCTACAA 


G C AAAAAC AT 


CTTTGACTCA 


ACAGGTATCC 


AAAG CTTGG A 


420 


CAAATTACCT 


GAAAAACTTG 


GAATCCTTGG 


TGGCGGAAAT 


ATCGGTCTTG 


AATTTGCCGG 


480 


CCTTTACAAC 


AAACTTGGAA 


GCAAGGTCAC 


AGTCCTAGAT 


GCCTTGGATA 


CATTCCTACC 


540 


TCGTGCAGAA 


CCTTCCATCG 


CAGCTCTTGC 


TAAACAATAC 


ATGGAAGAAG 


ATGGCATTGA 


600 


ATTGCTTCAA 


AATATCCATA 


CTACTGAAAT 


C AAAAACG AT 


GGTGACCAAG 


TGCTTGTCGT 


660 


AACTGAAGAC 


GAAACTTACC 


CTTTCGACGC 


CCTTCTCTAC 


GCAACTGGAC 


GCAAACCAAA 


720 


TGTAGAACCA 


CTTCAACTTG 


AAAAT A C AG A 


TATTGAACTA 


ACTGAACCTG 


GTGCTATTAA 


780 


AGTAGACAAA 


CACTGTCAAA 


CAAACGTTCC 


TGGTGTCTTT 


GCAGTTGGAG 


ATGTCAACGG 


840 


TGGCCTTCAA 


TTTACTTACA 


TTTCACTTGA 


TGACTTCCGT 


GTTGTTTACA 


GCTACCTTGC 


900 


TGGAGATGGC 


AGCTATACAC 


TTGAAGACCG 


TCTCAATGTG 


CCAAATACTA 


TGTTC ATC AC 


960 


ACCTGCACTT 


TCACAAGTTG 


GTTTGACTGA 


AAGCCAAGCA 


CCTGATTTGA 


AACTTCCATA 


1020 


CGCTGTTAAG 


GAAATCCCCG 


TTGCAGCAAT 


GCCTCGTGGT 


CACGTAAATG 


GAGACCTTCG 


1080 


CGGTGCCTTC 


AAAGCTGTTG 


TCAATACTGA 


AACAAAAGAA 


ATTCTTGGAG 


CAAGCATCTT 


1140 


CTCAGAAGGT 


TCTCAAGAAA 


TCATCAACAT 


CATCACTCTT 


GCTATGGACA 


ACAAGATTCC 


1200 


TTACACTTAC 


TTCACAAAAC 


AAATCTT C AC 


TCACCCAACC 


TTGGCTGAGA 


ACTTG AATG A 


1260 


CTTGTTTGCG 


ATTTAAGTTG 


AGATTTAATC 


GTATCCAACA 


GCCCTCTTTG 


GGCTGTTTTT 


1320 


ACTTCTGCGG 


AATCTCAAAT 


CTGTCTTTCT 


CCTCTTTTAT 


GATATAATAG 


AAACATGAAC 


1380 


TTAAAAACTA 


CTTTGGGCCT 


TCTTGCTGGG 


CGTTCTTCCC 


ACTTCGTTTT 


AAGCCGTCTT 


1440 


GGACGTGGAA 


GTACGCTCCC 


AGGGAAAGTC 


GCCCTTCAAT 


TTGATAAAGA 


TATTTTACAA 


1500 


AACCTAGCTA 


AGAACTACGA 


GATTGTCGTT 


GTCACTGGAA 


CAAATGGAAA 


AACCCTGACA 


1560 


ACTGCCCTCA 


CTGTCGGCAT 


TTTAAAAGAG 


GTTTATGGTC 


AAGTTCTAAC 


CAACCCAAGC 


1620 


GGTGCCAACA 


TGATTACAGG 


GATTGCAACA 


ACCTTCCTAA 


CAGCCAAATC 


TTCTAAAACT 


1680 


GGGAAAAATA 


TTGCCGTCCT 


CGAAATTGAC 


GAAGCCAGTC 


TATCTCCTAT 


CTGTGACTAT 


1740 


ATCCAGCCTA 


GTCTTTTTGT 


CATTACTAAT 


ATCTTCCGTG 


ACCAGATGGA 


CCGTTTCGGT 


1800 


GAAATCTATA 


CTACCTATAA 


CATGATATTG 


GATGCCATTC 


GGAAAGTTCC 


AACTGCTACT 


1860 


GTTCTCCTTA 


ACGGAGACAG 


TCCACTTTTC 


TACAAGCCAA 


CTATTCCAAA 


C C CT AT AG AG 


1920 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 1980 

ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 2040 

GGTGCCT ATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AG AAT AC GGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 2220 

TTCCTAGGTG CCGATTCGCA ACTCATCAAA CAGGGATTTG ACAAGAGCCG TCCTGTCTTT 2280 

GGACGCCAAG AAACCTTTCA TATCGGTGAC AAGGAATGTA CCCTTGTCTT GATTAAAAAT 2 340 

CCAGTCGGTG CAACCCAAGC TATCGAAATG ATCAAACTAG CACCTTATCC ATTTAGCCTA 2 400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 24 60 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACG CTGGCGG TGTTCGTCAT 2520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT CACTGAAACG 2580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2 64 0 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTCTT 2 7 00 

AG AAAGG AG A TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 2 7 60 

ATCAGCTCAA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG ACAATGGAAA 2820 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGG AGC C CATGTGACCG TTGACATCGT 28 80 

TTCTCTCCAT GATGACTTTG ATG AAAATC A CTACGACATC GCCTTTTTCG GTGGTGGTCA 294 0 

AGACTTTGAA CAAAGTATCA TTGCAGACGA CCTACCTCCT AAAAAAGAGA GCATTGACAA 3000 

CTACATCCAA AACGACGGTG TAGTTCTCCC TATCTGCGGT GGTTTCCAAC TATTGGGTCA 3 0 60 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG GACACTACAC 3120 

GCTCAACCAG ACCAATAACC GTTTTATCGG TGACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAAT C AC C A AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3 240 

ACCGCTGGGA CAGGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3 3 CO 

TCATTATAAG AATGTCTTTC GTTCCTACTT CCACGGGCCT ATCCTCTCTC GTAATGCCAA 3 3 60 

TCTGGCTTAT CGCCTAGTTA CTACTGCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3 4 20 

CCCTGCCTAT GAGGACATTC TCAGCCAAGA AATCGCTGAA GAGTACAGTG ACGTCAAAAG 3 4 80 

CAAGGCTGAC TTTTCTTAAA C AAAGG AAAA TGAT AT C AAA GAACTCCGTT ATCTTGTCGG 3 5 40 

AGTTTTTTGT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3 60C 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA .3 660 
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TCAGAAAAAC TAGACGTCGC TGTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3 7 20 

TTTTACACTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTACCAGA TCAGGAACCT 3780 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 384 0 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3 900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3960 

AAAAC AGTT T TTCACACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 4 020 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TCCTGACAAG 4 080 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 4140 

CAAATCGACT TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 4 2 00 

AC TGGTG AT A ATGC TGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 42 60 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 4 320 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAACATA TTT AC AACTT GACTCTTGAA 4 3 80 

GCCCAACAAG CTGCTCT TG A CTTTATCAAG CCAGGTGTGA CTGCTCATGA AGTGGACCGC 4 4 40 

GCTGCCCGTG AGGTCATCGA AAAAGCTGGT TATGGTG AGT ACTTCAACCA CCGTCTCGGG 4 500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA CGACATGGTC 4S60 

ATCGAAGAAG GCATGTCCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 4 62 0 

GTTCGTATTG AAGACTGCGG TGTTGTTACC AAGGATGGCT TCAACCTCTT TACAAGCACC 4 680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 4 74 0 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATGCT AAAGAAATTG GCTGCAATAA 4 800 

TCTAACCCTA AGTGTCTGGA ATGATAACGA GGGTGCTCTC CGCTTTTATC AAAGACAAGG 4 8 60 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTGAT TAAGAAGTCA TCTATCAAAA 4 920 

GATGTTAGAA AAAGTTC AAT TTCACTAGAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4 980 

AT AGT AT C AG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT S04 0 

TTAGTTCTGT ACTGCACACG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG 5100 

T AGT AAT C AA TATAGTCTAT AATGACTTGT TCCAATTGGT TAAGTGATTT AAATCTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATTCCAT CATACCGTTG 5 220 

TCTTGGCTGT TTCCCTTGCG TG AC AT AG AT GCTTGAATTC CCTTATTCTC TAGGAACCGA 5 2 80 

TGATAAGAAT CGTGTTGGTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5 34 0 

TGCTTCTCTT TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 54 00 

GAAAGATTAA AAGCAATAAT TTCG CTGTT A AAGCCATCTA AAACTGGTGA TAAGTAAAGC 54 60 
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TTTTGAGTAC TTGCTCGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGC ATT TT AGCTTGT A AATTGAGTAC TTTCATCAAG 564 0 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 5 7 60 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA ACGAGCTAGT 5 820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCACTCA TTAATTCTTG AACAATTTCT 5 380 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5 94 0 

CATTCTCCGC TCTCACGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CT AGC C AG TT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 62 4 0 

AAACACAAAA AGAAAGGAAA CTCACATGAA CACTTTACCA AATCATCACT TCCAAAACAA 6 300 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 63 50 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 64 20 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTCTCC AGTTCCTCTT 64 80 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO : 10 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
TTTTCAACCC ATATCGTGGC TCCTCAATAC TACTTACTCA CAACTATGCT AT C AG AG ACT 60 
TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGG AC AT 12 0 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 
CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 24 0 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 3 00 
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AGCATTTTCT 


ATGAATTTTT 


AAACCATGTC 


CTAGCCTTAA 


AAAATAAATT 


TAAAAAATCA 


360 


TAGGAGTTTA 


AAATGAAACA 


ACTAACCGTT 


GAAGATGCCA 


AACAAATTGA 


ATTAGAAATT 


420 


TTGGATTATA 


TTGATACTCT 


CTGTAAAAAG 


CACAATATCA 


ACTATATTAT 


TAACTACGGT 


480 


ACTCTGATTG 


GGGCGGTTCG 


ACATGAGGGC 


TTTATCCCTT 


GGGACGACGA 


TATTGATCTG 


540 


TCCATGCCTA 


G AG AAG ACT A 


CCAACGATTT 


ATT AAC AT TT 


TTCAAAAGGA 


AAAAAGCAAG 


600 


TATAAGCTCC 


TATCCTTAGA 


AACTGATAAG 


AACTACTTTA 


ACAACTTTAT 


CAAGATAACC 


660 


GACAGTACGA 


CT AAAATT AT 


TGATACTCGA 


AATACAAAAA 


CCTATGAGTC 


TGGTATCTTT 


720 


ATCGATATTT 


TCCCTATAGA 


TCGCTTTGAT 


GATCCTAAGG 


TCATTGATAC 


TTGTTATAAA 


780 


CTGGAAAGCT 


TCAAACTGCT 


GTCTTTCAGT 


AAACATAAAA 


ATATTGTCTA 


TAACCATAGC 


840 


CTTTTAAAAG 


ATTGGATACG 


AAC AG CCTTC 


TGGTT ACT CC 


TTCGACCGGT 


TTCTCCTCGT 


900 


TATTTTGCAA 


ATAAAATCGA 


GAAAGAAATT 


CAAAAATATA 


CTCGTGAAAA 


TGGGCAATAT 


960 


ATGGCTTTTA 


TCCCTTCAAA 


ATTTAAGGAA 


AAGGAAGTCT 


TCCCAAGTGG 


T AC CTTTG AT 


1020 


AAAACAATCG 


ATTTACCCTT 


TGAGAATTTA 


AGCCTTCCTG 


CACCTGAAAA 


ATTTG AT ACT 


1080 


ATTTTGACAC 


AATTTTATGG 


AGATTATATG 


ACCCTACCAC 


CAGAAGAAAA 


ACGCTTCTAC 


1140 


AGTCATGAAT 


TTCACGCTTA 


TAAATTGGAG 


GATTAGGATC 


CAATATTTAG 


AAAAAAAAGA 


1200 


AATTAAAGAA 


ATTCAACTAG 


CCCTGCTGGA 


CTATATTGAT 


GAGACTTGTA 


AGAAACATGA 


1260 


TATTCCTTAT 


TTTCTCAGTT 


ATGGAACCAT 


GCTTGGACCC 


ATCCGCCACA 


AAG GT ATGAT 


1320 


TCCTTGGGAT 


GATGATATTG 


ATATTTCCCT 


TTATCGTGAG 


GATTATGAGC 


GTTTACTGAA 


1380 


GATTATTGAA 


GAAGAAAATC 


ACCCTCGCTA 


CAAGGTTCTT 


TCCTACGATA 


CATCTTCTTG 


1440 


GTACTTCCAT 


AATTTCGCAT 


CGATTTTGGA 


CACTTCTACT 


GTTATAGAAG 


ACCATCTTAA 


1500 


GTACAAGCGT 


C ATGAT AC C A 


GCCTTTTCAT 


CGATGTCTTC 


CCAATTGATC 


GATTTACAGA 


1560 


CTTGAGCATT 


GTCGACAAGA 


GCTATAAGTA 


TGTGGCTCTT 


CGTCAACTAG 


CTTATATCAA 


1620 


AAAATCACGA 


CCAGTTCACG 


GTGATAGCAA 


ACTAAAAGAT 


TTTCTTAGAT 


TATCTAGCTG 


1680 


GTACGCTCTC 


CGATTTGTCA 


ATCCTCGCTA 


CmTACAAG 


AAAATTGATC 


AACTAGTCAA 


1740 


AAATGCTGTA 


ACCAACACTC 


CTCAATATGA 


AGGAGGACTT 


GGGATCGGTA 


AGGAAGGGAT 


1800 


GAAAGAAATC 


TTCCCAGTTG 


ATACCTTTAA 


AGAACTGATT 


TTAACTGAGT 


TTGAGGGCCG 


1860 


TATGTTGCCT 


GTTCCCAAAA 


AATATCACCA 


ATTTTT AAC C 


CAGATGTATG 


GCGATTATAT 


1920 


GACACCACCA 


TCAAAAGAAA 


TGCAAGAGTG 


GTATAG7CAT 


AGCATTAAAG 


CTTATCGCAA 


1980 


AAACTGATTG 


AGGGGG ATT A 


TACAAACTAC 


TAAGATAGAC 


GTTATTCAAA 


AACATAATTT 


2040 


TAGTAGAAAA 


TGAAATACAT 


ATTCCCACAA 


TAAAACGCAT 


CATATCAAGG 


TTTTTGAAAA 


2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT T AATTT AT AG AAATATTTTA GTATTCCTGT 2 22C 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 22 80 

AAGCTACGAC TTTCAACTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2 340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2 4 00 

TTTGTTCCTA CTGTGTTAAT GAT AG CTTG A ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2 4 60 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGCCCTT GCAAGCCATG ACT AG C AAAG 2 520 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2 580 

AAGTTCAAGG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2 64 0 

GGCAAGTGTA CATAAGAAGC TTCAAAGGCT TT ATAGAG AG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 27 60 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2 82 0 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2 880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2 94 0 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3 000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AAC AG CC ATT GAGTCACCTG AGTGGACACC ACCACGTTCG 3L20 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AG AACTGG GT GGTCTGGACT AGCCTTAACA 3 2 40 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3 3 00 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3 3 60 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3 4 20 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3 4 80 

A7GGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3 540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3 600 

AATGGCTCAA AG T AAAGCTT ATCTG AT AC A GAGAAGTCTG TTGAAACGGT CTCTCGGTTT 3 660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3 720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTCGAC CTGAACCTAG GACAAGTACA 3 780 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3 840 
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GTTTCGGAGT CGAACTCTCC CGCACAAGTG TCTACCATCT TATAAACTGG AACAATCTTG 39 00 

TTTTCCAAGC GAAGTTGGCG AACTTTATCA TCAGTCGTTC CCCAGAGTTC AGCAATCTTA 3960 

CGGTCTGAAA AACCATTAAG TTTGGCTGTT TTCAAAACTT CTAAATCTTG TGGATGAGCA 4020 

CCCAATTCTT GCTCAATTTC AAAGATATGC AAGAGTTTAT CAAGATAGAA CATATCAATT 4 080 

TTTGTAAGCT CTGCAATTTC TTCAGGTGTG TAGCCACGAC GAAT6GCTTC TGATACGTAG 4140 

AAGAGACGGT CATC TTGGGC TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 4 200 

GCAAGTTCAG GTATTTCATT GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 4 2 60 

AGAGATTCCT CG ATGTT AC G ACCCATTGCC ATCACTTCTC CAGTCGCCTT CATTTGTGTA 4 32 0 

CCGAGACGGC GTTCACCCTT TTCAAACTTG TCAAATCGGA AACGTGGAAT CTTAGCAACT 4 3 30 

ACGTAGTCAA GGGCTGGTTC AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 4440 

TCATCCAAGG TCAAACCTAC TGCAATCTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 4 500 

GCTTTAGAAG CAAGGGCTCA CGAACGTGAT ACACGAGGGT TTACTTCGAT AACATAATAC 4 560 

TTGAAGCTGT TAGGATCAAG AGCTAGCTGA AC ATT AC ATC CACCTTCAAT CTTG AGGGC A 4 62 0 

CGAATAATGC TCAAGCTCGC ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTGC 4680 

GCAGGGGCAA ATACAATGGA ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 4 74 0 

TTACAAACAA CCAAGGCATT GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 4 800 

AAACCGGCAA TCGAACGCTC AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 4 860 

TCAGTGATTT CACGCAATTC TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 4 920 

GTAAAGGCTG GACGAACGAT GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTCT 4 980 

TCTACTGTGT TAACAATTTC AGATTCTGGA ATGGGTTGTT CAAGCTCTTC CATCAATTGT 504 0 

TTAAAGAGGT CACGGTCCTC CGCTTGGTCA ATGGCAGATA ATTTGGTACC CAGAAGTTCA 5100 

ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 5160 

TGACCACCGA GTGTTGGTAG CAAGGCATCT GGACCTTCCT TACGAAGAAT ACGTGTCACA 5220 

AACTCAAGTG TAATCGGTTC AATGTAAACC TTCTCAGCAA TTTCCTTGTC CGTCATGATG 5280 

GTTGCAGGAT TTGAGTTAAC CAAAACAACC TCATAACCTT CCTCTTTCAA CGACAAGCAA 5340 

GCCTGAGTCC CAGCGTAGTC AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 5400 

AT C AC C AT AA TTTTTTGAAT ATCACTACGT TTAGGCATAT ATAAGATATT AAGGGTGTCA 54 60 

AGCGGACAAA GCTAAAATAG GAGTTATGAC GAAGAACTGT CAGTTCTAGC AATAACTATC 5520 

TTTTTAGCAC CGTCCGTAGC CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 55 80 

CTATTCGTCG CCTCTCAGGG CGACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 5640 
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GAATTTTAGG 


AAATC AAGG A 


AGG ATTGAC A 


ATCGAAGTTG 


rTTTPTP m AP 
Oi A t L 1 L i AO 


A a. 1 O 1 O AOO I 


3 / Ul' 


TTCCGTCCGT 


GTTCAGTTAC 


ATAAATTCTC 


rr.ArGArrTT 

V. vJrtV_ Ort vjO 1 L 


TTAPTPPTTP 

I I A(_ 1 Lb 1 i O 


» I AO I I I OAT 




TGTTTAAAAA 


CTTCCATCAT 


CTCGATAAAC 


Tr^TPAA ATA 


OO 1 AOO I AOI_ 


O 1 OOToTOOC 


58 2 0 


CCAGGAGCTG 




GTATTGAArA 




O I L OO 1 A I O 1 


OTooCGCAoA 


58 80 


CCTTCCACTG 






THTPT AATAA 


TP A A rT^PTP 
L LAAo 1 OO i (_ 


TOOOAaAToC 


594 0 


TCGCGGCTGA 


CTGCATAACG 


ATGGTTCTGG 




PT A PTPPTrr 

O 1 AO 1 OO I OO 


TOTTOOOAl 1 


6000 


TCACGTACCf. 


PATPflTTHAA 




ppa aapttp a. 


1L I 1 A A AOOT 


CTTAGCCCCG 


6060 


tttgccat^t: 


CAAAGAGTTr, 


ollj 1 OOO A 1 A 


PAAATAPPSi 


AO A rTOOAAT 


TTTTCCTTGT 


6 120 


ACACCGCGAA 


TCATGTCGAG 




APPTPTTPTP 


Pptt sprrrr 
OO i L AO O i OO 


AOOATTTGAC 


6130 


AAfATAA PT P 


^-Vj 1 ^ nuO/\ 1 1 




Al 1 1 v, 1 I LAo 


llGTTGTCGA 


ATAAGGAACA 


6240 


ACTCTCACGT 


T JVC AP.TTp.pp. 


TTT AH AAA PT 


TPAPPTarr & 

A V_/\U.lj 1 AuOA 


I" O AoTooTT 


GAGACCAAAG 


63 00 


TCCACTAGCA 


PPAPP-PTPAA 






AAOAOO 1 \T\ I 


AoTAGAAACC 


6 3 60 


Tf"!'l i *T**T , P AT fv T 


» O i v_ i O I L.OO 


T & A A A rTPTT 




GGTCCGTCAC 


ATGGTCCATA 


642 0 




OA 1 U<JU 1 V— A/\ 


uv) 1 J. O I— AO OL- 




OoTTAooGAT 


AATCTTGGTA 


64 80 




1 Al V. AA 1 1 <k.L 


TP A A ATPPPT 
A O AAA A O O O 1 


uuAA 1 i 1 ILi 


Togo Ai To aa 


AAATTCATCC 


6540 


AACHTATTT 


pptt r* rrr n a 


fTTr fT A f P*T 
ul i tjL A AOO 1 


L. I ALULuL I I 


LTTlAAAAAC 


AACGACTCCC 


6600 




HAATAA^fT.A 
O AA 1 AA & uvn 


TTPATAATP A 


T»r APr ATT a A 


TAoCATAATi 


ToCTACCAAA 


66 60 


GGATAACT AA 


AP^TP AAr.AT 
AOO J \.nnun 1 




T A iPAPrrrT 
I AAuAU 1 Vjvj I 


LTCiTAATGGA 


TTCTTGGTAG 


6720 


CCGGTCATPP 


fTGTATT AAA 
k_ 1 i /\ 1 i AAA 


PAPPATTTPP 
OrtL OM ill OO 


PPTPTT A /** A T 
LL loll ALA 1 


O AAT ATOTOO 


TCCGAAGGCC 


67 80 


TTCCCT^CAA 


AAAr^GTCCC 


ATrTTGTAAT 


APTAPA ATTP 
AO A AOAA 1 1 L 


1 1 i 1 IIjI OA I 


ATTTTCACCT 


68 40 


CTCGTGGACG 


CTCAPTCCCG 


TG^TTTAAPG 
i v. * * i * n/\v*>j 


TPTTPTP.TTT 


TAPTTPPPPT 
1 AO 1 1 OOOO 1 


TTCTACTCGC 


6 9 00 


TAGTACGGAT 


TCTAAGATTG 


C C ATTCG AAC 


AAAGACACCA 


TTGGTCATTT 


GTTGCACAAT 


6960 


CCGTGATTTT 


GGTGCTTCAA 


CCAAGTGGTC 


TGCTATTTCT 


ACATCACGAT 


TGATTGGAGC 


7020 


TGGGTGCATG 


AGGATTGCTG 


TTTCTTTCAA 


ACGATCGTAA 


CGTTCTTGAG 


TCAAGCCATG 


7080 


TTGGGCATGG 


TAGTCTTCTT 


TTGAAAATAC 


AGCTCCACTA 


TCATGGCGTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATG AC AT 


CAACCTGATC 


AATGATTTCA 


T C AATGGTT A 


CAAACTGTCC 


7200 


ATAGTCTGCA 


AACTCTTGAC 


TTCTCCATTC 


CTCAGGTCCA 


GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA 


TATTGCATTT 


GGCAACGCGT 


GAGTGGTCCA 


AGTCACCTGC 


7320 


AATAGCAACT 


TTAAGACCCT 


CAAAGTGGCC 


AAATTCCTCA 


TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG 
CGTTGGACTA 
ATCCACTCCT 
CGAGCTAGTC 
AAAGGACTTA 
TTCATAGGGA 
TTGATCGACA 
CATCGCTACT 
AAGTTCTGTC 
GTAATCTGGA 
ACGCGCAGGA 
ATAGAGCACA 
AGTATCTTCT 
AACTGAAAGA 
ACGAGTTTTA 
ATAAGTAATA 
CATGACAAAC 
AGCGAGCCCT 
GCCTCACGGG 
TTAAAATCAA 
TGGGTACTGG 
GGCCTGATGG 
CTCAAGTGGC 
AGTATCCACT 
GCGACCAACA 
ATCGTCTAGT 
CAAGCGAGAA 
TGGAAAGGCA 
TGATAACATG 
ATCTGGGAAG 



CTAGGGTGTT GGCCCGAACC 
CCAATCAATT CTCTATACTA 
AAAGCAGACA GAGTCAAAAT 
TTCACATCAA AGTCAAGTCG 
TGTGTCCGTG TAGAATCCTC 
AGCTGGGCTC CAT TTTT AAA 
GTGAGGTCTT CCATGGACAC 
CCTTTAACTT TCTAAGCTTC 
ATCTCTACGA TGATTTCTTC 
CGGATTGGCA ATTCTCTATG 
CGACCATGAC CCACAATATT 
TCATCCACCA AG AT AACTTC 
CCACTTTTAA CATCATCACG 
TTTTCTAACT GCTTCAAACG 
ATACCAGCCA AGACGATCTT 
CGCGTAATCG CTCGTTTGAC 
CTCCAAAAAG AAAAGT CTCC 
ACTGCACACA GTATAGACTT 
ACACGTTTAA AGGAATATTT 
GCAAAAAGTT TCAATGTAGC 
TCACACTCTG GATTTTTTGG 
GCAGCTAACC ACTGCT C AG G 
GTCGCTCATT TTTTGACAAT 
GCAAAGGCTG GAATTCCAAA 
CCTGCCAAAC TCTCCAATTC 
AACTGTTGGG CACATTTTTT 
ATATGTGAAG CAATCTCACT 
ACAAAGAGAC CTGGTCTGGC 
ACCGCAACCA GGAGTTCAAA 
AGGGCAATGA TTTCTTCTAG. 
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ATCTCCACCA TTGATGATGG 
GTCGACCTCT GGATGGCGAA 
GGTGTCATAA AGTGTCTCAC 
TTCCAATCCA AGTTTAATCT 
AAAGAAGAGA TTGGAAACAA 
CTCAATTCCT CGCTTGATCA 
CACATGGTTC AATGCTTGTT 
TTCAGTAATC AGAACTCTGT 
AGAACGACTG GTTGGGATAT 
TCCACCATCG ACTAGAACTG 
ATCAATAGCA GCACGGATGG 
GCGGTCTGTC ACATCGACAG 
GAAACCTTTA GTATCCAATT 
TTCTTGGATT CGGTGGGCAA 
ATTCAAATCT TTGTTGCGTT 
GGTCAATTCG TCTACAACTT 
TTAAACAACG AGACTTGAAA 
CACCCTTCTA CTTTATCGCG 
AGTTATCATT TACTATAGCA 
ATCTTACAAA TTGCTAAAAT 
ATGGCAAATG GCTCTTCCAA 
CGGCAAGATA TCCATGACCC 
ATCGTGGTGT TTGCAAATAC 
TCCTACACTC ATGACAACAT 
TTCACGTGTC TGAGGGACTT 
AAGGAATTTA GCTTTATTCC 
CTCTGTCGCT ACAGACATAG 
CTTATTTACC GCTGCATCTG 
ATGATTGGTA AAATCAAGAC 
CACCTTTCGT GCTCGTTTTT 



AAGTCGT AAT 74 4 0 

TC AC AC AG AC 7 500 

CCTTATTAAC 7 560 

CTGCGACTTC 7620 

TCGGATGGTC 7 680 

ATTTCATTAC 7740 

GATTTTCTCA 7 800 

CTTGGTCATC 7 860 

TTTTTCCAAC 7 92 0 

CTAAACTCAC 7 980 

TACGACCTGT 804 0 

AAACCAAAGA 8100 

CCACAACAGG 8160 

TAAAGACACC 8220 

CGATAATCTC 82 80 

CTTTTGTTTT 8340 

TTTATAGCCA 84 00 

CTCCTTGCCT 84 60 

CAAAGCATGC S520 

CATATAATTG 8580 

AAT AAATCAT 8640 

GCTTTTCCAC 87 00 

GCTCCACATG 87 60 

TGGCTGTCTT 8820 

GACCATCAAA 8880 

GATACAATCC 8940 

CTTGGGGTGT 9000 

TCGTCTGGGC 9060 

TAGGCTTCCC 912 0 

TTGACAAGAC 9180 
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CATTATTCAT 
TTTACTGCTT 
ATAAATCCTT 
ATATTGTCTG 
TCATCATCTA 
ACTGGATAAG 
GTATAAGATA 
GC AAGGAT AT 
TCTAACCTTT 
TTCATACTTC 
GCACACCAAA 
TTTTTAGTCC 
GACAAGAAAC 
GGTACGGCAG 
AAAACGGAAC 
AGTCTTCTGA 
AACTATGATA 
TTTGAATTTT 
ATATAAAGAT 
AGAACCTTTA 
TTCTGGCATG 
GCTTTTTGTT 
TCTAACCATC 
CCAAGACGGG 
TAAAATTTTA 
CTATCAGTCA 
AC ACG ACT AC 
GGAGGAATTG 
AGGGGGCGAA 



CTCCGTCAAA 
TTTGAGCTTG 
GACCAGCTTC 
ATACACTCTC 
AAACTTCTTG 
ATTCAACTAA 
GTTG AT AATC 
CTTTTACATC 
GTTCAAAGTT 
CTCCATAAAG 
CTTTGTAATT 
TATACTATAG 
TCATTCAACT 
ATGATATTCG 
AGTTGAACAA 
TTGGACTGGG 
AATTCCATTA 
AGAGCATAGA 
ACTAATACTC 
TTTAGTAGAT 
TTAATAGAGT 
ATGCACTATG 
ACACAAAACA 
GAAAAGGTTG 
ATGGGGGAAG 
CTGGCCTACA 
TTCTTTTTAG 
CATTTTGATA 
GCTTTGAAAA 



TAGTCCTTGT 
GTATTCTTCC 
TTCTTCAGCC 
AGCAAGGTCA 
ATCTAGCTGG 
CTCAACTGGC 
TAAGAAATAC 
TAAAATTTCT 
CAAACCTTCA 
ATTTACTCTC 
AAAATTCAAA 
GGAAAGAGTA 
CCTATCCAAG 
ACTACAAGAG 
CAGTATCTTA 
TAGCCTAAAA 
CGATCATGTC 
ATTTCCAGTT 
GGAGGTAAGG 
CTTGTTTTTG 
TTTTTTGACA 
AACATTCTAG 
AAGATTTACG 
CTATTATTGG 
CTTTGTCTGA 
TTCCTCAAAA 
ATTCTATTGA 
GCAATCCTTT 
TTCAGCTTAT 



787 
AAGCCAGCAA 
TCTGTCATGA 
GTCAAGACCT 
AGCTCCCCAT 
TTAGTTGCGC 
TCCATACTGC 
ATACGGTCTT 
TGATTACGTG 
GACTGCTT AC 
TTC ATT A T AC 
ATTTTAACAT 
CATCAGATCA 
TTAAATAAAA 
CTGCTAAACA 
ATTGATCTTG 
CTAAACGATC 
AAACACGTAC 
TTCTGTTGAC 
GAGACATGAA 
TCTCTTTTTG 
TAGACTTTGG 
AAAGGGAAAT 
AGATCTTGTA 
TGAAGAAGGA 
TTTCACTATC 
AGTCCCTGAG 
TTTAGACTAC 
CGCAAGTGAC 
CCATGAGTTA 



AAGGACTGTT 
TTTGCCAGTC 
TGATAGGAAT 
TTTCGATGGG 
CTTCCATGAA 
GACTCGACGC 
CATATTGTAC 
CACGCAGGTC 
GAATTTCTTG 
CAT G AAAAGG 
ATTTACTATG 
AGGAGGATCC 
GCTACCAAAA 
C T ACT ATG C A 
AGAAATTTTA 
AAGCACGCAC 
TAAGTCTCTA 
AAAATTTCCT 
CAACTAAGTC 
TGTGCTCTTT 
GCTCTACTAG 
CATATGATAA 
TCTGACCTAA 
AATGGCAAAT 
AAGGGAAACA 
GACCTAAAAA 
AG TAT CCTCT 
CAAGAGATTG 
GCCAAACCCT 



TTCTTCTTTC 
ATTTCCTGAG 
GTTTAGCAGG 
CAAGACCAAG 
AACTTCCGTG 
AAGAACAATG 
TTTCCCAACT 
ATCAACTAAA 
AATATTTAAT 
CTACAAATCA 
AT AGTTTT AT 
TCACATGGAA 
CTGTAAACAG 
AGAGCTCAAA 
CCAACCTACC 
TGCTTGGCGA 
TGGACCTGTT 
TAAAGGTATA 
TATCAAATAA 
TATGCTCTTT 
GT AAACT AG A 
AAAT C AATC A 
CCATGACCAT 
CAACCTTACT 
TCCAATCTGA 
AGAAAACTTT 
ATCGTTTGGC 
GCAATCTATC 
TTGAGATTCT 



9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
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ATTTTTAGAT GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTGGC TAAAAGGCCA 109 80 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT GATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC ACTATTGTTC ACTTGCGACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

AGT AG AG CAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGC T AATT TTGCCAAAC A 11160 

AAGTCAGCAA GCTGCTAACA AC C AAAG AG C CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 11280 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGC AGCTC A 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA CTTTTCTTTT CAGACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 1152 0 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TTACAGAGAC TTCTGAATGA 11580 

T AAAAG AG AG ATTTCACTTG GTTTTATGCC ACAAGATTAC CACAAAAAAC TCCAATTGCA 11640 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 117 00 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCC AG AAAT G CAGCATCAAA TTCGCTCCTT 117 60 

ATCTGGCGGA CAACAGGGAA AACTCCTGCT TTTGGATTTA GTCCTGCGCA AACCAAACTT 11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAATCAGAAA 11880 

ACTCTTTGCT ACCTATCCAG CCCGTCTCAT CACTGTTTCG CATGACCCTC GTTTCTTAAA 11940 

AG AAGTCTG C TCGATCATCT ATCGCATGAC AG AAC AC GGT TTGAAGCTAG TTAATTTAGA 12000 

AGATTTATAA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTT AC AT C 12 0 60 

TGTTTTAAAC GTTCAATCCG TTCTGAGATA GGTGGGTGGG TATAAAAGAG TTTTTGGAAC 1212 0 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGC AC TGC TAG CATC ATC GACGTGGCGA 12180 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA C ACT AG AT GC ATCTGCCAGA AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12300 

ACCAAGCTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAG C AT AATG 12 360 

ATTTCAAGAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACCCCAC 12420 

CACATCATAC GACCTCCCAT ACTAGAAAGC ATGGTGATAG CACTAGCAAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA ATATGACTGA CTTCATGTCC CATAACAGCT 12 540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AGACCTGAAG TCGCAGCAAC AGCCGCATTT 12600 

TGAGGATTAG AACCTGTCGC AAAGG CATTT AAGGCTGGAT CATCAATGAT GAAAACACGG 12 660 

CGCATAGCAA TCTGAGCGAC CAG AGC CAT A TCTTCCACTA CAT GGT AG AG GTCTGGTGCC 12720 
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GTTTGCTCAT CCACCTCACG CCCTCCATTC ATGGACATGA CAATCTCTGT CGATTGAAAA 12780 

ATCATAGACA AAGCGTAGAT AAACCCGATA ATCAGTGCAA TAACCAAACC ACCAAGTCCA 12 840 

GATCTTATAA AGAGATAACC AACCGCATAA CCAACAAGAG CTAAGAGTAG GAAAAATACC 12900 

AGCAACAAAA TCCAGGTTTT TCGTTTATTG CTTGC AATTT GATCAAACAA CATCTTAGTC 12960 

ACCTAAACCG CTAAAATCAA CTTTAGGAAC CGACTTTTCC TCTTCAGGTC TTTGAAGGAA 13O20 

ATCTGCCCCT TTAAATCCAA ACATTCCAGC GATAATATTG CTCGGGAAAG TTTCT AATTT 13 080 

T AC ATTGT AG TTGCTGACAA CACTGTTATA GAGTTGACGA GAGTAAGAAA TTTTATTTTC 13140 

TCTGTTTGTC AACTCCTCTT GCAATTTAAC AAAGTT AG C A CTAGCTTTCA AATCTCGATA 13200 

GCTTTCTGCA ACTGCAAAAA TACCTGAAAC CTGACGAGTG AGGGCATCAC TGGCTTTCAT 13 260 

AGCTTCTGCT GGTGAAGTCG CTGCCGCCAC TTGGTTACGT AGTTCTGCCA CCTTTTCAAG 13 320 

GGTAGAACCT TCATATTTGG CATAACCTTT TACAGTCTCA ATCAAGTTTG GC AAGAGGTC 13 380 

ATTGCGACGT TTCAACTGAA CATCAATCTG ACTCCAAGCC TCCTTGGTTT GCATACGATT 13440 

TTTAACCAAA CCCTTATAGC TAACAATCAC AAAAATAACA ATAAGAGCGA TAACTCCAAG 13 500 

AATAATCCAA GTCATAATAT AAGT CCTTT C TGCTTTT AG A TTAGTACCAG TATATCAAAT 13 560 

TTTCT AT GAT TGTGGTAAAA TAAGATGATA CTAAAGAAGG AAATAACTAT GAAACCAAAA 13 620 

AC ATTTT AC A ACTTGCTTGC CGAGCAGAAT CTTCCACTTT CGGACCAGCA AAAAGAACAA 13 680 

TTTGAACGTT ATTTTGAGCT CTTGGTCGAG TGGAATGAGA AGATT AATTT GACGGCGATT 13 7 40 

ACGGACAAGG \AGAAGTTTA TCTCAAACAT TTTTACGATT CGATTGCACC CATTCTTCAA 13 800 

GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 13 860 

AGTCTACCAA TGAAAATTCT CTATCCGGAG TTAGATGTGA CCATTATTGA TTCACTCAAT 13 920 

AAGCGCATCA ACTTCCTACA ACTCTTGGCT CAAGAACTGG ATTTGAACGG AGTTCATTTC 13 980 

TACCACGGAC GTGCCGAAGA TTTTGCCCAA GACAAGAACT TCCGTGCTCA ATATGATTTT 14 040 

GTAACAGCTC GTGCGGTTGC CCGTATGCAG GTCCTATCTG AATTGACTAT TCCCTACCTT 14100 

AAGGTTGGTG GCAAACTATT AGCACTCAAG GCTAGCAATG CGCCTGAGGA ATTATTAGAA 14 160 

GCTAAGAATG CCCTCAATCT C CTTTTT AGT AAGGTCGAAG ACAATCTCAG TACGCCCTAC 14 2 20 

CG AAT AG AG A TCCGCGCTAT ATCACAGTGG TAGAAAAGAA AAAAGAAACA CCAAATAAAT 14 280 

ATCCACGTAA GGCTGGTATG CCAAATAAAC GCCCACTTTA AATTTTTTAG TAAACAAATG 14 3 40 

TTTACAAAAT CAGCCTCGCT CTTTTATTTC TAGGCTCGGG AAAAAATGAT TTACAAAATC 144 00 

AGCCTCGCTC TTTTATTTCT AGGCTCGGGA AAAAATGATT TACAAAATCA TTTTTTTCTG 144 60 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 14 5 20 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCT ATTGCTG 14 580 

TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14 640 

TCTTGATTTG TTAC 14 6 54 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i> SEQUENCE CHARACTERISTICS: 

< A) LENGTH: 6405 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND E DN ES S : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGC CTTCT AC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 1B0 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC ACTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 4 80 

GGGTTG CTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCACCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 7 80 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 9 60 

AATCTAAAAG GACGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCGTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGT AGC AG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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GGGTGAGATT TTCTATGCTG GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 12 SO 

GGAAATCGAA GAAAGCTTTA CTATTCCTTG TGAGATTGAA AATCATGTCA ACTGTGCAGG 1320 

TCTTGCTGAG GCAGTATCTG GTTCAGGCAA GGGAGCAAGT GTGACACTTT GCTTGACCAT 13 80 

TGGAACCGGT ATCGGTGGTT GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 14 40 

TTCAGCCTGT GAAGTCGGGT ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

TACAACAGCT TTAGTGAAAT ATGTAGCTGA AGCCCATGGA GAAGATGTTG ATCAGTGGAA 1560 

TGGCCGTAGA ATTTTCAAAG AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 16 20 

CCGTATGGTT GACTATCTAG GAAAAGCTCT GGCAAATATT TGCTACGTTG CCAATCCAGA 16 80 

ACTGGTTATT CTTGGTGGTG GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 1740 

T AC AGCCTT G AAAGAGCCTT TGGTACCAAG TTTAGCAGAA AAAAC ACG AT TAGAATTTGC 1800 

CCATCACCAA AATACAGCAG GGATGTTGGG TGCATATTAT CATTTTAAGA CAAAACAATC 18 60 

CTAGTTTGGC TCAGCCAAAC TAGGATTTTC TTACACGTTT TTGTCTACGA TAGCCGTTGA 1920 

GTTTTTTATT TTCCCAGTAG CTATTAAAGA TTTTTTCCTT GCTTTCGCG A TTGATTTCCA 1980 

AAAAGTAGGC AT AAATC AAA TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 2 04 0 

GGATATAGGA GGGTTGGCTG TGGGTGGCTA CAAGAACAGT CTCTGT AT AG GTCTCGCTAT 2 100 

CTTTATTGGC AACACTTGTA AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GACTATCTAA AATAGAAGGA GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AG AC AATTTT 2 220 

CATCCATGAT GCTGGTTGTC CAGGCAAAGC CGTCTTGGTC TGTCAAAGCT TCGCAGACCA 2 280 

CACCTAGTCG CATAAAACGT AATTTCATTT CACGGGCGAC GAGGCCAGAA CTCCCTGTTC 2 34 0 

CAAAGAAGTA GATACGCTCA GCATCTTCGA TTAGCTGGGC AATTCCTTCT AG TTGG AT TT 2 400 

CGTCAATCAA GTCTTGTGTT TGTTCCCTCA TATTGCTATA ACTTCTGAGG ACTCGTTTGG 2 4 60 

TCAGTGGACT GTGCTTGGAG ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2 5 20 

AAATAAATTC TCGGTAGCCA GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 2 5 80 

GAG AAAT ATG TAATTTTTGG GTGACTTGTT GAGAAGATAA ATCATCTGTA ATCGTTTCAG 2 64 0 

CTTG C AAAAA ATAGCGAGCG ATTTCTTCTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2 700 

CAATGATAGT TGCGATATCT GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 2 7 60 

GGAAGACTAG ATCAGAGAAT AGTCACACTT CATTATAACA CATAATATAA GGATAGATAA 2 820 

ATAAAAACGC ATCTCTGTTT TAAAAACGAA AAAATCGAAA AAGCTTCTCT CTTTTCCATA 2 880 

ATTTTCTACT CAAATTGTGG TACAATTAAG AGTAAGATTT TAAGTTAGAA ATGAGACTGA 2 94 0 
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TTTGTATGAG 


AAAATTTAAC 


AGCCATTCGA 


TTCCGATTCG 


GCTTAATTTA 


TTGTTTTCAA 


3C00 


TCGTCATTTT 


ACTCTTTATG 


ACCATTATTG 


GTCGTTTGTT 


GT AT ATGC AG 


GTTTTGAACA 


3060 


AGGATTTTTA 


CGAAAAAAAG 


CTAGCTTCAG 


CTAGTCAGAC 


CAAGATTACA 


AGCAGTTCAG 


3120 


CCCGTGGGGA 


AATTTATGAT 


GCTAGTGGAA 


AACCTTTGGT 


AGAAAATACG 


TTAAAGCAGG 


3180 


TTGTTTCCTT 


TACGCGTAGC 


AATAAAATGA 


CGGCTACAGA 


CTTAAAAGAA 


AC AG C T AAAA 


3240 


AGTTACTGAC 


TTATGTCACC 


ATCAGTTCTC 


CAAATTTGAC 


AG AACGC CAG 


CTGGCCGATT 


3300 


ACTATTTGGC 


TGATCCTGAA 


ATCTATAAAA 


AAATAGTGGA 


AGCTCTCCCA 


AGTGAGAAAC 


3360 


GCTTGGATTC 


AGATGGCAAT 


CGTCTATCCG 


AATCAGAACT 


GTATAACAAT 


GCGGTCGATA 


3420 


GTGTACAAAC 


GAGTCAACTA 


AACTATACAG 


AGGATGAAAA 


GAAAGAAATC 


TATCTTTTTA 


3490 


GTCAGTTAAA 


TGCTGTTGGA 


AACTTTGCGA 


CAGGAACCAT 


TGCGACAGAT 


CCTCTAAATG 


3540 


ATTCTCAGGT 


GGCTGTTATT 


GCCTCTATTT 


CAAAGGAGAT 


GCCTGGCATT 


AGTATTTCTA 


3600 


CTTCTTGGGA 


T AG AAAGGTT 


TTGGAAACTT 


CCCTTTCTTC 


TATAGTTGGC 


AGTGTATCCA 


3660 


GTGAAAAAGC 


TGGTCTCCCA 


GCGGAAGAAG 


CAGAAGCCTA 


TCTTAAAAAA 


GGCTATTCTC 


3720 


TAAATGACCG 


TGTAGGAACC 


TCCTATTTGG 


AAAAGCAATA 


TGAAGAGACC 


TTACAAGGAA 


3780 


AACGCTCGGT 


AAAAGAAATC 


CATCTGGATA 


AATATGGCAA 


TATGGAAAGC 


GTGGATACAA 


3840 


TTGAGGAAGG 


TAGTAAGGGA 


AACAATATCA 


AACTGACCAT 


TGATTTGGCT 


TTCCAAGATA 


3900 


GCGTGGATGC 


TTTACTGAAA 


AGTTATTTCA 


ATTCTGAGCT 


AG AAAATG GT 


GGAGCCAAGT 


3960 


ATTCTGAAGG 


TGTCTATGCA 


GTCGCCCTTA 


ACCCAAAAAC 


AGGTGCGGTT 


TTGT CT AT GT 


4020 


CAGGGATTAA 


ACATGACTTG 


AAAACGGGAG 


AGTTGACGCC 


TGATTCCTTG 


GGAACGGTAA 


4080 


CCAATGTCTT 


TCTTCCAGGT 


TCGGTTGTCA 


AGGCGGCGAC 


CATCAGCTCA 


GGTTGGGAAA 


4140 


ATGGAGTCTT 


GTCAGGAAAC 


C AG AC CTTG A 


CAGACCAGTC 


CATTGTCTTC 


CAAGGTTCAG 


4200 


CTCCCATCAA 


TTCTTGGTAT 


ACTCAGGCTT 


ACGGTTCATT 


CCCTATCACA 


GCGGTCCAAG 


4260 


CTCTCGAGTA 


TTCATCAAAT 


ACCTATATGG 


TCCAAACAGC 


CTTAGCTCTT 


ATGGGGCAAA 


4320 


CCTATCAACC 


CAATATGTTT 


GTCGGCACCA 


G CAATCT AG A 


GTCTGCTATG 


GAGAAACTGC 


4380 


GTTCAACCTT 


TGGCGAATAT 


GGCTTGGGTA 


CTGCGACAGG 


AATTGACCTA 


CCAGATGAAT 


4440 


CTACTGGATT 


TGTTCCCAAA 


GAGTATAGCT 


TTGCTAATTA 


CATTACTAAT 


GCCTTTGGGC 


4500 


AGTTTGATAA 


CTATACGCCG 


ATGC AGTTGG 


CTCAGTATGT 


AGCAACTATT 


GCAAATAATG 


4560 


GTGTTCGTGT 


GGCTCCTCGT 


ATTGTTGAAG 


GCATTTATGG 


TAATAATGAT 


AAGGGAGGAC 


4620 


TGGGTGACTT 


GATTCAGCAA 


CTGCAACCGA 


CAGAGATGAA 


TAAGGTCAAT 


ATATCCGACT 


4680 


CCGATATGAG 


CATCTTGCAC 


CAAGGTTTTT 


ATCAGGTTGC 


CCATCGTACT 


AGTGGATTGA 


4740 
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CAACTGGACG TGCCTTTTCA AATGGTGCCT TGCTATCCAT TAGCGGAAAA ACAGGTACAG 4 800 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGCTG GCCTATGCCC 4 860 

CATCTGATAA TCCCCAAATC GCTGTCGCAC TGGTCTTTCC TCATAATACC AATCTAACAA 4 920 

ATGGTGTAGG ACCTTCCATT GCGCCTGACA TTA7CAATCT GTATCAAAAA TACCATCCAA 4 980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA ACACCTATTG CCAAGTTGAT TGACAGTTAT 504 0 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCCTTTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

ACATATTGTT CTATTTGTGG ACGTTTGACA GACGACGATC CTTGTTCTAT CTGTACTGAT 52 20 

CCGACTCGTG ACCAGACAAC AATTTTAGTT CTTGAGGATA GTAGAGATGT GGCAGCCATG 52 80 

GAAAATATCC AAGAATACCA TGGACTCTAT CATGTCCTTC ATGGCCTCAT TTCTCCTATG 5 3 40 

AATGGTATCA GTCCGGACGA TATCAATCTC AACAGCCTTA TGACTCGTCT TATGCATAGT 54 00 

G AGGTT TC AG AAGTGATTGT GGCGACTAAT CCTACAGCGC ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT ATCAAGCTTA CGCGTCTAGC ACGAGGTCTC 55 20 

GCTGTCGGAG CGGACATTGA GTATGCGGAC GAAGTGACAC TCTTACCAGC CATTGAAAAT 55 80 

CCGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 5640 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTG TGATGACTAG GCTCAGGTTC 57 00 

AACTTTTAAA AAACCAAGCA AAT ATG AT AT ACTAAAGAGC GACTATTCTA GTAGAATTAG 57 60 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATGGT GGACGGAGTG CGG AACGCG A 58 20 

AGTCTCTGTC CTTTCAGCTG AGAGTGTCAT GCGTGCGGTC GATTACGACC GTTTCACAGT 5 8 80 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACAGGAAT TTAGTCATGC 59 4 0 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TGAAACCATT G ATTGGG AT A AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GGCCAATGGG 60 60 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTGAAA ATGCCTTACG TTGGTTGCAA 6120 

CAT TTTG T C A TCAAGTCTTG CCATGGATAA AATCACGACT AAGCGTCTTC TGGAATCTGC 61 BO 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CGTTGAAGGC GATGATGTGA CTCCTAAAAT 62 40 

CGCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 6 300 

TAGTGTCGGT ATTTCTAAGT CTGAAAACCA AGAAGAACTC CGTCAAGCCT TAAAACTTGC 6 3 60 

CTTCCGATAT GACAGCCGTG TCTTGGTTGA GCAAGGAGTG AATGC 6405 
{2) INFORMATION FOR 3EQ ID NO: 108: 



WO 98/18931 



PCT/US97/19588 



794 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 108: 

CGAGCTCCGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAACCAATC 120 

CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTCTTTGAA CGCAGTAAAG C TCG CGGTTT 180 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 2 40 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 300 

TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 360 

AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 4 20 

AATGATTTGG CTCAGGCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTGATTAAA 480 

ACCTTGTCCT TAATGGAAAG AAAGCACCAA ATTCCTTTTC AAGAAGATGG TAGTCTGACA 54 0 

TT AG AAATT A AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GG AAGGCG AG GAGGACGACC TTTTTCTACG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAACAAACT CGCTGACCGC 7 20 

AATAAGGGAA CAGCAGCAGA AG CCAAAATT ATTGATATCC TAGAACACAC TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAACT ATGCTGGCTA TATTCGTTCA 840 

AAAAATCAGA AAATCAGTCA ACCGATTTAT GTTAAGAAAC C AG C C CT AAA ATT AG AAGG A 900 

ACAGAAGTTC TCAAAGTCTT TATCGATAAA TACCCAACCA AGAAACATGA TTTCTTTGTC 960 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 10 20 

TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTC TTGTTAAGGA ACCACAAACT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 114 0 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATGCAGTGCA TATCAAGGCT 12 00 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAG ATGTTTCTTA TTATGTGACC 12 60 

GAGCGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA CGTGACAGAC 132 0 

CGAGTGGTGC CAATGCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 13 80 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 144 0 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 1500 
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GATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 1620 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTT CTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATCTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 1800 

GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGACTTT TGGCTTGCGC 1860 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 19 20 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT 1980 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2 04 0 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGACATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC CATAGAAGCT GAGCGTGAAG TCGAAGCCAT GAAAAAGCCT 2 220 

GAGTATATGG AAGAATACGT GGCTGAACAC TATGATGCAG TTGTATCAAG TATTGTCAAA 2 280 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 2 34 0 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 2 400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2 4 60 

G AG ATT G ATT TTTCATTCGT ACCTAGTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2 520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2 580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA CAAGAAAAAA 2 64 0 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2700 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 2760 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 2 820 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2 880 

CGCGCCTTAC GAAGAGGGCA ATATCTGCAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2 94 0 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3 000 

AGTTCCCCTT AAGGTCT AT A TAAAAGATGG CTACGCTAAG C TTCTTTT AG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCG AG AT AT 3120 

CGCGCGTGTG ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 3 24 0 
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AAAGCACAAT ACAAAAGTTG GGACTTGGGC GAAAATTACT GTCTTGAAGG GAGCTCTCAA 3 300 

GTTTATTGAA TTGACAGAAG AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 3 3 60 

AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATCATGT 3 420 

GGAATGGTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT TTTGCTAAAA AATACAATAC 34 80 

CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3 540 

GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3 600 

GACGGCTGTA GATCAAAATG CACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3 660 

AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 37 20 

TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 37 80 

TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 3 840 

GGACACGGAG GATTATCCTT GCTCGCTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3 900 

GGCACACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT GAAAATCCAG GCCATTTGCA 3960 

CCGTCGCGAT GAGAATGGCA ATCGTATTCA ACT AC G CTTT GCGACCTTAC TAGCTAAGAA 4020 

AATCAAGTAA ACACACATGA AGATTAGGAA TTTTCCTGAT CTTTTTTCTT TTTTACGAAT 4 080 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT GCTAGGGGAG AATTGGTAAA 4140 

TGTGTTAGAG GATAAACTTG AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 4200 

CCATTTGCGT CAAGGACCAA GTGTACGGAC GCATTTTGCC CATAAATCCT TAAAAGACTG 4260 

TGATTTTTTC TTTGAAAATG AAAGT CC AG A ACACCTGGCC AATAAGGAAT CCCTCTATCA 43 20 

C TGGTTG AAA AAAGAGACAA AGGTT CAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 4 3 80 

TGCGGATCTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT CAGTGTAGTC CCTTGCCTCA 4440 

GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTGTGGTT 4 500 

GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG GTTTTCTTTA 4 560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 4 620 

ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC CATTATCAAA TCAAGGAATT 4 6 80 

TTCCTATGGT CAAGGTAGTT TATTGGAAAT AT TGCGTCTT CCCTATAAGA GACAAAAAAT 4 74 0 

ATCTC AT TTT AC AGTT TCTG AGGACAAGGA CATCTGTCGC TATATCCGGC AACAACTTTA 4 800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 486 0 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGG GCAAATTTTT 4 920 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4 980 

TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA 
TGAAAAAGAT 
CTTAAAAGAT 
TAGTGCGGAT 
GAAGCTTTAC 
AGAGTACTAT 
TGAGCCTGAA 
AAAGCTGCAG 
TTCACAACGT 
AACCTTCGCT 
TAAAGAAGTT 
GGTTCGCCGT 
TGCCAAAACC 
CAAGAGTGCT 
TTTGGTAGCA 
AAAAATCTTG 
TGAATACAGT 
GGGTGAGGAT 
CGAAAATCAA 
TATGCTTCTC 
TCACAGTATG 
TATCTTTTTG 
GGAAGAAGTG 
CCGTGGAACA 
AGATCAAAAT 
CCAAGAGTAT 
CATTCCACAC 
CTCACCCTTG 
CTACCTCAAG 



GAATAGAAAG GATGGAGGAA 
ACATGGGATC TATCAACGAT 
TTAACAGAAC AATTGGAGAC 
AACCTACTAG AAATCACTGA 
GCTTATGCTC ATATGAAGAA 
GCCAAGGCCA TGACACTCTA 
TTTATGGAGA TTAGCGAAAA 
GTTTATCAAC ACTATTTTGA 
GAAGAAGAAT TATTGGCTGG 
ATCTTGGACA ATGCGGATAT 
CAGCTATCTC ATGGGACTTA 
GGTGCCTATC AAGCTCTTTA 
TTGCAAACCA ATGTTAAGGT 
CGTCATGCAG CCCTCGCAGC 
GCAGTTCGCA AGCATTTGCC 
GGGATTTCAG ATCTCAAGAT 
TTTACCTACC AAGAAGCCTT 
TACTTGAGCC GTGTTAAACG 
GGCAAGCGTT CAGGTGCCTA 
AACTGGCAAG ACAATCTGGA 
CATTCAAGCT ATACTCGTGA 
GCTGAGATTG CCTCAACTAC 
GAAGACGACG CAACACGCTT 
GTTTTCCGCC AAACTCAATT 
GGGGAGGTCT TGACAAGCGA 
TATCGTTTGA GTAAGGAAGA 
TTCTACTATA ACTACTATGT 
GCTGAAAAAA TTGTCCATGG 
GCAGGTAAGT CGGACTATCC 



797 

TCTAATGGTA TTACAAAGAA 
CTACCCAACT GACCAGGCTT 
AGTAGCCCAG TATGAAGGCC 
ATTTTCTCTT GAAATGGAAC 
TGACCAGGAT ACACGTGAAG 
CAGCCAGTTA GACCAAGCCT 
GCAGTATGCT GACTTTTTAG 
CAAGCTTTTG CAAGGCAAGG 
AGCTGGAGAA ATCTTTGGTT 
TGTGTTCCCT TATGTCCTAG 
CACACGTTTG ATGGAGTCTA 
TGCGACTTAC GAAC AATTC C 
GCAAAATTAC CGTGCTAAAG 
GAATTTTGTT CCAGAAAGTG 
ACTCTTACAT CGCT AT CTTG 
GTACGATGTC TACACACCGC 
GAAAAAAGCA G AAGATG CTT 
TGCCTTCAGC GAGCGTTGGA 
CTCTGGTGGT TCTTATGATA 
CAATCTCTTT AC T CTTGTT C 
AACTCAGCCT TATGTTTACG 
CAATGAAAAT ATCTTGACGG 
TGCTATTCTC AATAACTTCC 
TGCTGAGTTT GAACACGCCA 
TTTCCTAAAT AAACTCTACG 
CAATCCTGAA ATCCAATACG 
ATATCAATAT TCAACTGGCT 
TAGTCAAGAA GACCGTGACC 
ACTT AATGTC ATGAGAAAAG 



ATGAAATAAA 5100 

GGGAAGAAGC 5160 

ATCTCTTGGA 5 22 0 

GCCAGATAGA 52 80 

CTAAGTATCA 5 3 40 

TTTCATTCTA 54 00 

AAGCTCAACC 54 60 

ATCACGTTCT 5 52 0 

CAGCAAGTGA 55 80 

ACGATGATGG 5 64 0 

AAAAACGTGA 57 00 

AACACACCTA 57 60 

TTCGTAACTA 5 82 0 

TTTATCACAA 58 80 

AGCTTCGTTC 594 0 

TTTCATCTGT 6000 

TGGCAGTCTT 6060 

TTGATGTTTA 6120 

CCAATGCCTT 6180 

ATGAAACAGG 6240 

GGGATTACTC 6 300 

AGAAATTATT 63 60 

TAGATGGTTT 64 20 

TTCACCAAGC 64 80 

CAGACTTGAA 654 0 

AGTGGGCTCG 6 600 

TTCCGGCCGC 6660 

GCTATATCGA 67 20 

CTGGTCTTCA 6780 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC GTTTAAATGA 6840 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6 900 

ATGCTAACCA TAACATGCGT CGTCCTG TCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 69 60 

AGCGTCAAAA GCAGGTCACA GGTT TCTTG A AAGAATTGGA AGACTTTGCC CGCAAGGAAA 7020 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTCC G mT CTTATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 7 140 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7 200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 72 60 

CGGTGCATCT CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG GATTCTGCCA 7 3 20 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TC CTCAAAC A TTTGGAAGTT GGTGGTGTGG 7 3 80 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7 44 0 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7 500 

CAGAACTCAC CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT CGT AAAAATG 7 560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 7 620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7 680 

GTGCCATCAC ACT ATT ATC A GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT C AG C ATG AAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA GTTTTTGAAA 78 60 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATCA T ACT ATTGC C GAAGAAAAAA 7920 

AACAATATGG C G AAAACT AC CAACGTGTCT TGTCACAAGC AGGT AT G ACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG CAAGTCAAAA 82 80 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 83 40 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA CTACAAAGAA AAATTAAAAA CTGTT ATCTT CACTCAAAAA CAAAATGATT 84 60 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 85 20 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA . 85 80 
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GTAGTACATC 
AAAATGAAGC 
AAAGTTAGAC 
TTT AGTTTT A 
TCCAATTGAT 
ATGCC AAAG A 
GCTTGAATTC 
TGGTCACTAT 
AACGATCAAT 
AAAGCTTCTC 
AAGTTAATTT 
TGTAGTTCAT 
T CAT TTT TTT 
G CTGCTC AAA 
ATGTGAAGCT 
GATAATGCAA 
AAACTTTTAC 
ATAACTGCTT 
TCTTATTTCA 
TTGGCAAGTT 
ATATCTCCTA 
ATTCCAACTA 
ATTTGCTAGA 
ACTGGCAGTA 
TAATGATTTG 
CATTCGAGTG 
AGCTTCCAAG 
AGGAACGATT 
GACCGCTCGA 



AAACGAATAG 
AAACATTCCC 
AATTAATTTA 
TCTTAATTCT 
TAAGTGATTT 
AAGATTCCAT 
CCTTACTCTC 
GGAGAATCGT 
CAATTTAATC 
TAAGCTATAT 
CATAATAAAA 
GTACACCTGA 
AACTTGATAC 
GAACAGCTTT 
GACGTGGTTT 
GATTCCATAC 
CTTTTCCTCC 
CTAAAACATT 
TTTTGTT CT A 
GCAATACCTT 
TGGTTCTAGT 
TTGGGAGTGA 
CCGCCATCTT 
AGATTGGCGC 
AT AAATTGG A 
TCAAATTGAG 
GTTGCAATTT 
TCTAAAGAAC 
GGTAAATCAC 



TCCAAATCAA 
ACAATAAAAC 
TCCGAAGGAT 
CTTATTGTTG 
AAATGTTTTC 
CCTACCGTTG 
TAGGAAGCGA 
ATTCTCGTAG 
ATGTACCTAA 
CCTTGTTTTC 
ACACCCCAAA 
TATGATGCGT 
TCAGTGAAAA 
GAGGTTGTAG 
GAATAGATTT 
AATGGGTAAG 
CTACTCATCT 
CTTATAAATT 
CAATCCTGTT 
TTTACGAGGC 
TCAGAAGGCT 
ATGTTTCAAA 
TACGAAGAAG 
CGTGTCCGAC 
TGGTCCGTTG 
CAAGATTTGA 
TCAAACCTTC 
AGGGGGTATA 
TTGAATAAAT 



799 
TGAGTCAGGG 
GCATAGTACA 
TTAGTTCTGT 
TAATAATCAA 
TCATAGCCAT 
TCTTGGCTGT 
TGATAAGAAT 
TGCTTCTCTT 
GATTAGAATT 
TAAGTTCATA 
AGTTAGATTT 
TTTATAATTT 
GCAAAGATTA 
ATAAAACTTG 
TAGAAGAGTA 
CTAGAGTTCT 
TAGTATAGAA 
GATTTAAATT 
GAGAAGACAC 
TCTGTTGTCT 
AGGCTATAAT 
ATCATGGGTT 
TGGTTCTTTA 
AATTAGAATA 
AGTTGTACTA 
ACGAAAAGCC 
TAACTTCCCA 
GAGTTGACTT 
CTGATCAAAA 



AAAAAACTCG 
AGGTTTGTAC 
ATTGCACAGA 
TATAGTCTAT 
AAAACATTTC 
TGCCCTTACG 
CGTGTTGATA 
TGAATGCCTG 
GTTTATCCCA 
G ATCTG AACT 
TTTCTGTCTA 
T AAAG ACTTT 
AACTAGGAAG 
TGACCTCACC 
TGAGTCTGGA 
TATGTGAAGA 
AAGTGAATCT 
CTCAAATCAT 
GTGTTCATAT 
TATTTTTGTT 
TATGATTGAT 
TCTATAATGG 
TAGCCTAGGA 
CGTTCAGCTG 
TAGAGGGATT 
TGGATTTGTT 
AGTTGCCATT 
TGGATAATCT 
GGAATTTCCT 



ACTTCAGGAA 
TGCCCCCCAA 
GCTAAGTCCT 
AATG GCTCGT 
GGATTTTAAA 
TGACATGGAT 
TTGCCAGCCT 
TTCCAACATT 
AATTTATTTC 
TTATCATCAT 
ACTTTTGGGG 
TTGACCAGCC 
CTAGCTGTAG 
AACATATATA 
AGTTTT AATG 
GTTTGGGCAT 
GAAATAGTAC 
ATTATTCACT 
CAAAAAGGTA 
TCAACTGACT 
AAGAAGTATC 
TCAGGCTGGC 
GAGTACGAAG 
GACTATCTTT 
CGGCTCCGAA 
GCGGGTAAAT 
CACGGAGATT 
CAGCAGATTT 
TG AG AT ACTG 



8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9460 
9540 
9600 
9660 
9720 
9780 
9340 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA 
TTGAAAACCA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG 
TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTCCCTTTA 
GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA 
AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT 
TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT 
TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA 
ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA 
CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA 
ATAGCTGCTT TAG AT AGG AT TTGTTCCACC AATG TTTTTT 
ATTATTTATA TTTATCCTCC AAT TG ACT C A TCCAAATACC 
AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC 
CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA 
AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC 
CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG 
CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA 
GGATATGGGC ACCGTCGGTA TCCGCATCG 
(2) INFORMATION FOR SEQ ID NO: 109: 

(U SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



GGAGAATCAC 
CGGACAAAGT 
CAAAGCTAGC 
CAATCATTTG 
GACGAATTCC 
TTTGTGGAAT 
AGTGCTGAGT 
AGGTTTTGAA 
TTTGAATCCC 
TGATTTTAAC 
AAGCCAGATT 
TGTGTATTGG 
TAGGACGATA 
CTCCGTCCAA 
AGGTGGGAGG 
AAATGTCAAG 



CACTAGCACC 

AGAGTTTCAT 

CAAGTCTTGT 

ATCCAAATCT 

GACAGGATTA 

CATGCTCACT 

GATAGGGACA 

AATCTGATCT 

CTGATGAGGG 

TTTTTTTGCC 

CCCAGCGCAA 

ATACTTTCAG 

CCGATGATGA 

GCGTAGGCCA 

GCAATATAGA 

AGCAAGGTCT 



10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAC 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAC AAATCTTAAT 3 60 

CCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 4 20 
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GG C ATT GG AT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 4 80 

GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 600 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTC AAG AC GCAAACTTTC 660 

TTTGTAAATA ATGTCT AG AA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGG AGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 900 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC G AAT AT C CAT CCGTTTCACG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCCGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGCC 114 0 

TAGATCCTTG TATTGCTGAT CCTGTTGGGA GACTTTCGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 1260 

AGCTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA CCCAGTAGCC 13 2 0 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 1380 

TGCCATCCCT GCAATCACAA TGAGGCTAAC ACTTAGATTG ACTAGCCAAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CC AAT ATG AA CGTCCACCAA 1560 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAG ACGCT CTCTAGTCAG 1620 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 1680 

CAGTAACTTA TTCGCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 174 0 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 1300 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTACGG CTAAAAGGCT TGGTCAGGTA 1860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 192 0 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2 04 0 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTACAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GT C AAG C AAT TTCAGAATGT CTTCTTCATC ATCCAC7AAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 22 20 

AT T AAATTT G CCTATCAAGA CTAGTATCTG GTCAAACCCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2 340 

GGAGCAACAA G ACAAG AAA C CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2 400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAACTGACT 24 60 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2 520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATCTG AAG GATTCGTTTC TTCTAATAAG 2 5 80 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2 640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA AT ATGGGG AG CTAACTCTAG AGTTCTTATC 2 7 00 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2 7 60 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2 820 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2 880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2 9 40 

ACT ATTGT C A TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3 000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3 060 

CACGTAAATC CAGCAAACTC AAGCCACTAA C AGTTG AAG A TAAAGCCTAT AACCATGCGC 3X20 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA ACTAAAAACG TTT AAAATG A 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGG ATT ACGAATGAAT TTGATTGCTG 3 24 0 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3 300 

AATTAGTGAA GCCTTTAGCC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3 360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3 42 0 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3 480 

CATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTt JGC 354 0 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTCTGGCG 3 600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 36 60 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CC AAG AT AC G TGTATGTGCA TAGTATTTGT 372 0 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 37 80 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATCAGTTT AATGATTTCC CAGCTTTCAC 384 0 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3 960 
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TAGCCTCTAG GTCGAAGTCG TATCCATTTG TACGGTCGGT TTTGAGGTCA TAGAATTTAA 4 020 

TGGCTCCAAT CCCAACAGCA TGTGCTACTT GGTCTTTGTT TTCT AG TTC A GGATTTTTAG 4080 

CCTCGATTTG GACCTTGGCA CGGCTAACAG CCTCTGCAAC AGTAGGCTCT AGCAAGATGA 414 0 

CATTCCCTTT ACGAGTAGAG AGTTTCTTCC CTTCTTT T GT AACCAAACCA AAAGGAACGT 4 200 

GAGTAATGTC GTCACTCCAG TCGTAGCCCA TCTCTTGCAA GACAGCTTTG AGCTGTTTAA 4260 

AGTGGGCAGA TTGTTCTTGA CCAACGACAT AGATAGATTT AGCAAATTGC TATTCGTTTT 4 320 

TACGGTAGAG GGCTGCAGCC AAGTCACGTG TGATATAGAG AGTTGCACCA TCAGACTTCT 4 380 

TGATGAGGGC TGCATGTTCA ATTCCATATT TCTCAAGATT CACAACTTGG GCACCTTCTG 4 4 40 

ATTCAAGAAG TAGTCCTTTT TCAGAAAGAA TGTCTACAAC TGCATCCATC TTATCATTCT 4 500 

AGAAGGCTTC TCCGTTATAG CTGTCAAATT CAACCTTCAA TTCATTGTAA AGGCGGTTAA 4 560 

ATT C C ACT AA ACTTTCATCG CGGAACCATT GCCAAAGAGC GAGAGCTTCC TCATCTCCAT 4 620 

TTTCAAGTTT ACGGAACCAT TCGCGCGCTT CTTCATCCAA GCTAGGGTCA TTTTC AG CTT 4 680 

C AG CGTTG AT GCGGACATAG AGTTTAAGGA GTTCATCGAT TGCATGAGCT TTTACAGCTT 4740 

CTTCGTCGCC CCATTTTTTG TAGGCAACAA TCAACATCCC AAATTGTTTA CCCCAGTCTC 4 800 

CCAAATGGTT GACCTTGACC GTTTGATAAC CGATTTTTTG GAAAATATGT GACAAGCTAT 4 8 60 

CTCCGATAAC AGTTGAACGC AGGTGGCCAA TAGAAAATGG TTTAGCGATA TTCGGACTAG 4 920 

ACATGTCGAT AACAACATTT TCTTGTTTAC CAATATTTTG GTCAGCATAG TGTTCTTTTT 4 980 

CAGTGGTAAC AGCTTGCAAT ACTTGAGCAG AAATGGC AG A TTTATCAAGG AAAAAGTTAA 5 04 0 

CGTAAGGTCC TGTTGCCACA ACTTTTTCAA ACGCTTGGCT GTTCATTTTT TCAGCCAGTT 5100 

CAGCCGCAAT CATTTGTGCT GCTTTACGTT CGACTTTTGC AAGAGAAAAA GCAGGGAAAG 5160 

CAATGTCTCC CATTTCTGAG TTTTTAGGGG TTTCCAGTAA CTTTAAAATA GCCTCTTGGT 5220 

CCAGGCTATC AATGATGCTA GATAATTCGC TAGCAATCAA TTCTTTTGTA TTCATTAAGA 5280 

GCTCCTTTTT GGACTTTTCT ACTATTTTAT CACAATTTTA AAGAAAGAAG AAAAAATTTT 53 40 

TGAAATCTCC TGTTTTTTTG GTATAATATG GTTATAAATA T AGTT AT AAA TATGCACGCA 5400 

AGACGATTTT ATGAGAAAAA GAGATCGTCA TCAGTTAATA AAAAAAATGA TTACTGAGGA 54 60 

GAAATTAAGT A C AC AAAAAG AAATT CAAG A TCGGTTGGAG GCGCACAATG TTTGTGTGAC 5520 

GCAGACAACC TTGTCTCGTG ATTTGCGG 554 8 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3132 base pairs 
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(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
tD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TACCCGGTAG TCTTAGCAGA CACATCTAGC TCTGAAGATG CTTTAAACAT CTCTGATAAA 60 
GAAAAAGTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA L20 - 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AAGAAGTTGT TAGTAAAAAT 180 
CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA AAATTCCAAT 2 40 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 3 00 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 3 60 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 4 20 

GGTAGTCCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 4 80 

TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT 540 
GGAGTTGAGG AAGCTATTGA TT AC CT AAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 600 
GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 660 
ATGAGAATCG ATGATGATCC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA CTTAAAAGGC 720 
ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 7 80 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 840 
CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC 900 
ATAGATGGAA TTGCACCTAA TG C ACAAATT TTCTCTTACA AAATGTATTC TGACGCAGGA 960 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCT AT TG AAGATTCTAT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 114 0 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 12 60 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTT AA CAT AG GTGGAGAAAG TTTTAAATAC 13 20 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 13 80 

GCTCCTAGTA AATT AAAAT T TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 14 4 0 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 1500 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT 1560 
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TACTACAATA 


GAGATAATTG 


GACAGAGCTT 


CCAGCTATGG 


GATATGAAGC 


GGATGAAGGT 


1620 


ACTAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


GATGATGCTG 


TAAAGCTATG 


GAACATGATT 


1680 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA 


AGATAAATTG 


174 0 


GAGCAATACT 


ATCCAATTG A 


TATGGAAAGT 


TTTAATT CCA 


ACAAACCGAA 


TGTAGGTGAC 


1800 


GAAAAAGAGA 


TTGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCGTTC 


CAGCAGGATC 


TACATCTTGG 


GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAAT AT T AAA 


TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


204 0 


TTGATTAGAC 


CGAAATTAAA 


GG AAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


T CTTAAGCGA 


2100 


GATGACAAAA 


TAGATCTTAC 


AACTCTTACA 


AAAATTGCCC 


TACAAAATAC 


TGCGCGACC** 


2160 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


222 0 


GGAGCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG 


AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCT CTT AA 


AGAAATAAAA 


2 3 40 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


* » • w^lw 4 * X L 


2 400 


AAAGTTTCAG 


CATCAGCGAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAC 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 




2 52 0 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2 580 


AGCTTTGATT 


TGAATGCGGT 


T ATAAATGTT 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2 64 0 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 




& t u u 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


GAATTCCAAr 


z t o\j 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTGCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


CAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO : 111: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14672 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111; 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACT C ATTTG C TTCTG GG AG A GTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TTAGATCCTG CAAAAGGCAG 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGTTAGAT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 360 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACGAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TT ACT AT ATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 54 0 

GTATTTATTA AGAC AGATAT ATGTTGCATC CTTAAAATAT AT AATGG AT A AAG AG ATT AG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 6 60 

ACGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 7 20 

TTTTATAGCA CCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGG TTTTATTTAG 7 80 

CTTTTCTAAG TACATGGTAA T C TTT T TAAT ATCACTAAGT CTAGCTCTAT CTATTGTAAT 840 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC T AACTGG AG A 960 

AGAGGAATTT TGGATTAACA AATGGGAT AA TTTTAATACA AAACAGCTCA TCATAGGTCG 1020 

AAAACT TG AT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TCACGTTAuG 114 0 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 12 60 

TAA TACT AAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 1320 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAG AAAC TCTTCCTATT GTTGGAGAAT CAGGTTCAGG 144 0 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 1560 
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AGTCTTACAA 


AATTCAACCC 


TAAGTTATGG 


TACCTTAAGA 


G AG AATTTG A 


CATTTGGACA 


1620 


CTTTGTTTCA 


GATGAAGAAT 


TAATGACAAA 


TCTAAATTCA 


ATTGGTCTTA 


GC AAT GT AG T 


1680 


TAAATCTTTA 


CCTCTTGGAT 


TAGAGACAAT 


CATCGCTGAA 


GAAGGTAATA 


ACTTTTCTGG 


1740 


AGGGCACCAG 


CAAATGATAC 


TTTTAGCTCG 


TTGTCTTTTG 


TCGAAACCTT 


CGGTAGTTGT 


1800 


TTTGGACGAA 


GCAACAAGTA 


GTTTAGATAA 


TTTATCTCAA 


CAAATTACAA 


CTTCTTACTT 


1860 


AAGTGAAATC 


GGTACCACTA 


AGATTTTAAT 


TGCCCATCGA 


CTAGATACTA 


TCAAGTCTGC 


192 0 


AGATAAGATC 


TTAGTAATGC 


AT AATG GTG A 


AATTGTAGAG 


ATTGGGACCC 


ATAGAGAACT 


1980 


TCTTGAACTA 


GGAGGCATTT 


ATAAGCAATT 


GTATTCAAAT 


AATTAGTTTT 


TGATTAAAAG 


2040 


GGTAAATTTA 


TGAAGATTAT 


GAAAAAAAAA 


TATTGGACTT 


TAGCGATATT 


ATTCTTTTGT 


2100 


TTGTTCAATA 


ATTCTG TT AC 


TGCTCAAGAA 


ATACCTAAAA 


ATCTTGATGG 


CAATATAACT 


2160 


CACACTCAGA 


CTAGCGAAAG 


TTTTTCTGAA 


TCTGATGAAA 


AACAGGTTGA 


CTATTCTAAT 


2220 


AAAAATCAAG 


AAGAAGTAGA 


CCAAAATAAA 


TTTCGTATTC 


AAATCGATAA 


GACAGAATTA 


2280 


TTTGTAACAA 


CAGATAAACA 


TTTAGAAAAA 


AACTGTTGTA 


AATTGGAACT 


TGAACCACAA 


23 40 


ATAAATAACG 


ATATTGTTAA 


CTCTGAAAGT 


AAT AATTT AC 


TAGGCGAAGA 


T AAT TT AG AT 


2400 


AATAAAATTA 


AGGAAAATGT 


TTCTCATCTA 


GATAATAGAG 


G AGG AAATAT 


AG AG CATG AC 


24 60 


AAAGATAACT 


TAGAATCGTC 


GATTGTAAGA 


AAATATGAAT 


GG GAT AT AG A 


TAAAGTTACT 


2520 


GGTGGAGGCG 


AAAGTTATAA 


ATTATATTCT 


AAAAGTAATT 


CTAAAGTTTC 


AATTGCTATT 


2580 


TTAGATTCAG 


GAGTCGATTT 


ACAAAATACT 


GGATTACTGA 


AAAATCTTTC 


AAATCACTCA 


2640 


AAAAACTATG 


TCCCCAATAA 


AGGATATTTA 


GGAAAAGAGG 


AGGGAGAGGA 


AGGAATAATA 


2700 


TCAGATATTC 


AAGATAGATT 


AGGTCATGGT 


ACGGCTGTTG 


TAGCTCAAAT 


TGTAGGGGAT 


2760 


GACAATATTA 


ATGGAGTAAA 


TCCTCACGTT 


AATATTAACG 


TCTATAGAAT 


ATTTCGTAAG 


2820 


TCGTCAGCTA 


GTCCAGATTG 


GATTGTAAAA 


GCAATTTTTG 


ATGCTGTAGA 


TGATGGCAAT 


2880 


G AT ATT AT C A 


AT CTTAGT AC 


TGGACAATAT 


TTAATGATTG 


ATGGAGAATA 


TGAGGACGGA 


2940 


ACAAATGATT 


TTGAAACATT 


TTTGAAGTAT 


AAAAAGGCTA 


TTGATTACGC 


GAATCAAAAA 


3000 


GGAGTAATTA 


T AGT AGCTG C 


ATTAGGGAAT 


GACTCCCTAA 


ATGTATCAAA 


TCAGTCAGAT 


3060 


TTATTGAAAC 


TTATTAGTTC 


ACGCAAAAAA 


GTAAGAAAAC 


C AGG ATT AGT 


AGTTGATGTT 


3120 


CCAAGTTATT 


TCTCATCTAC 


AATTTCGGTC 


GGAGGCATAG 


ATCGCTTACC 


TAATTTATCA 


3180 


GATTTTAGCA 


ATAAAGGGGA 


TTCTGATGCA 


ATATATGCGC 


CTGCAGGCTC 


AACATTATCT 


3240 


CTTTCAGAAT 


TAGGACTTAA 


TAACTTTATT 


AATGCAGAAA 


AATATAAAGA 


AGATTGGATT 


3300 
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TTTTCGGCAA CACTAGGAGG AT AT ACG T AT CTT TAT GGAA ACTCATTTGC TGCTCCTAAA 3 360 

GTTTCTGGTG CGATTGCAAT G ATT ATT GAT AAATACAAAT TAAAAGATCA GCCCTATAAT 34 20 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT AT AAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TAT AAAAAT G 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3 600 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT T ATT ACT ATT GCCCGAGAAG 3660 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 37 20 

GTTTAGTTGG AGT ATT ACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 37 80 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3 840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3 900 

CCGTTCAGAG G AAGAAC ATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 3 960 
TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA ' 4020 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 4080 

CATTGTTATA ACACCCCAAT CAACTGGTCC ACAGTCCATT* TTGTTTTGGA TAGACGCAGT 4140 

AC AG AAC T AC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 4 200 

C ATT G AAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CAT T ATTGG A 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 4 320 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTAGAC GTGCCATTTT 4 3 80 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 44 4 0 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4 500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4 560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4 620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4 680 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 4 7 40 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4 800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4 860 

GCTTAATTGA AAACCAAAGT ATTGAAGAAA AC CTT AAGCT AGGTCTCATT GGTCAAAAGT 49 20 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4 9 80 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 5040 

TGG C AAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACAGCTTCAA 5100 
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TAGACCCAGC 


AACCTCTCAG 


1 TTCATTATCG 


: AGATTTTGCT 


' ATCTCTTCGA 


. GATGATAATA 


S160 


GGCTAATCAT 


1 TATCGCAACA 


CATAATCCGG 


CAATTTGGGA 


GATGGCTGAT 


GAAGTGTTCA 


5220 


CGATGGATCA 


TCTGAAATAA 


AAATCCTTGT 


TTTTAATTGC 


ACGATGAGTT 


ACTGAAATAT 


5280 


TATCATGAAT 


CAAGAATTGG 


AGTTAATTTA 


GAATTGTACT 


TAATTTAGAA 


TTGT ACTTT A 


5340 


TTAATATTGA 


GGTAACTTTT 


TCTTGATAAA 


GGAAGAAATA 


ATGGAGAGGA 


AGTTAGAATG 


5400 


AAAAAATTCG 


ACAATTATAT 


TATTGAGAAG 


CCTTGCG ATT 


CTAATTCAGA 


TAAACTGCAA 


5460 


AAAATCTTAA 


TAATTGAAAG 


TTTGGTAGAT 


GATATTTTGC 


AATTTTCTCT 


CAGAATCAAT 


5520 


AATAGTGTAG 


GAGAGATTTT 


CCTCCTACAA 


CCGTTTTAAA 


AGAAAACTAT 


CTTTATTCCA 


5560 


TGTTATTTTG 


AGGAAGATAT 


TGTGAAAGTC 


AAAGATGATG 


ATAAAGTTGA 


GTGGAATTTG 


5640 


T T AG AATTTC 


AAAAATT TAG 


AG C ATTTTTG 


GCTTAGTAAT 


CTGTGTTGAA 


GGCTCAAAAC 


5700 


CTATCGTAAA 


AAAGTAGCTT 


TGAAAACGTA 


TTGCCTCCAA 


AGATTTAGTT 


AAATAATGAT 


5760 


TTAACACAAA 


AAGAAATTAT 


TGAAGTTCTG 


GAAAGATGTT 


GTTTCAGTAT 


TGAGAAAAGG 


5820 


TGGGAAAAAC 


TTGCGATTTT 


CACAGAGAAA 


GGAAGAAAAA 


GTATAGAAAT 


ATAGTCAATT 


5880 


GAAACAAGAA 


CAGGATAAAA 


GAACCTTTTG 


TGCCATATTT 


TTCTCCTTTC 


GCTTTACAAT 


5940 


TGGATTGAAC 


ACCTTTATTG 


TATCGCGTTT 


GGAGTTTTTT 


TGGTATAACC 


TTCGACGCAC 


6000 


ACCCGCATAG 


CGGGTGTTTT 


TTTTGTCTCG 


CACCTAACGG 


AGCGAGACAA 


ACTAATAGTC 


6060 


ACTTAATCAA 


AAAACGCACC 


ATATCAAAAA 


CTAAAAAGTT 


TGATATCATG 


CGTCATGTCT 


6120 


TAAACTAATT 


G ACT AT AC TT 


TCTATTCAAA 


TGAGCTTTTA 


ACCAATTGAT 


TGAGCCAATC 


6180 * 


CACTCTTAAA 


ACCAAAGAGC 


AATTTCTCGC 


TTAGCTGACT 


CTTCTGAATC 


TGAACCATGT 


6240 


ACAACATTTT 


GGATAATCTC 


ATTTTCTCCA 


GCAGCTTTTG 


CAAAATCACC 


TCGAATAGTG 


6300 


CCTGGTAAAG 


CTTCTTCTGG 


ACGAGTTGCA 


CCCATCATGG 


TCCGCCAAGT 


TTCGATTACT 


6360 


TTGGGACCAG 


AAATGACACC 


CACAAGAACT 


GGACCTGAAG 


TCATGAATTC 


A\.onn i J 


O *1 £ (J- 


GGGTAAAAAC 


TCTGACCAAC 


CAAGTCCTGA 


TAGTGCTGGT 


CAATCAACTC 


TTCTGAAACC 


6480 


TGTGAACGAA 


ACTCCAATTT 


TTCGATTGTA 


AATCCACCTT 


GTTCGATGCG 


CTTTAACACT 


6540 


TCACCCACTA 


GCCCTCTTTT 


TACACCATCT 


GCTTTGATGA 


TAAAGAATGT 


TTGTTCCATA 


6600 


CCCGTCTCCT 


TTGTCAGCTT 


CTTTCTTTTA 


TTTTACCACA 


TTTCGTGGAA 


AAATGGAGAA 


6660 


AGTTTTCAGA 


AGAGAGAATG 


AGAGAACCCT 


CGGGTT CTCT 


CATTCTCTCT 


T ATTCT ACT G 


6720 


TTTCTTCCAC 


AGTTTCAACG 


GCAGTATCCA 


CAACTACTTC 


TGTTGTTTCT 


TCATTTCCTT 


6730 


CTTCCTCTAC 


TGGAGGATTA 


AGGTATTCTT 


CTTCGTTGAC 


ACCATGTCCT 


TCAAGGTTAC 


6840 
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GGTAACGGGC CATACCACTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC AGTAAGGACA CGAGTTGTTT 6 9 60 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7 2 00 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 72 60 

AG AC AT C AC G AACTGCAAGG AGACGTTTTG G TT GG AT AG A ACCTTCTGTC AGAGCAGCAC 7 3 20 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7 380 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 74 4 0 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7 500 

CTTC AAAG AT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7 62 0 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7 680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 774 0 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATCTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 804 0 

CACCGTGAG7 TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACC ACGC AT A CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTG GGTI ' CT 8220 

TAGGATCTTG GTTACCAATC AAGCGTTTCT CAAGTTTTTC ACCGG C AG C A CCCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 8 340 

TTTGTTCGAC AC GTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 84 60 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 85 2 0 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8 5 80 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 864 0 
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TGTTTGGTTC 
CAACTGTTGT 
CTGTTGCGAT 
CTTCGTCACG 
AGTAGTAGTT 
GGTTCAAGAT 
AAAGTGGTAC 
CAAGTGGGTG 
CCAAACGGTG 
CAAGGATATC 
TTTGCACGAT 
CCATTTCACG 
AACGTCCTGA 
CTTTAAGCAT 
CACGACGACC 
GAACGATGAT 
TAACACGGCG 
ACATTGGACG 
TGTTTCCAGA 
TTTGTCCAGT 
CTTGCTTCAA 
CATATTCACG 
GTGTATCCTT 
GAGGGCTCAT 
GAGATACAGG 
TTACTTCAAC 
CACAAGCACA 
GTTCTGGTTT 
ATGAACGGAC 



TTGCAAGTAT 
AAGCAAGACC 
ACCAACACGT 
GTCTTTGAAG 
ACCCAAAACC 
GTGCTCAGCA 
GTGGATGGCC 
CAAGCGAAGA 
AAGGGTCGGT 
CCAGATACGC 
ATCACGGGCA 
CGGCACACCA 
GAAGTCAACA 
GTGGCTCAAT 
ATTGTCAATC 
ACCTGGTGCA 
GTAAAGGTCA 
AAGATCTGGT 
CTTGTAAAAG 
AGCTGTTTTC 
AAGGTCTTGG 
CAAGCGCTCT 
AGGATCAATC 
ATCAAGGGTC 
AGCTTTCAAT 
CCCACAGCGG 
TTCCCAGTCT 
CAAGGTACGA 
TTTACTTGGA 



GGTAGCCCCT 
TTATGTCTTT 
GAGTGGAGGT 
ACCATTCCTT 
ATGTCCTGAG 
GCTAGCATGA 
ATTTGGTCCC 
GCCTTACCAT 
GCGCGGTTCA 
TCATCTCCGC 
ACGATTTCAC 
C AT TGGT AC A 
CGTTTACCGA 
GATTTCAATG 
AAAGCGTCAA 
TTTAACTCAA 
TTCAAGTCAG 
GGGATAACCG 
GCATCCAAAA 
AATTCTTCTT 
ATGGCTTCCG 
CGGTATTCGC 
ACCACATAAC 
AAGCCCATAC 
TCGATATGTC 
TCACAAACAA 
TTTGTAGGAC 
TAATTGATTG 
GAAGCTAGGG 



311 
CTGCCATGAT 
GCTC TTCTGT 
GAACATAACC 
CACCTTCCCG 
ATGGAGTAAC 
GGATACGAGC 
CGTCAAAGTC 
CAATCAAGAC 
AAAGCACTGG 
GTTCCACCAA 
GCATGACAAA 
TCTTAAGAGT 
GCAAGTTTTG 
CACGGCTACC 
CTGCTTCTTG 
GCAAACGAGC 
ATGAGGCAAA 
GAAGGATGTT 
CATCCAAACG 
TGAGTTCAGC 
CACCCATCTT 
GCTCTGTCAT 
CCGCAAAGTA 
GGCTTGGAAT 
CCATACGCTC 
TTCCTCTCTA 
CAAAGATCAC 
TTTCAGCTTT 
TGATTTGCAT 



ATCGTTGAAG 
CCAAGGCTTG 
ATTGCGGTAA 
ACCAGCTTCT 
TACCGGTTTC 
TTCTGCTTGT 
AGCATTGTAG 
TGGCTCGAAG 
GTGTTCTTTA 
GCGTTTAGCT 
TGGTTTAAAG 
TGCACCAACG 
ACGGAAGCGT 
TGGTCCTGTC 
AAGCATACGC 
CAAACGGTTG 
ACGGCCACCA 
AAGAATCATC 
ACGGATGGCT 
AATTTCTTTT 
GGCAACAAAT 
GATAGACTTG 
GATAACTTCC 
CCCCTTGAAG 
ACGACGAACT 
AC G AATGCGT 
TTCATCAAAG 
TTTGACTTCT 
ACTTTTAAAA 



AGAATTTTAC 
TTGAGGCTGT 
GCCATAACCG 
TCCATGGTCA 
CCATCTTTCG 
GCTTCTTCTG 
GCTTCACAGA 
GCTTGGATAC 
ATCACTTCTT 
GCTTTGACGT 
ACTTCAATCG 
GCCATAACTC 
CCTTGTTTAC 
ATTGG A CG AC 
TTCTCATTTT 
TTACGGTTGA 
TCCAACTGCA 
CATTCAGGTT 
TTGACACGCT 
TCAAGATCTA 
GAACCATAAC 
TGCTCAAGTG 
TCGAGGGCAC 
TACCAGATGT 
TTCGTACGCG 
TTGTACTTAC 
AGTCCTTCAC 
CCATAAGACC 
CGATTTACAT 



8700 
9760 
9820 
9880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 104 4 0 

TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA GCTTCAAAGG CTGCTTTAGC 10500 
CTCTTGGGCT CCTTTTTCCC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 
CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10620 
GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 
TTTTGGAATT CGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTGATATC 10740 

CTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGC CTTCAAGAGC 10800 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGGTAACA CTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 10920 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTCCTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG ATATTCATAC GTGATGGTAC 11160 

CCCAAGTCCC TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAACGTAAG GCATCTCTTC 11220 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCCTCA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 1134 0 

CAACTCATCT CCATTTACAC GTGTAAACAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 11460 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATAT CACCT TCTTTAACCT CAGCACCAAT AC GG AT AATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 117 00 

CAAGCCTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 117 60 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 11820 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTT ACG AG A GTGCGTTGGT TGT AAGC ACT 11380 

ACCTCACTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CAT CTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CC ATCAT ACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTG CTTGGT A TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GT TTTGGTG A 12180 
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AACGTCCATG 


TAGTCAACAA 


TATTAGCTGG 


ATACTCTTGG 


TTGACCCCTT 


GGTGACGTCC 


12240 


CATGACAATC 


TTCTCAGCAA 


AGGTTCCATC 


TTCATTCAGA 


CG AG AG TT AG 


CCTGAGCTAC 


12300 


AGTATATTCA 


TCTTCTTCAT 


CAGCTGTCAA 


CCAAACAATT 


TCGTTCGTGA 


CAACACCTGT 


12360 


TTCACGGTCA 


ACCTTACGGT 


ATGGTGTTTC 


AACAAAACCA 


TATTTGTTCA 


AGTGTCCATA 


12420 


AGATGACAAG 


TTATTGATCA 


AACCGATGTT 


AGGTCCTTCA 


GGTGTCTCGA 


TTGGACACAT 


12480 


ACGACCATAG 


TGAGTGTAGT 


GCACGTCACG 


TACTTCATAT 


CCAGCACGGT 


CACGAGTCAA 


12540 


ACCACCAGGT 


CCTAAGGCTG 


ACAAACGGCG 


TTTGT GAG AC 


AACTCAGAAA 


GCGGGTTGTG 


12600 


TTGGTCCATG 


AACTGTGACA 


ACTGTG ATG A 


ACCAAAGAAT 


TCTTTAACTG 


CAGCTGTTAC 


12660 


AGCACGGATA 


TTGATAATTT 


GTTGTGGTGT 


CAAGACTTCA 


TTGTCCTGAA 


CAGACATACG 


12720 


TTCACGGACA 


TTACGTTCCA 


TACGAGAAAG 


TCCCAAACGT 


ACTTGGTTGG 


CAAGCAATTC 


12780 


ACCAACCGCA 


CCGATACGAC 


GATTTCCAAG 


GTGGTCGATA 


TCATCTACAC 


GGCCAAGTCC 


12840 


TTCAGCCAAG 


TTGAGGAAGT 


AGCTCATCTC 


AGCAAGGATA 


TCTGCAGGAG 


TCACCGTACG 


12900 


AACCTTGTCA 


TCTGGGTTAG 


CATTACCAAT 


GATCGTTACG 


ACGCGATCTG 


GATCAGTTGG 


12960 


AGCAATAACC 


TTGAATTTTT 


GAAGAACAAC 


AGGCTCAGTC 


ACAACGGCTG 


CATCCTTTGG 


13020 


GATGTAGACA 


ATCTTGTTCA 


AGTCGCCATC 


CAAATGGCTT 


TCAATGCTTT 


CAATCACGCT 


13080 


ACGAGTCATA 


ATCGTACCAG 


CTTCTACCAA 


GATTTCTCCA 


GTTTCAGGGT 


CTACCAATGG 


13140 


CTCTGCAATG 


GTTTGGTTC A 


GCAAACGTGT 


TTTAACATTG 


AGTTTTTTAT 


TGATTTTGTA 


13200 


ACGACCAACT 


GCTGCCAAGT 


C AT AACG ACG 


TGGGTCAAAG 


AAGCGAGCTA 


CAAGCAAGCT 


13260 


ACGTGAGCTT 


TCAGCCGTCT 


TAGGCTCACC 


TGGACGAAGG 


CGTTCGTAAA 


TTTCTTTCAA 


13 320 


GGCTTCGTCT 


GTACGAGAGT 


CCATTGGATT 


CTTGTGGATA 


TCTTTTTCAA 


CAGTGTTGCG 


13380 


AACCAATTCG 


CTGTCACCAA 


AGATATCAAA 


GATTTCATCA 


TCACCTGAGA 


AACCAAGAGC 


13440 




rTTPTl A a Tf~ 
ul lul J\J\J\ 1 V7 


& ft T* 1 "^P^ r*r* 

Onnl Li i AL\> 


AGTACGGTCG 


AT ACG AGT G T 


AGGTGATATC 


13500 


TTTTGAGTCG 


CTTTCAACTT 


CCAACCAAGC 


TCCACGGTTA 


GGGATAACAG 


TTGAACCATA 


13560 


GCCCACCTTA 


CCATTTTTGT 


CTACTTTGTC 


GTTAAAGTAA 


ACACCTGGTG 


AGCGGACCAA 


13620 


CTGAGAAACG 


ATAATACGTT 


CACCACCATT 


GATGATGAAA 


CTACCCATTT 


CTGTCATGAT 


13680 


TGGGAAATCA 


CCAAAGAAAA 


CTTCTTGGGT 


CTTGATTTCG 


CTTGTTTCTT 


TATTGATCAA 


13740 


ACGGAAGCTT 


ACAAAAATTG 


GTGCTGAGTA 


CCTAGCATCG 


TGGATACGAG 


CTTCTTCTAG 


13800 


CGTATATTTT 


GGTTCCTTGA 


TTTCATATCC 


AACAAATTCC 


AACTCCATTG 


TGTCTGTGAA 


13860 


GTTTGAAATT 


GGCAATACAT 


CTTCAAACAC 


TTCCTTAAGA 


CCGTGGT CT A 


GG AAAGCTTT 


13920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTG AACGTC A TGTCCTGCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGAGCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 14280 

CCATATTATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC T G AAAAT CAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 144 60 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14 520 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14 580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG ACCGAGTTTC TGGCAGAGTT 14 640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14 672 

(2) INFORMATION FOR SEQ ID NO; 112: 

(i> SEQUENCE CHARACTERISTICS : 

(A| LENGTH: 7902 base pairs 

[B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACCT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAC 2 40 

AAGTT AAAG A TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 3 00 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 3 60 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 4 20 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGCCATTGGC AAAGACTAAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATC ACTT A GGTGAATTTT AACAAGTTTG 54 0 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA AT AAG AAAAT AG AG AC GAG C 6 60 
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** 1 V./M3WA i M^. 


vjO/\ 1 OAAOOO 


A OTTTTOT ft ft 
AO 1 1 1 1 O 1 AA 


AAOAOAAAaO 

AAGAGAAAAG 


TGTAA TTTTC 


CAGAGCTTCA 


72 0 




* TTO A A A & 
^.unl 1 liAAAA 


A ATOTOOOO A 
AA 1L1 GGGC A 


A TO A otto a a 
A 1 O AO 1 1 OAA 


TCCCCATAAC 


ft ft ft « 

AAGGTAGACC 


780 


TTT TTO, A OO. A 


1 I Ourt 1 


aota lame 

AL» 1 AAAO AAO 


AO AOO AT AOO 
AGAGGA L AOO 


r-»m » ft m ft m k 

O a AOGATATA 


OACAGCAGTG 


840 




T'arsa ooo a t 
1 Al_ AAOOOA 1 


OO a O ft ft ft 1* ft ft 

IjLALAAAI AA 


ft a ft a r* ft /'in ft 
AAAAGAC L AO 


ft ft ft ft ^*/^Tin^^f^ 

AAAAGGTTCT 


GTTAAGATCT 


900 


TTTTTGTTAT 


CCTTGACATT 


AO TO AT AG OC 


C I 1 AAA C COT 


AGTTATAGAC 


ACCATAAGTT 


9 60 




AO & A A A a TV a 

AuAAAAA 1 \j A 


O A A a A T ft r~<T>r*- 
LAAAAlALi lO 


I LGAl i OAOT 


TGAAoTAACC 


ATAGTCAGTT 


1020 




l_ AC GO. OOO AO 


A TAOOTTOCA 


GTTAGGAvGG 


GAAAAATAAT 


ATTCAAGACA 


i080 


ooa aTTTrr & 


Ibl AAuA r AG 


AOO ATTTAAT 


TTTATACTTT 


TCATTCAATT 


TACCTCGTTT 


1140 


». 1 V-A 1 IAIA1 


LATAAAGTTA 


OOT A A T ft ft ft 

GCTAATAAoA 


ft ft T*/"* ft ft /*■*/"»/"• /"» 

AAToAAOOOC 


AGTAAGTCAA 


GTAATCACTT 


1200 


1 VjAAO 1 I L l» A 


A A TfT ft ft f~T* 
AA 1 l_ *. 1 AAb I 


TTTAAGTTTT 


/TTWTWT*- ft X ft ft 

CTTTAAOOAA 


ACT AT ATT AT 


TCTG AAGG A C 


12 60 


TOT* a & & UTTT 
I \— 1 nAnn ill 


GGCAGGCA TT 


TATTAGTAAT 


ifti*/^r ft ft it ft 

TOCTACAGAA 


TTCCTAGTCA 


TT ACT AG AAA 


1320 


1 01»»Ai_ I AVj 1 1 


TrTW a A T A 
1 l_ i I l OAA 1 A 


A*!* 1 AO A AOTOO 

AT AO AACTCiC 


AT AATTCTC C 


TATTCTAGAA 


GGGGAGGACC 


1380 


AGTATTTCTT 


TTATGA * AUG 


ACTAOATTGT 


GGTATAATAG 


AGAGAATAAG 


TTTTTTTAGT 


1440 




&OAAAATAOA 
AoAAAA i AO A 


T/'&'Mtf A TOO 

TGATTTATGC 


fti""/"* ft ft fltlf/ 11 1*ryt 

AOOAATTCTT 


GCCGGTGGAA 


CTGGCACACG 


1500 




AO 1 AAL 1 l 


OA A A AO a &<IW 

CAAAACAAn 


IIMIMII ft /"• ft f*f**T* ft 

riTAGAGCTA 


GGTGATCGAC 


CTATTTTGAT 


1560 




O A A A A A TTTP 
yAAAAAl 1 


i oi rooAooc 


a a or a tto a a 
AAGTATTGAA 


AAAATTGTAG 


TTGGTGTTC A 


1620 


TOO A O A OTOO 


G 1 1 1 CTCATG 


o a o a a o a rp^vr 
CAGAAGATCT 


rn n s+ ft ft ft ik 

TGTAGATAAA 


TATCTTCCTC 


TTTATAAGGA 


1680 


A OO.T & TO A TV* 
AL.^j 1 rt. 1 V_A I \_ 


i'n'p&rj a aoo 
Ail ALAAAOtj 


OTOOTOOTO a 

o roo rooTOA 


CCGCAATACA 


ft J o*^' k n n r* a « 

AGTATTAAGA 


ACATCATTGA 


1740 




11 ATt,t»TL 


CGCl 1ACTCC 


fty"»ft/" > ^*ff»ftf¥'^ 

AoAGGATATC 


GTTGTTACCC 


ACGATTCTGT 


1800 


tootoo & ttt 


Al lAUAti AG 


OO A TO A TTO A 
bLAluAl 1GA 


OOAOAATATO 

GGAGAATATC 


O ft ft #^»*|Wf fc^— A » 

CAACTTGCCC 


AAAATCATGA 


1860 




r\v_Mlj 1 1 AO 


ft AOOOOTTO A 


T aOT ATOOTT 
1 AL I A 1 CO I i 


o a a a ot a oo a 
OAAAOTACOA 


ftTl/^*/^^/^ft ft IIWII 

ATGGTCAATT 


1920 


T ATT AC AG AT 


ATTC C AAATC 


GTGCTCACCT 


TTATCAAGGA 


CAAACACCTC 


AAACATTCCG 


1980 


TTGCAAGGAC 


TTCATGGACC 


TTTATGGATC 


TCTTTCTGAT 


GAAGAGAAGG 


AAATCTTGAC 


2040 


AGATGCATGT 


AAAATCTTTG 


TGATCAAAGG 


AAAAGATGTG 


GCTTTGGCCA 


AACGTGAATA 


2100 


CTCAAATCTG 


AAGATTACAA 


CCGTAACAGA 


TTTGAAGATT 


GCAAAAAGTA 


TGATTGAGAA 


2160 


AGACTAGTAA 


AATGATTAAT 


CAAATTTATC 


AACTAACTAA 


GCCTAAGTTT 


ATCAATGTCA 


2220 


AATATCAGGA 


AGAGGCTATT 


GACCAAGAGA 


AT C AT ATCCT 


TATCCGTCCC 


AACTACATGG 


2280 


CTGTCTGTCA 


TGCCGATCAC 


CCTTACTATC 


AGGGAAAACG 


TGATCCCAAG 


ATTTTGAATA 


2340 


AAAAGCTTCC 


AATGGCAATG 


ATTC ACG ACT 


CATGTGGAAC 


CGTCATTTCT 


GACCCGACCC 


2400 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2 4 60 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GGATTTGATG 2 520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGT GGCTT AT GATGCTATTG 2 580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TC ACT GTGGG CATGCACGCT ATGAATCGTC 2 640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCGCCGT TATTGGAGAT GGAAGTTTAG 2700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AG CAG AG AT T GTGGTTATTG 2 7 60 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATT ACGG AT A 2 8 20 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATGGTACTG 2 880 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2 9 40 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCGCGATGC CTTAGAAAAG GGCTTGATTT 3 000 

TGGTTGGGTC ATCTCGTTCT GGTC GCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3060 

TCAAGAAATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GT AAG AG AAA 3120 

TTAAACATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 3 240 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3 300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 3 3 60 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 34 2 0 

GGCTT AGATG TAAAAAATTA TCTTTTTCAT ATTGAAGCTG GTATCAAAGT AAATG AGT AT 3480 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 354 0 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3 600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3 6 60 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 37 20 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3 7 80 

TATTTGATTG ACTGGGAATA TTCATCAATG AATGATCCAA TCTCGGATTT GGCTGCCCTC 3 8 40 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3 900 

G AC C AAAC AC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3 9 60 

TGGAGTCTAT GGACTGTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4 020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4 08 0 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 414 0 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 4 2 00 
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TTTTTTGAGC ATCTTTATCT TACTAGCTTT TCTCTTGGTA AAAGAAGGGA AAGTTCGCCT 4 2 60 

CTCAATTTTC TTAAATATTC GCAATCTCAG TGTTATCATC GGAGCCTTGC TAGCAGGCCC 4 3 20 • 

TATCGGTATG CAGGCCAATC TTTATGCAGT TAAGTATATC GGAAGTTCTT TAGCTTCATC 4 3 80 

TGTATCGGCT ATTTACCCTG CGATTTCAGT TCT ATTGGCT TTCTTCTTTT TGAAGCACAA 4 4 40 

GATTTCGAAA AAT ACT GT A T TTGGGATTGT CTTGATTATT GGAGGGATTA TTGCTCAGAC 4 500 

CTATAAGGTT GAACAGGTTA ATTCTTTCTA CATTGGGATT CTTTGTGCTT TGGTTTGTGC 4 560 

TATTGCATGG GGAAGTGAGA GTGTTCTTAG CTCTTTTGCC ATGGAAAGTG AATTGAGTGA 4 620 

AATCGAAGCC CTCTTAATCC GTGAAGTAAC TTCGTTCTTG TCCTATCTTG TGATTGTGCT 4 680 

CTTCTCTCAT CAGTCATTTA CTGCAGTAGC CAATGGACAA TTGCTAGGTC TCATGATTGT 4 74 0 

TTTTGCAGCC TTTGATATGA TTTCCTACTT GGCTTATTAT ATCG CTATC A ATCGCTTGCA 4 800 

ACCAGCCAAG GCTACAGGCT TGAACGTGAG CTATGTAGTA TGGACGGTCT TGTTTGCAGT 4 860 

TGTTTTCTTG GGTGCACCGC TAGATATGCT GACCATTATG ACGTCACTTG TCGTCATTCC 4 92 0 

TGGAGTTTAT ATTATTATTA AAGAATAAAG GAGATTCGTG TGAAAGCCAT TATCTTAGCA 4 98 0 

GCGGGATTGG GAACTCGCTT GCGTCCTATG ACTG AAAAT A CCCCTAAAGC CTTGGTTCAG 504 0 

GTTAATCAAA AACCTTTGAT TGAGTACCAA ATTGAGTTTC TCAAAGAAAA AGGAATCAAT 5100 

G AC AT CAT C A TCATTGTTGG TTATCTTAAA GAACAATTCG ATTACTTGAA AGAGAAATAC 5160 

GGTGTTC GTC TCGTTTTCAA TGATAAATAC GCTGACTACA ATAACTTTTA CTCTCTCTAT 5220 

CTTGTAAAAG AAGAATTGGC CAACAGCTAT GTTATTGATG CTGACAATTA TCTCTT T AAA 52 80 

AATATGTTCC GCAATGATTT GACACGTTCG ACTTATTTTA GTGTTTATCG TGAAGATTGT 5 340 

ACCAACGAAT GGTTCTTGGT TTATGGAGAT CACTACAAGG TTCAAGACAT TATTGTTGAT 54 00 

AGCAAGGCAG GTCGCATCCT TAGTGGTGTA TCCTTCTGGG AT GCTCC AAC TGCAGAAAAG 54 60 

ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTGAAT TTGTTGATCT CTATTGGGAC 5520 

AATATGGTTA AGGATAATAT CAAAGAGCTA GATGTCTATG TTGAAGAATT AGAAGGCAAT 5580 

AGCATTTATG AGATCGATAG TGTCCAAGAC TATCGTAAAT TAGAAGAAAT TCTTAAAAAC 564 0 

GAAAATTAAA GATTCCAACA TCTGACAAAA TAGTCGGATG TTTTTTGATT TTTTACGAAC 57 00 

TTTTACGAAT AGATAGATGA GTAGAAAAAG AAATGGAGTT ATTTATGAAA ATCACAAACT 57 60 

ATGAAATCTA TAAGTTAAAA AAATCAGGTT TGACCAATCA ACAGATTTTG AAAGTGCTAG 5 820 

AATACGGTGA AAATGTTGAT CAGGAGCTTT TGTTGGGTGA TATTGCAGAT ATCTCAGGTT 5880 

GCCGTAATCC AGCCGTTTTT ATGGAACGTT ATTTTCAGAT AGACGATGCG CATTTGTCGA 5 94 0 
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AAGAGTTTCA AAAATTTCCA TCTTTCTCTA TTTTAGATCA CTGTTATCCT TGCGATTTGA 6000 

GTGAAATATA TGATGCGCCT GTACTTTTAT TTTACAAGGG AAATCTTGAC CTCCTGAAAT 6060 

TCCCGAAGGT AGCGGTCGTG GGCAGTCGTG CTTGTAGCAA ACAGGGAGCT AAGTCAGTTG 6120 

AAAAAGTCAT TCAAGGCTTG GAAAATGAAC TGGTTATTGT CAGTGGTCTG GCCAAGGGCA 6180 

TTGACACAGC AGCTCATATG GCAGCTCTTC AGAATGGCGG AAAAACCATT GCAGTGATTG 62 4 0 

GAACAGGACT GGATGTGTTT TATCCTAAAG CCAATAAACG CTTGCAAGAC TACATCGGCA 6 300 

ATGACCATCT GGTTCTAAGT GAATATGGAC CTGGTGAACA ACCTCTGAAA TTTCATTTTC 63 6 0 

CTGCCCGTAA TCGCATCATT GCTGGACTTT GTCGTGGTGT GATTGTAGCA GAGGCTAAGA 64 2 0 

TGCGTTCAGG TAGTCTCATT ACGTGTGAGC GAGCAATGGA AGAAGGACGC GATGTCTTTG 64 80 

CTATTCCTGG TAGCATTTTA GATGGACTAT CAGACGGTTG CCATCATTTG ATTCAAGAAG 6 540 

GAGCAAAATT GGTCACCAGT GGGCAAGATG TTCTTGCGGA ATTTGAATTT TAAAAATGAC 6600 

CTAAGCTAGA ATTCTAAGAA AAAATCAATT TTAAGAGAAA ATGAACCCAA CATTTCCATA 6 660 

ATAAAACGCA TATTAGCAAG TTTTTAACAC TTGATAATAT GCGTTTTTTC TAAGTGGATT 6720 

AGTAGAGTAG AGGATTTTTC TCATATAATA CTCTTCGAAA ATCTCTTCAA ACT ACGT C AG 6780 

CTTCCATCTG CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 684 0 

CAAAGCAGTG CTTTGAGCAA CCTGTGGCTA CCTTCCTAGT TTGCGCTT T G ATTTTCATTG 6900 

AGTATAAGGG AAAGTATAGT GAATTGAAAT AAGATGTGAA CAACTCTATC AGGAAAGTCA 6960 

AATTAATTTA TAGAAATATT TTAGCAGCCA AGGTGTACTG TTATAGATTC AATTACACTA 7020 

TAATTTAGTG TAATTGAGAA AGGAGAAATG ATTGTGATTG ATGTTGGCTA GGTTATGTTC 7080 

AATGATTCCT ACCGTCTCAA ATCTTGT C AG TAAGGAAAAA TAAATTCTTC AAAAGTAGAG 7140 

ATTACAAGGC TTGTTTAAGA AAGAATTCAA AGACCTTGAC AAATAAAAAT AAAATGGTTA 7 200 

TTATAAAAAA TGGTCTGAAA TAGATGATGA TACTTTTCGA AAATCTCTTC AAATACGTCA 7 2 60 

GCTCAGCTTT GCCTTGCTGT GTTTTGAGCA AGCTACGGTT AGCTTCCGAG TTTGATTTVC 7 320 

ATTTACTAGA AATGAAACTG ATCAGAGATA TCAGT AG AC A TTTGAGTCAG GATATTATGG 7 380 

AAAATGATAA AAAGAGCTCG TGAGATTGGC ATATCAGACT ACTAAAGTAT TGAGTTTGTT 74 4 0 

AGGATTTTAG CGACTAGTTA GCTGGGAAAG CAAGATATTT GTGACAAATA ATAAACTGTA 7 500 

TTCGTTGATA GAATTTAGAA ATAAAATATA TGAAGAATTA GAACTTTCCA GAAGTGATTT 7 560 

AGCGATTTTA CTATGTGCCA TGCTTATCGC CTCTATCGGA TTAAATATGG ATTCGACTCC 7 6 20 

CGTGATTATT GGAGCCATGT TAATCTCTCC TTTGATGACA CCTATTCTGG GAGTGGGGCT 7 680 

CTCTCTAGCT ATATTTGATT TTAAATTGTT AAGAAAATCT TTTAAAATAT TAGCTATTCA 77 4 0 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7 800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 78 60 

AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 18627 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{D> TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 
TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGGT GAT TTTTCTTGAC GATAGAAGTT 12 0 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTAGTAGGCA TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 2 40 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 3 00 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GCATATTAGG GTCTTTAATG 3 60 

TCTAGTAATT TTGTGAT AAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 4 20 

CTCGCTTTTC ATTATAGGTC ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 4 80 

ATCTATAAGG GATTTACCCA CTACAAATAT TATAGAGCCA AAAATCCTTT GTTTACTAAA 54 0 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 600 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 6 60 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 7 20 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTACG CCCCTTCCAA 7 80 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 84 0 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 9 00 

AGTCT AG CT A AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 9 60 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTCCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 1080 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 114 0 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AACCGGTGGA GCAAGAGAAA ATAAGTCACG 12 00 
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CTTATCTTTT TTCTGCTCCT CGTGGAACGG GAAAAACCAG TCTTCCTAAA ATCTTTGCCA 12 60 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC T AT ATTTGT C 13 20 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 13 80 

GGGTAGATGA AATTCGCGAA ATTCGT GAT A AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

AT AAGGTTT A TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAACATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 162 0 

CACAGGATAT TAAGGAACAT ATTC ACT AT A TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 1860 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 192 0 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 204 0 

TGTTTGAAAT GATTCGCTTA GC AACAGTG A GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

T AT CAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT AGTCGACCAG 2280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 2 340 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2 4 00 

AGGTAATTGA AAGTCTAGGT GGGC CGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 2 4 60 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT GGTCAAACTA 252 0 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2 58 0 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 264 0 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATG AT AC A 27 60 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTGAGACCT 2820 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2 880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 294 0 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGGGAAATC 3 060 

AGACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGCAACTA TCATGTCAGT 3120 
ATTAGAGATC AAAGATCTTC ACGTTGAGAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 3180 
CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3 2 40 

GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA GTAACTAAAG GTGAAGTTTT 3 3 00 

GTTTGATCCC GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TGGGACTTTT 3 360 

CCTTGCTATG C AAT A CC CAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCGTGC 34 20 

CGCTATCAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA CTTCGTGAGT TTATTACTAA 3 4 80 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3 540 

CGAACCCTTC TCTGGTGGTC AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGGA 3 600 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3 660 

TGTGTCTAAA GGTGTCAATG CCATGCGTGG TGAAGGTTTT GGTCCTATGA TCATCACTCA 3 7 20 

CTACCAACGT CTTTTGAACT ATATCACACC TGATGTGGTA CACGTGATGA TGGAAGGTCG 3 7 80 

TCTTGTCCTT TCTGGTGGTC CAGAATTGGC TGCG CGTTTG GAACGTGAAG GATACGCAAA 3 84 0 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3 900 

GGAGAAGTAA ATGACTAGAG AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3 9 60 

CTGGTT GGCT GATCTCCGTC AAAAAGCTTT TGACAAGATT GAGACTTTCG AATTACCAGT 4020 

TATTGACTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA CAGAAAATGA 4 080 

GCCATCAGCA AATGTTCCAG ATTTCACAGC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTCAA ACTGTTTTCG AACAAACTCC AG TTG AGTT A GCTGAACAGG GTGTTGTCTT 4 200 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 4 2 60 

ATCTGTTAAG TATGATGATG ACAAGTTGGC GGCTTACCAC ACAGCTTACT TTAACAGTGG 4 3 20 

TGCTGTACTC TATATTCCAG ATAACGTAGA AAT C AC AG AG CCAATTGAAG GAATTTTCTA 4 3 80 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 44 4 0 

TTCTAACATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 4 500 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCC CAAGTCAAGT TTGCTGCTAT 4 560 

CGACCGTCTA GGTGAAAACG TC ACT GC CT A CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4 62 0 

TGCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTG ATTTTG A 4 680 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAC GTTGTAGCTC TTTCAAGTGG 4 7 40 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4 800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATC GTATCGGCCA 4 860 

CATCATCAAG GGTGCTAAGG GAGCAGATCC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4 920 

AG AC C AAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4 980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA TCCAGAAGAT ATCTACTACC TCATGAGTCG 504 0 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 5100 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAACCACT AGTACTTCTC AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5340 

TTCACCGTGG T GT CC AT AC C TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 54 00 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 54 60 

CGACAACCAG CCTTAACTGG GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATCGCAG GAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTT AAAG A CGGTGCCTTG GATATGGAGG 564 0 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 57 60 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 5880 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 594 0 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCAGGA GCTATTGGAC TTGCGACTGC AGTTG ATT AT CTGG AAAAC A 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCC AAAAC 612 0 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG -5180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 624 0 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGCAAG 6300 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 63 60 

TAGTCGATGC CCT AC AAAAG AC AAAGG AG T TTTTCAATGG CACTTTCTAA ACTAGATAGC 64 20 

CTTTATATGG CAGTGGT AGC AGACCATTCG AAAAATCCAC AT C AC C AAGG GAAGTTAGAA 64 80 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 65 40 
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AAGTTTGATG 


C AG AGGACCG 


TTTGGAAGAT 


ATTGCTTTTC 


TAAATTCAGG 


ATGCACGATT 


6600 


TCAACTGCTT 


CTGCTAGTAT 


GATGACAGAT 


GCCGTTTTAG 


GAAAAACCAA 


ACAAGAAATT 


6660 


TTAGAACTCG 


CGACTATTTT 


TTCTGAAATG 


GTTCAAGGGC 


AAAAAGATGA 


GCGTCAAGAC 


6720 


CAACTTGGAG 


ACGCGGCATT 


CTTGTCAGGT 


GTTGCCAAAT 


TCCCTCAAAG 


AATCAAGTG T 


6780 


GCAACCCTAG 


CTTGGAATGC 


CCTTAAGAAA 


ACAATTGAAA 


ATCAAGAAAA 


ACAGTAAGAC 


6840 


AAGTTTCTTT 


TGTCTTATGA 


ATTATTAGAA 


ATGAAGAAAG 


AAAG G AT ACT 


ATGGCTGAAG 


6900 


AAAGAGTAGA 


ACCAAAACCA 


ATTGACCTTG 


GTGAATATAA 


ATTTGGTTTC 


CATGACGATG 


6960 


TAGAGCCTGT 


CTTATCGACA 


GGAAAAGGAC 


TCAACGAAGG 


TGTTATTCGT 


GAATTATCTG 


7020 


CTGCTAAGGG 


TGAGCCTGAG 


TGGATGTTGG 


AGTTCCGTTT 


GAAGTCTTAT 


GAAACCTTCA 


7080 


AAAAAATGCC 


CATGCAAACT 


TGGGGAGCAG 


ACTTGTCAGA 


G AT TG ACTTT 


GATGACTTAA 


7 140 


TCTACTACCA 


AAAACCATCT 


GACAAACCAG 


CCCGTTCTTG 


GGATGATGTA 


CCTG AAAAG A 


7200 


TTAAAGAAAC 


CTTTGAACGT 


ATCGGGATTC 


CAGAAGCTGA 


ACGTGCTTAT 


TTAGCAGCGG 


72 60 


CTTCTGCCCA 


GTACGAGTCA 


GAAGTGGTTT 


ACCACAACAT 


GAAGGAAGAG 


TTCCAAAAAT 


7320 


TAGGTATTAT 


CTTTACAGAT 


ACAGATTCCG 


CACTCAAGGA 


ATACCCAGAC 


TTATTTAAAC 


7380 


AATACTTTGC 


GAAGTTGGTA 


CCGCCGACAG 


ATAACAAGTT 


GGCAGCCCTC 


AACTCAGCAG 


74 40 


TATGGTCGGG 


TCGAACTTTT 


ATCTACGTGC 


CAAAAGGTGT 


CAAGGTAGAT 


ATTCCACTTC 


7500 


AAACTTATTT 


CCGTATCAAT 


AACGAAAATA 


TAGGTCACTT 


CG AA CGT ACC 


TTGATTATCG 


7560 


TTGATGAGGG 


AGCAAGCGTC 


TACTACGTAG 


AAGGATGTAC 


AGCACCAACA 


TATTCAAGCA 


7620 


ATACCTTACA 


CGCTGCCATT 


GTAGAAATTT 


TTGCTTTGGA 


CGGAGCTTAT 


ATGCGTTATA 


7680 


CAACTATCCA 


AAACTGGTCT 


GATAACGTCT 


ATAACTTGGT 


AACAAAGCGT 


GCTAAGGCTC 


7740 


AAAAGGATGC 


CACTGTTGAG 


TGGATTGATG 


GAAACTTGGG 


TGCCAAAACG 


ACTATGAAAT 


7800 




TTACCTTGAT 


GGAGAAGGAG 


CGCGTGGTAC 


CATGCTCTCT 


ATCGCCTTTG 


7860 


CTAATGCAGG 


GCAACACCAA 


GACACGGGTG 


CTAAGATGAT 


TCACAATGCT 


CCACATACCA 


7920 


GCTCGTCTAT 


TGTGTCTAAA 


TCCATCGCTA 


AAGGTGGAGG 


AAACCTTGAC 


TACCGTGGAC 


7980 


AAGTCACCTT 


TAACAAGAAC 


TCTAAGAAAT 


CTGTTTCCCA 


CATTGAATGT 


GATACCATTA 


8040 


TCATGGATGA 


CTTGTCAGCA 


TC AG AT ACT A 


TTCCATTTAA 


TGAAATTCAC 


AACTCGCAAG 


8100 


TGGCTTTGGA 


ACACGAAGCC 


AAAGTATCTA 


AGATTTCAGA 


AGAGCAATTG 


TATTATCTCA 


8160 


TGAGCCGTGG 


ATTGTCAGAA 


TCTGAGGCAA 


CTGAAATGAT 


TGTCATGGGA 


TTTGTAGAAC 


8220 


C CTTT AC AAA 


AG AACTTC C A 


ATGGAATACG 


CAGTTGAGCT 


GAACCGCTTG 


ATTAGCTATG 


8280 
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AAATGGAGGC ATCACTTCGA TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 834 0 

ACGTC AG CAT CGCCTTACCG TATGTATGGT TwCTG A CTCG TCAGTTTCAT CTACAACCTC 84 00 

AAAACAGTGT TTTGAGCAAC CGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 84 60 

AGATTTACTC AAAATCAAGG ATTTTG AAG A TGAACTTGTA TCAAAAAATC GCGGTTTAAA 8520 

ATCGCGATTT TTTATAATTT CTCGTT AAC A AAGCGGACAA ACTGATTCCA CCAAACTTTT 8 580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 87 6 0 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CAT CT TGT AC AGGGGCTTTG CTATCTTGAT AGCCATCGCC TTGTTGAACG 8 8 80 

ATTTTGCGAA GTGTAAATGT AG AAG AAAT A TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT T CCTGTT TTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GCTTGAAGTG 9120 

ATTGTCATCC C AG C AAAAGT AGAAGAAGGT TTT T TGGTGA TTT CT AAG AC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCCAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 924 0 

TCTTTTTTAG AAC C TGGGT A AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT C AT ATCGAC G 9 3 60 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 942 0 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GAT AAACAGT AATCAATTTT 9600 

GTTATGGAAG C AATTTC G AC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9 6 60 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 97 2 0 

GAAGCACCCC CTAAAAG AG A GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 97 80 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AG AAATT TT A CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 99 6 0 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATCGTTTGCT TGGTATTTTA ATT AC CTTG T GTACTTTGGT CTGCGGAATG GTCATAGGTG 1014 0 

TTGTCTATCT CTTACCTATT TTGATTAATC AGTTATCTAG TTTGATTATA TCTAGTCAAA 10200 

CTATTTATAG TCGAGTACAA GACTTAATCA TAGACTTATC TAATTATCCT GCGCTCCAGA 102 60 

ATTTGGATGT AGAAGCTACA AT TC AGCAGT TAAACTTATC CTATGTTGAT ATTCTTCAAA 10320 

AT ATCCT AAA TAGCGTATCA AATAGTGTGG GGAGCGTCTT GTCAGCTCTT ATCAGTACTG 10380 

TTTTGATTTT GATTATGACT CCAGTTTTTT TGGTTTATTT CTTATTAGAT GGACATAAAT 10440 

TCTTGCCCAT GCTTGAAAGA ACGATTCTAA AGAGGGATCG CTTG CAT ATT GCAGGCTTAT 10500 

TAAAGAATTT AAATGCGACG ATTGCTCGCT ATATTAGTGG AGTTTCGATT GACGCAATCA 105 60 

TTATAGGTTG TTTG GCTTAT ATTGGCTATA GTATTATTGG TTT AAAAT AT GCTTTACTTT 10620 

TTGCCATTTT TTCTGGTGTA GCCAATTTAA TTCCTTATGT GGGGCCAAGT ATT GG TTTG A 10680 

TTCCTATGAT CATCGCAAAT ATATTCACTG ATCCCCATAG ACTGCTGATT GCAGTGATTT 10*740 

ATATGCTTGT TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAACTG 108 00 

TTATGAAGGT TCATCCAATC ACG ATTTTAG TTTTACTTTT GTTGTCAAGC AATATCTATG 10860 

GTGTAGTTGG AATGATTGTC GCAGTGCCAA CCTATTCTAT CTTGAAAGAA ATTTCTAAGT 1092 0 

TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA GAATTAGCTA 10980 

AGTAAAAGTC AGGAGAACCC TGATTTTTCT TTACTGGAAG TGGCCTTTAG ATTAGAAGAC 1104 0 

TGAAAATAAG TTAAAGTCTT AAACTAATTT TCACAGCTAA GAATAGTAGA AGTTAATCTG 11100 

AT AAAAATC G AAAAAACCAG TGGAATTCTG TGTCAGGGTA AGTTCCACTG GTTTTCATAG 11160 

TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 11220 

AATTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TCTATTCGTT CTATTCTTAT 11280 

TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT CCGAATAATA TAAGGATTGA 11340 

TTTTATAGTT TTTAAACTCA AATGAATTGA AATAAAGAGA GT ACG AAAAT T CTC ATCTG A 114 00 

AAGTATTTTA GAATAATTCT CTTCGTGAAT TTCTTCAAAA CAGATAGCTT CATCTTAGGT 114 60 

ATGTGATTTC TTTTTGCATT TTTGAGTTAG ATAAGGTATA ATGATTTTAT TGTCTTTTGG 115 20 

GGTCGTTACG G ATT CG AC AG GCATTATGAG GCATATTTTG CGACTCGTGT GGCGACGTAA 11580 

ACGCTCAGTT AAATATAACT GCAAAAAATA ACACTTCTTA CGCTCTAGCT GCCTAAAAAC 11640 

CAGCAGGCGT GACCCGATTT GGATTGCTCG TGTTC AAT G A CAGGTCTTAT TATTAGCGAG 11700 

ATACGATTAA GCCTTGTCTA GCGGTTTGAT AAGAG ATT G A TAGACTCGCA GTTTCTAGAC 117 60 

TTGAGTTATG TGTCGAGGGG CTGTT AAAAT AATACATAAC CTATGGTTGT AGACAAATAT 11820 
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GTTGGCAGGT GTTTGGACGT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 11880 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTGGTATTC 11940 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGG AT AG A 12000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 12060 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC TCTTAAGCCT 12120 

CTTGAGTAAT CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGG CTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 12240 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12 300 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 12 3 60 

TAGAAAAATT AGTTAGTTTG CTAGAAACGT CAGAATCAAC CGTTCGAAGA GACTTGGATG 12 420 

AGTTGGAAGC GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC CCCTACTCCT 12 4 80 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAG AAATTG C 12540 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12 660 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 12720 

AC GTCAAGAC GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 12780 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCT AT TAT AC G ACT C 12 840 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTCGA GAATGCCAAG CAGACCTACG 12 900 

TCTTGGTGGA TTCGTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12 960 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 13 080 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCG TGTCTT GAAACGTTTG AATATACCAA ATACAGCGAC 13 200 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT 1.3 2 60 

CGAGACACGT TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGTTAAAA TCAAAGCAGA 13 3 20 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTGAGC TAGAAGAATT 133 80 

GAAAGCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

TAAAAATCTA GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTCGTGC 13500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT ACCAGCCTCT 13 560 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC TCGAAAGTTT 13 620 
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AGATGAAATT 


GAGAAATACG 


CTCGTGAGTT 


ACTGGCTAAG 


GGTGCTCAAA 


ATGTTATTAT 


13690 


CTCTATGGCT 


GGTGATGGTG 


CCCTTCTTGT 


CACATCTGAG 


GG AG CTT ACT 


TCGCTAAACC 


13740 


AATCAAAGGA 


ACAGTCAAAA 


ATTCAGTTGG 


AGCTGGTGAT 


TCTATGGTTG 


CTGGATTCAC 


13800 


AGGTGAATTT 


GTCAAATCAA 


AAGACGTAGT 


AGAAGCCTTC 


AAAT GGGG AG 


TGG rTTGCGG 


13 860 


AACGGCAACT 


ACCTTCTCAG 


ATGACTTGGC 


AACGGCGGAA 


TTTATTAAAG 


AAACATATGG 


13 9 2 0 


AAAAGTTGAG 


GTAGAAAAAC 


GATGAAAATT 


CAAGACCTAT 


TG AG AAAAG A 


4 *J 4 4 *J 4 4 \J 


13 980 


CTAGATTTGC 


AGGCAACTGA 


AAAAACAGCT 


GTCATCGACG 


AGATGATTAA 


AAATTTGA<"*A 


1404 0 


GACCACGGTT 


ATGTAACAGA 


TTTTGAAACA 


TTTAAAGAAG 


G AAT T TTGGC 


C CGTG AAGCT 


14100 


TTGACTTCTA 


CTGGTTTGGG 


TGATGGAATC 


GCAATGCCTC 


ACAGCAAAAA 


CGCTGCTGTC 


14160 


AAAGAAGCGA 


CAGTTCTATT 


TG CTAAGTCA 


AATAAGGGTG 


TTCACTACCA 




1 A ? "3 fl 


GGACAAGCAA 


CTGACCTCTT 


CTTCATGATT 


GCAGCTCCAG 


AAGGTCCCAA 


TGATACTCAf* 




TTGGC AGCC T 


TGGCAGAATT 


GT CT CAAT AC 


TTGATGAAAG 


HWUsJ 4 4 4 4 


AGACAAAfTT 


14 J1U 


CGTCAAGCAA 


CATCTGCAGA 


C C AAGT TAT C 


GAACTTTTTG 


nv.v.AAVjv i is. 






GAGGAACTTG 


TTCAAGCACC 


TGCTAATGAC 


TCTGGT G ACT 


TTATCGTAGC 


TGTTACAGTT 


14 4 60 


TGTACAACAG 


GTATTGCCCA 


CACTTACATG 


G CCC AAGAAG 


CCCTTCAAAA 




1 4S20 


GAAATGGGGG 


TTGGTAT C AA 


GGTCGAAACC 


AACGGTGCTA 


GCGGTCTTGG 


AAATrAACTA 




ACTGCAGAAG 


ATATCCGTAA 


GGCTAAAGCT 


AT TAT C ATTG 


CAGCAGACAA 




14 640 


ATGGATCGAT 


TTGATGGAAA 


ACCATTGATC 


AATCGTCCAG 


TTGCTGACGC 


TA^frGTAAG 


14700 


ACAGAAGAGC 


TAATTAACTT 


GGCTCTTTTCA 


GG AG AT ACTG 


AAGTCTACCC 




14 'OU 


GGTGCCAAAG 


CTGCAACAGC 


CTCTAACGAA 


AAACAAAGCC 


TTGGTGGTGC 


CTTGTACAAA 


14 820 


CACTTGATGA 


GTGGTGTATC 


TCAAATGTTA 


CCATTCGTTA 


TCGGTGGTGG 


T ATC ATG ATT 


14880 


GCCCTTGCCT 


TCTTGATTGA 


CGCTCCTTTG 


GGTGTTCCAA 


ATGAAAACCT 


TGGCAATCTT 


14940 


GGTTCTTACC 


ATGAGTTAGC 


TTCTATGTTC 


ATGAAAATTG 


GTGGAGCTGC 


CTTTGGTTTG 


15000 


ATGCTTCCAG 


TCTTTGCGGG 


TTATGTTGCC 


TACT CT ATTG 


CTGAAAAACC 


GGGTTTGGTA 


15060 


GCAGGTTTCG 


TGGCTGGTCC 


TATTGCCAAA 


GAAGGTTTTG 


CCTTTGGTAA 


AATTCCTTAT 


15120 


GCCGCAGGTG 


GTGAAGCAAC 


TTCAACTCTT 


GCAGGTGTCT 


CATCTGGTTT 


CCTAGGTGCC 


15180 


CTTGTTGGTG 


GATTTATCGC 


AGGTGCCTTG 


GTTCTTGCCA 


TCAAGAAATA 


CGTTAAAGTT 


15240 


CCTCGTTCAC 


TCG AAGGTGC TAAATCAATC 


CTTCTATTGC 


CACTTCTTGG 


AACAATCTTG 


153C0 


ACAGGATTTG 


TTATGCTAGC 


TGTGAATATC 


CCAATGGCTG 


CAATCAACAC 


TG CT AT G AAT 


15360 
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GACTTCCTAG 


GCGGTCTTGG 


AGGAGGTTCA 


GCTGTCCTTC 


TTGGTATCGT 


C CTTGGTGG A 


15420 


ATGATCGCTG 


TTGACATGGG 


TGGACCAGTT 


AATAAAGCAG 


CTTATGTCTT 


TGGTACAGGT 


15480 


ACGCTTGCAG 


CAACTGTTTC 


TTCAGGTGGT 


TCTGTAGCCA 


TGGCAGCAGT 


TATGGCTGGA 


15540 


GGAATGGTGC 


CACCACTTGC 


AATCTTTGTC 


GCAACTCTTC 


TTTTCAAAGA 


TAAATTTACT 


15600 


AAGGAAGAAC 


GTAACTCTGG 


TTTGACAAAC 


ATCATCATGG 


GCTTGTCATT 


TATCACTGAG 


15660 


GGAGCGATTC 


CATTTGGTGC 


CGCTGACCCA 


GCTCGTGCGA 


TTCCAAGCTT 


CATCCTTGGT 


15720 


TCAGCAGTAG 


CAGGTGGACT 


CGTTGGTCTT 


ACTGGTATCA 


AACTCATGGC 


GCCACACGGA 


15730 


GGAATCTTCG 


TTATCGCCCT 


TACTTCAAAT 


GCTCTCCTTT 


ACCTCGTTTC 


TGTCTTGGTA 


15840 


GGAGCAATCG 


TAAGTGGTGT 


GGTTTATGGT 


TACCTACGCA 


AACCACAAGC 


ATAAAAAATA 


15900 


GAAAAATGAA 


AACATTGGAC 


CGTTTGGTGC 


AGTCTTTTTC 


TCTTCCCGAA 


ATGCCTGTGA 


15960 


AATATGGTAT 


AATAGAAGAA 


TGGCAAACAA 


GAATACAAGT 


ACAACAAGAC 


GGAGACCGTC 


16020 


TAAAGCAGAA 


CTGGAAAGAA 


AAGAAGCGAT 


TCAACGAATG 


TTGATTTCGT 


TAGGAATTGC 


16080 


GATTTTATTG 


ATTTTCGCAG 


CCTTCAAATT 


AGGGGCTGCA 


GGTATAACCC 


TTTATAATTT 


16140 


AATTCGCTTG 


CTAGTGGGTA 


GCCTAGCTTA 


TCTGGCGATA 


TTCGGCCTAT 


TAATCTATCT 


16200 


CTTCTTTTTC 


AAGTGGATAC 


GAAAACAGGA 


AGGACTCTTA 


TCTGGCTTTT 


TCACCATATT 


16260 


TGCTGGCTTA 


CTCTTGATTT 


TTGAGGCCTA 


CTTGGTTTGG 


AAATATGGTT 


TGGACAAGTC 


16320 


CGTTCTAAAA 


GGGACCATGG 


CTCAGGTTGT 


GACAGATCTG 


ACTGGTTTTC 


GAACGACTAG 


16360 


CTTTGCTGGA 


GGGGGCTTGA 


TCGGGGTCGC 


TCTTTATATT 


CCAACAGCCT 


TTCTCTTTTC 


16440 


AAATATCGGA 


ACTTACTTTA 


TTGGTTCTAT 


CTTGATTTTA 


GTGGGTTCTC 


TCCTAGTCAG 


16500 


CCCTTGGTCT 


GTTTACGATA 


TTGCTGAATT 


TTTCAGTAGA 


GGCTTTGCCA 


AATGGTGGGA 


16560 


AGGGCACGAG 


CGTCGAAAAG 


AGGAACGCTT 


TGTCAAACAA 


GAAGAAAAAG 


CTCGCCAAAA 


16620 


GGCTGAGAAA 


GAGGCTAGAT 


TAGAACAAGA 


AGAGACTGAA 


AAAGCCTT AC 


TCGATTTGCC 


16680 


TCCTGTTGAT 


ATGGAAACGG 


GTGAAATTCT 


GACAGAGGAA 


GCTGTTCAAA 


ATCTTCCACC 


16740 


TATTCCAGAA 


GAAAAGTGGG 


TGGAACCAGA 


AATCATCCTG 


CCTCAAGCTG 


AACTTAAATT 


16800 


CCCTGAACAG 


GAAGATGACT 


CAGATGACGA 


AGATGTTCAG 


GTCGATTTTT 


CAGCCAAAGA 


16860 


AGCCCTTGAA 


TACAAACTTC 


CAAGCTTACA 


ACTCTTTGCA 


CCAGATAAAC 


CAAAAGATCA 


16920 


GTCTAAAGAG 


AAGAAAATTG 


TCAGAGAAAA 


T ATCAAAAT C 


TTAGAAGCAA 


CCTTTGCTAG 


16980 


CTTTGGTATT 


AAGGTAACAG 


TT G AACGGGC 


CGAAATTGGG 


CCATCAGTGA 


CCAAGTATGA 


17040 


AGTCAAGCCG 


GCTGTTGGTG 


TAAGGGTCAA 


CCGCATTTCC 


AATCTATCAG 


ATGACCTCGC 


17100 


TCTAGCCTTG 


GCTGCCAAAG 


ATGTCCGGAT 


TGAAGCACCA 


ATCCCTGGGA 


AATCCCTAAT 


. 17160 
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CGGAATTGAA GTGCCCAACT CCGATATTGC CACTGTATCT TTCCGAGAAC TATGGGAACA 172 20 

ATCCCAAACG AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG TTAATGGAAC 17230 

CGCAAGAGCT TTTGACCTTT CTAAAATGCC CCACTTGCTA GTTGCAGGTT CAACGGGTTC 1734 0 

AGGGAAGTCA GTAGCAGTTA ACGGCATTAT TGCTAGCATT CTCATGAAGG CGAGACCAGA 17 400 

TCAAGTTAAA TTTATGATGG TCGATCCCAA GATGGTTGAG TTATCTGTTT ACAATGATAT . 17 4 60 

TCCCCACCTC TTGATTCCAG TCGTGACCAA TCCACGCAAA GCCAGCAAGG CTCTGCAAAA 17 520 

G GTTGTGG AT GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG TTCGGAATAT 17 580 

TGCAGGTTTT AATGCCAAGG TAG AAG AGTT CAATTCCCAG TCTGAGTACA AG C AAATTC C 17 640 

GCTACCATTC ATTGTCGTGA TTGTCGATGA GTTGGCTGAC CTCATGATGG TGGCCAGCAA 17 700 

GGAAGTGGAA GATGCTATCA TCCGTCTTGG GCAGAAGGCG CGTGCTGCAG GTATCCACAT 17 760 

GATTCTTGCA ACTCAGCGTC CATCTGTTGA TGTCATCTCT GGTTTGATTA AGGCCAATCT 17 820 

TCCATCTCGT GT AG CATTTG CGGTTTCATC AGGAACAGAC TCCCGT AC G A TTTTGCATGA 17 880 

AAATGGAGCA GAAAAACTTC TTGGTCGAGG AGACATGCTC TTT AAACCG A TTGATGAAAA 179 4 0 

TCATCCAGTT CGTCTCCAAG GCTCCTTTAT CTCGGATGAC GATGTTGAGC GCATTGTGAA 18000 

CTTCATCAAG ACTCAGGCAG ATGCAGACTA CGATGAGAGT TTTGATCCAG GTGAGGTTTC 18 06O 

TGAAAATGAA GGAGAATTTT CGGATGGAGA TGCTGGTGGT GATCCGCTTT TTGAAGAAGC 18120 

TAAGTCTTTG GTTATCGAAA CACAGAAAGC CAGTGCGTCT ATGATTCAGC GTCGTTTATC 18180 

AGTTGGATTT AACCGTCCGA CCCGTCTCAT GCAAGAACTG GAGATAGCAG GTGTCATCGG 182 40 

TCCAGCTGAA GGTACCAAAC CTCGAAAAGT GTTACAACAA TAAAAAAATA GCTTCTTTCC 18300 

AAG TTT GG AG GGAAGCTATT TTACTGGCTA TTGATTGCTT TTATTTTCTG AAGTTG GCGC 18 360 

ATTGGACTGT TTTTCGTTTT CAGTAGCAGG TTTACTTGAA GCAGGAGTAG AAGAGTCCTG 18420 

AGTTGCTGTT TTCTGATCTT CTTTTTTCTC TTCCTTCACG CTAGATTTTG GTGTTTCCTC 184 80 

TTGCTGTGTT TTTTCTTGAC TAGTGTTAGT CTCTTTAGTT GGACTGGTGT TTTCCTTAGG 185 40 

GGATTCCTTT TGGATTTCTT TGACAATGGT TGTCGTCTCC CTTGTCGTAG GTTCTTTTTT 186 00 

AATATTTTTG TTATTATCCA AGGCGTT 18627 

(2) INFORMATION FOR SEQ ID NO : 114: 

(i) SEQUENCE CHARACTERISTICS : 

I A) LENGTH; 2560 base pairs 
{ B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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{xi> SEQUENCE DESCRIPTION : SEQ ID NO: 114: 

TAAAATACGT TACCTTGCTT CTGCACGTTC AGCACGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GG AGTTG AT A TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCC AG AT GTTCCTTTGG TTGTTCCAGA 2 40 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG A CCGT ATC AT 3 60 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA CCAATTCTTG AG AC AC AACG 420 

TGAACTTCGT GAAGTCTTGA AT G ATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 480 

GCCTTGAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 54 0 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT AT ATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 72 0 

AGCAGCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT CTTGGTCGTA TCCGTAAAGA 84 0 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC T CAGTTC AG A TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 9 60 

AGCCGAATTG AAATTTGAAT TAAAATAGTC AT ATCGTTT A GGAGTTCAGA TGAACTCCTT 102 0 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 1080 

CAGCCTTTAT TACCCCCTTC CATGAGGATG CTTCCATTAA CTTTGATGCT ATTCCAGCCT 114 0 

TGATTGAGCA TTTATTGCCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 12 00 

ACAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA C AAAAGG TT\ i 12 60 

TCAATGGACG CGTTC CTTTG ATTGCGGGTG TAGGT ACT AA TGATACGCGT GACTCTATTG 13 20 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTC CTT 1380 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATC AGC ACT T T AAG ACT ATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTCACTCCAG 1500 

AAACCATGCT TCGCTTGGCT G AC CAT CC AA ATATTATCGG TGTCAAAGAA TGTACTAGCT 15 60 

TGGCTAATAT GGCTT ACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 162 0 

AGGATGGAGA TGCTTT CC AT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 1680 
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CT CAT AC AAA TGGGGATCAA ATGCACGACA TCTTTACTGC GATTGCAGAA AGCGATATGA 174 0 

AGAAAGCCGC AGCAATTCAG CGTAAATTCA TTCCTAAGGT TAATGCTCTC TTCTCTTATC 1800 

CAAGTCCTGC TCCAGTTAAG GCAATTCTTA ACTATATGGG ATTTGAAGCT GGACCCACTC 1860 

GTCTACCTCT TGTTCCAGCA CCAGAAGAAG ATGCCAAACG CATTATCAAG GTTGTCGTAG 1920 

ATGGCGACTA CGAAGCAACT AAGGCAACTG TAACAGGGGT CTTAAGACCA GATTACTAAT 1980 

AAAGACAATA AAATCCGGCT CTTTGTCAAC TGTAGTGGGT TGAAGTCACC TAAGCTCGAG 204 0 

AAAGGACAAA TTTTGTCCTT TCTTTTTTGA TATTCAGAGC GATAAAAATC CGTTTTTTG A 2100 

AGTTT T C AAA GTTCCGAAAA CCAAAGGCAT TGCGCTTGAT AAGTTTGATG AGATTATTGG 2160 

TCGCTTCCAA TTTGGCGTTT GAATAGGGTA GTTGAAGGGT GTTGACGATT TTCTTTTTGT 2 2 20 

CCTTTAGAAA GCTTTTAAAG ACAGTCTGAA AAATAGGATG AACCTGCTTC AGATTGTCCT 2 2 80 

CAATGAGTCC GAAAAATTTC TCCGGTTCCT TATTCTGAAA GTGAAACACC AAGAGTTGAT 2 34 0 

AGAGCTGATA GTGATGTTTC AAGTTTTGTG AATAGCTCAA AAG CTTGTTT AAAATCTCTT 2 4 00 

TATTGGTTAA GTGCATACGA AAAGTAGGAC GATAAAATCG CTTATCACTC AGTTT ACGCC 24 60 

TATCCTGTTG AATGAGTTTC CAGTAGCGCT TG AT AGCCTT GTATTCGGGA TTTTCGATGA 2 52 0 

AACTGATTCA TGATTTGGAC ACGCACACGA CTCATAGCAC 2560 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TATTGGATTT CCCTTGCAAT CACTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTC TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AG C AG ACC CT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 180 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGCG CAAGTCTTTG 24 0 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 3 00 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 3 60 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 4 20 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 48 0 
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AATCTTCATA TAC ACTAAAC AAATGACTTT CTAATTATTT CAATTAGTTT TGGCTAGTAA 54 0 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 600 

AGTCATGAGA GTTTTTCGCA TTGATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 7 20 

CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 7 80 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GATACTAGAA TCTCCTTGTA 840 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 900 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGC TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 1080 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 114 0 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCT AC AAT GGGAGCCACC ATCAGTTCTG 1200 

TTCCAAAAAA GTATTGGTTT GGAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 12 60 

AG ATT GG ACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GT AT AT AG AT 13 2 0 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTTGTTTCTG 13 80 

AAAAAAACCA AGGTTCTTTA CTATTAAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 14 40 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCGTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 15 60 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 1620 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 174 0 

TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCT AG T TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA ATCAAAAATA GCAGGTTCTT 1860 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 192 0 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGA G TTTTAG CTTTCTATCA TGAAGTTGTT 1980 

GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA ACTATATCCT GTCCAGCCAC 204 0 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT ATCTAACACA CCG AT AGAT A 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACCTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2 2 20 

T C AAATGGT A AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AACAAATACA 2 2 80 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 23 4 0 

AATCATCCAA TAAGGCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 24 00 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACCT TTCAATAGAT TCACCAAAAT 24 60 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 2520 

TAAATTCTCC TTTATTAAAG TGCAGATGAA AATAGTCCGT CAT AAT ATCT AGTACGTTTG 2 530 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATTATAGATA ACTTGTG TCG 264 0 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACCT TACTAGCTTG TCTGTTAATA 27 00 

CAGAGATTCG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGCTA 2760 

GCATAAAAAC AGAACGCACC AT TTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 2 820 

ATCTGTTATA TCTATCACAA AT AAT AGTC A ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2880 

TTAACAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3000 

TCTTACTAGC AATCATACGA TATTCAAGCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 30 60 

TAGAACTGAA GAACCCGCAT CGGTATATAA ATTATCCGGA T C AAC AT AG T CATAAGATTC 3L20 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

ACATCACTAA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC AT AC C AAAC A 3240 

CTCTTAAAAA CGGATCGGTT TTCAAAAGCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 3300 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 3 3 60 

TAAAAATTTT ATTTTGTTTG T CAT ATCT AA GTTTTCAGCA CGCTTATCCT ATTTTATAAG 3 4 20 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTCCTTTTTT ACTATTGTAT C GTATTCT AC 3 4 80 

GATAACATAC TTTACTTTAT TGTTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 3 54 0 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3 600 

AAGAATATCT T AAAAAGAC C TCTTCACTGC T AAT ATT AAC ACTCATTATT T AAACT AT AT 3 660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 37 20 

TTTTGATCAT ATACACGGAT GAATGGAAGA TAGACTAGGA ATGCTGCAAA TGCACATACT 3 7 80 

ACAGCAACTA AT ACAGCT CG AAGATCTGCT GTC CCT AAC A AAGCTCCAAT CCCTACTGCA 3840 

GTTGGCCATG GAACCTGTGC G AT AATTGG C TTAATAAAGT TTAGAGAATT CGCTACGTAA 3 900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC TGGATTATAG 3 960 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 4 020 
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GCTCGTCCTA 


TTGCTTTAAG 


CTGTTCAGAT 


1 TTAGAGGCAA 


AAG C AAT AT A 


TAAACATAGT 


4080 


CCTAAAGTTG 


CACCAGAACC 


ACCTGCAATT 


ACAAACATAT 


TAGAAAATTC 


ACCTGCAACA 


4140 


GCGAAGTGCC 


CGCCAGCAGC 


ATTTTCAGCC 


ATGTT AGC AA 


G AG C AATTGG 


ACTAACAAAT 


4200 


GCAAAAACAA 


TGTTCGCACC 


GTGGATACCT 


ACAATCCAAA 


GTAGTTGAGT 


CAATAGATAA 


4260 


ATAATCATTA 


AACCAATCCA 


CGAATTAGTC 


AGATTGGATA 


CAA3tA.CC AAA 


TGGAATTGCA 


4320 


ATGACTTTAA 


AAATATCTGT 


TCCCATTGCT 


ACAAGAAGAC 


C GTTG AT AAA 


GATAACAACA 


4380 


AATGCAACAA 


CAAATCCCGG 


AACCAAAGCG 


GT AAATC C AC 


GAGAAACTCC 


TTCTGG AA C A 


4440 


GCTTCAGGCA 


TTTTAATAAC 


CCAATTATGT 


TTAACACACA 


TACGATAAAT 


AAGAACAGTC 


4500 


ACAATTGCCA 


TAATGATTGC 


GGTAAAAATC 


CCTGTTCTCC 


CAAAACGTGC 


GACTACATTT 


4560 


CCCATTGCCC 


ATCCATCTGC 


AATTACTGCA 


CCTTCTTTTA 


GACTTGTCAC 


AGTCTTCATC 


4620 


ATTCCACCAT 


CAAAAATGAT 


TTGCGGTACT 


GTCATGACAA 


AAGCCATCAA 


GGCAAGCAAG 


4680 


GCACCATTAA 


GAGGATTCAT 


ATTGAGTTCT 


TCTTCCTCTG 


CATAAATTTT 


TGTCAATTCA 


4740 


TATGCAAGTG 


ATAGAACGAA 


ATAAAGAGAT 


AGAGAACCCA 


TAGTCGCATA 


GTTTGCAACC 


4800 


ATGTAAAGTG 


ATGTGAATTT 


ATCAAATGAA 


GCAGAGAAAA 


TATCTGCCAC 


AATTGGCCAA 


4860 


AATGAGAAAG 


CTTGTGGCAA 


AATACTGAAT 


ACCAAAAACA 


TTGATCCTAC 


AATAGTAAAT 


4920 


GGTACAGCAG 


CCATACCTGC 


AGCCGTGATA 


GC ACGT ACT A 


CTTTAAACTG 


AGCAAGTTTG 


4980 


CCCATTGGTC 


CCATAACATG 


GTT TTCAAGA 


AAACCAAACA 


ACCCGTTTTG 


TTGATCCATA 


5040 


AATAGACCTC 


CTTAATAAAA 


CATAATAATT 


TTTACTTTCT 


AAAGACTAGT 


TTCAAATACA 


5100 


AATTATACTA 


GATCAGGATT 


ATAAACTAAG 


TGAGTTCTTT 


TCCAATTGGA 


CAAATTGTTG 


5160 


AT AAG CCTT A 


TCTGTTCGTT 


TATAAATTTT 


TTTAATTCTT 


CTAATGTCTA 


AC AAACTC AG 


5220 


AACTAAACCT 


AATAGAAGAA 


CTACAAAAAC 


AAATAAACGT 


GCTACTTGGT 


TATTTTCAAA 


5280 


AATCGGAAAA 


AGATTCTTAA 


ACCAACTTGT 


CCAAGTTAAA 


ACAAGTAATC 


CTATTGAAAT 


5340 


AAGCATTTGT 


ATTCTAACAA 


ACATTAGTGT 


TATTCCCAAC 


TTTTCTTTCC 


TATTTCCATA 


5400 


AAGTTTAAAT 


TGTTCAACAG 


TTGCTAAAAT 


AGAAAATACT 


ATGAGCATAA 


TGGGGAAAAT 


5460 


AATAATAGGC 


GAGGGACTAA 


TAAACTGACT 


CAAAAGCCAA 


TAAATATTCC 


CAAAAAAGAA 


5520 


GAGTGCTATT 


GAATAACGTA 


G AAG AAGAT A 


TCGATTGAAA 


AAAGTATTAG 


TTAGAGCCAT 


5580 


CTCTCGACGT 


TGTTGTTCAA 


TCTTTTGTCG 


TTCTTTTTTA 


TCCATATCAT 


TTCC TCCTT A 


5640 


TATAACAACA 


CATATTTAGT 


TAACTTTCTT 


ATAAAGAGCT 


AACATTTCCT 


TTGCTACTTC 


5700 


TAATAATGTC 


ATAGTGGTCA 


TTAAATGATC 


TTGAGCATGT 


ACCATGATAA 


TTTCAATTTT 


5760 


AATTTCCACT 


CCACTTGCGT 


ATTCTTGCAA 


GAGTTTGGTT 


TCTCCATGAT 


GCGCTTCAAG 


5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 5890 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCACGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTT CCT CTTGATTCAT AT AAAC CTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGT C AG AGAG ACTATCTTTT TAGT CACAGT TGC CAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACCAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6 300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 6 3 60 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6 54 0 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT G AC ACT AT AA AATCTATTCC 6 600 

TTTAAGTTTT TCTTGATTCA AT TC AT AGT A GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 6960 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AACATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7 080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAG T A ATGTAGTCTA TTTCCTGAAC 714 0 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 72 60 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 73 20 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7 380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 744 0 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7 500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7 560 
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ATCATGAGAT 


AATCTTGAAC 


GACTAACGTA 


AAGTTCATCA 


GCTAAATCAT 


CAAAAAGAAC 


7620 


TGGAACTTGC 


TCAAATAATA 


ATTTATTTAA 


GATAAATACT 


AAACGATCAT 


CAC CTTTTG A 


7680 


AACCGCAGTT 


TTCGTATAGT 


CTTCTTCCAG 


TTCATAAGTT 


TGTCTAAACT 


CCTGGTAAGC 


7740 


GCCTTGATTC 


TCAAAAAATA 


TTTGAT AC C C 


TTGACCTTGT 


TTTGAAATCA 


ACCGGACTCC 


7800 


TTGAATAATC 


ATTGTCTTCT 


CAATTAATTT 


CAGTACATTA 


CGGACAGTTC 


TATCTGAACA 


7860 


GGATAAATAT 


TCTGCCAGTT 


CTTTGCTTGT 


AACAAAACGT 


TCCTTATTTT 


TTATTAAAAA 


7920 


TTGAAGGATA 


TCTTTCTCTT 


TAATGTTTAA 


C AC ATT C ATT 


CCCTCCTAAA 


ACGTATGTTT 


7980 


TCATATATTG 


AAGCATATTA 


TACACTTAAA 


TCAGTTTATA 


TCAAACTCAA 


AACAATTTAT 


8040 


CTTAACCTAA 


ATATTTATTG 


ACATTTCATG 


TGTTCATCAA 


ATATTCTCAA 


GAATCAAATT 


8100 


AGCCATTTTT 


TCAATTCCC A 


TTGGAATAGG 


AATATAGGCT 


TGAGGAGGTA 


TTTGTACAAC 


8160 


TGGTTTTCCT 


GCTTTAGAAC 


CAGCCTCTTC 


AAATTGCTTA 


AAGTACATTT 


TTGTTTGAGG 


8220 


ACT G ACAAG A 


TACAAATCAA 


AAGCTGCTGC 


TGCGATAGCT 


TTCCCTCCTT 


CAGTAGCACT 


8280 


AATAGCATCA 


ACTACAATAT 


CTTTCCCTTT 


TCCTTTTAGA 


AACTCTGTTG 


TTTTCTGTGC 


8340 


CATAAGTGAT 


GAAGACATTC 


CTGCTGCACA 


AATAATTAAA 


GCTTTTGCCA 


TAATATTTTC 


8400 


TCCTTTTCTT 


AAATCCAATC 


AAAGCTGTGC 


TAAGTTGGCT 


TATTTGTTAT 


CTATTTTTAT 


8460 


TATAAAATAA 


AGCGTTTCCA 


ATGACAATTC 


CCTCATTTTC 


CTAAATGATA 


TGGAAAAAAA 


8520 


TTATTTATAC 


TTCAATTTAT 


AAAATAAAAT 


TATTCCTGAG 


AGTAGAAATG 


AAACACTATT 


8580 


TGCTAAAATC 


AAAGGCAAGT 


CTCCTATACG 


AATACCATGA 


GCAAGCCACA 


ATGCAATACC 


8640 


AATAACTTGC 


ATAACATACA 


TACCTAGAGC 


AATAGATCCT 


GTGTCCTTTG 


TCTTAACTAC 


8700 


AC G AAAAACT 


TGTGGTAAAA 


ATGCAAATGT 


TGTTAAAATT 


GCTGCAATAC 


TTCCAATCAT 


8760 


ATGTCACCTC 


AATATGCTAA 


ACAAACTGAG 


AATAATCTCA 


GTTTGTTTAT 


ACTATTCTAC 


8820 


TGATTCACCG 


TTAGATGAAA 


TAACTTCCTT 


ATACCAGCCA 


AAAGATTTTT 


TCGGGGAACG 


8880 


ATTATAACTT 


CCCTTCCCAT 


TATCATCTTT 


ATCTACATAA 


ATAAAGCCAT 


AACGTTTCCG 


8940 


CATTTCACCG 


GTACCAGCTG 


AAACCAAATC 


AATACATCCC 


CATGGAGTAT 


AACCCATTAA 


9000 


ATCAACACCA 


TCTTCAACTA 


CAGCCTTTTT 


CATTTCACGA 


ATATGGGCAC 


CT AG AT ATT C 


9060 


AATTCTATAA 


TCATCATGTA 


CCATACCATC 


TGCTGCAACT 


TGATCTATAG 


CTCCAAAACC 


9120 


ATTTTCAACA 


ATAAAGAGTG 


GTAAGTGATA 


GTGGTCTGTA 


AACCAATTTA 


ACGCATAACG 


9180 


CAAACCTTCT 


GGATCAATTT 


GCCACTCCCA 


TTCAGAAGCC 


TTAACATAAT 


TATTTTTCAC 


9240 


TAAATCTTCT 


GTTTCAAGAT 


AATCAAAATA 


AGGATTATTT 


TCACGATCAG 


AGTCGATAGC 


9300 


AAAGGACATA 


TAGTAACTGA 


AACCAATGTA , 


ATCTACAGTC 


CCACCAAGTA 


AATCTTCTTT 


9360 
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ATCCTGGG C A GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATCCTC 94 20 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CCCTTCTGCA TAGCTTTCAT 9 480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGG AC AC A TCGCAATCAT 9 54 0 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC GTGCAGAAGC 9 600 

AACT AATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9 660 

TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGT AA CGTTTAAATA CGACTTCTGC 97 80 

AAAACGAGCA AAGAAATCAA T C AATTTCCT ATTTTTCCAA CCACCATATT CGGTCACTAA 98 40 

GTGATAAGGC ATTTCAAAAT GAG AT AG AGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9 9 60 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 10020 

AAAAAGTGCT AT ATCTT CTT TATAACGGTG ATAAAAATCT ATCGCCTCAT GATTTGGATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTC CATC AC GACCAGCAGT 10140 

CATAACATCA CCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 102 60 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 103 80 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 104 40 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGCCATC CCAGAATCAT CTTCTATATA 10 500 

ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 10 560 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 106 20 

AGTTCCCTCT TTTTCAATTC GTAAAAAATA TTCGTATTTC AAGTTACCTT TTTGATTTCT 106 80 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10 800 

CAATTTTAAA AAAAC AAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC 108 60 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCT T AT A 10920 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTGCTTTTTC ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTACCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGT GGTT 11160 

AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA ACTCCCTCGT CATCGTCTCT 11220 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TCATATCTTC TTCCAGATAC TTT 11303 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3112 base pairs 

( B ) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 2 40 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 3 60 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGG AGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 6 00 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 6 60 

ACTCAGAGTT CT AT CAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 7 20 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 7 80 

CCAAGTTTGA TGGG CAAAT C CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG £40 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT C AAC CAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTT ATTT C AC CTTCCTATAC TTTCTTCGCT GTTTGTAAAT CAAAATGCAA GACACATCAG 12 60 

TAGCACCATA TTTGTTACTC TTATCTGTCC TCTCAAGAGA CTATTATGAG TTATTTCAGA 132 0 

ATCATTCACT ACTTTGACCC TGACTCTCCT TAGTCTCAAA ATCAAAGACT TATACTCTTC 13 80 

AAAAATCTCT TCAAACCGCG TCAACGTCAC CTT GG ATTAT ATATGTGatC TGa CTTCGTC 1440 

AGTTCTATCT ACAACCTCAA AGCAGTACTT TGAGCAACCT GCGACTAGTT TTCTAGTTTG 1500 

CTCTTTGATT TTCATTGAGT ATTAAACAAA AAGTGAACAA ATCTGAATTC TAATGTACAG 15 60 

AAGACTAGGC TTGTTCACTT TTTTATAGTC GCTATAAGAT GACCTTATCT ATACCTTTTT 16 20 

A TAT ATAATT ATATATTCAG ACATACTATT ATCAATTTTG TCGCAGGGAG GAATCTGTTA 1680 

ACGCACCCAT TCACCATTAT CATTGACTCT ATAGCCATCT ATACTTGTAT TGACCGCTAA 1740 

CTCACCCGAT GTATTTACAT AATACCATTT ACCACCAACT TGGAACCATT GATTGACTTT 1800 

CATAGAACCG TTGCTGTTGA GGTAGTACCA TG AAC T ATT A ACTTGTACCC AACCTGTTGC I860 

CATGGAACCA TCAGTATTAT AAAAATACCA CATACCATTT TCTTGTTTCC AGTCTGTTCT 1920 

TGGAGCAACT GCTTTAGCTG GTTCTACTGC TACATCTGTT CCTTGGTTAG ATGTAACAGA 1980 

TACAGGATAC GAAGGAATAG ATGATTGCTC AGGAACAACA ACTTTTTCAG GTTCTCTCGT 2040 

CCCTCTCCTT ATACGTCTTT TTACCATCTC TTTAGTAATT TGACGAGAAG TAGTTTCTTC 2100 

AATTGTTCCA TCACGTTCAT CTACAGTATA GATTGTAGTA AGAGTAATTT ACCAATTTCT 2160 

CCTACTTCTT CTACTTCTTG ACTTTTATCA AGAGTTGGGC CATCGAGATA TTCTGTTTCG 22 20 

ATTGGAATTT CTTGGACAAG AACTTGGGGC TTGGTTCTTT TTTTAACAAC TCTTGTTTGA 22 80 

GAGTCTTTTT TTTGACTTAA AGTACTCTCA GTTACTTGTC CACTCTTTCC ATCTACATTA 2340 

TAAGTTATCG TTGTAACTGT TTTCCCATTC TTTCCTAGAG TAATCTCTTG CTCCTGTCCT 24 00 

GCAGAAAGGT CATTGTCTGC TTCATATTTA GTAGCAAATG GAACAAGAAC TTCTTCAACC 24 60 

TTGCTTTTAC CTGGAACTTT GATAACTGTA TCCGTGGCTT CTTTTCTATC AACAGTAACC 2520 

TGTTCGGTAA CATAACCAGT CTC TG G ATT A ACATCGTAGG TCCTTGTCGT AGTTACATAG 2580 

CCATCCTCTC CATCAATTGT AACAGGATTT TC ACT AC GGT CTTTTGTTTC ATCTTTTTCA 2 64 0 

TAACGAATTC GCGTACTTGA AATTTTCTTG GTTACTACCT TAGGTTTAGT CGCTACTTTT 2700 

ACAATAATAT CCCCATTGTC AG CGT CAT C A TACTCTATTC CCTCTTCTTT ATCTCTAGTA 27 60 

TCATCTCTGA CATATTGAAT CCCATCAGCA GCATGAACAA AACTTGTATT CAGATTCCTC 2820 

CTAAAAATAA AGTTAGCCCG ATT ACCGC AG AACCAAAAAT CTTTCCGAGT TTACGTATTG 2880 
CATAGCGCTT ATT ACT ATT A GATTTTGCCA TTACATCCTA CTTCTAGTAT AGCATCTTTT . 2 94 0 
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CTATCAAACG TTAAACAATA TACGTTATAT AT AAAAT AG A CTTAGAATGA TATATTCATT 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA T ATAAAGG AT TGCAGACATC 3 060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4327 base pairs 

( B ) TYPE: nucleic acid 

(C) STRAND EDN ES S : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCC 60 

TCACTTCTAT CCACAACCTC AAAACAGTGT TTTG AG CAT C ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTT CATTG A GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

AT AAA TGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGT TTTAAA AATGACGTAT AATAAATTTA 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 3 60 

AAAAATAGTA GTACCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 4 20 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 4 80 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTCGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT T C C C ATT ATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTCCATCATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGACAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG G AT AT AG AT A TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAACAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGG AAAAT GTCG AAAC AG 12 60 
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TGATAGAAAA 


AGAAGATGTT 


GAAACCAATG 


CTTCAAATGG 


TCAGAGAGTT 


GATTTATCAA 


1320 


GTGAACTAGA 


TAAACTAAAG 


AAACTTGAAA 


ACGCAACAGT 


TCACATGGAG 


TTTAAGCCAG 


1380 


ATGCCAAGGC 


CCCAGCATTC 


TATAATCTCT 


TTTCTGTGTC 


AAGTGCTACT 


AAAAAAGATC 


1440 


AGTACTTCAC 


TATGGCAGTT 


TACAATAATA 


CTGCTACTCT 


AGAGGGGCGT 


GGTTCGGATG 


1500 


GGAAACAGTT 


TTACAATAAT 


TACAACGATG 


CACCCTTAAA 


AGTTAAACCA 


GGTCAGTGGA 


1560 


ATTCTGTGAC 


TTTCACAGTT 


GAAAAACCGA 


CAGCAGAACT 


AC CT AAAGGC 


CGAGTGCGCC 


1620 


TCTACGTAAA 


CGGGG T ATT A 


TCTCGAACAA 


GTCTGAGATC 


TCGCAATTTC 


ATTAAAGATA 


1680 


TGCCAGATGT 


AACGCATGTG 


CAAATCGGAG 


CAACCAAGCG 


TGCCAACAAT 


ACGGTTTGGG 


1740 


GGTCAAATCT 


ACAGATTCGG 


AATCTCACTG 


TGTATAATCG 


TGCTTTAACA 


CCAGAAGAGG 


1800 


TACAAAAACG 


TAGTCAACTT 


TTTAAACGCT 


CAGATTTAGA 


AAAAAAACTA 


CCTGAAGGAG 


1 860 


CGGCTTTAAC 


AG AG AAAAC G 


G AC AT ATT CG 


AAAGCGGGCG 


TAACGGTAAC 


CCAAATAAAG 


1920 


ATGGAATCAA 


GAGTTATCGT 


ATTCCAGCAC 


TTCTCAAGAC 


AGATAAAGGA 


ACTTTGATCG 


1980 


CAGGTGCAGA 


TGAACGCCGT 


CTCCATTCGA 


GTGACTGGGG 


TGATATCGGT 


ATGGTCATCA 


2040 


GACGTAGTGA 


AGATAATGGT 


AAAACTTGGG 


GTGACCGAGT 


AACCATTACC 


AACTTACGTG 


2100 


ACAATCCAAA 


AGCTTCTGAC 


CCATCGATCG 


GTTCACCAGT 


GAATATCGAT 


ATGGTCTTCG 


2160 


TTCAAGATCC 


TGAAACCAAA 


CGAATCTTTT 


CTATCTATGA 


CATGTTCCCA 


GAAGGGAAGG 


2220 


GAATCTTTGG 


AATGTCTTCA 


CAAAAAGAAG 


AAGCCTACAA 


AAAAATCGAT 


GGAAAAACCT 


2280 


ATCAAATCCT 


CTACCGTGAA 


GGAGAAAAGG 


GAGCTTATAC 


CATTCGAGAA 


AATGGTACTG 


2 340 


TCTATACACC 


AGATGGT AAG 


GC G AC AG ACT 


ATCGCGTTGT 


TGTAGATCCT 


GTTAAACCAG 


2400 


CCTATAGCGA 


CAAGGGTGAT 


CTATACAAGG 


GTGAC CAATT 


ACTAGGAAAT 


ATCTACTTCA 


2460 


CAACAAACAA 


AACTTCTCCA 


TTTAGAATTG 


C CAAGG AT AG 


CTATCTATGC 


ATCTCCTACA 


2520 


GTG ATC; XCCl A 






CTCAAGATAT 


TACTCCGATG 


GTCAAAGCCG 


2580 


ATTGGATGAA 


ATTCTTGGGT 


GTAGGTCCTG 


GAACAGGAAT 


TGTACTTCGG 


AATGGGCCTC 


2640 


ACAAGGGACG 


GATTTTGATA 


CCGGTTTATA 


CGACTAATAA 


TGTATCTCAC 


TTAGATGGCT 


2700 


CGCAATCTTC 


TCGTGTCATC 


TATTCAGATG 


ATCATGGAAA 


AACTTGGCAT 


GCTGGAGAAC 


2760 


CGGTCAACGA 


TAACCGTCAG 


GTAGACGGTC 


AAAAGATCCA 


CTCTTCTACG 


ATGAACAATA 


2820 


GACGTGCGCA 


AAATACAGAA 


TCAACGGTGG 


TACAACTAAA 


CAATGGACAT 


GTTAAACTCT 


2880 


TTATGCGTGG 


TTTGACTGGA 


GATCTTCAGG 


TTGCTACAAG 


TAAAGACGGA 


GGAGTGACTT 


2940 


GGGAGAAGGA 


TATCAAACGT 


TATCCACAGG 


TTAAAGATCT 


CTATGTTCAA 


ATGTCTGCTA 


3000 
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TCCATACGAT GCACGAAGCA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3 060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTCAGTTC ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGC CT AT ACCCTATCAT 3 240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3 3 00 

AGACAGATGG GCAAAGGAGA G ATGGG C AAA GGAGTTATTG CCTTGGAGTT CGACTCAGAA 3 3 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3 420 

ACCCAGTATG ATAGCAAGAC CTTGTTCTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 34 80 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTC CTGT AAATCTAGCA 3 54 0 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3 600 

TTTACAGGGG GAGTTAATGG T AC AG AG CCA GCTGTTCATG AAATCG C AGA GTATAAGGGA 3660 

TCTGATTCCC TTGTAACTCT TACTACAAAA AAACATTATA CTTACAAACC TCCTCTTGCT 372 0 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3 84 0 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT 3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3 960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTCAAAG TGAAACAGCA 42 00 

AGAGTTGATA GAGCTGATAC TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4 2 60 

AAATCT CTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4 320 

CTTTACG 4 3 2 7 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 521 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(DJ TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TT C AT AAAGG AGTTCTTATG 60 
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TTAGATATCA AACGTATTCG TACAGATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 120 

GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCGTAACATC 180 

TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 240 

GCTAAGCGCA ACAAGGAAAA TACAGATGAC AACATTCCTG CCATGCAAAA TCTATCTGCT 300 

GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG CTAAATTGAC AGAATTTACA 3 60 

ACGACTCTTC CAAATATCCC AGCTGACAGC GTTCCTGTTG GGGCTGACGA AGACGACAAT 4 20 

GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT TCGAACCTAA AGCTCACTGG 480 

GATCTCGGTG AAGACCTTGG TATCCTTGAC TGGGAACGCG CTCCTAAGGT AACAGGCGCT 540 

CCCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC GTGCTATCTA CAACTTTATG 600 

TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA CACCTTACAT AGTCAACCAT 660 

GATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG AAGATACTTT TGAACTCAGC 720 

GATACCAACT TTGTCTTGAT TCCAACTGCT GAAGTTCCTC TGAC AAACT A CTACCGTGAT 7 80 

GAAATCTTAG ACGGCAAAGA TCTTCCAATC TACTTCACTG CCATGAGTCC GTCATTCCGT 84 0 

TCTGAGGCTG GTTCTGCCGG TCGTGATACG CGTGGCTTGA TCCGTTTGCA CCAATTCCAC 900 

AAGGTTGAAA TGGTCAAATT TGCCAAACCA GAAGAATCTT ACGAAGAATT GGAAAAAATG 960 

ACAGCCAACG CTGAAAACAT TCTTCAAAAA CTCAACCTTC CATACCGTGT CGTTGCTCTC 1020 

TCTACTGGAG ATATGGGCTT CTCAGCTGCG AAGACTTACG ACTTGGAAGT GTGGATTCCA 1080 

GCACAAAACA ATTACCGTGA AATCTCAAGC TCTTCAAACA CAGAAGATTT CCAAGCCCGT 1140 

CGTGCCCAAA TCCGTTACCG TGATGAAGCA GATGGCAAGG TGAAACTCCT TCATACCTTG 1200 

AACGGTTCTG GACTTGCAGT TGGACGTACA GTGGCTCCAA TTCTTGAAAA TTACCAAAAT 12 60 

GAAGATGGTT CTGTG AC CAT CCCAGAAGCA CTTCGTCCAT ACATGGGTGG AGCTGAAGTC 1320 

ATCAAACCAT AAAAAATAAG GTTTAGCTAT TTCTAGCTAG ACCTTTTTTC GTAACCAAAT 1380 

CAGATAAGCA CCTAGTACAA AG AAT AAAAT AGTTAGGCAT ATAATGGTTT CAGCCAATAC 1440 

CAGGTAATCC AGAAATGGAA GTTTCAAAAT TCCCTGAGCC ATCTTGAGCG AGGTCGCTGT 1500 

GATAATGGTT GGGAAGGTGA GGGCTGAGAA GGCTGGTTGA AAACCTTGTT TTAAAATCTT 15 60 

GGGCAGACGA GTTAAAACAA AGAAAAAGAA GGATTGAGAA GCCAAAATCA TGACAATCAA 1620 

GACCCAAGTC GGCAGGCTGG TTCCTCCTAC TCGAACTAGA GAAGCCAAGA GT AG AG AG AA 16 80 

AGGAGCACAG TAGATTCCTT CTTGTCCAAG CAAGGCTAGT GGGAGTGGAT GTTTCTT T AA 174 0 

ATCGCTATAA ATAAGGGGAT AG AG AT AG AA GGTCAAGAGA AAACCAAAAC TCAAGGTCGC 1800 



WO 98/18931 



PCT/US97/19588 



844 

ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 136 0 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 19 20 

GGTAAAACCA GCAAT CAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 19 80 

TACCAAAGGA AG AT AAG AG A ATACGCGAAA GACATAGGTC G AT AAAATCA TCCCAGCCAT 204 0 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TC AG ACT AAA 2160 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 22 20 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2 2 80 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 2 340 

GACTTCATTT TACTCAGATA TGAATTAGGC AT AAGGTTG C AATTCTGGAT TAATTGGTGT 2 4 00 

ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 2 4 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC T AAAAACT C A GACAAGGCTT CATCTCCTAC 2520 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2 580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGG ATTTG 2 640 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 2 7 00 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 2 7 60 

GACAATTTGC TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGG 2 8 20 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2 880 

TGACTCTACT CTATGGATAG TAAATGTTGT C AT AA GAT AC CTCTTTTCTT ATTATTGACA 2940 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3 000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 30 60 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCC A TTATTTGTCA AAATACTTTT TAGTTTC AG C AAT AACGACT GGCGACAAGA 3 18 0 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3 2 40 

AGACCATATC. CAACTCGATA AATCCT CCT A ACAAGACCAT GAGCACAAAA ACCACACGGT 3 3 00 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCGTTCTCCG TAATAGTTCC 3 3 60 

AA CCT AG AAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 34 20 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT CAGTCAAGCC AACCCCATTC AAGTCACCGC 3 4 80 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3 521 
<2) r N FORMAT I ON FOR SEQ ID NO: 119: 
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fi) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 1968 base pairs 
(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi J 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


119 : 






AACCTGGGCA 


AGCAAGCTAA 


AAGCAATGGG 


ACCTGGAATC 


CTAATGGCAA 




6 0 


TGGAGGTTCC 


CACATTGTAT 


CCTCAACTCA 


AGCTGGCGGT 


TCTT ACGGTT 


GGTCTCTACT 


120 


TCTCTTGGTC 


ATCTTAGCCA 


ATGTCTTTAA 


ATATCCATTT 


TTCCGTTTTG 


GTGCTGAATA 


18 0 


CACAGCTGAT 


AC TGGAAAG A 


CTTTGGTTGA 


AGGTTATCCC 


GAAAAAGGAA 


AACTCTATCT 


2 4 0 


CTGGATTTTC 


TTTATCCTCA 


ATGTCTTTTC 


GGCTATGGTC 


AACACGGCTG 




3 00 


TCTGTGCTCA 


GCTATCATCG 


CCAGTGCCTT 


CCCAATGATT 


GGACTTAGCA 


TTACTCAGTG 


3 60 


GTCCCTCATT 


CTCGTTGCAA 


TCATTTGGGC 


TATGCTACTC 


TTTGGAGGCT 




*i 4 \J 


AGACGGCATG 


GTCAAATGGA 


TTATGTCTGC 


CTTAACCATT 


GCGACTGTTC 




i on 
** Oyf 


CATTGCGGCG 


GTCAAGCATC 


CAGAATACAG 


TTCTGATTTT 


GTCGAGAAGA 


CACCTTGCCA 


54 0 


AATGGCAGCT 


CTGCCCTTCA 


TCGTCTCCCT 


CCTAGGATGG 


ATGCCGGCTC 


CTATTCAAAT 


600 


TTCAGCCATC 


AATTCACTTT 


GGTCAGCTGA 


AAAGAGAAAG 


ACCCTCAACT 


TTAACACAGA 


6 60 


AGACGCTCTG 


TTTGACTTTA 


ACACTGGTTA 


TATTGGAACA 


GCTATCCTAG 


CCGTCTTCTT 


720 


TGTG GC ACTG 


GGAGCACTGA 


TTCAGTATCC 


TACAGGGCAG 


GCGGTTGAAG 


CTGCTTCAGC 


780 


CAAATACATC 


TCTCAATTCG 


TGGGCATGTA 


TGCCTCTGTT 


CTTGGCGAAT 


GGTCCCGTTA 


840 


CTTGATTACC 


TTTATTGCCT 


TCCTCTGTAT 


CTTTGGAACA 


GTTATAACTG 


TTATCGATGG 


900 


CTATTCTCGC 


GTTAATCAGG 


AATCTCTCCG 


ACTGCTAATC 


AGTC AAAAAG 


AGGACAATCG 


960 


TAAATCTTTG 


AACATCTGGA 


TGACCATCAC 


TGCTATCATC 


GGTATCGTCA 


TTATCAAGTT 


1020 


CTTCGCTGGT 


CAGGTTTCAA 


CCATGCTCCG 


CTTTGCCATG 


ATTGGCTCTT 


TCCTGACAAC 


1080 


ACCTTTCTTT 


GCTCTTTTGA 


ATTACGCCTT 


GGTAACGCGT 


GAAAACAAAA 


ATCTTCCTTC 


1140 


TTGGCTCAAA 


CACCTTGCCA 


TTGCGGGATT 


GATTTTCCTC 


TTTGCTTCGC 


CATCTTCTTT 


1200 


ATCTACGCAC 


TCGCAATCGG 


AAAAGCAGGG 


TAAGGGACAA 


GCGCGAGATG 


AAGATAAGGT 


1260 


TTCATTTCAA 


G AG AAAATT C 


AGCAAATATT 


TCTATGATAA 


AAAGCATAAG 


AACAAGGTTT 


1320 


TGAAGACCTG 


AACTTATGCT 


TTTTTACGTT 


CTTAAAGACT 


GTTTATACTC 


AAAAAACAGT 


1380 


TGAACAACTT 


CAACCACCTC 


TTATAAGAAC 


TTT ATACT AT 


TCGAGAATCT 


CTTCAAACCA 


1440 



WO 98/18931 



PCT/US97/19588 



846 

CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTCCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 1620 

CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 174 0 

AATCTCCCAC CCGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 1860 

CCTTGACCAA ATC CTTG AT A TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 192 0 

GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1963 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 120: 



CCGCATTTTT 


TATCACTAGA 


CTCGAGACAT 


CTTTTGAGTG 


GCTCTTGCTC 


TCTGGTTTAA 


60 


TTTTCTTCCT 


TGCTCAAGGA 


CTCCTGCTAT 


TTCTCTTGGT 


CGTCCGACTC 


AAACATCAAT 


120 


TCGCTGAGAT 


TTATCCTCAA 


ATCAATAAAA 


AGATTCGCTT 


CTACTATTTA 


CGGGTTCTCA 


180 


CCATTGATTT 


TCTATTTTTT 


GTTCTCTTAG 


CCTTCATTAC 


TTCTCAGCGT 


TTTTCATCTC 


240 


TTATGCCAAT 


CATCACTCCT 


TCCCATTCTA 


CTTTTTATTA 


TATGACAGCT 


GACTACCTAA 


300 


GAGAAAACTA 


TCCAGACTTT 


TACGACAAAC 


ACATCTCTTT 


ATGGGAGTGT 


CTCTAAAGAA 


360 


AAGGAGGTTT 


TAGCATGAAA 


AAAATCATCT 


TCATCAAAAC 


C ATT C AACTC 


CTTGTCATTG 


420 


ATGGAATCAT 


GCTGGCATTT 


TTGACATTTA 


AAAGGGGGCT 


TACTTGGCAC 


TGGATTTTGA 


480 


TTT AT AG CGG 


TTGGCTCATT 


TTCTTTCATC 


CTGTGCTATT 


GACCTATCTT 


TCAAACCAAC 


540 


TTTGTGACCA 


CTTTAGTTAA 


CTCTATTCCC 


AGATTAGACC 


GAGATTCTGG 


CGTTTTGCTT 


600 


T AC AAATTCT 


CCTATGGGAT 


AGCCTGATGA 


TTCTCTCCTT 


GGTGTCTTTA 


AGTGATATTC 


660 


CACTTTTCCT 


TCAGGGAACT 


CTCCTCATCC 


TAGGACATCT 


CATCCCTTCC 


TATCGCATCT 


720 


GCCAAAGCCT 


GAAAAGAGAC 


TTCCCCCAAG 


CATATCAAGA 


ACCGATTTCT 


TTTTGGAGTA 


780 


TTTTATGATA 


GATGAGAAAG 


ACCAAGCCGA 


CTGGGCTTGG 


TCTTTCTTAT 


CTCTTTTTAG 


840 


TATCTAGGAT 


AATGGTAACA 


GGTCCATTAT 


TAACCAGCTC 


AACCTGCATA 


TCTGCTCCAA 


900 



WO 98/18931 



PCT/US97/19588 



847 



AGATGCCTGT 


CTGAACGGGC 


ACTTCTTGCG 


1 CTAATTTTTG 


ATTGAAAGCA 


. TC AT AG AAG T 


960 


CTGATGCCAT 


ATCAGGTTTA 


GCTGCCCCTG 


TAAAGgCTGG 


ACGATTCCCT 


' CTCTTAGTAT 


1020 


C CGCAAAG AG 


GGTAAACTGA 


GAAATAGAGA 


GGATTTCTCC 


TTCAATATCT 


TTGACAGACA 


1080 


GGTTCATCTT 


GCCTTCTGCG 


TCTGAAAAAA 


TCCGCATATT 


GACCAGTTTT 


CTCACAGCAT 


1140 


AGTCCAAATC 


TTCCTCTTGG 


TCCTCTGGTC 


CAACACCAAC 


CAGCAATAAA 


AGTCCCTGAT 


1200 


TGATTTTTCC 


CTGAATCTGG 


CCTTCTATAC 


TCACTTGGGC 


TTTTTTAACC 


CGTTGGATAA 


1260 


TGATTTTCAT 


AATAGCCTTT 


CTAGTAAGAG 


CTAGGACAAC 


TAGCCGTTGG 


TCCGTT TG AC 


1320 


AGAGTAAACT 


TCTGGCACAC 


TCTTAATTTT 


ATCGACAACC 


GTGGTCAGTG 


TAG AG AGG TT 


1380 


GGCAATACCG 


AAG g AC A CAT 


GGATATTAGC 


AAACTTCATA 


TCCTTGGTTG 


GTTGGGCATT 


1440 


GACCGTTGAA 


ATATTCTTGG 


TTGTATTTGA 


AAGAACTTGC 


ACTACATCGT 


TC AACAGTCC 


1500 


TGTACGGTTG 


AGACCGTAGA 


TATCGATATG 


GGCCATATAC 


TCCTTATTTG 


AGCTAGGGTA 


1560 


CTGGTCTTCC 


CATTCCACAT 


CAAGGAGACG 


TTGCTCGTAG 


TTTTCTTGGG 


CACGCAGGTT 


1620 


CATACAGTCC 


ACACGGTGAA 


TAGCCACACC 


ACGACCCTTG 


GTAATGTAGC 


CAACAATATC 


1680 


GTCACCAGGC 


ACGGGGTTAC 


AACACTTAGC 


AATCCGCACT 


AGGAGACCAG 


AAGCACCTTC 


1740 


AATAACCACT 


CCCCCCTCAT 


GCTTGACCTT 


GAGGGTTTCT 


TTATTTTCAA 


CCTTGACCTC 


1800 


GCCACCTTTG 


ACAAGCTCCT 


CTGCCTCAGC 


TTTGGCCTTG 


GCACGCTCTT 


CCTCACGGCG 


1860 


TTCCTTTTCA 


GTCAGACGGT 


TAAAGACGGT 


AATCGCACCG 


ATTTCCCCAA 


AACCAATGGC 


1920 


CGCAAAG AGG 


GAGTCTTCTG 


TCTTGTAACT 


GGTCTTTTGC 


AGAACTTGAT 


CCATGTGGCG 


1980 


CTTGTCCATA 


AATTTATTTG 


CCACATAGCC 


ATTTTCTTGG 


AACTGAGCCA 


TCAGCATCTC 


2040 


ACGACCCTTG 


TTGACAGACA 


ATTCCTTATC 


TTGGTTTTTA 


AAGAACTGGC 


GAATCTTATT 


2100 


GCGCGCCTTG 


CTAGTCTTGA 


CCATATTGAG 


CCAGTCACGG 


CTAGGTCCAA 


AGGAGTTCGC 


2160 


GTTGGCGATA 


ATTTCAACCT 


GATCCCCTGT 


CTTTAACTTG 


GTTGTCAGTG 


GAACCATGCG 


2220 


GCCATTGACC 


TTGGCACCAG 


TTGCTTTTTC 


ACCGACCTTG 


GTATGGATTT 


CGTAGGCAAA 


2280 


ATCAATCGGT 


CCTGAATCTT 


TGGGAAGGGA 


ACGGACAGCT 


CCATCTGGGG 


TAAAAACGTA 


2340 


AATCTCCTCA 


GCCAAATAGT 


TTTCCTTAAC 


AGAGTCCACA 


AATTCCTTAG 


CATCATCAGC 


2400 


CTCGTCTTGC 


AGCTCCATCA 


T CTC CTTG A T 


CCAGTTCATT 


CCAATAGCTG 


ATTCCTTGCT 


2460 


GTTAACTTGC 


CCCTTTATAC 


CTTTCTTATA 


AGCCCAGTGA 


GCCGCAACCC 


CGTACTCAGC 


2520 


CACCTCGTGC 


ATTTCCTTGG 


TTCGAATCTG 


G AATTC AATC 


GGCCCTTTTG 


GTCCATAAAC 


2580 


AGTCGTATGG 


ATAGACTGAT 


AACCATTGGC 


CTTGCGGTTG 


GCGATATAGT 


CTTTGAAGCG 


2640 
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ACCTGGCATC GGTTTCCAAA ATTCATGCAC GTAACCAAGC ATGG C AT AAA CATCACTTTG 27 00 

GGT ATCT AAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 27 60 

GTCCTGCATT TTGCGGAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2820 

CAAGTGACGT TCTGTCGT AT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG GGTAATCTTG TAAAACTCCG TTGGATTGAG 2 940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3 060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCG ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG AC CTTGCC AA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3 240 

AAAC TCTCTT TCC AAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3 3 00 

TCCACAAGCT ACTGTTACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 3 360 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 3420 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTTGGTTAA 34 80 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGGC ATCTCTACTC TCCAATTCTT 3 54 0 

CCTACCATTT TATCACTTTT TTAAGAATAT GAAAACTAGA TTGGAACAGA ATAAGAAAAA 3 600 

AATAATTCAA AATTGCTTGA TAATTCTGAA TTATTGCTCC GTAATATACT ACGAAGTTAG 3 6 60 

ATTTTAAACT TAGGT GAT AG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 3720 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 3780 

AC ATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 3 84 0 

ACT AGG AAAC TAGCCGCAGG TTACTCAAAG CACCGCTTTG AGGTTGC AG A TAAAGTTGAC 3 900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3 9 60 

AGTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATGT 4 0 20 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4 080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 4140 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4 2 00 

TTATGCAACC TATGAAGGGC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4 2 60 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 4 3 20 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAAGA G C AAACT AG A AAGCTAGCCG 4 380 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 44 4 0 
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TTCGAAGAGT AT T AGT A C AT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4 500 

CAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4 560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4 6 20 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4 680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4 740 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4 800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTACAGGAAT TTCGTTTTAA 4 860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATCCTCA 4 920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4 980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCT A 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 522 0 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACCCCACA TTCCGTCTCC 52 80 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAACTTT GGTACTTGCA CATTGACACG 534 0 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 54 00 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 54 60 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 55 20 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 57 00 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 57 60 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG ACCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTCCA TAGCCAATAT AAGGTTTG AT 6000 

CACATTGAGC TTCACGATAG CCT GTTT AC A GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATCGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 6240 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TACCTGGTTG AAAGATTGAC 6300 

AAC ACTC AAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 6360 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTT C A TT AG AG AG C A CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTACC 67 2 0 

C C AT AG AAC A TTGATTCTAG CATCCATAAA GTCTGGCGAT AC AC C AAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6840 

TTCTTGGAAT TCTTT AAGG A GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACCA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6 960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 7020 

TAAATCAACA AAACCTCCTG TT TG AG ACT T ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGC TGGTCTTGTT 60 

GTTTGTATCA GTTTGTTTAC AG AGGAT CTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GCTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GAT AAAAG AC 240 

TGCTAATTTT TTGAGTTTTT CTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 360 

TACAGAGCGT C AG C AGGAT A TTTTAAATCT GATTATTGAC ATCTTT AC CA AAACGCACGA 420 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 4 80 

TGACATGCCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AG CCTTTG AT C AGG AATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 34 0 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCCTTT T AG AT ATTC A CTACAAGATT CGGACGGACA TTCCGCAGAT 9 60 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGACT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CCGGGAGTTG G AATTCT AGC 12 60 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 13 20 

CCCTGTTTTG ACCATGAAGT TGACAGATTT TTAC CGCT AC CTCAGCAGTA ATCATTACGA 13 80 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 14 40 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGCC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCCC 15 60 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTG AG CGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 174 0 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT I860 

ACCATCCCTC AAGTCTTTCA AAAAGG CT AC AAACTCCATG ACCGCATCCT ACGCCCAGCA 192 0 

ATGGTAGTGG TGTATAACTA AG AT AT AAAG CCCGTAAAAA CCTCCCAGTA AAAATAGGAG 19 80 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 204 0 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TC AG AAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2 220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 2 2 80 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2 3 40 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTT AAG A AGAGTATCAA AAAGAAAAAT 2 400 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 24 60 

CAACAAACTC AGCAGTTCCA GTTCTTGAAG GAACTGAAAG C AAAATC AT C GCAAACCCAG 25 20 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2 5 80 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2 640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG G AAAAG AAT A CACTCCACAA GAAATCTCAG 2*700 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTCGTGAG AAAGTAACCA 27 60 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2 8 20 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 28 80 

TTGCTTATGG TTTGGACAAG ACTGAC AAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2 940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA AT G ACTTTG A 324 0 

CTCGTGCGAA ATTTGACGAT TTGACTCGTC ACCTTGTTG A ACGTACAAAA GTTCCACTTC 33 00 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3 3 60 

GTGGTT C AAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 34 2 0 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 34 80 

TCATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTC TCACTTGGTA 3540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3600 

CTAAATCACA AGTCTTCT C A ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3 6 60 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACCCTTC CAATTGACTG 372 0 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC AT CG ACAAG A 37 80 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGT AG A CCTTCGTAAT CAAGTAGACC 3 9 60 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4 0 20 

AACGTGACGC TCCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4 080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4 14 0 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4 200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGAGTATCT AAAAAATACA CCAAAAGTTT ATAATGATTT TTGTAATCAA 4 3 20 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA AT ATTT AGCT 4 3 80 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 44 4 0 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4 500 

CGAACGTTCA GCTTATAT 4 518 



(2) INFORMATION FOR SEQ ID NO: 122: 

( i ) S EQUENCE CHARACTER ISTICS : 

(A) LENGTH : 8145 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAACTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 12 0 

AACTGATTTT TAT CT ATT AT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 24 0 

GAATTGACAC TATTCCAAGA GGACAGTATG AG GCCGCC AA GGTCTTGAAG TTTAGCCCTT 300 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 3 60 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 4 20 

ACCTT AT CTT GGCTAGTCGA AC AGCTGCT A ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGC AGG AG C CATTTATTTG ATTTTGATTG GG ATTGTG AC AATTATTTCC AAAAAAGTTG 54 0 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AG AATT AC G A AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 66 0 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 72 0 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGCGATTTG TCTTCCAAGA TTTTC AACT A TTTCCTCATC 840 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACC AACT T C TGCCCTGCAT CCAGAATTAC 10 80 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 1140 

TTACCCATGA TTTGCACTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 12 60 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 13 20 

ACCAGTCTAA CAAGTCTATT ACT ATTG G AT TTGATAGTAC TTTTGTTCCA ATGGCATTTG 1380 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG AT ATTG ATT T AGCT ACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC T AC AG AC G AA CG C CGTG AAA 1560 

AGGTGCCTTT CAGTAACTCA TATATGAAGA ATGAGCAGGT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC GGACTTTGAA GC AAATCC AG AAATTTTGAA GAATATTGTC GCTAATAAGG 17 40 

AAGCGAATCA ATACCAAACC TTTAATGAAC CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCCT GTCTATGCAA ACTATTATTT AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1980 

GC AAGTTCC A AG AAATC AG C CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 204 0 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA AT ATGG AAT A 2160 

GCCCACTGTG ATTTCTAAAA C ATTGTT AAA AATTGATTTC ACTTCCAAAA TTAAAATGTT 22 20 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGGTTT 2 2 80 

TTGCACCTTA C ATT ATGCGT TTTTGTG AT T TTAAGACTTG TTAGCTGATT TTTTACAATC 2340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 24 00 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 2 4 60 

GCAAATCATT CACTTCGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGC G AC 2 5 80 
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CCATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2 64 0 

TCACTTCAAC GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CCCTTTCATT 2 700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 2 7 60 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2 820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2 880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2 940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 30 60 

AGCATTGTTG CGG AATTTAG CCATAATCGC TTTGATTTGT TC AGC AC CAT CGACACCTGA 3120 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 3 2 40 

AACGGCTTGG ATGGCATCTT TATCACCTAC AAATGGTTTA ATCAACTAAC CGAAGCTTTC 3 300 

TTCAAATCCC ATCATGTAAG TCTCGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTC AG C 3 3 60 

CATAAATTTC AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3 4 20 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG G CATTTTC AG GAAGAGTTCC 34 80 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3 540 " 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAG CGTCTGG 3 600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3 660 

GGCTGCTTGG CTTTCT GGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 37 20 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTCG GCAAGAGCAC GACGAGCCAA 37 80 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 38 40 *~ 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGC 3 900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 3 960 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 4 02 0 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4080 

GCCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 4140 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 4 200 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 4 260 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGCCC 4 3 20 
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AACAACGTAG 


ATGTTGATAC 


GGTTTGTACC 


AGCACCAACC 


AAGCCACGCA 


TACCTGCAGT 


4360 


ACCAAATTCA 


AG AT TTGT AT 


AG AAGG CATC 


TTCCTTAGTT 


TTTTCGTCCA 


TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC 


AAAATCAACC 


CATTTCTGGT 


AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA 


AAACATTTAA 


TCAGTTTAAT 


TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT 


CGAACATCTC 


TTCAAACCAG 


GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATGT 


TGAGGCTAGG 


AAAAATTCAA 


TTT C AG T AAA 


AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC 


TTAGTTTTAA 


ACAAGCATAT 


TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC 


GAGCATTAGA 


TT A CAT C AAA 


GAGGGGCACA 


4800 


GCCATGTCGA 


GGCAGCCAAG 


TTTTTTGGTG 


TTGG CGTC AG 


AACTCTCTTC 


ACGTGGGAAA 


4360 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG 


GAAAAAGCGA 


GTCGTCAAAA 


ACCGAAAGAT 


4920 


TCCTTTAGAG 


GAATTGAAAG 


CCTTTGTAGA 


GGCTCATCCA 


GATGCTTTTT 


TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC 


AGTATGGGCA 


GCTTTAAAGC 


AGATTAAGGT 


5040 


CACTTTAAAA 


AAAGATGACG 


AGCTTTAAGG 


AACAAGACCC 


AGAAAAGTAG 


CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC 


ACCTGTTTAT 


ATTGATGAAA 


CAGGAATCGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC 


TCCTAGAGGG 


GAGAAAGTCT 


ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT 


TGTTGCAGGA 


CAAGTAGACG 


GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC 


AAGCGATTTC 


TTTGTGGAGT 


GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA 


TGTTATTGTC 


ATGGGCAATG 


CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGG AT G 


AACTCTGCAT 


CCAAGATAAA 


CACTTTTTCT 


TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA 


GCAAGCTTGG 


GCTATCTTGA 


AAAAGAAAGT 


5520 


GACGGATGTA 


TTAAGGCAAG 


TTC CAACT AT 


TTTTGAATGT 


TTGGAATGCT 


TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT 


ATCGAGTAGT 


CATTAAAACT 


AAGGATACTG 


5640 


CTGGTTAAGA 


GAAGACGGTA 


TACAATCAAA 


CCATTCACCG 


TGTAGCCGAA 


AT CGTTC AG A 


5700 


ATGAAGACTT 


GTATCAGAAT 


GAAGACTTGT 


ATAAGAAAGG 


TTTGAATGTT 


GAACTTGCGC 


5760 


ACCAACAAAT 


T AAGGG ATTT 


TTTGAAGCAG 


AGTTTAAAAA 


TCGTATTAAT 


GGAGTTCTTA 


5820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC 


GTGTAAATAA 


AAAAACTATA 


CACCAGAGCA 


5880 


ACAAAAACTC 


CATGATCAAT 


TTGAAGCAGA 


AGCAACGGAA 


GATGCTAAAA 


AACAACGCCA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT 


GAGCATATCT 


AAGTCTAATA 


AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT 


AACCAATAGG 


CTAGCAAATG 


ACTTGAATAA 


6060 


TCTCTTGAAA 


CAGGTTGATA 


AAGATACTCC 


TAATACCCCA 


ACTTGGCTAA 


ACTCAGCTGC 


6120 



WO 98/18931 



PCT/US97/19588 



857 

TTCTAAAGCT AAAGATGATG ACAGAGTATA TAAACTACTG AAGACTCTTA TACCAGGAGA 6180 

AAATTACCTA TCATGTTAAG CATAATCAGC TAGAAGTAGA AACAGATAAA TACACATATA 62 4 0 

CTGCCGCTAG AAATGGTAGT AAGGAAGTTG GTATTCAAGA GTCAGATATA GCAGCAACTC 6300 

TAAGTGCCGA TGAATATAAT TCTAATCGCC AAACTTTTGA G AG AG AAT AC AAATACAAAA 6 3 60 

GCAAATGCCC TTAATAATGG TTGGGCTAGA TCTGGTTCTG AAGAGTTCAA AAAGTTCTCC 642 0 

CACTTTGTAG GGGTAGACAA AGGGATTGTG CGAACGAATG TACTGACTGG TAAAAAACTA 64 80 

TCTCATAAGA TTAGGAAAGA AGTGGGCTCT GGAGATAGCA AACTAGGAAA AGGCGGCTAT 6540 

TTCTCTACTG GGGATGTTCT ATTAGGAAAA GATGTTGTTT CTTATACCGT ACAAGTATTT 6600 

TCAGAGAATA ATGAAAGAGT AGGAGTAAAC ACTCAAACTC ACCGTGTTCA G T AT AATCTC 6660 

C C AATTCT AG CTGACTTTTC AGTCATCCAA GATACTGTGG AACCATCACG AACCGTTGTT 6720 

GAAAAAATCA TTCCAAAACT AAATATTCCC GAAGAAGAGA AAGCGAAAAT AACCGAAGAA 6780 

ATCAAGAAAA ACAAAAAAAC CTCAGAATTG GCAGAACTAA TCTCAGAAAA TGTGAAAGTT 6840 

CGCTATGTTG ATGAACAAGG GCGTTTGCTA TCATTGAAAA ATG AT AC TGG AATTGGAGAA 6 900 

AAAG AAAGTG ACGGAACCTA CATTACCAAT AAAAAACAAC TGATTGCTAC CAGCTATAAT 6960 

CTCACAGATA AAAAACTCAG TAGCATGACT ACT ACTG A CG G AAAAT ATT A TACTTTTAAA 7020 

GAAGCAGATA CAAATTCTCC AAGTTTAACT GGGAATATTG TAACCGAAGG TAGAACAGTG 7 080 

ACCTT AGTTT ATAGAGAAAG CGAAGCGCCA ACCACTGCTA CAGTAACAGC CAATTACTAT 714 0 

AAAG AAGG T A GGCAAGAGAA GTTGGTAGAG TCTCTTATAA AAGCTGATTT AGCGATAGGT 7 200 

TCTGAGTATA CCACAGAATC AAAAACTATT GAAGGGAAAA CAACAACTGA GGACAAAGAA 7 2 60 

GACCGAGTTA TCACAAGGAA AACAACATAC ACCTTGGTAG C AACTCCTG A AAATGCGTAC 7 32 0 

CAGAAGACGG TGCAACAGTT GACTATTACT ACCGTGAGAA TGTTGAGGAA ACAGTGGTTC 7 380 

CCAAAACAGC AACCTCTACT GAGACGAAGA CTATAACGCG TATCATTCAT T AC GTTG AT A 7 440 

AAGTTACGAA CCAAAATGTA AAAG AAG ATG TTGTTCAACC TGTAACCTTA AGCCGTACAA 7 500 

AAACTGAGAA CAAGGTCACG GGAGTTGTAA CCTACGGTGA ATGGACAACA GGAAACTGGG 7 5 60 

ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC AGATATTCCA ACAGTTGAAT 7 62 0 

CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC GGT AAGGT AT GACCGTTTAT 7680 

CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA TCCAAGTGTT CCGACACCAA 77 40 

ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA ACCAACTCCA GAACCAGGTA 7 800 

CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAACT TCCGACTTAT GAGACAGGTA 7 860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCT AATG C TACCTTGGCT AGTGCTGGTA 7 920 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7 980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA T GTGGG CTTT TTTCCTGGTT 804 0 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTCA ACTGGGAAAC 8100 

TATGAATCCT A T TTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 814 5 



(2) INFORMATION FOR SEQ ID NO: 12 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8697 base pairs 
(Bi TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



CGGTACCGGG 


AACGATACTT 


AGTCTAATTT 


TGCACCTTTT 


CCATGTATGG 


TAAAGGTTTT 


60 


TCTTTTTTTA 


AAAAGGAAAA 


CGAGAAGAGG 


AGGTTCTTAT 


GAAAGCAAGC 


ATTGCCTTGC 


120 


AAGTTTTACC 


CCTAGTACAG 


GGGATTGATC 


GGATAGCTGT 


TATTGATCAG 


GTCATTGCTT 


180 


ATCTGCAwAC 


TCAAGAAGTG 


ACGATGGTAG 


TGACACCATT 


TGAAACGGTC 


TTGGAAGGGG 


240 


AGTTTGATGA 


GCTTATGCGC 


ATTCTAAAAG 


AAGCGCTGGA 


AGTGGCAGGG 


CAGGAGGCAG 


300 


ACAATGTCTT 


TGCCAATGTC 


AAAATAAATG 


TAGGAGAGAT 


TTTAAGTATT 


GATGAGAAAC 


360 


TTGAGAAGTA 


TACTGAGACG 


ACACATTAGT 


CTATTGGGCT 


TTCTCGGAGT 


ATTGTCAATC 


420 


TGGCAGTTAG 


CAGGTTTTCT 


TAAACTTCTC 


CCCAAGTTTA 


TCCTGCCGAC 


ACCTCTTGAA 


480 


ATTCTCCAGC 


CCTTTGTTCG 


TGACAGAGAA 


TTTCTCTGGC 


ACCATAGCTG 


GGCGACCTTG 


540 


AGAGTGGCTT 


TACTGGGGCT 


GATTTTGGGA 


GTTTTGATTG 


CCTGTCTTAT 


GGCTGTGCTC 


600 


ATGGATAGTT 


TGACTTGGCT 


CAATGACCTG 


AT TT AC C CT A 


TGATGGTGGT 


C ATT C AG AC C 


660 


ATTCCGACCA 


TTGCCATAGC 


TCCTATCCTG 


GTCTTGTGGC 


TAGGTTATGG 


GATTTTGCCC 


720 


AAGATTGTCT 


TGATTATCTT 


AACGACAACC 


TTTCCCATCA 


TCGTTAGTAT 


TTTGGACGGT 


780 


TTTAGGCATT 


GCGACAAGGA 


TATGCTGACC 


TTGTTTAGTC 


TGATGCGGGC 


CAAGCCTTCG 


840 


CAAATCCTGT 


GGCATTTTAA 


AATCC C AGTT 


AGCCTGCCTT 


ACTTTTATGC 


ACCTCTGAGG 


900 


GTCAGTGTCT 


CCTACGCCTT 


TATCACAACT 


GTGGT AT CTG 


AGTGGTTGGG 


AGGTTTTGAA 


960 


GGTCTTGGTG 


TTTATATGAT 


TCAGTCTAAA 


AAACTGTTTC 


ACT A TG AT AC 


CATGTTTGCC 


1020 


ATTATTATTC 


TGGTGTCGAT 


TATCAGTCTT 


TTGGGTATGA 


AGCTGGTCGA 


TATCAGTGAA 


1080 


AAATATGTGA 


TTAAATGGAA 


ACGTTCGTAG 


AATTAGAATG 


TTTCTGAAAA 


AGAAAAGAGG 


1140 
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AAATCAAAAT 


GAAGAAAACA 


TGGAAAGTGT 


TTTTAACGCT 


TGTAACAGCT 


CTTGTAGCTG 


1200 


TTCTGCTTGT 


GGCCTGTGGT 


CAAGGAACTG 


CTTCTAAAGA 


CAACAAAGAG 


GCAGAACTTA 


1260 


AG AAGGTTG A 


CTTTATCCTA 


GACTGGACAC 


CAAATACCAA 


CCACACAGGG 


CTTTATGTTG 


1320 


CCAAGGAAAA 


AGGTTATTTC 


AAAG AAGCTG 


GAGTGGATGT 


TG ATTTG AAA 


TTGCCACCAG 


1380 


AAGAAAGTTC 


TTCTGACTTG 


GTTATCAACG 


GAAAGGCACC 


ATTTGCAGTG 


TATTTCCAAG 


1440 


ACTACATGGC 


TAAGAAATTG 


GAAAAAGGAG 


CAGGAATCAC 


TGCCGTTGCA 


GCTATTGTTG 


1500 


AACACAATAC 


ATCAGGAATC 


ATCTCTCGTA 


AATCTCATAA 


TGTAAGC ACT 


- CCAAAAGACT 


1560 


TGGTTGGTAA 


GAAATATCGG 


ACATGGAATG 


ACCCAACTGA 


ACTTGCTATG 


TTGAAAACCT 


1620 


TGGTAGAATC 


TCAAGGTGGA 


GACTTTGAGA 


AGGTTGAAAA 


AGTACCAAAT 


AACG ACT C AA 


1680 


ACTCAATCAC 


ACCGATTGCC 


AATGGCGTCT 


TTGATACTCC 


TTGGATTTAC 


T AC GGTTGGG 


1740 


ATGGTATCCT 


TGCTAAATCT 


CAAGGTGTAG 


ATGCTAACTT 


CATGTACTTG 


AAAGACTATG 


1800 


TCAAGGAGTT 


TGACTACTAT 


TCACCAGTTA 


TCATCGCAAA 


CAACGACTAT 


CTGAAAGATA 


I860 


ACAAAGAAGA 


AGCTCGCAAA 


GTCATCCAAG 


CCATCAAAAA 


ACGCTACCAA 


TATGCCATGG 


192 0 


AACATCCAGA 


AG AAGCTG C A 


GATATTCTCA 


TCAAGAATGC 


ACCTGAACTC 


AAGGAAAAAC 


1980 


GTGACTTTGT 


CATCGAATCT 


CAAAAATACT 


TGTC AAAAG A 


ATACGCAAGC 


GACAAGGAAA 


2040 


AATGGGGTCA 


ATTTGACGCA 


GCTCGCTGGA 


ATCCTTTCTA 


CAAATGGGAT 


AAAGAAAATG 


2 100 


GTATCCTTAA 


AGAAGACTTG 


ACAGACAAAG 


GCTTCACCAA 


CGAATTTGTG 


AAATAATGAC 


2160 


AGAAATTAGA 


CTACAGCACG 


TCAGTTATCC 


CTATGGTCAG 


GAGAGGATTT 


TAGAGGATAT 


22 20 


CAACCTACAG 


GTGACTTCAG 


GCGAAGTGGT 


TTCCATCCTA 


GGCCCAAGTG 


GTGTTGGAAA 


2280 


GACCACCCTC 


TTTAATCTAA 


TCCCTGGGAT 


TTTAGAAGTT 


CAGTCAGGGA 


GAATTGTCCT 


2340 


TGATGGTGAA 


GAAAATCCCA 


AGCGGCG CGT 


GAGTTATATG 


TTGCAAAAGG 


ATCTGCTCTT 


2400 




ACGGTGCTTG 


GAAAT AT CAT 


TCTGCCCCTC 


TTGATTCAAA 


AGGTGGATAA 


2460 


GGCAGAAGCT 


ATTTCCCGAG 


CGGATAAAAT 


TCTTGCGACC 


TTCCAGCTGA 


CAGCTGTAAG 


2520 


AGACAAGTAT 


CCTCATGAAC 


TTAGCGGTGG 


GATGCGCCAG 


CGTGTAGCCT 


TACTCCGGAC 


2580 


CTACCTTTTT 


GGGCACAAGC 


TCTTTCTCTT 


AGATGAGGCC 


TTTAGCGCCT 


TGGATGAGAT 


2640 


GACAAAGATG 


GAACTCCACG 


CTTGGTATCT 


TGAGATTCAC 


AAGCAGTTGC 


AGCTAACAAC 


2700 


CCTGATCATC 


ACGCATAGTA 


TTGAGGAGGC 


CCTCAATCTC 


AGCGACCGTA 


TCTATATCTT 


2760 


GAAAAATCGC 


CCTGGGCAGA 


TTGTTT C AG A 


AATTAAACTA 


GATTGGTCTG 


AAGATGAGGA 


2820 


CAAGGAAGTC 


CAAAAGATTG 


CCTACAAACG 


TCAAATTTTG 


GCGGAATTAG 


GCTTAGATAA 


. 2880 
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GTAGAAAAAT 


ACCGAGTTGG 


TGAAGATTAT 


CCTTTACCAG 


CGCCCTTTTT 


CTTTTAAAAA 


2940 


TGAGAAAATT 


TCGGTATAAT 


AGTCAAACAA 


GGTCAAGGTT 


TAAAGAGAGA 


GGTGGGTTTG 


3000 


TTATGAGATT 


TAAAAATACA 


TCGGATCATA 


TTGAGGCCTA 


CATCAAGGCG 


ATTTT AG AT C 


3060 


AATCTGGTAT 


CGTGGAGTTG 


CAACGGAGTC 


AGTTGGCAGA 


TACCTTTCAG 


GTTGTTCCTA 


3120 


G TC AG ATT AA 


CTACGTGATC 


AAGACACGCT 


TTACGGAAAG 


TAGAGGCTAC 


TTGGTTGAAA 


3180 


GTAAGCGTGG 


TGGCGGAGGC 


TACATTCGTA 


TAGGACGGAT 


TGAGTTTTCT 


AGTCATCATG 


3240 


AAATGCTCCG 


GGAGCTGCTT 


T ACTCG ATTG 


GTGAGCGAGT 


CAGTCAAGAA 


ATTTATGAGG 


3300 


ATATTCTCCA 


GCTTTTGGTT 


GAGCAGGAAT 


TGATGACCAA 


GCAGGAGATG 


AATTTGCTAG 


3360 


AATCAGTAGC 


TTTGGATCGC 


GTTTTAGGAG 


AAGAAGCTCC 


AGTTGTTCGA 


GCAAACATGC 


3420 


TACGTCAGAT 


CATACAAGAG 


GTAGATAGAA 


AAGGGAAGTA 


AGATGAACTA 


TTCAAAAGCA 


3480 


TTGAATGAAT 


GTATCGAAAG 


TGCCTACATG 


GTTGCTGGAC 


ATTTTGGACC 


TCGTTATCTA 


3540 


GACTCGTGGC 


ACTTGTTGAT 


TGCCATGTCT 


AATCACAGTT 


ATAGTGTAGC 


AGGGGCAACT 


3600 


TTAAATGATT 


ATCCGTATGA 


GATGGACCGT 


TTAGAAGAGG 


TGG CTTTGG A 


ACTGACTGAA 


3660 


ACGGACTATA 


GCCAGGATGA 


AACCTTTACG 


GAATTGCCGT 


TCTCCCGTCG 


TTTGCAGGTT 


3720 


CTTTTTGATG 


AAGCAGAGTA 


TGTAGCGTCA 


GTGGTCCATG 


CTAAGGTACT 


AGGGACAGAG 


3780 


CACGTCCTCT 


ATGCGATTTT 


GCATGATAGC 


AATGCCTTGG 


CGACTCGTAT 


CTTGGAGAGG 


3840 


G CTGGTTTTT 


CTTATG AAG A 


CAAGAAAGAT 


CAGGTCAAGA 


TTGCTGCTCT 


TCGTCGAAAT 


3900 


TTAGAAGAAC 


GGGCAGGCTG 


GACTCGTGAA 


GATCTCAAGG 


CTTTACGCCA 


ACGCCATCCT 


3960 


ACAGTAGCTG 


ACAAGCAAAA 


TTCTATGGCC 


AATATGATGG 


GCATGCCGCA 


GACTCCTAGT 


4020 


GGTGGTCTCG 


AGGATTATAC 


GCATGATTTG 


ACAGAGCAAG 


CGCGTTCTGG 


CAAGTTAGAA 


4080 


CCAGTCATCG 


GTCGGGACAA 


GGAAATCTCA 


CGTATGATTC 


AAAT CTTGAG 


CCGGAAGACT 


4140 


AAGAACAACC 


CTGTCTTGGT 


TGGGGATGCT 


GGTGTCGGGA 


AAACAGCTCT 


GGCGCTTGGT 


4200 


CTTGCCCAGC 


GTATTGCTAG 


TGGTGACGTG 


CCTGCGGAAA 


TGGCTAAGAT 


GCGCGTGTTA 


4260 


GAACTTGATT 


TGATGAATCT 


CGTTCCAGGG 


ACACGCTTCC 


GTGGTGACTT 


TGAAGAACGC 


4320 


ATGAATAATA 


T CATC AAG G A 


TATTGAAGAA 


GATGGCCAAG 


TCATCCTCTT 


TATCGATGAA 


4380 


CTCCACACCA 


T C AT GGGTTC 


TGGTAGCGGG 


ATTGATTCGA 


CTCTGCATGC 


GGCCAATATC 


4440 


TTGAAACCAG 


CCTTGGCGCG 


TGGAACTTTG 


AGAACGGTTG 


GTGCCACTAC 


TCAGGAAGAA 


4500 


TATCAAAAAC 


ATATCGAAAA 


AGATGCGGCA 


CTTTCTCGTC 


GTTTCGCTAA 


AGTGACGATT 


4560 


GAAGAACCAA 


GTGTGGCAGA 


TAGTATGACT 


ATTTT ACAAG 


GTTTGAAGGC 


GACTTATGAG 


4620 


AAACATCACC 


GTGTACAAAT 


CACAGATGAA 


GCGGTTGAAA 


CAGCGGTTAA 


GATGGCTCAT 


4680 
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CGTTATTTAA 


, CCAGTCGTCA 


. CTTGCCAGAC 


: TCTGCTATCG 


; ATCTCTTGGA 




4 7 4 0 


GCAACAGTGC 


AAAATAAGGC 


AAAGCATGTA 


AAAGCAGACG 


it TCACATTT 


ynu I — 1 


*iOUU 


GACAAGGCCC 


TGATGGATGG 


CAAGTGGAAA 


CAGGCAGCCC 


AGCTAATCfir' 


AAAAf^AAflAr 


d s fin 
** o D u 


GAAGTACCTG 


TCTACAAAGA 


CTTGGTGACA 


GAGTCTGATA 


TTTTr: arrar 

ill A 


LI 1 UAL* i U<ol_ 


T " * U 


TTGTCAGGAA 


TCCCAGTTCA 


AAAACTG ACT 


CAAACGGATG 




ill AAA 1L1 1 




GAAGCAGAAC 


TCCATAAACG 


GGTTATCGGT 


CAAGATCAAG 


v- A v» 1 1 1 \_/AJ"V\J 


LA L 1 1 


jU^ U 


GCCATTCGCC 


GCAACCAGTC 


AGG G ATTCGC 


AGTCATAAGC 


r.Trrr. itt^p 


. I LL I i 1 Alb 


5100 


TTCCTAGGGC 


CTACAGGTGT 


CGGGAAAACT 


GAATTAGCCA 




tt tt^"*T , ^**T v T* 
Ao AA*j . ILi 1 


Q 1 £ fl 
3lO(J 


TTTGACGACG 


AATCAGCCCT 


TATCCGCTTT 


GATATGAGTG 


r\\*» 1 A\ a S\ 1 VJOrt 


c tt tt & iwrrrp & 

OAAA ill A 




GCTAGTCGTC 


TCAACGGAGC 


TCCTCCAGGC 


TATGTAGGAT 






C 1 £> A 


ACAGAGAAGG 


TTCGCAATAA 


ACCCTATTCC 


GTTCTCCTCT 




ttf—ttf^tt a t~T* f*r* 


5 3 4 0 


CACCCAGATA 


TCTTTAATGT 


TCTCTTGCAG 


GTTCTGGATG 


AV. USJ 1 Vj 1 L 1 1 


VjAf. AlsA 1 AbL 


54 00 


AAGGGACGCA 


AGGTCGATTT 


TTCAAATACC 


ATTATCATTA 




I L 1 AVAj L U 


54 60 


ACTGCCCTTC 


GTGATGATAA 


GACTGTTGGT 


TTTGGGGCTA 


A^nATATTrr 


TTTTGACCAG 


55 2 0 


GAAAATATGG 


AAAAACGCAT 


GTTTGAAGAA 


CTGAAAAAAG 


CTTATA^Arf 


V*L» AA 1 1 \—A 1 \_ 


CCQA 


AACCGTATTG 


ATGAGAAGGT 


GGTCTTCCAT 


AGCCTATCTA 


CTCATfATAT 




jOi v 


GTGAACATTA 


TGGTCAAGCC 


TTTAGTGGCA 


AGTTTGACTG 


AAAAAfWAT 


l OMU I 1 lj AAA 




TTACAAGCTT 


CAGCTCTGAA 


ATTGTTACCA 


AATCAAGGAT 


ATG ACCC AC A 






CGCCCACTTC 


GCAGAACCCT 


GCAAACAGAA 


GTGGAGGACA 


AG tt dnr* An a 


tt/ ^PTPTTPTP 


coon 


AAGGGAGATT 


TAGTGGCAGG 


CAGCACACTT 


AAGATTGGTG 


TCAAAGCAGG 


C C AGTT AAAA 


5 880 


TTTGATATTG 


CATAAAAGAA 


TAAAAGTATC 


AGCATCTGAC 


CATAAGTCAC 


ACTGnAnTHA 


594 0 


a ttT"T , f~* h. a T*r* a 
AAA LLAA1UA 


AAATCAAAGA 


G C AAACT AGG 


CAGCTAGCCG 


CAGGTTGCTC 


AAAACACTGG 


6000 


TTTGAGGTTG 


CAGATAGAGC 


TGACGTGGTT 


TGAAGAGATT 


TTCGAAGAGT 


ATGAAACTAA 


6060 


AACCT AT AGC 


TTCTAAACGA 


TCCGTGGTTT 


TCATCATTCA 


ACACAAAATT 


CATATGTTTA 


6120 


TTACCCTCCG 


TCGTATTTGT 


CTTAGAGCGT 


GTGTAGTAGA 


AAAAGAGCAG 


TCTTATCTGA 


6180 


AATTTTTATT 


CTTTCAAAAG 


AGACCTGTTT 


CTTTTTTGCA 


TGTCAAATCC 


GTTCTAGCTG 


6240 


GTATTTGAAA 


AATCAAACTA 


ATATTCAATG 


AAAATCAAAG 


AAC AAACT AG 


GAAGCTAGCC 


6300 


GCAGGTTGCT 


CAAAACACTG 


TTTTGAGGTT 


GTAGATAGAG 


CTGACGTGGT 


TTGAAGAGAT 


6360 


TTTCGAAGAG 


TATAAGCTGC 


AAGATGAATG 


ATTTTCTTGT 


ATTGACGTTG - TTGTTG AC AA 


6420 
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AAAGTAGCGG 


ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA 


TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGG 


AGAGAGATGA 


6540 


ACATTTTAGT 


TGC AGATG AC 


GAGGAAATGA 


TTAGAGAAGG 


AATTGCAGCA 


TTTCTGACAG 


6600 


AAGAGGCTTA 


TCATGTCATT 


ATGGCTAAGG 


ATGGACAAGA 


GGTCTTGGAA 


AAATTTCAAG 


6660 


ATCTCCCTAT 


CCATCTCATG 


GTACTGGATT 


TAATGATGCC 


TAGGAAGAGT 


GGTTTTGAAG 


6720 


TGTTAAAAGA 


AATCAATCAA 


AAGCAC GAT A 


TTCCTGTCAT 


CGTCTTGAGT 


GCTCTGGGAG 


6780 


ATGAAACTAC 


TCAGTCACAG 


GTATTTGATC 


TCTATGCTGA 


TGATCATGTG 


ACAAAACCTT 


6840 


TTTCTTTGGT 


ACTG CTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGACGTTAC 


TACGTCATAG 


6900 


AGGATCTTTG 


GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 


CTCTTACAAA 


CCACATTATA 


6960 


AAAATGAAGA 


AATTGATCTC 


AAACCAAAGG 


AATTACTGGT 


ACT AAAGTG T 


TTGATTCAGC 


7020 


ATAAAAATCA 


AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAAGA 


AATTTCAAAA 


GATGTAGCTG 


7080 


ATTTACCTTG 


TGATAGGGTC 


GTTGATG TCT 


ATATTCGTAC 


TCTT CGCAAA 


AAATTAGCTT 


7140 


TAG ATT GT AT 


CGTGACTGTG 


AAAAATGTTG 


GGTATAAGAT 


TAGCTTATGA 


TAAAAAATCC 


7200 


TAAATTATTA 


ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 


CTAGGTGGTG 


TTGGTCTAGT 


7260 


CATTCATATA 


GCTATTTATT 


TGACCTTTCC 


TTTTTATTAT 


ATTCAACTGG 


AGGGGGAAAA 


7320 


GTTTAATGAG 


AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


ACTAAGACAT 


CTGATGAAAT 


7380 


TCCAAGCTTA 


CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 


TCTGCTCACC 


TTAAAAGAGA 


7440 


TATTGTAGAT 


AAGCGGCTCC 


CTCTTGTGCA 


TGACTTGGAT 


ATTAAAGATG 


GAAAGCTATC 


7500 


AAATTATATC 


GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATGGTAAAC 


AGGTAACCGT 


7560 


GCAATTTGTT 


CACGGGGTGG 


ATGTCTACAA 


AGAAGCAAAC 


AATATTTTGC 


TTTTGTATCT 


7620 


CCGATATACA 


TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


T TTTCTT ATT 


TTT AT ACT AA 


7680 


ACGCTTGCTC 


AATCCTCTTT 


TTTACATTTC 


AGAAGTGACT 


AGTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT 


CGTTTTGATG 


AAAGTAGGAA 


AGATGAAGTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG 


TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTCGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA 


TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA 


GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TT GG AG ATT A 


7980 


CAAAGATCAT 


CCAAAATATA 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA 


TGAGCCACTT 


3040 


ATTAGAAGAA 


GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGG 


ACAGAGTGTC 


GTGAGACCTT 


8100 


GACTGTTAAG 


CCAGTTTTAG 


TAGATATTTT 


ATCACGTTAT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA 


ATTGAAAATC 


AATTGACAGA 


TGCTACCAGG 


GTCGTCATGA 


GTCTTAGGGC 


8220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 82 80 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAAT AC A T GTGCGCCTCT 83 40 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 84 00 

TAAGGATGAA ACTTCCGCTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 84 60 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8 52 0 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 864 0 

TGATTTTTTT AAAGCGTTCG ATAGAGAATC AG AAACG AAT CCTTAGCAAT GGCGGCC 8 697 



(2) INFORHAT I ON FOR SEQ ID NO: 124: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AG ATTTTCG A AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTG AG AG AGT AATCCCTTTG 120 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 24 0 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 3 00 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AAT ATTG AT A TAAAAGAAGG 3 60 

GAGGCCAATA TG AG AC AT AA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 4 20 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGACGCT 4 80 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTG AG A CAACCTTTAC AGG TG AC AT C 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATG TGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACCC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 7 20 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 7 80 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACCTT CCACCTTGGC 9 00 



WO 98/18931 



PCT/US 97/ 19588 



364 



TTCTTGTAAA 


TCCAAAACTA 


CCTTCTTAGT 


GACTTGAGCA 


AGATTTTGAC 


GCAAATCATC 


960 


TGTCAAAACA 


TAAACAGTTT 


GGGCTGC CTT 


CAAGATGGCT 


TCGTAAATCT 


TATCTGGATT 


1020 


AAATTCAGCA 


ATTTCGCCAT 


TACGTTTGAT 


TACTTGCATA 


GGTTTCTCCT 


T T AT TCTTTG 


1080 


TTTTCTTTGA 


TTTCTGCCAG 


CATTTTTTCT 


TCTTCTACTG 


TCAGTTGATA 


ATGTTCAAGT 


1140 


AAATCCGGTC 


TGCGCTCGTA 


GGTTTTCTTT 


AAACTCTCGT 


ACAATCGCCA 


CTGACGAATC 


1200 


TTTTCATGGT 


GGCCACTCAT 


CAATACATCT 


GGCACGACCA 


TGC CTCG AT A 


ATCATAGGGA 


1260 


CGTGTGTACT 


GAGGATATTC 


TAAAAGACCT 


GAAGAAAAAC 


TATCATCTTG 


GTGGCTAGAC 


1320 


TCCTTGCCAA 


TCACTTCTGG 


AATCAGGCGA 


ACTGTAGCAT 


CAATCATGGT 


CATAGCTGCC 


1380 


AATTCTCCAC 


CAGTGAGGAC 


ATAGTCACCT 


AGGGAAATCT 


CATCTGTTAC 


CAAGGTCTTA 


1440 


ATGCGCTCAT 


CATAACCCTC 


ATAGTGCCCA 


CAGATAAAGA 


TTAGCTCTTC 


CTCTTGAGCC 


1500 


AAATCTTCAG 


CATAAGCCTG 


ATCAAACTGC 


TTTCCACCAG 


GATCAAGGAG 


AATAACGCGC 


1560 


GGATTTTTCT 


TTTCAATAGC 


ATCAAAGGAA 


TCGAAAATAG 


GTTGTGCTCT 


GAGCAACATG 


1620 


CCCTGACCGC 


CTCCGTAGGG 


CTCATCATCT 


ACATGACGGG 


CCTTTTCAGC 


ATTTTCTCGA 


1680 


AAATTATGAT 


ACTGGATATC 


CAAGAGCCCT 


TTTTCTCGAG 


CCTTTCCAAC 


GATTGAGTGC 


1740 


TCCAGTGGAG 


AAAACATCTC 


TGGAAAGAGG 


GTTAAAATAT 


CAATCTTCAT 


CGTCTAACCC 


1800 


TTCTAAGATT 


TCCACATCGA 


CCCGTTTACT 


TGGAATATCA 


ACATTGAGAA 


CCACTGGTGG 


1860 


GATATAAGGT 


AAAAGCAAAT 


CACGTTTGCC 


TTTTCGTTTG 


ACCACCCAGA 


CATCATTAGC 


1920 


ACCTGGTTGC 


AGGATTTCCT 


TGATGGTTCC 


AACCAAGCTA 


TCACCCTCAT 


AGACTTCCAA 


1980 


ACCGATAATC 


TCGTGATAGT 


AAAATTCACC 


ATCGTCTAGG 


TCATTCAAAT 


CTTCCTCAGC 


2040 


GACCTTGAGA 


CTGT ATCCCT 


TGTACTTTTC 


GATAGTATTG 


ATATGGTACA 


TATCTTTGAA 


2100 


TTTAATAATG 


TCAAAGTTCT 


TCTGTTTACG 


GTGGCTAGCG 


ATGGTCACTG 


TTTGGACAAA 


2160 


CTGATCTTTT 


TCATCAAACA 


AAACCAGCTC 


AGCTCCTTTT 


TTAAACCGTT 


CTTCTGCAAA 


2220 


ATCCGTCACA 


GACAAGACTC 


GCATCTCCCC 


CTGTAATCCC 


TGCGTATTAA 


CGATTTTCCC 


2280 


AACATTAAAG 


TAGTTCATCT 


TGTCTCCTGT 


AATCTCCTTT 


TTTCCATCTT 


ATTCTAACAA 


2340 


TTCTCGAATA 


ATAGCCGCAA 


TTTTTTCCGA 


TTCTGACCAT 


TGTAAATAAT 


GGTGATTCCC 


2400 


TCCTAAAATG 


AGTTTAGTAT 


TGGAAGTCCA 


ATATTCTGAT 


TCTCTGTACT 


CTTTTTCTCT 


2460 


ATAAGGCTGA 


CAAAAAACAA 


ATACAGGAAT 


ATGAGCTTCT 


ATAGATACAT 


CCTCAAAATC 


2520 


TTCCTCAGTA 


ATCTCTCCAG 


ATATCTGAAA 


TTCTGGATCT 


TGATTTTCCA 


ACTCTAAGCC 


2580 


TTTTTCTTGC 


ATTAATTCCC 


AGATTTTTTT 


ATTCGTTTCA 


GGACTAAATG 


TTGCTTGAGT 


2640 


TAAGTTCTTA 


AAATAAAGTT 


CAGGACCACA 


CTCGTCAATC 


AGCCTCATCT 


GCTCTTCCAT 


2700 
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TTCTGGATAA CGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2760 

TCCTACTAAA GTCTGACGCT TAATTGGTTT C7CGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCT7CAAGTA CTTCATAAAC 2380 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCCGAC TCCTACCTCT 2 94 0 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGCA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTT CAT AAA 3120 

G AAAT CC ACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGCT TTTGTCTTTC TCAAAGACAG 3 300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3 3 60 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGACCGCCTA 3 420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG AC CTTC ATC A 3 4 80 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA A TTGC CT AAA CGTTCAATCT 3 540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3 600 

ATTTGACTCT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TT TACT AT AT CTTCTATTCC 3 660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3 720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCG CAG AAA TAGTGCGACC 37 80 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA T TC AAATG AT ATTCCAAAAA 3 840 

TTCTGGTGTA TCCTCAATCT TG AT AG TT AA GGCATCTGGT TGTGAAATTA AGGGTTT C AC 3900 

AATCGCAATA ATCAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3 9 60 

AAAAT TT AG A ATCGTGCAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 402 0 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4080 

CTTCGTTTTC AG CAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 414 0 

GTGGTGAACG TGAATCTGCT AC GTTG AT AC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4 200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4 317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4881 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
(01 TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 125: 

AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATCCT TAAAATAAGG 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATAC A CGAAATCAAT 130 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 24 0 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 3 00 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 3 60 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAACCGCTGG AGATTCTTTA 4 20 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 4 80 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCACTTCTT GCCTTCTGGA 600 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 660 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 84 0 

TGTTGAAGGT CAAAGCAAGT ATCGCTTCTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 900 

TAGAATTTGT GAAGAGTCTT CAAGTG7TCT TTGAAACGAG GCACTTCGTA ACGGTTT AC A 9 60 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 102 0 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 10 80 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTG AG A TGATGCAACG 12 00 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 1260 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 1320 

TAAGCACCTT TTTGAGCATT T AC AAT AT AG TTATCTGCAA AGGCAATATC ATAGTTTTCA 1380 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 14 40 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTG TTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 15 60 
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CCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 
ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 
TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 
ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 
GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG I860 
AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 1920 
ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 
AC AAAAT C GC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 204 0 

GTAGAGCAAC TCC CAGTTG A CCAATCAAGC CGACC GCTTT GGCAATGTAG TCCATATTTT 2100 
GTTGCATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAG CTGTT AA GAGCGTGATG GTTGCGATAC 2220 

CTCGTAGTGA C AATGG CAAA CAGATTTGGA AG AAAAT CCG GGCCTCACTG GCACCATCGA 2 2 80 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 234 0 

TCATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACACTG TCACCAAGCT 2 400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 2460 

GAAGGACGAA GATCCTAAAG AATCTGCGAT ACT T AAAGGT TGTCCGTGAA ATAGCGTAGG 2 52 0 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2 580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTCCCAA AAACTCAAAA CCCTCTAACC 2 64 0 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 27 60 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2 820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 28 80 

7CGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2 940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAAC GTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3 060 

AAGAGACCGA AGTCTG CGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCAT G A TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT AGAAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 3 3 60 

CCTTTATCAA TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA AGACAAGAAG 34 20 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG G AGCTCG CTG 34 80 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AG AT AAAG CC AAGATTGTAG 3540 

AGGACAGTAT TTCGTCTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3 600 

TCGAGTCCGA CCCATTTACT ATTTATCATA CTATCTATGA AACCATTACT GGTCATGTGC 3 660 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3 7 20 

AGTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3 7 80 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 384 0 

TTGATAACGA TT AC ATT ATT AGTATACTCC TATTTGCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 4 020 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTC C AG ACT AGGG AACTC A GCGAT AG TTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 414 0 

TTAAGATTAT CAATCAACTC TG CTGCAGT A TGCTCAGAGC CTTTTTCATC TGCCAACAAC 4 200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGCA CATCCTTATC TACTGTTTCA 42 60 

AGGTTTGAGT CTTTAAGAAC TTTACTTAAT TCCTTGGCTA ATTTCTTGAG TTTGATTTGC 4 3 20 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 43 80 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATC 4 440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTAGCCT CTGCCAAAAC 4 500 

CAGACGTGCT GAATCTGTGG TAG GATTCG A TACACCTGTC AATGATCTCA AAAGATTTTC 4 560 

TAAGGTTTGA GTCTGCTT A C TAATACTAGA CTAAAATCAA AAAG T ATT AT ATAACAGTGA 4 620 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4 680 

AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTTGT CTGATGGGCA AATCTTATAA 474 0 

AG AG AT TAT A GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCACCAAAC 4 800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4860 

TATACTTGGT GGTTTAGCTC G 4 881 
(2) INFORMATION FOR SEQ ID NO : 12 6: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDN ES S : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGT ATTGCC A AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 24 0 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG TCACACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 42 0 

CCCTTGCTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 4 80 

ACCGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 54 0 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA AT ATGC C ATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG CATGGTGAAG CTGTAGGAAC TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCC TTGTCT GAAATGAAAA 72 0 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGCCTAAA TATATCATTG 84 0 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 960 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTCG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 1080 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 114 0 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATCG TTCTTCAATG TATGACAGTT 12 60 

TAAACACTCG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA T ATTCT AT C A 132 0 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 13 80 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 14 40 

T T AAAGC AAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAA7CTTCAA 1500 

CTG CTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 15 60 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

C C ATTGT CAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT G ACGTT ATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 174 0 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCT ATC AC AGGCCAACAA AGCCCAATTA 1300 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 19 80 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATT ATC AT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TT G ACT AG AC 2160 

TAGCGAAACG CTT AGAAAAG ACGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 22 20 

TTTACACAAG C ACT ATGG AA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 22 80 

AGGAGATGTT GTTTGTATCA TCGCTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 24 00 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 24 60 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 264 0 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTCC 2700 

CCTTGACCCT GAGATGGTTG G AGACGT AC T T AACGTT ATC AAGGAATTGG CTGAGCAAGG 27 60 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 282 0 

TAT CT TT ACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2 880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 294 0 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC T AGG AAT AAA 30 60 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTCGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACT AT ATC AA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3 2 40 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3 3 00 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTC AATG A TTCGGAACAT TAT ATTGCTT 3 3 60 
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TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 34 20 

TCTTGT CT AT CGCACCTCAG TTCTTTGGAA CCATTGCCAA AC AC CT C AAA TCTGAAAACA 3430 
TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 
CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3 600 

GTATCGACCA TTATCTTGCT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3 660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3 720 

CGGAGCGCTT CGGTGTAGAA GAACGTGGTG GCT ACT ATG A CCAATCCGGT GCCCTCCGTG 37 80 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 384 0 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 3 9 60 

ATGGCATGAA ATACATCTCT T ATCGTAG CG AGCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 414 0 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACACAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 4 2 60 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4 32 0 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4 380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4 440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AG CC AG CTTT GACCTACTTG 4 500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA CATGGTCGCT 4 560 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TG ATT AG ATT 4 6 20 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4 680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4 800 

TTCGAAAATT CTTTGGCCTG TACT AGGG C A TTTTGACCTG TTGATGCATC AAGTGCCAAG 4 860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 49 20 

TCAGCCATAA GGTTATCCTT ATTTT G C AG A CGACCAGCAG TATCAATCAT GAGAATATCG 4 9 80 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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CCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTCTTTGTAG 5160 

CGGTGGGCTA GTTTT CCG AT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 5220 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 52 80 

AGCTCAACCA ATTTCTCAAT GATGAC AC G A CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGG CTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CGACTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 54 60 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5 5 20 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GCAGCTTCTG GTTCAAGCAC TTCAGAATTA 5 580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5 6 40 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC C7GACTTTCT 57 00 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 57 60 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5 820 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5 8 80 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GCACATATTC TTCGATAGCC 5 9 40 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTC ACT A CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTG CAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 6060 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTG AC C ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCT GG AG AAA TCTTCTGAAT GGCTGCATCC 62 4 0 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 6 3 00 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 63 60 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATCGCAT CAAGAGGCAG TGATAATTTC 642 0 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 64 80 

TCTCCTGTAT TTTTCTG AAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 654 0 

CCGTCAGTCC CTAACTCATG G AG AAAG AAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCC TCCAAGTCCA ATGCAATCAA TTTTATATCT 67 20 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 67 80 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6840 

CCTGCATCAT ATCTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 6900 



WO 98/18931 



PCT/US97/19588 



873 

TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT ATCGACATAG TCCAGAACAA 6 960 

TATCAATGGA TTCAAAGCCA GTTGTCATGG CCTTAACACC AGGAACGTTT TCGTTTACCC 7020 

AAGCCTCCCC ATCTTCCAGC GT TT CTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7080 

CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT 7140 

TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 72 00 

GTTTATTGAT ATCTACATAA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 72 60 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCT ACC AAA CTGCCATTTT 7 3 20 

CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA GACAGAAATC 7 3 80 

CCCGACCCGA AGCTACCGCA AAGT AAATC C CTTTTTCCTT GTAGGAAACC AAG AG AG AC T 74 4 0 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7 5 00 

CT ACT AGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 7560 

CTTTGAAACA CCCGATTCTT GCATGGTCAC TCCATAGATG GAATCAGCCG CTGC C ATGGT 7 6 20 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCTGTCCTTG TCAAAGCGGT TGAGGTAATC 7 680 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 774 0 

TGGAATAGTC TTGACACGAA TAATGGACAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7800 

ACCACCACTC ATGAGATTAA GAGACTGGAT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 7 8 60 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7920 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC T C AAAGGTTG AT TT AAAGCG 7 980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAACG AGCAGGTTTT TCGCAGACAA 804 0 

AATATCATCA CGTTGGCTAT TTAGGAAATC CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATACCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 82 20 

GATCTGGTAC TGGTCTGTTA AT TG ACT TTG TAGATGGCGC AAGCGCTCGC TAACCTTTTC 8280 

TTTCTTGGCT TCAGCACGAG TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCCACCCTG 8 34 0 
ATCCAAATGA CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT . 8400 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 8460 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 8520 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTTAAGCAAA CGAGCTTGCG CTAGCTCTTC 8640 
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CTGCAAGTTT 


TGATAGCGTT 


CTTGGATGGC 


ATTTTTGTTA 


GACTTAATCT 


CTTCAATCTC 


8700 


AGCTTCCAGA 


TTTTG CTTGT 


C ACTGG AG AT 


TGCAGCAAGA 


CGCTCTTGGC 


AGTTTTCCTT 


8760 


ATCCGCTTGC 


CAATCTCCCT 


CGGAAAGACG 


ATCTATTTCC 


TCTTCTTGGA 


GTTTCCAAAG 


8820 


AGTTTCCAGT 


TCTTCAACTT 


GCTGACTAGT 


TTGCTGATAA 


GCGAGGAACA 


AGCCTTGCTC 


8880 


CTGAATACGT 


GCCTGCTCTC 


CTTGAGATTT 


AATAGCTTCT 


AATGACTCGG 


TCAATCTGGC 


8940 


CATCTCATCT 


TGCAAGGTCT 


TCAAAGTCGC 


CTCTTCTG AA 


CCCAAGCTTG 


CTTCTTCTTC 


9000 


AGCAATTTCT 


TTTTGTAATT 


GCTCCAGTTC 


TGGCTTGATA 


AAAATGCTCT 


TATTCTGGCG 


9060 


ATTGGCACCA 


CCTGCATAAG 


AACCACCTGT 


GCGCAACTCT 


GTCCCATCCA 


ATGTCACCAT 


9120 


ACGAACCTGA 


TAACGAACTT 


GGCGAGCTGC 


TGCACGCGCA 


TGTTCTACGG 


TATCAAACAT 


9180 


AGCCGTCGTA 


GCTAGCAAGT 


TCTTGAAAAT 


GGCTTCCAGT 


CTAGTATCAA 


AAGTCACCAA 


9240 


CTCATCTGCC 


ATCCCAAGGA 


AACCTGGGCT 


TACAGCGATA 


GCATCTTGGT 


TCTGACTAGA 


9300 


AATCGTACGC 


GCCTTGATAG 


TCGTCAAAGG 


AAGAAAGGTT 


GC ACGACCGG 


CTCTGTTCCG 


9360 


TTTAAGGAAG 


TCAATAGCCT 


TGGTTGCCGA 


CTCTTCATCT 


TCTACGATGA 


TATGCTGGCT 


9420 


ACTTGCCCCT 


AAGGCAATCT 


CTAGGGCAGT 


TTGATAATAA 


ACATCAAAGG 


TCAGATGCTC 


9480 


ACTGACTGCA 


CC AATAATCC 


CACCTAGGCG 


ATCTTTTTCT 


TGGAGAACAC 


TCTTAACACC 


9540 


TGCATAAAAC 


TTACTATGAT 


TTCTCAGGAT 


ATTTTCCAAA 


CTTTGAGCTC 


TGGC CTGCTT 


9600 


GTTTTTGAGA 


TTATCCAGAC 


GGTCAAAGAG 


TTGGCTTTGT 


TGAGCTTGAT 


AGGAAGTTTT 


9660 


CTGCTCCTCT 


TGCTCCTTGG 


CAATAGCTTG 


GTAGTCAGCC 


AATAATTTCT 


GAACCTCCTC 


9720 


CTTGGCACTT 


TCAAGCTCTT 


CCTTTTGCTG 


ACTAGCCTTC 


TCTTTAGCTA 


TAGCTAATTG 


9780 


CTCTTTCAGC 


TTTTCTAGTT 


GATCTGCTTG 


TTTTTGAGAA 


AGCTGACGAC 


TATTTTCCAA 


9840 


CTCATTCTCA 


ATACGGGTCA 


ACTGGTTTGA 


GACATCCGCT 


TCTTCTTGTA 


AAAGACCTAC 


9900 


AAAGCGTTCA 


CGTAAGAGCT 


CAATCATCTG 


ATCAGGATCG 


TCTGAGAAAG 


CCAGCAATTC 


9960 


AGCTTCTAAA 


CGATTGAGTT 


TTTGATTATT 


TTGGACTAGA 


TTTCCCTCTA 


ACAGAGCTAA 


10020 


AGAGCTTTCT 


TTATCAGACT 


TTTCTTTGCT 


GAG TG AATTT 


CTCTTATCCT 


CCAAAGCAGC 


10080 


CAAACGGGCT 


TGTGCCTCCT 


GTTGATTCAA 


GGCCACTTGC 


TCGGACTCCA 


GTTTCGATAG 


10140 


GGCTAATTTT 


CTTTCTAAAT 


CACTAATCAG 


ACT AGT C AAG 


TCCATCAAAC 


TGCCTTGGTC 


10200 


TTTGGCCATT 


TCAGCCTGTA 


AATCTTGGCG 


TTGCTTTTTA 


AGAGTTTGAT 


TTTCTTCTTC 


10260 


TAATTTTTCA 


CGCTTTTGGT 


AATAACTCAT 


CAAGAGTTCT 


TGAACCTGAG 


TCAACTCTTC 


10320 


TTCTGTCGAC 


TCTAGTTCAG 


CCTTATTTTC 


CTTGATTTGA 


GCAACCAGAA 


CATCTAAATA 


10380 


AATAGCCTTA 


CGTTGTCCTT 


CCAAGTCTAA 


AAACTTACGG 


GCATTCTCAG 


CTTG CTTCTC 


10440 
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AAGAGGCTTG 


ATTTGATTAT 


CCAACTCGTA 


GATAATGTCC 


TCTAAGCGGT 


CCAGATTATC 


10500 


CTGAGTTTGC 


TGCAGTTTAC 


TCTCGGTTTC 


TTTTCTGCGA 


GTCTTGTATT 


TTAAAACTCC 


10560 


AGCAGCTTCT 


TCAAAAATAG 


CTCGTCGTTC 


CTCAGGCTTG 


GAATTAAAAA 


A W 1 W W- 1 W^W\W 


10620 


CTTCCCTTGG 


GAAATAATAG 


AGAAGGAATC 


TCGTCCCAAT 


CCAGTATCCA 


AGAAnAfiGTf 1 


10680 


ATGAATATCA 


CGCAGACGGA 


CTTTCTTGCC 


GTCAATCTTG 


TATTCGCTAT 




1074 0 


AT AG AC ATGG 


CGTTCCACCC 


TGATTTCTTG 


ACCTGCATCC 


TTGATAAATC 


VJ X V* *V X VIA X X 


10800 


ATCCAGAGTC 


ACAACTACAG 


AAGCATAATT 


GAGCGGTTTG 


CGACTTTCGG 


TTCCAGCAAA 


10860 


GATGATATCC 


GGCATCTTGC 


CCCCACGGAG 


ACTCT TG AC A 


CTAGACTCCC 


CCAAAGCrrA 


1092 0 


ACGCAGACTT 


TCTGTAATAT 


TGGACTTTCC 


AG ATC CATTG 


GGT CCAACAA 


w 1 vtwwvi 1 v«/\w 


10980 


ACCTTGGTCA 


AAAACGACCT 


TGGTCTTATC 


AGCAAAAGAC 


X i VJ/VW^C^i. X 


111 w vjrV 1 


11U4U 


TTCCTTTAAA 


TACATGAATC 


CAGCCCCTTC 


TCAACGGCAT 


XXIX \*J Vw£v_ AVJ\« 


1 1 ww I V»v_ 1 \_ 1 




GCTAATTTCT 


TAGAACGACC 


T TGGCCTTG A 


CCGATGCTCT 

V* WUA X WW X k 




>v\0/W\wi 1 1 w 1 




ACATCAAAAA 


CCTTATCGTG 


AGCAGGCCCT 


GTTTCAGAAA 


TCACCTGATA 


AC* f5 AAT C C 




ACATCACCAT 


TGACCTGAAG 


CAACTCTTGG 


AGATGGGTTT 


TATACTCTCT 

X r\ X rWJ 1 L 1 \J X 


A ATP ATTT P A 


1 1280 


AACTCGCCTG 


CTTCAACCTT 


AGGAATCATG 


ACTTGATAGA 


TAAATTCCTT 

X r^r\£\ X 4 %«»V^ X X 


vj^\w w i- l uuuv 


i i Tin 


ACATCCTTAT 


CCAAAAGAAG 


GGCACCAAGA 


AAGGCTT C AA 


AGGCATCArr 


AAGAATnGTr. 


1 1400 


TCACGATTGC 


GACCACCTGA 


TTTTTCTTCC 


CCTTTACCCA 


ACTTGATAAA 

*V^w, x X X r\f\£% 


fTGGTCAAAr 


1 1 4 fin 


TGGCAATCAC 


GCGCAAAACC 


AGCTAAACTC 


TCCTCACGGA 

* WW i W«^W\JW?f^ 


CAAT PATAGC 




I 1 CTft 

II j«u 


GATAGGTCAC 


CTTCAGGCTT 


TTTAGGATAT 


TTTTTATATA 


GATATTCTGA 


AATCAATAAP 


1 1580 


TGTAGAACAG 


CGTCTCCTAA 


AAATTCCAAG 


CGTTCATTGT 


GTGAAATTTT 


TAAGAGGCGG 


11640 


TGCTCATTCG 


CATAACTCGT 


ATGAGTAAAG 


GCAGTTTCCA 


GTAACTTTTT 

W X AnW X X X X X 


GT CTGCAAAT 


117 00 


TCGATTGCAA 


AATGATTCTT 


TAGTACAGTT 


TGTAATTCTT 


TCATACCAAC 


CTCTTTCTAA 


11760 


CTGATAATAG 


TCCTTTTTAT 


TATATCAAAA 


AAAGCCCCCT 


GAGTCACTCT 


AAAACGGGAC 


11820 


TGGAAAGCAT 


TTGGGAATTC 


TTTAGACAGA 


GATTCTCAGT 


TTT AGCGG C A 


AATTTGGGTC 


11880 


AGGATAAAGA 


AAAAAGCCCT 


ATTAAAGGCT 


TTTTAGGATG 


TTTACATCCA 


CCCTGAGGGA 


11940 


ATCGAACCCC 


CATCTCAAGA 


ACCGGAATCT 


TACGTGATAT 


CCATTACACT 


AAGGGTGGAA 


12000 


ACTTGTTTTA 


TTATAACAGA 


AATTTGCTCT 


AATAACAAGT 


TTTTTGGTCA 


AAGACCCCGT 


12060 


CTTAGTGGGA 


AGCATCCCCA 


TTCCAGATGG 


AGTTTTTCAC 


GATCACATAA 


TCAACGTGTT 


12120 


f AAGGTCAGC 


AACCTGACGT 


CCACCTGCAT 


AAGAAATAGC 


ACTTTGAAGG 


TCTTGTTCCA 


12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12 24 0 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATCTG TCCTGCAAAG AGGGAACCAA 12 3 60 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA CTACGAATTC 12 420 

CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 12 4 80 

GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 12540 

TTCCGACCTT AGTAGCATCC GCACC AGC AT TTTCCAATTC ACCCACAGCT TCTCGTCTTC 12 600 

CCACATTTCC AGCAATGACA AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12 660 

AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 12 720 

CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 127 80 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TCAAAACGGT 12840 

GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 12900 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12 9 60 

TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTCC AATCAATTGA ATATCTTCGT 13 020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTG T AATGACCTAC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 
(2) INFORMATION FOR SEQ ID NO : 12 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 7: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACACTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 2 40 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACCT 3 00 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TCGAGTTGAT ATTCCAAATG 3 60 

AC AAACGC GT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 4 20 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 4 80 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 54 0 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC AT AC CGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTC7TCC GATTGCTGGT AAGAAAAAAT AATATAGGAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAAT C AGTTG AAGTT ACTGT 96 0 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 102 0 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCCCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 114 0 

CCACTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 12 00 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 12 60 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 13 20 

GTTTCACACA GTTCCAGGTG TTCGTGAAGA CCTGATCCAA ATCATTCTGA ACATTAAAGG 1380 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATC AT C GAACTGGATG TTGAAGGTCC 14 40 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTT AAC AG 1560 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 162 0 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT TC G AC AAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 174 0 

TATTCCAGAA GATGCTTTAG CGCTTTCAGC ACGTATTTTG ACAGAACATC TTCATTTCTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC 18 60 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 192 0 

CTGTTTAAAA CCTGCCGGTA TCAAT ACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 19 80 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2 040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTG AT C 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2 220 
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ATCATTACTC 


TAGGTAAACG 


TGGTGATTTG 


CATGCACGTC 


GTCAAGCAGC 


TGCTTTCGTA 


2280 


CGTAATGAAA 


TCGCATCTGA 


AAACTATGAT 


GAAGCAACTG 


ATAAGTACAC 


TTCTACTACA 


2340 


GC ACTT C AAA 


AATTGTTCTC 


AGAAATCGCA 


CCTCGTTATG 


CTGAACGTAA 


CGGTGGATAC 


2400 


ACTCGTATCC 


TTAAAACTGA 


ATCACGTCGT 


GGTGATGCAG 


CGCCAATGGC 


GATCATCGAA 


2460 


TTAGTATAAA 


ATCATCAATT 


TTGTTGAGTG 


TTATGATGAT 


GGAGTCTTGT 


GCTCTTAGTC 


2520 


TAGCTCTGGT 


CTACCGCTAG 


GATTTCGGTC 


CTAGCGGGAA 


CACTCATCAT 


AAGTTGGGAT 


2580 


AGTAGACGCT 


TGTTTACGAA 


ATTGTTTTTT 


TCTTAAGAAC 


AACTTCGTAA 


GCAGGCGTTT 


2640 


TTGAGTATTT 


TCGTTAGAAT 


TATGCTATAC 


TATTTGAAAA 


GAATCCTGTT 


TAATGTTAAG 


2700 


GTTTCTTATT 


TTAAGAAGAA 


TTGGAGTTTA 


CTTATGAAAG 


CC ATT AT AAC 


TGTTGTTGGT 


2760 


AAAGATAAAT 


CTGGAATTGT 


TGCAGGTGTT 


TCTCGTAAAA 


TTGCAGAATT 


AGGATTGAAT 


2B20 


ATTGACGATA 


TCTCTCAAAC 


TGTCTTGGAT 


GAATATTTTA 


CG ATG ATG G C 


TGTTGTATCT 


2880 


AGTG ATG AAA 


AGCAAGATTT 


TACCTATCTT 


CGTAATGAAT 


TTG AAGCTTT 


TGGGCAAACT 


2940 


TTGAATGTAA 


AAATCAATAT 


TCAGAGTGCA 


GCGATTTTCG 


AAGCTATGTA 


TAATATCTAG 


3000 


GAGGTCATCA 


TGGATATTAG 


ACAAGTTACT 


GAAACCATCG 


CCATGATTGA 


GGAGCAAAAC 


3060 


TTCGATATTA 


GAACCATTAC 


CATGGGGATT 


TCTCTTTTGG 


ACTGTATCGA 


TCCAGATATC 


3120 


AATCGTGCTG 


CGGAGAAAAT 


CTATCAAAAA 


ATTACGACAA 


AGGCGGCTAA 


TTTAGTAGCT 


3180 


GTTGGTGATG 


AAATTGCGGC 


TGAGTTGGGA 


ATTCCTATCG 


TTAATAAGCG 


TGTATCGGTG 


3240 


ACACCTATTT 


CTCTCATTGG 


GGCAGCGACA 


GATGCGACGG 


ACTACGTGGT 


TCTGGCAAAA 


3300 


GCGCTTGATA 


AGGCTGCGAA 


AGAGATTGGT 


GTGGACTTTA 


TTCGTGGTTT 


TTCTCCCTTA 


3360 


GTACAAAAAG 


GTTATCAAAA 


GGG AG ATG AG 


ATTCTCATCA 


ATTCCATTCC 


TCGCGCTTTG 


3420 


GCTGAGACGG 


ATAAGGTCTG 


CTCGTCAGTC 


AATATCGGCT 


CAACCAAGTC 


TGGTATTAAT 


3480 


ATGACGGCTG 


TGGCAGATAT 


GGGACGAATT 


ATCAAGGAAA 


CAGCAAATCT 


TTCAGATATC 


3540 


GGAGTGGCCA 


AGTTGGTTGT 


ATTCGCTAAT 


GCTGTTGAGG 


ACAATCCATT 


TATGGCGGGT 


3600 


GCCTTTCATG 


GTGTTGGCGA 


AGCAGATGTT 


ATCATCAATG 


TCGGAGTTTC 


TGGTCCTGGT 


3660 


GTTGTGAAAC 


GTGCTTTGGA 


AAAAGTTCGT 


G G AC AG AGCT 


TTGATGTAGT 


AGCCGAAACA 


3720 


GTTAAGAAAA 


CTGCCTTTAA 


AATCACTCGT 


ATCGGTCAAT 


TGGTTGGTCA 


AATGGCC AGT 


3780 


GAGAGACTGG 


GTGTGGAGTT 


TGGTATTGTG 


GACTTGAGTT 


TGGCACCAAC 


CCCTGCGGTT 


3840 


GGAGACTCTG 


TGGCACGTGT 


CCTTGAGGAA 


ATGGGGCTAG 


AAACAGTTGG 


CACGCATGGA 


3900 


ACGACGGCTG 


CCTTGGCCCT 


CTTGAACGAC 


CAAGTTAAAA 


AGGGTGGAGT 


GATGGCCTGC 


3960 


AACCAAGTCG 


GTGGTTTATC 


TGGTGCCTTT 


ATCCCTGTTT 


CTGAGGATGA 


AGGAATGATT 


4020 
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GCTGCAGTGC 


AAAATGGCTC 


TCTTAATTTA 


G AAAAACT AG 


AAGCTATGAC 


GGCTATCTGT 


4080 


TCTGTTGGAT 


TGGATATGAT 


TGCCATCCCA 


GAAGATACGC 


CTGCTGAAAC 


TATTGCGGCT 


4140 


ATGATTGCGG 


ATGAAGCAGC 


AATCGGTGTT 


ATCAACATGA 


AAACAACAGC 


TGTTCGTATC 


4200 


ATTCCCAAAG 


GAAAAGAAGG 


CGATATGATT 


GAGTTTGGTG 


GTCTATTAGG 


AACTGCACCC 


4260 


GTTATGAAGG 


TTAATGGGGC 


TTCGTCTGTC 


GACTTCATCT 


CTCGCCGTGG 


ACAAATCCCA 


4320 


GCACCAATTC 


ATAGTTTTAA 


AAATTAAGAA 


AATAGGAGAA 


ATTTTAAGTT 


CTATTTAAGA 


4380 


TTAGACGTGT 


ATACTATAAT 


CATTAAATAA 


AGACCTCCTA 


ATATTATTTG 


AAACAGATAA 


4440 


CACTGAATTA 


CTTTGAATTT 


G ATT TT CATC 


TAATATCTTT 


ATTTAATGAA 


CTC C T AAA CT 


4500 


TTTTCATAAT 


AATCTCCTTC 


AAAAGTCGCC 


TGTATGGCTG 


G CTTTT ATTT 


TATCATTCAT 


4560 


GATATAATAG 


AAGCAAACGG 


AGGACGGAAA 


ATCGTAAAAG 


TACGATTGTA 


TTTGGTACGT 


4620 


CATGGCAAGA 


CCATGTTTAA 


CACGATTCGT 


CGCGCGCAAG 


GTTGG AG CG A 


TACTCCCTTA 


4680 


ACTGCTGAAG 


GTGAACGAGG 


GATTCAAGAG 


TTAGGAATCG 


GTTTGCGAGA 


ATCTGATCTA 


4740 


CAGTTTGAGC 


GTGCTTATTC 


GAGTGATTCT 


GCTCGTACCA 


TTCAGACCAT 


GGGAATTATC 


4800 


CTTGAAGAAC 


TTGGCTTGCA 


GGGGGAAATC 


CCTTATCGCA 


TGGACAAGCG 


TATCAGAGAA 


4860 


TGGTGTTTCG 


GTAGTTTTGA 


TGGAGCCTAT 


GATGGCGATC 


TTTTCATGGG 


CATTATTCCT 


4920 


CGTATCTTTA 


ATGTGGACCA 


CGTTCACCAA 


TTGTCTTATG 


CTGAACTGGC 


TGAGGGCTTG 


4980 


G T AG AGGTCG 


ATACAGCTGG 


TTGGGCTGAA 


GGCTGGGAAA 


AACTCAGTGG 


CCGAATCAAG 


5040 


GAAGGCTTTG 


AAATGATTGC 


AAAAGAAATG 


GAAGATCAAG 


GTGGAGGTAA 


CGCCCTTGTT 


5100 


GTCAGCCATG 


GAATGACTAT 


TGGAACCATT 


GTTTATCTCA 


TT AATG G CAT 


GCATCCGCAT 


5160 


GGTCTGGATA 


ATGGTAGCGT 


GACAATCCTT 


GAATATGAGG 


ACGGCC AG TT 


TAGGGTTGAA 


5220 


GTTGTCGGTG 


ACCGTAGTTA 


CCGAGAGCTA 


CGACGTGAGA 


AGATGGAAGA 


AGGCTCTATT 


5280 


TAATCAGTCT 








i AAUj AA 1 AIL 


AAGATAAGAA 


53 40 


AAAAC AG C CG 


AGGGCACTCC 


TTTCGGCTGT 


TTTTGATGTG 


GAAAACTAAA 


GTGTAATGCT 


5400 


ATTGCTTTTA 


GAGATTTTCA 


TAAACAAGAG 


CAAGGAACCT 


ACTGTT AG AA 


CAGTCAGGAT 


5460 


AGTTGACAAG 


GTTGCGGCTA 


CACCGTAATT 


T CCTCTG AG A 


ACCTCTCTAT 


AAATAGCTAC 


5520 


ACT C ATTGT T 


CTTGTTTTGA 


CATTGTAGAG 


GAGGATAGAA 


GTAGAGACTT 


TTGAAATCAT 


5580 


TGTGACTCAA 


GATAAGATGG 


CTCCAGAAAT 


GATACCAGAT 


AGCATCATTG 


GAGTTGTAAT 


5640 


CTTAGCAAAG 


CTATTGAGAC 


GACTACTTCC 


TAAGCTTTCA 


GCACCTTCTT 


CAATACTTGG 


5700 


TGCTATTTGT 


TGTAAGCTAG 


C AACAG AT G A 


GCGAATAGTA 


TAAGGTAATC 


TTCTGGCAGA 


5760 
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TAG AG AC ATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5 820 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGC 58 8 0 

TACCATACTG AGGCTGTCAA TTAAG TTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 59 4 0 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAACGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060, 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 612 0 

T AAAAC AG AG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAACC 63 00 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGCGA AAGTCCGATA 63 60 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTCCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 64 80 

CCGAAGGACC C CATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 654 0 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTC TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATC 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 67 80 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GCAAACCTTC 684 0 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGG AT AAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 708 0 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTT TGT AG AT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7 2 00 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATT C ATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 73 20 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7 380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 74 4 0 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7 500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7 5 60 
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TCCCACTTCG 


TGT TGG ATTT 


CT CGG ATGGC 


TTPAPP.PATA 


TPPAAP.PPAA 


f TTTP P P P*T*P 


/ o 4. u 


CAGATTACTA 


AGTGGCTCGT 


CCATGAGGAG 


AAPAPTTP.P.A 


TT A A (TCfT A 
I 1 AALLUL 1 A 


ft pp PPP ATPP 


"7 £ Q A 
/ O O U 


CAAGGTGACA 


CGTTGTTGTT 

VwU 1 1 vj 1 1 U 1 1 




VjMLj 1 1 i n L Hj 


P.PPTTTPP A T 


PPP P IV. R TTP 


/ / q U 


AGCAATTTGC 


ATGAGTTrAA 


CZ A T A (""TTCITT 
un 1 AL 1 1 vj 1 1 


OLi 1 L 1 L? 1 1 Kj A 


ATPA UTTrw 
A1L.AA1 I L IT 


CTTTTGGAAC 


7 800 


\ * \_ 1 1 L 1. VjVp 


ATAAflAPPAA 


AAfifAAPGTT 


PTPTPPPAPA 


PTPA a ATPTP 
"o 1 L. AAA IL-lo 


PPJV TV IV JV^TVPP 

ggaaaatagl 




U X vJ 1111 VJO 


AAAAPfATrr 




I X I k_>L 1 tjuu I 


1 LLA T A IT AT 


TGATTTTTGT 


7 9 20 


a\ i v_,/\ i Lunnu 


T A A A iTT , r v pr 




AL 1U1 1 UAAA 


C- LTGC AATC A 


^ TV PP Tl IV P IV A P 

T ACG AAG AAG 


7 980 


1 \-U 1 1 1 l V 


t-L AL.A x ^. v_ ». Vj 


& apptpp a ap 

onbL ILLAAlj 


AAL»L*L» I AAA(j 


AOACTTCCTT 


TTGGAATTGT 


8040 


A AT CTTr AAA 


TTr^P A AT A A 
i 1 L . V.AA 1 A/\ 


rapr.r.APATr 


HTf.PT IP ATT 
u 1 L»Lj 1 A\j/\ 1 1 


III. rTOQaCGT 


TAATAATTTT 


8100 




A 1 Mo 1 MALL 


T' »* IVI IN | Wl V » P^T^P* 
1 w 1 A 1 1 AL 4 <-J 


TTTAGATTGG 


ATATCTGTAA 


TV P TV wnPP fTS ^ 

AG ACTTCGT T 


8160 




ALL»A I ni V. 1 vj 


ATT T ATTCTT 


L»A 1 LiALA I AA 


I L.ATAATCTT 


CAGTGAGTGT 


8220 


TTTO A TTTTH 
X 1 lun 1 1 * 1 vj 




1 LA lul I L 1L 


GCTTGTTTTA 


GCATTTTTAC 


G AAC AG G ACG 


8280 


ui i nvj X Av> 1 vj 






TACTTCTTGA 


CjACjATAATAA 


AATCGATAAA 


83 40 


1 1 1L1 1 VjAjL A 


TTTTCCATAT 


TTTT AG ATTT 


TTT & A PP Jl T» TV 

111 AALLjATA 


GCAGCACTAG 


CAGGTAGGAA 


8400 


V-J^%^— VJ\J 1 1^,1,1 


1 L i 1 11 uun 1 


nunt. 1 av. L. 1 1 


AA I O i 1 AbL J 


pp /""i^^ r iiwr^y iv 
LLLiTlATTTA 


AGAGTTTAAC 


84 60 


Tfi f^Tf^/^ a t 1 p*t 

1 V-»v_ l VjvjA 1 L i 


1 LA 1 AnunbA 


rArrAArArr 
u AL I- nnL AL* L 


p tv TTTPTPP a 
LA 1 1 1 L 1 LL A 


T»P *. PPP IV PW IV 

TCAGCGACTA 


CTTT AT AG AC 


8520 


Ml— i AL»A a vj AA 


L 1 lljAALLLrA 


TTTT A PP ATP 
till ALL A I L 


& A T* TV R PTPT*P 

AA I AAL» I vj 1 O 


AAAAGATC I 1 


TT AC AT AAG A 


8580 




TP AT/ "PTTP.T 
1 LA 1 L 1 1 1 vp 1 


a APPiPPTTr 


* PPTT'PT A PP 


A T IV ^f^^PTI* TV 


«v mmp fv P P iv iv n 

ATTGAGCAAA 


8640 


GGCnfTAflAA 


uAL> 1 I > \jL 1 Vj 


LL» 1 LAbLA^j 1 


TP PP A TT*T*PT 


p/ ,, T^^'i*'n tv rf" | fiHii 


P IV P*' " ^^P ^ > 

CAGGl i TGAA 


8700 


AAGATCCTTA 


T A TrY"TT f^P! A 


lull l_A 1 OLL 


TTT APTT AAA 
111 no i i AAA 


TP A r^PPTTP » 


PP IV TT IV TV A TV P 


87 60 


ACTACCA^CT 


AGTGTATAAG 

r\\9 X vj 1 /*. I 


P.AGTAP.AGTA 


flPPlPTTTP 

VjV_\_ 1 * U 1 o 


TTTTP ATA TT 
1111 OA 1 A 1 1 


PTTTP A T A A P 
LH i OA 1 AAL 


a a -I u 


ATT ATC ATTT 


TCTTTTGAAG 


TATAGTTTTC 


AAAGAGTTCT 


CCGTGGGTAG 


TATATTGTGT 


8880 


ATAAGAACCA 


CCAAAGATAA 


CATCAGCTAC 


AGGAACTTCT 


TTTTCTG ACT 


CTAGTTTTTT 


8940 


GAAAAGTTCT 


CCAGTACCAG 


CTTGAATCAG 


TTCTACTTTG 


ATACCATATT 


TTTCTTCAAA 


9000 


GGCAGGAATA 


GTTGCTCCAA 


TTAAGCCCTC 


TGAGTTTGGT 


GAATAAACGA 


CTAGCGAACC 


9060 


GCCGTCTCCT 


TTATCAGATG 


AACTGTCATC 


GGCAGATTCA 


TTAGAAGAAC 


AAGCAGCATA 


9120 


ATACATCCAT 


TTCTTTTTCA 


TGATGGATAC 


CTCCGTTGTG 


TTATTTAAGT 


TTATTTTAAA 


9180 


ACAATGTAAG 


CGTTTTTAAA 


ACATACAATT 


CTATTCTATA 


GTGTATTGAA 


TCT AT AAC AG 


9240 


TACACTTTGA 


CTGCTAAAAT 


ATTTCTATAA 


ATTAATTTGA 


CTTTCCTGAT 


AGAGATGTTC 


9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9 360 

TTTTACTATT CTTCTG AGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9 4 20 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TG T ATCGG AG AATGCGCGTG 9480 

AGGT CAC AAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGG AG A GATTTTTTAA 954 0 

AGGAATGAGA CATCCGCTAC CTCC TTGGAA GGTTTTTG 957 8 



(2) INFORMATION FOR SEQ ID NO: 128: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
CGCGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGCT 



GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 
TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 
ATGCATATCA ATGCTCCCAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCCTCTC 240 
ATTCCAGGGA TTTTCATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 
GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCCCT AGGGATTCGT 360 

CC T AAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG T C AAAAAT AT CTTGC C AGC A 4 20 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 4 80 

ATGGAGTTGT GGAATGGGGC TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATCAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG ACAAGTTGTC CTTATCATCC GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 7 80 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC AT GG TTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 102 0 

TTGGTTTGCC AGATAAGGCA GACGCTTATC C ACAG AGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATCT TTTCCTCTTT GACGAGCCAA 1140 
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CTTCAGCCCT 


' AGATCCTGAG 


1 ATGGTTGGAG 


! AAGTTCTGGC 


TGTTATGCAA GATCTAGCCA 


1200 


AGTCAGGAAT 


GACCATGGTT 


1 ATCGTAACAC 


ATGAGATGGG 


ATTTGCCCC? 


1 GAGGTGGCAG 


1260 


ATCGTGTCAT 


CTTTATGGCA 


GACGGTGTGG 


TTGTTG AAG A 


CGGAACACCT 


' GAGCAGATTT 


1320 


TTGAACAAAC 


CCAAGGACAA 


AGGACTAAAG 


ACTTCTTGAG 


T AAGGTTTT A 


. TAAGTTAGCT 


1380 


TTGTTTAGCT 


ATTTGTAGCC 


AGCTTTAAAC 


GTTAAAGAGA 


AGATTAGTGA 


AAAGCTCAAC 


1440 


CAGAGCTTTT 


TCTTATAGTT 


TAAAGCTATA 


GGATTGCCTA 


GGAAAGAAGT 


GTTAGAGCTA 


1500 


CATTCTATTT 


TTTGCTA7AA 


TTAAAGATAT 


TTGT AAG AAA 


AG AG AAG TG A 


T ATG AC AC AG 


1560 


ATTATTGATG 


GGAAAGCTTT 


AGCGGCCAAA 


TTGCAGGGGC 


AGTTCGCTGA 


AAAGACTGCA 


1620 


AAATTAAAGG 


AAGAAACAGG 


TCTAGTGCCT 


GGTTTGGTAG 


TGATTTTGGT 


TGGCCACAAT 


1680 


CCAGCCAGCC 


AAGTCTACGT 


TCGCAACAAG 


GAGAGGTCAG 


CCCTTGCGGC 


TGGTTTCCGT 


17 4 0 


AGCGAAGTAG 


TACGGGTTCC 


AGAGACCATT 


ACTCAAGAGG 


AATTGTTAGA 


CCTGATTGCT 


1800 


AAATACAATC 


AGGATCCAGC 


TTGGCATGGG 


ATTTTGGTTC 


AGTTGCCATT 


ACCAAAACAC 


18 60 


ATTGATGAAG 


AGCCGGTTCT 


ATTGGCTATT 


GACCCAGAAA 


AGGATGTGGA 


TGGTTTCCAT 




CCTCTAAACA 


TGGGGCGTCT 


TTGGTCTGGT 


CATCCAGTCA 


TGATTCCTTC 


GACACCGGCA 


1980 


GGAATTATGG 


AAATGTTCCA 


TGAATATGGG 


ATTGACTTGG 


AAGGTAAAAA 


TCCAGTCGTC 


2040 


ATCGGTCGAT 


CCAATATTGT 


CGGAAAACCT 


ATGGCCCAGC 


TTCTTTTGGC 


AAAGAATGCA 


2 100 


ACAGTAACCT 


TGACTC ACT C 


ACGTACTCAT 


AATCTTTCCA 


AGGTGGCTGC 


AAAAGCAGAT 


2160 


ATTCTGGTTG 


TTGCAATCGG 


TCGTGCCAAG 


TTTGTCACTG 


CTGACTTTGT 


CAAACCAGGT 


2 220 


GCGGTAGTCA 


TTGACGTTGG 


GATGAACCGC 


GATGAAAATG 


GTAACCTCTG 


TCGGCATGTT 


2 280 


GATTATGAGG 


CGGTTGCCCC 


AC TTGCT AGC 


CACATTACGC 


CAGTCCCTGG 


AGGTGTCGCT 


2 340 


CCTATGACCA 


TTACTATGCT 


GATGGAGCAA 


ACCTATCAGG 


CAGCACTTAG 


GACATTGGAT 


2400 


AGAAAATAAG 


ATAAlifiTTT 
n. 1 rWu\s\ ill 




vj ruTATTTTC 


TAT AG CT AT A 


TCTAAAATGA 


2460 


TAGAAATGAA 


TATTAAATTT 


TAGAAATAAG 


TTT AT AAAAG 


GAGGTTTGCG 


CCTCCTTTTT 


2520 


GTTGTATAAT 


GGAGTGAGGT 


GATTAGATGA 


TTTTAAAAAT 


TTATAATGGG 


GAATATAGTT 


2580 


TACAATGGGA 


TGGAATATAC 


TACTTAGCAC 


TAATTGATTA 


TCCAAATATT 


CAAGAGTGGG 


2640 


AATTAGAAAA 


AATTGCTAAA 


TTTATAGCTT 


ACGAAAAACT 


TCATAAACGT 


CAAACAAGTA 


2700 


TTGAGTGTGC 


TGATTCTTGT 


TTAAAAAAAG 


AAATTTT AG A 


TTACATCTGT 


CAGCATCCCT 


2760 


TTCTGCCACC 


ATTTACTCCT 


ACAGATAAAA 


GAGTAGCCTC 


GACTTATGAC 


CTACATAAGA 


2820 


GGTTAGTGAC 


TTCAGACTAC 


TGTAGTCATA 


CTACGACTAT 


AGATGCAGCG 


ATTTCTATTT 


2880 
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TTAAAACTGG 


TCGTCTTTTA 


TCTGCTGTGA 


AAGCCTTTGG 


GCGAGATGCT 


GAGGAGTTGG 


2940 


T TTTGG AT AG 


TCGAAATGCT 


GCATCTGATC 


CCATAGATTA 


TTTTGACTAT 


GTCATGTTAG 


3000 


GGTGGTCAAA 


TACAAGTTCT 


GGTTATCGAT 


TGGCGATGGA 


GCGTTTATTA 


GGTCGAGCTC 


3060 


CTTCAGAGAA 


AGAATTACAA 


GACAAGTTTA 


TTCCTGGAGT 


AAGTTTT C AT 


TTTATCTATA 


3120 


C AG ATTTG A T 


TAAAGTTCCT 


GGTTATATTT 


TTGATGCTTA 


CL'ATGCTGTA 


AAAATTAAGG 


3180 


ACATGCTTAA 


TTTATTAAGT 


GAGTTGTATA 


TTTGCATTAT 


TCCAACTCAT 


AATAAGAGCC 


3240 


AATTTGAAAA 


TATTATTCCA 


ACCAAAATAC 


AACATAGCCT 


GTATTATCTT 


GACTATGCTG 


3300 


GAGAAGACTT 


AGAAGAGTGG 


ACTAAGAAAG 


TCTATCAAGT 


TGTTTTAAAA 


CAATCAGATA 


3360 


AAGGATAGTT 


GAGGAAAAAA 


CGATGAAAGT 


GATTGATCAA 


ACCTTACTAG 


AAAAAGTCAT 


3420 


TATTGAACGT 


TCTTGTACAA 


GTCATAAAGG 


AGACTACGGT 


CGTCTGCTGT 


TGCTTCGTCG 


3480 


GACTTATCCT 


TATGGTGGTG 


CCATCATCAT 


GGCTGCTTTA 


GCAGCTGTAA 


AAAGCGGTGC 


3540 


AGGATTGGTA 


ACCGTTGGAA 


CGGACAGGGA 


AAATATCCCT 


GCTCTACACA 


GCCATTTGCC 


3600 


TGAGGCTATG 


GCCTTTTCTC 


TGCAAGATCA 


GTAATTGTTA 


CAAGAGCAAT 


TGGAGAAGGC 


3660 


AGAAGTTGTC 


TTGCTGGGGC 


CTGGTTTACG 


AGACGATACG 


TTTGGACAAA 


ATCTTGTAAA 


3720 


ACAGGTCTTT 


GCTAGCTTAA 


AAAAGAATCA 


GATTTTGATT 


GTAGATGGAG 


GGGCCTTAAC 


3780 


CAT CCTTGCT 


AGGACAAGTT 


TGTTGTTTCC 


ATCTAACCAG 


CTTATCTTAA 


CTCCCCACCA 


3840 


AAAAGAATCG 


GAAAAACTGT 


CTGGTATTGC 


TATTGAAAAG 


CAAAACGAAG 


GTACAACATC 


3900 


TAGTGCCCTG 


ACTTCTTTCC 


CTCAAGGAAC 


AATTTTGGTA 


GAGAAAGGTC 


CAGCTACTCG 


3960 


TATTTGGCAA 


GTTGGCCAGT 


CTGATTATTA 


CCAGTTAAAG 


GTTGGCGGTC 


CCTATCAGGC 


4020 


GACTGGTGGT 


ATGGGTGATA 


CACTGGCTGG 


AATGATTGCA 


GGATTTGCAG 


GCCAATTTCG 


4080 


ACAGGCCAGT 


CTCTACGAAC 


GTGTGGCAGT 


AGCAACCCAT 


CTTCATTCAG 


CCATAGCCCA 


4140 


AGAACTATCT 


CAAGAAAATT 


ATGTGGTCTT 


GCCGACGGAA 


ATTAGTAATT 


GTCTTCCTAA 


4200 


AGTAATGAAA 


AGATATGTCT 


AAAATAGTTA 


GACAAAAAAT 


GTTGATAATT 


TGTATCATTA 


4260 


TTCTTAATTC 


ACAAAAAACG 


AACGTTTAGT 


ATTCTTCTTG 


CTAAGAAACT 


AAATTTGTTC 


4320 


G TTTTTTT AC 


TCTTGTAAAT 


CTATTTTTGT 


TAG AGTTG AT 


TTGGTTTACA 


TCCCTACTTA 


4380 


AATTG ATTTG 


TTAGAGCTCT 


ACTTTTATTA 


AAAAAATTCA 


ATTTCAAGGA 


TAAATAAGCA 


4440 


CTATTCTAAA 


GGTACTTTTA 


GATGAAATAA 


AAGCCTTTAC 


ATGGTATAAT 


AGAGGTAGCT 


4500 


CTTTAATGGA 


GGTGTTTGAG 


TGGAAAATCT 


GAAGAAAATG 


GCAGGTATCA 


CGGCTGCTGA 


4560 


ATTTATCAAG 


GATGGGATGG 


TTGTAGGGCT 


AGGAACAGGT 


TCTACTGCCT 


ATTATTTTGT 


4620 


CGAAGAAATC 


GGTCGTCGAA 


TCAAGGAAGA 


AGGCTTGCAG 


ATTACAGCTG 


TGACGACTTC 


4680 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4 74 0 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCACT TTAATGGAAT 4 800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GCAACACCAT CAAAAGAATA 4 360 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4 9 20 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4 9 80 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GAT AT G C AG A ATTTTATCAT 5C4 0 

TGACCTCGCC TTGGATCTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 52 20 

CTAAATTTAA TCCTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGCT GCAGCACCAG 52 80 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGG AC AC A 53 4 0 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 54 00 

CTCGTGAAAC TCCTCTTAAG ACTGTAGCAG CTCAAAGCAA TCCAACTGGA TATGCAACAA 54 60 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 5 52 0 

T C AAC ATT AC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGCTTA TTCCTGAACC C AAC AAACCT TATTCAGGAA 564 0 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 5700 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 57 60 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCCTCCTGCC CTTCTTGCTC 58 2 0 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 59 4 0 

TCGATACTTA TGCTGTGGGT AAAATC AAC G AT AT CTTT AA CGGTGCTGGT ATCAACCATG 6000 

AC ATGGGT C A CAACAAGTCA AATAGTCATG GAATTGATAC ACT ATTG AAG AC T ATGGG AC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA C AAACCT AG T TGACTTTGAT GCCCTTTACG 6120 

GCCATCGTCG T AATGCT C AC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 62 40 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCG A 6 3 60 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 64 20 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTG AG AG G T A TC AATGTTGG TGGTAAGAAT 64 80 

AAGGTCGTCA TCGCGGAGCT TCCTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 654 0 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 
GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 
AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 67 20 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 
GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 
GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 
CCTTTCTACC GCCACATTAC TATTCCTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6 9 60 

CTAAAAAAAT AAT AAAGG AG ACACACAATG ACATTTTTAA AC AAAATC C A TGAAACTGCT 7020 

ACTTTCCTCA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG T AG ACT ATG C TGAGATTCCA 714 0 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7 2 00 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTCACTT TCCCAGTTCG TG TG ATG AAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 73 2 0 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7 380 

AAC AT G ACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 74 4 0 

C C AG ATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTCGCTAAA 7 500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7 560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATG CAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7 680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AG AAG TG ACT 7 74 0 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7 800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7 860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7 9 20 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7 9 80 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8 04 0 

GAATTTCTCA GTCTTATCAT A7ATAGCACA ATG AG ATTT C GCTTGAGTCT G CTTGT AAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTC AAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 82 2 0 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 8280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 3340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 84 00 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 84 60 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 85 20 

TG G ATGTT AC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCACAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATT AAG CAT T ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 87 60 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG AT CCG AAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 924 0 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAACATGTG TCAGCACAAA TTACGATTAA 93 00 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 9 360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AG ACTCCTG T GAAGGATAAA ACATTGACAA TTCAACACAG 94 80 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 954 0 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 96 60 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT 


GACGAAGTCA 


G TC AC AT ATA 


TAATCCAAGG 


CGACGTTGAC 


GTGGTTTGAA 


10020 


GAGATTTTCG 


AAGAGTATAA 


ACAGAAAGGT 


AGAGCGCGTG 


TTCTAATTTG 


AACACGAGTA 


10080 


GAAAACTTTT 


CTAAAAACAA 


AAACGAAAGG 


ATGGGTAAAC 


TGTATTCGCT 


GAACTGAATA 


10140 


CGGGCGACTC 


TCCTCTAAAT 


CAAAATTAAG 


AAAGGAATTG 


ACCCCACCCT 


AAAAGTAGTG 


10200 


GGAAAAAGAT 


AGTTGATCTA 


GCGAGCATCG 


CTCACTGCGC 


CCAACTCCTA 


TTTTCCCTTC 


10260 


GCTTTTTGAT 


GGGTTTGGTA 


TCTTTCTCAA 


TATAAAATAT 


AAAATAAAGA 


AAGGTAGAGC 


10320 


GTGTGTTTTG 


AT TTGAACAC 


GAGCGGAAAA 


CTCGGAAAAT 


AGATAATCTG 


ACTGAAAAAT 


10380 


CAGGATTTCT 


CGTCAGGTTC 


CTAATTTTCA 


CTCGTTTTCT 


TCTCGCTCTT 


TCTATCATAA 


10440 


ATTATGTCTA 


TCCATATTGC 


TGC TCAGC AG 


GGTGAAATTG 


CTGATAAAAT 


TCTTOTTCCT 


10500 


GGGGATCCTC 


TTCGTGCTAA 


GTTTATTGCG 


GAGAATTTCC 


TTGATGATGC 


TGTTTGTTTT 


10560 


AACGAAGTGC 


GTAACATGTT 


TGGTTACACT 


GGTACTTACA 


AGGGTCACTG 


TGTATCTGTC 


10620 


ATCGCAACTG 


GGATGGGAAT 


GCCATCTATT 


TCGATTTATG 


CGCGTGAGTT 


AATCGTAGAC 


10680 


TACGGTGTGA 


AGAAATTGAT 


TCGTGTGGGA 


ACTGCAGGTT 


CTTTGAATGA 


AGAGGTTCAT 


10740 


GTTCGTGAAT 


TAGTTTTGGC 


GCAGGCGGCT 


GCAACCAACT 


CAAACATCGT 


TCGTAATGAC 


10800 


TGGCCACAGT 


ACGATTTTCC 


ACAAATTGCT 


AGCTTTGATT 


TGCTTGATAA 


AGCCTACCAT 


10860 


ATCGCCAAAA 


AACTTGGTAT 


GACTACTCAC 


GTTGGGAACG 


TTTTGTCATC 


TGATGTCTTT 


10920 


TACTCAAATT 


ACTTTGAAAA 


GAATATCGAG 


CTTGGTAAAT 


GGGGAGTCAA 


GGCTGTGGAA 


10980 


ATGGAAGCAG 


CAGCTCTTTA 


CTATCTTGCT 


GCCCAATACC 


ATGTTGATGC 


GCTAGCTATC 


11040 


ATGACCATCT 


CTGATAGCTT 


GGTCAATCCA 


GACGAAGACA 


CAACTGCAGA 


AGAACGTCAA 


11100 


AATACCTTCA 


CTGATATGAT 


GAAGGTTGGT 


TTGGAAACCT 


TGATTGCAGA 


ATAATTATAG 


11160 


CCAAAAAGGG 


GCTCTTTGTC 


AACTGTAGTG 


GGTTGAAAAA 


AAGCTAAGCT 


TGAGAAAGGA 


11220 


CAAATTTCGT 


CCTTTCTTTT 


TTGATATTCA 


GGGCGATAAA 


AATCCGTTTT 


TTGAAGTTTT 


11280 


C AAAGTTCC G 


AAAACCAAAG 


GCATTGCGCT 


TGATAAGTTT 


GATGAGATTA 


TTGGTCGCTT 


11340 


CCAGTTTGGC 


ATTAGAATAG 


TGTAGTTGAA 


GGGCGTTGAC 


GATTTTCTCT 


TTGTTCTTTA 


11400 


GAAAGGTTTT 


AAAGACAGTC 


TGAAAAAGAG 


GATGAACCTG 


CTTCAGATTG 


TCCTCAATGA 


11460 


GTCCGAAAAA 


TTTCTCAGGG 


TCTTTGTTCT 


GAAAGTGAAA 


AAGTAAGAGT 


TGATAGATCT 


11520 


GATAGTGGTG 


TTTCAAGTCT 


TCTGAATAGC 


TTAAAATCTT 


GTCAAGAATT 


TCTTTATTTG 


11580 


TTAAGTGCAT 


GCGAAAAGTA 


GGGCGATAAA 


AACGTTTATC 


GCTsArTTTA 


CGACTATCCT 


11640 


GTTGGATGAG 


TTTCCAGTAA 


CGCTTGATAG 


CCTTGTATTC 


ATGAGATTTT 


CGTTCAAACT 


11700 


GATTCATAAT 


TTGAACACGA 


AAACG ACT C A 


TGGCACGGCT 


GAGATGTTGG 


ATAATATGGA 


. 11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GGCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12 000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTCG TTGATGATAC GAGGCATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AAC CATC ATT TTTGAAsACT GATAGCACTT GAAACGGCGT TTTCTAAGGA 122 40 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CAT rTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 123 60 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 124 20 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12 480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12 540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12 600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AG AAAT TG AT CTAGCTCTCT CTGATTTCGA 12 660 

AGAATAGTGA CTTTATCTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 12720 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AG AT AAC G CT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATGGCG AAAAATTG AG AAAGATGATT TG GTC AGCTT CTTGCATTCC 12 900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 129 60 

AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GCAGTAGTAG TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 1314 0 

AT TTGG AG AC AAAAAAACAG CCTCTATGGA CTG TTTCTT A TTTAACAAGT TTAGCTGAAA 132 00 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13 260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13 320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13 380 

CAGCGC GTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 134 4C 
(2) INFORMATION FOR SEQ ID NO: 12 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 120 

ATAGCTTCTT TCAGGTC ATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACC AG AC AGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 2 40 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TC AT AG ATC A CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 3 60 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAACTCTGGA 4 20 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 480 

TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTCATTTAAA 540 

AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 600 

TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG ATACATGGTC 660 

AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGCCAGTC TTAAATGACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 840 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 900 

CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 960 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 1020 

CAAGTGGTGT CTTGCTACCA CCTGATTTGA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 1080 

C AG AATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 1200 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 12 60 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 13 2 0 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 13 80 

CGAAGCGTTT TTGACGTGGG T CATT AG ACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 14 40 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC 


TGAAAAGATG 


AATT GACC AG 


AATGrPPGAT 


TTCATGAATf 

1 * V.rt X W\ 1 V. 


AAvAj 1 A 1 J\\j A 




CATCGCTCAA 


ACGGCCTGTC 


CAGCTCATGA 


GTAPATAAfTG 
vj x X rvrvv»vJ 


ulu 1 V»t_ Vj A 


1 A I VjViV* X H-Lt 




CCGCATAACC 


ACCGGAATCC 


TTGCCACFGT 


TAf;pa^raA a 

x nuLnuLKnn 


L» 1 VV. AV.V.V, Av» 


v. Vj ct c tt cti i 


1680 


GGTAACGAGC 


AACTTCCTGA 


CAATATTCTT 


GCtTrAAAGG 


Ttpt a c*r~c a r** 


1 I V.A i SjALLA 


i / 4 U 


AATCATAGGC 


ATCGTCAATA 


VJ X V-<rW» 1 X 




«L 1 Vj I LLAA^j 


TCCAATi ICC 


1800 


AGTCTGCAAA 


GGTC ATCTTT 


TCAAGACCAT 


X X n\.L L I ouv. 




AvjVi IA1L1L1 


1860 


GAGCGACTGG 


TGCAAAGTCC 


TTCATnATflA 


Vj V» 1 v_ /\>\ Ik. lu 


f\ r* CX 1 f** a a a o 


ATva AC ACGOT 


1920 


CCACTTCTTG 


TTCAGCTAGA 


AGATAGTCAA 


AfiACAnAfiTC 


ulnlLVLl i v 


A 1 A i vAOvvA 


19 80 


AGAGTTTTTC 


AG ACT TG A CC 


TfiACICCAflAT 


nuuL i V»w X V»V. 


a f*C m F*f*' f T* iftMH it 

AuVwOlAI 1 I 


IuCiTGCTTAC 


2040 


GAAGTCCCTC 


TGAGAAGGAA 


CG GAAGGAT T 




v 1 v AvjV. A 1 


1VA I VjVj riir 


2100 


GGTAGAAATT 


CTCATAGG T C 


ACAAAGCTGT 


1 1 1 lul Muu I 


L 1 1 LiV-V-A IliO 


OLnCAAAGT 


2 160 


CAGCCATTTC 


AAAATCCCCA 


VJV. X V.V1V. « 1 \* 1 


TAfiTATAAAT 

X AV* I A I AAA I 


v» I l_ V, 1 (jLVj(jA 


vTTvjTAGAAAA 


2220 


CTTCACCGAG 


ATTTGTCAAG 






1 nnu 1 AU x 


GCTTTTTTGA 


2280 


TTTTAGCCTG 


ACGAATGGCA 


GCTGTTAAAT 


va 1 OvV.AA 111 




rCUAAOACTT 


2 340 


CCTCATCTGC 


TGCCACCAAG 


GP ATPCVPC' A A 


Awnnuu x Lnn 


v»Vj 0. rACvjCTC* 


uUVTL l G n l 


2400 


CAAATTCCAT 


CCCAGCTTGG 


GCAATATTGG 


r* a a a T^rrTr 


A I LvjL 1 A 1 Avj 


T CCGT CGTCT 


2460 


GAGGCATAAA 


ACCATAnTTfl 


V- V_./AX\ L r\ 1 V*Vj V- 


x V_ A 1 v» 1 unn 1 


<j I AbATCTGT 


TCCAATTCCG 


2520 


CAAAGGCCTT 


CTCGAAATCC 


TCAAAAfiTOT 


UA/iUn i 1 Vj V>V_ 


v, 1 IblAAiLA 


v.VjV»v» I AAAL i 


2 580 


GGTTGATGTC 


TTCG CG AGCT 


X X V. X UVJn 1 1VJ 


rAr(^f*AAGA & 


A 1 v_V, i, V, AL,VjL» 


TCTTGGTATA 


2 64 0 


GGGCTGTTAA 


GTCC CAG AGT 


X V- X A X \_« X ^— ' 


GAAATTPTGA 
u.rw\ x x v» X V*y\ 


AVoij Ibl 1 1 1 


TGTTCCATTT 


2700 


TCTTCCTCTT 


ATTTCTCTAA 


TTCTACTAAA 


APArTAAfifin 


V * V*A 1 AAAkjL 


VI 1 AAAULWj i 


27 60 


GCTGTTTCTG 


CTCGCAAAAT 


ACGAGGACCT 


AGGCCTGCCA 


AAACGGCTCC 


TTT AGCTTC A 


2820 


AAACTTTCGA 


TTTCTGCAGG 


TGAGAGACCG 


CCTTCTGGAC 


CAAAGATAAA 


GAGCAGTTTG 


2880 


GCTCCTGTTT 


CAAGACCAGT 


GACTGCTTGC 


AGAAGCGCAG 


CGGCTTCTCC 


TTCTTTAGCT 


2940 


GATTCTTCAT 


AGGCTACTAT 


GATAGAGTCA 


AACTGGTCCA 


GCTGAGCTAG 


AAAATCTGCT 


3000 


TTTTTCTCGA 


AAAGTTTAAT 


ACTTGGTACA 


ATATTACGCT 


TGCTTTGCTC 


GGCTGCTCCA 


3060 


AGGGCAATTT 


TTTCTAGTTT 


TTCAACTTTT 


TTACCCAATT 


TCTTGCCATC 


CCACTTGGCA 


3120 


ACTGACCAGT 


CTGCAGGAAA 


GGCCCAGATT 


TGGCTAGCCC 


CCAGTTCGGT 


TACTTTTTGA 


3180 


GCGATGAACT 


CCAGCTTGTC 


TCCCTTGGGA 


AATCCAGATG 


CGATGGTCAC 


TTGGACTGGT 


3240 
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AGTTCCACAT TGTCATTTAA TTCTTGG AC C AACTCAAACT GACGATTTTC CATATCCAGC 3 3 00 

ACGCGCGCCA AG CGCTTG AT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 33 60 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT G AC AGGAG AG 3 42 0 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGACATATC 348 0 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATC C AG CT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3 600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3 660 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3720 

ATCTCCAGCC GCT ACATGG A TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 3 7 80 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3 84 0 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3 900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3 960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4020 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4080 

GTCTG C CC AG TCTTCCTCAG CCAAGGCAGT CGTACCTATT T TT AACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4 200 

CGTGTCAGGG TAG T AG GCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 4 260 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTG CGACTCCTTG 4 3 20 

CGCTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 4 3 80 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 4 44 0 

AGGAAATTCT CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4 500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4 560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4 620 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAACATG ATATTGTCAA ATTGTTCTTG 4 680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATT ACG C TAGGATAAAT 474 0 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4 800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4 860 

ATTGGCCCAG GGAATACTTG CCTT AT CATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4 920 

AAACAAAGCA ATCTTGCAAC CTGTG AAATC AG AAATTT C T AGTTCCATCT TCAGTTCCTT 4 980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT TTCTG T CT AG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTCTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 53 4 0 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 54 60 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5 52 0 

TTTTTGCTCC ATCGCTGATG TCGACTCGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5 530 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 57 60 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AG AAT AAAG A 5880 

TTAAATACAT GCTTAGAAGA GCG AAGG CAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

CTACCACATA AACCAAATAA GTCCCAATTT TCTTCAT AAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAACCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCT CT G CTTCTGGGTC TTC CAAAAAG TCCATGAGGA 6130 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 62 40 

GGGAAAATCC TTTTTTAGGC AG G T AAGG AA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6 3 00 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATG A AA 6 3 60 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCACCATTT TGCGCTTCCA ATAGAAAACT 64 20 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 654 0 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAG AT A C TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6 660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AAT C AAT AT A AAAAGCCCTC 67 20 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6 780 
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TTTTATTGAC TATTACCACG AT AC C ACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 68 40 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6 900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 6 9 60 

TCAACAGATA CCATAGAAAT AAAAG ATT T A TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7 020 

TCATCTTTAA AAGGATCATT AT C AATT AAA G AC AAGCT AT TGATATCTGA TGGCTGAGGT 7080 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT CAGAAATCCC CTCTCCTTGT GTAGCTGAGT ATTCAAAATC TGGGAAATGG 7 200 

ACAAAATAAG TCGCTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 72 60 

ACTAACATTT CACTACCTCC AT ATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 73 20 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7 380 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCGC CTCTAGTCTT TATCCAACCA 7 44 0 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7 500 

G ACAAC AC C A TTATAACACG TGTTACACGT ATTGTAAAGG AGTGATACTT ATT ATT CT AT 7 560 

TATACATAAA AGCCCCTAGA TGTGGTTCTA AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7 620 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AG T CAATTGT 7 680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG GAACACTGCG 77 40 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7 800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTACTTT CCTAGTTTGC 7860 

TCTTTGATTT TCATTGAGTA TGATTAACTC TCAAGTCTTC GAAATCAGCA TTTTCAACAG 7920 

TTATTACAAG GAGGCGATTT ACTACTTCAA AAACATCAAT T ATT CTATTT TTCATATTTT 7980 

TTCAACCCAT TATTAGAATG AACTTCTTGG TAAGCAAAAT C AAGTTT AC A TTTAATGTTT 8040 

TCGTACAAAT CTAAAATCTC TTTTCGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

GAAATAACTT GATCACTAAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 822 0 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 82 80 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAAT AT C AC A 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGT CATTCT A CTCAAATCCA 84 60 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCGTTCG CATGCGCCCn GG 8S12 
(2) INFORMATION FOR SEQ ID NO: 13 0: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA CGTTGAGTCT CTTGCAAATC TTGTTCGCGT T CTT C CTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 120 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATC ACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATT kT CTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 2 40 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACAGG CTTT A CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TT AAAGGT C A AAATGATGAG 4 20 

CAACATCGCT ATGTGACTGC C C ATGT AG AT ACGCTTGGTG CTATTGTCCG TGCTGT C AAA 4 80 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

G AAAAC TGT A CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 6 60 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCG TGC TCTTGGCATT 7 20 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC AGGTTTTATC 7 80 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCGATTT TGCTCAATCT C CTTCGC ATT 84 0 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 102 0 

AAGGATGCTT CTCGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TT TGGCG AAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 12 60 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AG AG TTGGAA ACAGGCTATC TTCTGGTTGC 13 80 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTACCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATGAATATC GAACTGCTAG GAAATGGCGA 15 60 

TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT 162 0 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGACAGCAG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 174 0 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 18 60 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 1920 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 19 80 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACT AT A ATCC AGGGG A AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AG AT TATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 222 0 

CAGACAGGCT TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 2280 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TT AT CTCAAA 23 4 0 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 2 4 60 

GGAGACTACG TCGATTAATA CTCTT CG AAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 2520 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cGATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 264 0 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 2760 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 2820 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTCG 2 8 69 
(21 INFORMATION FOR SEQ ID NO: 131: 

(i> SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

S 
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CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTT C ATAGGCAGCT 
TCAACATCAT TAAAAAAAGA AxGCACTGAA GCAAGTTCTT CAGTGCTCCA CGACAAATCT 
AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCACC TCTCATTCTT GCTTCTTTTA 180 
GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 240 
TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTGCTCAA 300 
TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT TTTTGAAAAA 3 60 

TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 4 20 

GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 4 80 

TTTTTCTAAC CAGGCCTTAG CCTTACGACA TGATGTACAG CTCGGTGATA GAAATAGTGT 540 
AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 600 
GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 660 
AACGATTGAC ACAATTGGAT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 720 
ATTTTTATAT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 780 
AG AT AGC C AA AGAAGTTTTC ATAGAGGGCA AAAAAGAGGA GGAAGGCATG AAGAAAGAAG 840 
GTCTCTGGCA AAATCATAAT AACAGGATCC TTGGCTGGAT C AAAAAGC C A GGTATCATCT 900 
CCCACAAAGA GAATTTGATG GAAAAGAGTA AAGAATTGCT CAAAACCAAT CAAAACTCCC 
C C AAGTCCAA TCATCACAGG TAAGACTACT AGAGCCAGGA GACTTTTTCG ATAAAGAGAC 
AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGGCAG TGTCACTAGA 
GCTACTAGCT GAACCAAATG AAAGAGATTC TTGACCACTG CGAAATGCTG CACACCAGCT 
GCTGACGAAC GAAAATCAGG CATCTGTAAG ACCTGACTAA AAGGATTGGT C AG ATAATT C 
ATCAAGATAT GAAAATTGTA TTGAATGGTT TCTGGTTTTA GATAGACTCG ATT CGTT AAG 
TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT C AG AAGG A TG 132 0 

GAGAGGGAGA GGAGAAAGAG CATAGAGCCC CAAAAGATCA ATTTAGTTTT CATCAAAATC 13 80 

CCACTCCGCA AGG CT AG AAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 14 4 0 

TGCCTTAGTA AAACCTGTCG T C AC C AAG AG CGTTGGAATG CCATTGTCAA TCCCAGCCCC 
AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 
CTCAACCGCC TTGTC C AT AA TGATGGCATT TGGTTTTCCG AT AT AAACC G GCTTCACTCG 
TGTCGCTACT TCAAGCAGCG TAATCAGTGA CCCAGCACCT GGCAAAAGAC CGCGTTCCGT 
CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA TAGCAAGAGT 
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TGCTGTGGCA AATTTTTCAT AGTCGACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1800 

TTTTTCCTTG TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC 1860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 1920 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AAC ACT CTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 2160 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 2220 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2 290 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 23 40 

CCTCAAAAAT CCAGTCCAAG TATTTACTCT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATGGTAATCC TTGCTGATCG GTTCTTCCAA GGACTCATAC TTCGGTCCAC 24 60 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 2520 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGAGCTCCT 2 580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 264 0 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACCAT AATCCAGACC AG ATT AT ATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 2760 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AAT C ACATCT GGAAGTTTGA 2820 

TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 2 880 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2 94 0 

TTTGTCATGC GAACGTGGTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AG CAAAAG AC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3 0 60 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 3120 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGGGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 3240 

CCATAGTATT CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGGGTTTG 3 300 

GAGAAATTAG AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 3 360 

TCAAACATCT CCCCTTCTAG TTCCTCCTTA GGC AG AGGC A ATTTCCCACG TCGCATCCGG 34 20 

TTC AT AAAG A CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 3540 
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AGGTTTATCC CTAAATCATC CGCTAACTTT CTAGCTGCAG GAGTCGCTCT TAGCTTGTCA 
TCAGCCATGA CCTCTCCAAT TCTATTTATG ATACAAAGGG CCTCAAAAGC GACTGAAAAA 
TAGGAAATCG ACGATGGCTT CGATGAAGCC AAGGAGATTT ATCTTTTTTC CCATCTTTTA 
GCCCGTGCTC TAATCTAAGA TATTAATGAC GAAGAGCTCT GCACCTAAAA GAT AC AAAGT 
TTCTCGTCAG CTTTATTTTA TTTACATAAC TTATCTTATG TAACCCTATT CTTTCTTATA 
AGTTTTTCGG ATTGCATCTT TGATACTTTC AACTGTTGGA ATCATTGCAT TTTCTAGGTT 
TTGTGCATAA GGCATCGGCA CATCTTCTCC TGCACAACGG CGAATTGGTG CATCTAGATA 
GTCAAATGCT TCTGATTCTG AAATAATAGC TGAAATTTCA CCGATATAGC CACTTGTTTT 
GTGGGCATCG TTGACCAGAA CAACCTTACC AGTCTTCTTC ACTGAGTTTA TGATGATATC 
CTTATCAAGC GGAACAAGGG TACGTGGGTC AACAATTTCA ACTCAAATTC CTTCTTCTGC 
TAATTCTTCA GCAGCTTGAA CCACACGGCG AAG CATTTTT CCATAAGTAA CAACTGTTAC 
ATCCGTTCCT TGGCGTTTGA TTTCACCAAC CCCAAGTGGA ATTGTGTACT CTGGATCAAC 
TGGCACTTCC CCTTTTTGGT TAAATTCTGA CTTGTACTCA AGTATAATAA CTGGGTTGTT 
ATCACGGATA GAAGACTTAA GCAGGCCTTT CATGTCCGCA- GGTGTTCCAG GTGCCACAAC 
CTTAAGTCCT GGAATGTGAG TAAACCAAGA CTCTAGAGAT TGTGAGTGCT GGGCGGCAGA 
GCCAACTCCG TTACCAGCTG CACAACGAAC AGTCATTGGA ACCTGACCTT TACCACCAAA 
CATGTAACGT GTTTTAGCAG CTTGGTTGAC GATATTGTCC ATGGCAATAA CAGAGAAGTC 
CATGAAGGTC ATATCGACGA TTGGACGAAG TCCTGTCATG GCTG CTCCTG CTGCTGCTCC 
AGAGATGGCA GCTTCAGAAA TCGGACAGTC ACGGACACGT TCTGGACCAA ATTCTTCAAG 
CATTCCAACA GAAGTACCGA AGTCTCCTCC GAAGACACCG ACGTCTTCTC CCATCAAGAA 
CACATTTTCA TCGCGACGCA TTTCCTCAGA CATAGCAAGG AT AATGGTGT CACGGAAGGA 
CATTGTTTTT GTTTCCATTT TATCTCTTTC TCCTTAGTCT GCGTAAATAT CTTCAAAGGC 
TGATTCAAGC GGTGGGAATG GGCTTTCCTC TGCAAATTTA ACAGAAGCTT CT ACTGCTT C 
CTTTACTTGC GCTTGGATTT CTTCCAATTC TTCGGCACTT G CAATGTT AT TTTCAATAAG 
GTAATTGCGG AGGTTTTCGA TTGGATCTTT TTGTTTCCAC AATTCCACTT CTTCACGCGT 
ACGATATTTA CCAGGGTCAG ATGATGAGTG ACCGAGCCAG CGATAAGTTA CACTTTCAAT 
CAAGACTGGA CCATTGCCAC TGCGAACATG GTCCACAGCT TTCTGAAATC CTTCATAGAC 
ATCGATGACA TTGTTACCGT CTTCGATGAA CATTCCAGGA ATTCCAT AAG CGGCGCTACG 
TTGATGGATA TGTTCTATAT TGGTCATTTT CTTGATATCC GCAGAGATAC CGTAACCGTT 
GTTAATGCAA TAGAAAATGA CTGGCAGGTT CCAGATAGAA GCCATCTTCA CTGCTTCGTG 



4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

S760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 



WO 98/18931 



PCT/US97/19588 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 



987 

TCAAGCAATT CAGAACCATT TACCAAGACG TTCTTCTCTT TACTGATTGT ACCAATTCCT 
TTATGAACAG TAACTCCGTA GCTACGAAGA AGTCCTGCAA CACCACCAAC AAGAGTATTA 
ACAACTTTAG ATTTAGTTTC TAAAAGTTTT TCCATATCAA CAGTGAACTT AGGATTTTCA 
ATCACGATAC CACGATTTGC AGCATGACCG ATATTTTCAA TAATTTCAGC GTTATGAAGG 
TAGGTCTTGG TTGGAATACA TCCACGGTTT AAG CAGGTTC CACCAAGTTC AGATTTCTCA 
ACAAGGGCAA CCTTACCGCC GAATTGGGCA GCTTTAATGG CTGCAACATA ACCAGCAGGA 
CCTCCACCAA TCACAACGAT ATCAAAAGCA TCATCGCTCT TACCATCATC GTTTGAGGTA 
CTTGCTACAG GTACAGGGCT AGCTTCTGGC GATGCTGCTC CAGCTGTTGG GATGTTTTCC 
CTTTCTTCAC CAAGGTAACC GATAACTTCC GTTACAGGGA CAGTTTCACC ATCTCCTTTG 
AGAATGGCAA TCAAGTACCC ATCTTCTTCG GCTTCCAATT CCATGCTGAC TTTATCAGTC 
ATGATTTCCA AAAGGATTTC TCCTTCTTTT ACAAATTCTC CGACTTTTTT ATTCCATTGG 3780 
ACGATTTGTC CTTCTGTCAT ATCCACGCCG GCTTTTGGCA T AATT ACTT C TAAGGCCATG 3 840 

TCTTCCTTCC TTTATCTATA TCTTAAAAAT GAATACTCTT GCTCTTAAAT TAACATTGAG 
ATTGGCGTTT CAATCAACTC TTTCAAGTCC TTCATAAACT TAGCACCAGC CAT AC CATCT 
ACGACACGGT GGTCAATGGT TAATCCTAAA CTCATGATTG GGCGAATCAC AATTTCACCA 
TTGACGACAA CTGGCTTCTC GATTGTCGAA CTGACACCAA GGATAGCTGA GTTGGGTTGG 
TTAATAATCG GACCAAAGGA CTGAACACCA AACATTCCCA AATTACTGAT TGTGAATGTT 
GAATTTTGTA ACTCACTTGG AGCCAATTTA CCATCCAAGG TACGGCCAAT AACATCCTTA 
AAGGCTACAA CCAGTTCTGA AAGACTCATC TTCTCAG CAT TGTAAACAAC AGGTGTCATC 
AATCCATTAT CCATCCCAAC TGCCATGGCA AGATTGACAT AGTTGTGAGT GATAATAGTC 
TTGCCATCTT CTGTCAATGA AGCGTTGATG TATGGGTGTT TCATAAGAGT CTTAACAACT 
GCAAGCGAAA GAAGGTCTGT TACAGTAGTC TTCTTCCCAG TTGCTTCCAT GATTGGCTCA 
AGAACCTTCT TACGAAGAGC CAACATTTCA GTCATATCAA CTTCAT AG TT GAGGGTGAAG 
GTTGGCGCAG TCAAGTAAGA TTCAACCATG CGTTGGGCAA TAACCTTACG CATTGGTGTC 
ATTGGAATAC GCTCGATTTT AC CAT ATGGT GTTACGTTAT CAGGCACTTC TTCCACTTTT 
TCAATCTGAG CAGGAGATTT GATGCTATCG TTTTCGATAT TTTCAGGAAG CAGGGCCAAA 
AC ATC CTTCT T CAT GATTTT ACCACGATGA CCGGTTCCTT GGATTTCCTG CCAAGCAATG 
TTATGTTCGA GGGCAATTCG TTTTGCAAGT GGCGAAATGC GAACCACGTT TGTGTCTTTA 
TAAGTTTCCA CGTCTTCTTT GTGGACACGA CCGTTTGCAC CTGAGCCAGA AACGTCGTAG 
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CCACGACGGA CCTTAAATTC ACGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 

GTATCCTTGA TGCGATTGAT TTCGG CCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 14 40 

TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 

AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 

TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGG TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATC CAAAAG GTGTTTAAAG CCGTATTCTT CCAAGGCAAT ATT AAAAG C A 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGCATTTC TCCGTATTTC 2340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 24 00 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 24 60 

AAACCT ACT G CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2 580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2 64 0 

GTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCCA 2 700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2 760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2 820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 3120 
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CTCCAGG 

<2) INFORMATION FOR SEQ ID NO: 147: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 11340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
fD) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


147: 






CCGGTATGTT 


' CTGGAATACT 


' ACCAATCTAA 


l GCTGGCTGTG 


CCCTACAGTT 


1 TTACAACCCT 


60 


GTACGAATAC 


CTTAAGGAAT 


ATGACCGATT 


TTT CAGCTGG 


GTTTTGGAGT 


CTGGTATTTC 


12 0 


AAACGCTGAT 


AAAATATCCG 


ATATTCCTTT 


ATCAGTTTTG 


GAAAATATGT 


CTAAGAAAGA 


180 


CATGGAATCC 


TTTATCCTTT 


ATCTACGTGA 


ACGTCCCTTG 


CTGAATGCTA 


ATACAACAAA 


240 


ACAAGGTGTT 


TCACAGACAA 


CT AT CAATCG 


AACCTTATCA 


CCACTTTCTA 


GTCTTTACAA 


300 


GTATCTAACC 


GAGGAGGTTG 


AAAACGATCA 


GGGGGAACCT 


TATTTCTATC 


GTAATGTAAT 


360 


GAAAAAAGTT 


TCCACCAAGA 


AAAAGAAAGA 


AACCCTTGCT 


GCCAGAGCTG 


AAAATATCAA 


420 


GCAAAAACTC 


TTTCTAGGTG 


ATGAAACAGA 


AGGTTTTCTA 


ACTTATATCG 


ATCAAGAGCA 


480 


CCCACAACAG 


CTTTCAAATC 


GAGC TCTCTC 


ATCATTCAAC 




AnLuAbnll 1 


540 


AGCC ATT ATT 


GCCCTTCTCT 


TGGCATCTGG 


TGTTCGCTTA 


TCTGAAGCTC 


TTAATCTAGA 


600 


TCTAAGAGAT 


CTCAATCTAA 


AAATGATGGT 


TATTGATGTT 


ACTCGAAAAG 


GTTGCAAACG 


660 


TGACTCAGTC 


AATGTCGCTG 


CTTTTGCTAA 


ACCTTATTTA 


GAGAATTATC 


TGGCCATTCG 


720 


GAATCAACGC 


TATAAAACGG 


AAAAAACAGA 


TACAGCCCTT 


TTTTTAACTC 


TCTACAGAGG 


780 


TGTTCCTAAT 


CGTATCGATG 


CTTCTAGCGT 


TGAGAAAATG 


GTTGCTAAAT 


ACTCAGAGGA 


840 


TTTTAAAGTG 


CGTGTAACAC 


CCCATAAACT 


GCGCCATACA 


CTAGCAACTA 


GGCTCTATGA 


900 


TGCGACTAAA 


TCACAAGTTT 


TAGTCAGTCA 


CCAACTAGGA 


CATGCTAGCA 


CACAAGTCAC 


960 


TGACCTCTAT 


ACCCATATTG 


TTAGTGATGA 


ACAAAAGAAT 


GCTCTGGATA 


GTTTATGATT 


1020 


TTACGTATTT 


TAAATTATGT 


AAATAAATAT 


CAAAAAAAGA 


AGTTGGCCAA 


CTTCTTTTTG 


1080 


ATTTATCCAA 


CTACCGCTTC 


AGCGATTTCT 


TCACGGCTAA 


TACCAGCGAA 


GTAGCGTGTG 


1140 


ATATCAATGG 


TTTTTAGCGC 


CTTAAGAACA 


TCTTCGCGTT 


CGTATTTCAC 


CCCACGAAGG 


1200 


ACATCTTCTA 


CTGCAGCAAC 


GTCTTCAATA 


CCAAAGAAGT 


C AC CATAAAT 


CTTGATGTCT 


1260 


TGGATTTTTG 


ATTCAGTAAC 


GTTAGCAAAG 


ACTTCAACCT 


TACCACTAGT 


GAATTTGATT 


1320 
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TTCAACATCT TAAACAGTCG CTGGCTGTGC TCCTGACGTT AAAAGATACC GT CAT ACT AC 1014 0 
TGTTT CTG AC GACCAGTATG ATTGCCATCT TGGATGTGTC CCCTCGGCTG ATTGCCCTCC 102 00 
CCTTCATCCA ACAGACACTA GCACAACTGA GCATTGGGCA ACTCCTCGCC CTGCTCTCCA 10260 
TCATCATGTC TTGTGGAGCT ATCCTTGGCA ATATGACCAG CAGTAATCTA TTTAAAAATA 10320 
TCCGTTTCAC GC ACCTCTTG GTTTTCTCTG AGATTTCCCT ATTGACTCTA ATAACTAGTA 10380 
TCCTTTGTCA AGCCTATATC GTAATTTTCA TGACCAGTTT CATCAGTTCT AC GATT ATCG 10440 
GCATTCTCAG CCCTCGCCTA CAAGCAGCTG TCTTTGCCCA TATCCCCAGT GACAAGATGG 10500 
GGACGGTGCC CTCTGCTCTG AGCACAGTGG ACATTCTCGC CCCGTCCCTG CTCTCCCTAT 10560 
TAGCCCTATC CAT AG C ATCG GGCGTTTCGC TGCAGTTAGC ATTGATATTT TTGTATCTTA 10620 
TTTTAATTGC TCTTATCTTT TGTCAATGGT TACTCAAGTT CAACACTCAT AACTAACGAA 10680 
AAAGCATGTG TAGATTTCAC ATGCTTTTAA TCTCCCCAAT CGTCAGGTCA AG T ACAAC AA 10740 
AGTCACTTCT TTGATTAAGC GAGTGTTCTA ATATAATTAT AAGCGCCCTC TCATTACCGA 10800 

ACCCATTCGC CATTATAGTT GACAGAATAG C CAT CT ACGG TCCTATTCAC TGCCAAAGCA 10860 

CCTGAGCTAT AAGCATAGTA CCAGTTGCCA TTGACCTGGA ACCAACCTGT CTTCATGTCT 10920 

CCATTACCTG CATTTAGGTA GTACCAAGTT GAACCATCTT GATACCAACC AGTTGCCATA 10980 

GCTCCTGATG AACGGAGATA GTACCATTTG TTCCCAAGGT TTTGCCAACC TGTTTTCATA 11040 

TCGCCATTTG GGTGGTCTAA ATAATACCAA GTGGTACCTT CCTGATACCA GCCAGTGGCC 11100 

ATTG CTCCTG AGGAACGGAG GTAGTACCAC TTATTACCTA GATATTGCCA ACCTGTTTGC 11160 

ATAATACCAG TTGTTGGATC TAGGTAGTAC CAAGTCGAAT CATCGTTTAT CCACCCCGCA 11220 

CGTCTTTCAC CACCAAGGTA GTTTTCTCCA TTAATTTCCG TCTTAGCTAG ATAATACCAG 11280 

TTAGACTGAT CATAAAGCCA ACCTGTCTCT AAAGAATGAT TTTGATTAAA CTAATAGTTC 11340 

GTATAATAAC GCTTCTCTTC TTTATCTTCT GAATCTTCAC GTTTTTCCCC GTACTTTCTT 114 00 

CCAACACTGT CTTTAGTTTT AATCTCTAAT GTTTTCCAAC CAACAAACTC TTCTAGCACT 114 60 

CCATTTTTAT CGAAGTAGTA CCACTCTGAC TTTGGAAAAC CTTCTAATCT GATACCATTT 11520 

GGGTAAGGAC CAATTGTACT ACCTTTAGAT GGAAACGGGA TATATTGCCA GCCGACAACC 11580 

ATCTCTCCAG ATAGAGAATC AAAATAATAG TACTTACCAT CAATCACTCG CCAGTAGGTT 11640 

TCTTTGAGGT CCCCCTTTTT GTAGTAGGTT CTTCCGTTTT CTTGGACAAA CTGCCATCCT 11700 

TCAGAATCAT CTGCAAATAC TGTACTGGTC CCTAGCAAAC CAAAGAAAAA TACT GTC AGT 117 60 

CCAACTTGCA TAGTTTTTTT CAAAATTTTC ATCTATATAC CCTCCAATAT TAAATCCACT 11820 

CACCAGATGA GGCGAAATTA TAAACTTTAC CATCGATAGT TTGGCTACCT GTAACCATTG 11880 
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AAATTAAAAA 


AATAGAGGAA 


CATAAATATG 


ATTACAAAAC 


AGAATGTAAT 


AGTGTTCTAC 


8400 


AATTTTT ACT 


AGATAAAACT 


GTAAATTCTG 


AAGGAAGGAT 


CACTTCTTCA 


AC AGAATTTG 


8460 


GAAATTTCGT 


AAGTAATTTA 


TCATTCCAAC 


ACGGAATACC 


TGGACTACTG 


TTTCCTCTAA 


8520 


ATAAATTCTA 


CCCCCCAGAA 


CTGGATTCTA 


AAATACTCTC 


TATCATCAAG 


AAGCCAGTGA 


8580 


CAATTAGAAC 


GACACACACA 


TATGAATATC 


AATACTCACT 


GCTATTTGGT 


GATGCAGGCT 


3640 


ATCTATGGTT 


ACTCCTACAT 


TTATTTTCTA 


TCAGTAAAAA 


TCAATACTAT 


CTACAATTAG 


8700 


C AAACGTC AC 


CGCTAAAAAA 


TTAATAGAGA 


ATTATGATAC 


TCTAGAGGAA 


AT AG AC TTl'G 


8760 


CATTGGGAAA 


ATCTGGTGTC 


CTATTATCAT 


TAATAAAATA 


CTATCAATTT 


ACC AAT G AC A 


8820 


ATACTCTTAA 


AATTTTCATC 


CACAATAGTA 


TAGGGGAAAT 


TTATCATTAT 


TTCCTACAAA 


8880 


G AG AT AC AG C 


CAAAGAAAGC 


ATTTTAGACT 


ATAGCTTTGC 


TCATGGATAT 


TGTGGAATTG 


8940 


CATATGCTTT 


ATTTGCCTAT 


TCTAAAGTCT 


TAGAACCTTC 


TATGTTTTAT 


AATGATCTCC 


9000 


ATACATTCCA 


TACTGAATTA 


AAAAAATTAT 


TAGAAAAAGT 


TACTTCTAAT 


ACTGAAAATT 


9060 


TAGGAAATTT 


ACAACTTTCT 


TGGTGCAAAG 


GAATTTCCGG 


AAT AATCTT A 


TATCTTTGTA 


9120 


TGTACGATTG 


TGACGGAAAC 


AAAGATATTA 


TTAGTAAATA 


TCAAGAATTT 


GTTTTTAACC 


9180 


ATCATCTAAA 


AATGATGACA 


GGATATTGCC 


ACGGAATAAC 


TAGCTTACTA 


CAAACCACTG 


9240 


TCTACAATCA 


AAACAAATTA 


CTGATGAAAA 


AAATCCAACA 


GGTAATTTTA 


GCATGTTCTG 


9300 


AACGAGATGA 


TCACGGTTTA 


CTGATGTTTC 


AAGGAGATAG 


TGGTAAAGCA 


GATTTGTTTG 


9360 


ACTTCGGAAT 


AGGAAGCATG 


GGGTATATTG 


CTGTCTATTA 


AATAATAAAT 


TCCCATTCGA 


9420 


TGTGCAGACA 


TAAGGAGAAA 


AGTATGAAAT 


TATTTTGGAC 


AAACAACATA 


TATAGACAGT 


9480 


TGCTGCTAAA 


CAGCTGTTTT 


TCATCATTCG 


GCGACAGTAT 


TTTCTACCTC 


GCCATTATCA 


9540 


ATTATGTGGC 


TCAGTACAAT 


TTCGCTCCGC 


TAGCGATTTT 


ACTGATTTCC 


ATTTCAGAGA 


9600 


i V7VJ 1 1 UV-wU 1 




CTCTTTC TC G 


OGA l I CI AGG 


AG ATTTT C AA 


GAAAATAGAG 


9660 


TCAAACACGC 


ACTCTGGATT 


GCCAAAATCA 


AAATCCTGCT 


CTACGCTATT 


TTGACAGTAT 


9720 


TTCTCGTCTT 


GTCGCCCTTT 


TCATTAGTTT 


CAGTCATTAT 


GATTGTCATC 


ATCAACCTCA 


9780 


TCTCTGACAC 


CTTGAGCTAC 


CTGTCTGCCT 


ACATGATGAA 


CGCCCTCTAC 


ATCAGTGTAA 


9840 


TTAAGGACGA 


CCTGCATGAT 


GCCATGGGGT 


TCAGGCAGTC 


TCTGATGAGG 


GTTGTCCGTA 


9900 


TTGTCGCCAA 


TCTGGCTGGC 


GCATTCCTTA 


TCAATGTTAT 


AAGTATTCAA 


ACTATTTCCC 


9960 


TTATCAACAC 


TCTGACTTTT 


GTCATTGCCT 


TTTTGGGCCT 


GTATGTTATT 


CGACATACCT 


10020 


TGTATGAGGT 


TGAAAAAAGA 


ATTGAAATGT 


CACATACAGC 


ACTGAGTTTT 


AAGAAATATT 


10080 
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CCCAATATTT GATAATAGCA ATATTTGCCT AGGAACGTAC TGTTACAGCC TCTCTATC CA 



6600 
6660 
6720 
6780 
6840 
6900 



7080 
7140 
7200 
7260 



TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCT AG T TCTTGTGCGT CTCTCAAATT 
GGTTACCAAC GCTATGATAC AACCTCCTAG CCCACCACCG CTC A TCTTGG CACCCAGAGC 
ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 
TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 
AATCGCAACT TCTCCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGGCCATC 

CTTCCCCTTA TTTTGAACCA CTTGGATGGC TTCACGACTA TGACCATAAA CACCCGTATC 6960 

GGCAATCACC AAATAGGCGG AT AAATC CAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7020 
AAAGCGAATA CCTTGCTCAC TAAGACAGGT CTTAGCATCC AAACCACTAG GATTCATATG 
GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 
ATAGTAGTCA AATACTGCAC GAATGGCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 
CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 

ATACTCCAGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 7320 

ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGAGGCA GGGAAATGCC 7380 

AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 7440 

GACACCAACT TTTTTTGTCA T"TTTTTCCTT TTACTAGACG AAAAAACGTC TTATTTTTCA 7500 

TACAAGTATT AATTCTTTCC T AT CT ATTTT ATTATATTTT CACAAAAAAA GCGATTGTTT 7S60 

CCATTCACAA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7 620 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTAGT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 7740 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATAGGGTT C 7 8 60 

AATCTTCGCC CATTCTTGAT CGTTT AAATC AGTATC A TAT GCTTTGCG T A TCATAACTCT 7920 

AGCTTAACAT TTTTTTGTGA ATACAGGTTC TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 

GGAAAACGCC TTATGAACTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TACATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATTATGTTGT TGTGTTATAG ATTGATTGAA 8220 

TCTAACTAAA GGATCCTATT CAATTACTAG AACTATCACA TACTCAAGGT CAGCTCACAG 8280 

AT GAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAC TCAAACAAAT AATTTAGTCA 8 340 
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AACGGTCTGC 


AAATCTTGAC 


TGGCTTCTTT 


CAACTGTCTA 


AGCAAAGGCG 


TGTAAATATC 


4860 


TGTACTCAAG CCTTCTAAAA 


GCTTGCTGGC 


TACTTCTACT 


TGATCGATAA 


TCTTTTCTGA 


4920 


TTTCCCCTGT 


TCCAAGGCTT 


CTACCAGAGA 


AGTCACCGTT 


TCTTTTGAGG 


AAGTTAAAAA 


4980 


ATTTTGATTG 


ATATTTTGCT 


TGATTTGCTG 


GACCATGTGA 


CTCGATACAG 


CCACTTCCTT 


5040 


GGTCCATCCC 


ACTAAGAAAT 


CACATTCTAA 


AGTTGGTTTC 


ACTTGTGAAA 


TTGAAAAGCC 


5100 


CCAATCACGC 


TCCAGAACTG 


TCGCCAAGTT 


TTCTTCTTCT 


AACCAAGCAG 


CCACCTTCTG 


5160 


GCGATCAAAT 


GACTGGTAGA 


GAACCAAATC 


CTCTGCCACA 


AT AC AGGC AA 


GGTCGCCCAT 


5220 


GGAACCATTG 


TCTCCTCGCT 


TAAGCAAGAC 


AGCGCTAGTC 


AGCTTGAACA 


AGAGCTCCTG 


5280 


ATCAACAGAA 


ACATCATACA 


GAG CC AG T AA 


AGCCTTGACA 


ACCAAGACAA 


CGACGCTGCC 


5340 


ACTAGAACCT 


AGACCAAACT 


TTTTCCCTTC 


TCGTTCCATT 


TTGCCACAGA 


TTTCTAGAGA 


5400 


AAAAGGTCTT 


AAATTCTGAC 


CACGAACAGC 


GAGGAAGTCT 


CCCATCAAAG 


CAATCGTTTC 


5460 


TTGAATCAAG 


CTATAGTCAG 


G ATT AGG C CT 


TAAGTCCACT 


GCGAAATCAA 


ACATATCTGA 


5520 


ATAGATACGG 


TAGCTGTCAG 


AAAAAGCAAT 


CTCAGCCCTC 


ATATAGATGG 


GAATATCCTT 


5580 


TATCAAAGCT 


AACTGCCCTG 


GCTCTAAAAT 


AGCATATTCA 


CCTGCCCAAT 


AGAGTTTTCC 


5640 


GCAAGTTTTA 


ACAGCAATCA 


TCTTGACTCA 


AATCCTTTGT 


TTTTGACACA 


ATCAAGCGAT 


5700 


AACGATGACC 


GAAAATTTCT 


GATAAATGCT 


CCAAGTCTTT 


CTCCTGACAG 


AAG AC CTT AA 


5760 


CATTGGGACC 


AGCATCCATG 


GTAAAGTAGC 


AGGCCTCTCC 


TTTCT C ACG A 


AGCTGGCGAA 


5820 


CAAAGGCCAT 


AGCCTCATAA 


GAGGCATCCG 


TCAGATAAGA 


AAAGGCTGGA 


CTAGCAGTCT 


5880 


TTGTCGTAGC 


ATGCAT AG CC 


AGGGCATTTT 


TCTCCCTTAA 


TTCTCCAATC 


TTGGCAAAAT 


5940 


CATTTTCCTT 


GAGATAAATC 


AGCATATCCT 


G ATAGTC CTT 


CTCAGACTGA 


CGAACCCAGT 


6000 


CGTCGAAAGT 


CGTCGAGGTT 


TC CAC AC AAA 


GTTTCATCCC 


GTCACGGCTA 


GAGATTGGTT 


6060 


TTTTCTTGTC 


CTCTAGCACC 


AACATAATCA 


TAGCTAGTTT 


CAAGTCTGTf* 




C 1 irt 


AAATTTCTCC 


ACTATCCTTA 


TCCCAGGCTC 


CTAGTGGTCC 


ATAAAAACTC 


CGAGAAGAAG 


6180 


AACCTGAGGC 


AAATTTGGCT 


TCCTGTGCCA 


ACTGACTTCT 


ATCCAATCCA 


AGCTTGAAAT 


6240 


AAGCATTACA 


AGCCTTGACC 


AGGGCGGACA 


AACCACTAGA 


ACTTGAGGAC 


AGACCCGCTG 


6300 


CCGTAGGCAT 


ATTGTTTTGA 


GTATCGATAC 


GGACAAAGCC 


CTCACCAGCT 


GGACGATAAC 


6360 


GGTCAATAAT 


CTT ACT CATC 


TTGGCATGCT 


CGACCTCATT 


TTGTAGCTGA 


CCATTGATGT 


6420 


AAAATTCGTC 


AGCTGTTACA 


TTGGCTGGTA 


AAGGCGACAA 


GGTCG TCTCT 


GTATACATAT 


6480 


TTTCCAAAGT 


TAGAGAAATA 


CTGCTAGTAG 


CAGGCACCAT 


CTC'TTTTTCT 


TTTTTCTTTC 


6540 
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GCTTGGGCTC GTTCAAAAAA ATCAAATCCC CATAGAGACA GTTAACGCTG AGACTGACTT 3 060 

CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT CTTTCTGAGG ATAATGACAT 3120 

TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT GTCCTTGCCC ATGAAGCGAA 3180 

AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA AAAATCATGA AGATTTCCAA 32 40 

ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC AAAAACCAAA TTGGTCTGCT 3300 

CTTTTTCCTG CTTCATCATC GGGTAAAGAA GAAAGAGGCT ATAGATAACC GC AACAAAAA 3360 

TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA GAAGAGAATG GTTGCCGCTA 3420 

CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG AATCAAAAGC AAAAAGAGGA 3480 

ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT CAAAATCAGA GCTCCTGTCA 3540 

GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT CATAGGTTCA TCCTCTCCCT 3 600 

TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG GATGGAAAAA AGGCGCTGGT 3 660 

TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC CATAAAGAAG AT AGT CT ACT 3720 

TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA TAATCAAGCG TAGATCTGCT 3780 

TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT AGGTTTCAAC CAATTCCAGA 3 840 

ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA AGCACTTAAT CATATCCAGC 3 900 

GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT CTTTCCATTC TTGGGCATTG 3960 

AGAAGGGCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT AATCACGCTG GCCACTACGA 4020 

CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG ATAGGTCCAC TGTACGAACA 4080 

CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC CAAAGCCCAC TTCCTTGAGG 4140 

ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT GCGATTGCCA GCTTCTAAAC 4200 

TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT TGACATGCAC TTGCAATAGA 4 260 

ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA CAGGCTTGTC CAATCCAATA 4 3 20 

TTGGTTCCAA GGAGGAGATT GGGATGACTA GACTTGACAG AAAAAGAATC ATCCGTTGGA 4 380 

TTTTTGACGG CTGCGCTATA AGAACCCGTT ACAAATAAAA TACCACACCA TTCCGCCACC 4440 

TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC CACCAGTCAT GGCATTGATA 4500 

TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG AAAGATCGAT TTCATCCAGA 4560 

TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT CAAAGCTATT AT AGGAACTT . 4620 

TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT TTGTCGTCAT GTCCTATCCT 4 680 

TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA TTTTTTAAGG TTTTGGTTGA 4740 

TTGCG CATCA AAACTCAGGG CGATGCCACA GTCACCACCA CCAGCACCAC TACTCTTGGC 4 800 
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GCCCCACCAA ATGCTGCTGA AAGG CAT AG A CAGCCGCCTG GGTACGATCG CTCACTTCAA 13 20 

GTTTGGCAAG AATATTGGAC ACGTGGGTCT TGACCGTCTT GAGAGAGATA AAG AGGT CAT 1380 
CTGCGATGCG CTGATTTTCG TAGCCCTTGG CGATGAG TTG GAGAACATCT CGCTCACGCG 14 40 

CAGTCAATTC TTCATGAAGT TCCATATGAT TGCGGTCGTA TTCAACCTTC TTGCTAACCT 1500 
CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 1560 
CACTAGAAGT CTTGAGCATA TAGCCTTTGG CACCAGCATC TAAGACTGGC ATGATTTTTT 1620 
CATTGTCCAA AT AAG AGGT C ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 1680 
CTAAGGTCGC GTCAATCCCA TTCATCTCAG GCATGACAAT ATCCATGACA ATGACATCTG 1740 
GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 
CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT I860 
CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 1920 
ATATCAACCG CCAGCCCTTG CTTGGGAGCT GTCAAGAGTT GAACTGTTCC AGCCATATCT 1980 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2 040 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG AT AG AGGT AG 2100 

ACATCTAGGC AAGATGC CTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 2220 

CTAACATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAACCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 234 0 

GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT GACTGATACC CGATAAAATC ATGTGGGCCG CAAACAACTC CTGACTGACT 24 60 

GTATCGTGCA AATCCCGAGC AATTCGCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2 520 

GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 2 580 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 264 0 

AATAAACGCT TGAGATTAGC CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAG AC AGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2820 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 2880 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGTCAAGAAA ATCTTGACAC 2940 

TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 3000 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA AC CTGTCGTG 

CC AGCTG CAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCCCTC 

TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 

(2) INFORMATION FOR SEQ ID NO: 146: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11887 base pairs 
<B> TYPE : nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCCATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 24 0 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 300 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 360 

CTATTCTAAA TTTCTTAATC TACTATAGAA TCAAGAAATC TACCACCTTC TTTAAATACC 420 

CTC CATTATC ACAT AAACAG GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 480 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 540 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTTATTGAT AAAAAATTCT GTTTGGTTCA AGTTATGAAG GAGAATCTTT TCCATTAATT 660 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AGCGACACGT CTCAGCCAGA 72 0 

T ATT ATCTG A TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 7 80 

GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTG AC G TG GTTTGAAGAG 840 

ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TGTTCATACA CCAAACTCCC 900 

TACT ACT C G A TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAG CTTAGGC AAATCGGTTT CCTTTAGATA AG ACTG CATT GCTTTCAAAT AG TT AGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT C CCAATCGAT 1080 

AATCTTTGTT TTTTTCGCTT CTGGAAAATA TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCCCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACTCATCTAA AACTCCTCTT 12 60 
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CCCAGTCGCG CTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAAC ATT A CTTGACTGAT 8880 

GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 894 0 

CTTGCAATTC ATC GTCTGG A TAGGCAACGT ATTCG TCTGT ATTGTAAATA TCCCACAGAT 9000 

GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 9060 

AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAACCAG 9120 

GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 9180 

TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 9240 

GCCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 93 00 

TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 9 3 60 

TTTGAGCTGC TGTCTGCATT CCATACTTCC C ATT AG AC AT GT AG AC CTT A TTTATATAGT 9420 

AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT G TTCT AACTG AATCGCTAAC CAAGCTTCCT 9480 

G AGCCTT AC G AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 9540 

ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTCCAGATTC CGCAAGAAAG 9600 

CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 9660 

AAACGATTGC CTTAACCAAA TCTGTGGGAA TAT C ATT AG C TTGGGCAT TG ACGCGGCGTT 9720 

CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 9780 

CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GT A GT AG AAA AAAACTCCTC 9840 

CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 9 900 

GGCGCAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 9960 

ATTGAGATAA GG AATTTGAG GGAAGG CACC AGC CTTGATT TCATATCCAT ATTCTCGAAT 10020 

ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 10080 

TGCCGGCAAT AAGTAGGTTT CTTCCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 10140 

TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 10200 

AATCGCACGT l i rlvm-iC TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 102 60 

GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 10320 

TCG TTGTGGA T AGTC CACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 10380 

CAAGCCCTGA GACAAATAGT AGTCGTTGGT AGCATTGATC ATCTTTTCAA AGGCTACCGA 10440 

GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 10500 

TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 10560 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 
ATC AAGG CT A AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGAGATGG TGGAGATAAA 7140 
TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 
TTGTCAGGCT ATGAAGC AG C TTTGGTTAAG GCTCAGAAAT TAACACACAA AGACAATCAA 72 60 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7 320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 
CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 
GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7 5 00 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7 560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 
AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7680 
GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT GACAAAATCA 7740 
CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7800 
GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7860 
TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7920 
TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 
TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 
TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8X00 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGGTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAG TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG G TCATCAT AT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 8580 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCAGTTG 87 60 " 

AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC 8820 
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ACCCTTCCAA 


GCGATTGGGC 


AAAGAGCAAG 


GTTTACCTTT 


ACAAGCTAAC 


TGACCAAGGT 


5340 


AAGACAGAAG 


ACCAAGAACT 


AACTGTAAAA 


GATGGTAAAA 


TTACCCTAGA 


TCTTCTAGCA 


5400 


AATCAACCAT 


ACGTTCTCTA 


TCGTTCGAAA 


CAAACTAATC 


CTGAAATGTC 


ATGGAGTGAA 


5460 


GGCATGCACA 


TCTATGAC C A 


AGGATTTAAT 


AGCGGTACCT 


TGAAACATTG 


GACCATTTCA 


5520 


GGCGATGCTT 


CTAAGGCAGA 


AATTGTCAAG 


TCTCAAGGGG 


CAAACGATAT 


GCTTCGTATT 


5580 


CAAGGAAACA 


AAGAAAAAGT 


TAGTCTCACT 


CAGAAATTAA 


CTGGCTTGAA 


ACCAAATACC 


5640 


AAGT ATGCCG 


TTTATGTTCG 


TGT AG AT AAC 


CGTAGTAATG 


CCAAGGCAAG 


TATCACTGTG 


5700 


AATACTGGTG 


AAAAAGAAGT 


GACTACTTAT 


ACCAATAAGT 


CTCTCGCGCT 


CAACTATGTT 


S760 


AAGGCCTACG 


CCCACAATAC 


ACGTCGTGAC 


AATGCTACAG 


TTGACGATAC 


AAGTTACTTC 


5820 


CAAAACATGT 


ACGCCTTCTT 


TACAACTGGA 


GCGGACGTCT 


CAAATGTTAC 


TCTGACATTG 


5880 


AGTCGTGAAG 


CTGGTGATCA 


AGCAACTTAC 


TTTGATCAAA 


TTCGTACCTT 


TGAAAACAAT 


5940 


TCAAGCATGT 


ACGGAGACAA 


GCATGATACA 


GGTAAAGGCA 


CCTTCAAGCA 


AGACTTTGAA 


6000 


AATGTT G CTC 


AGGGTATCTT 


CCCATTTGTA 


GTGGGTGGTG 


TCGAAGGTGT 


TGAAGATAAC 


6060 


CGCACTCACT 


TGTCTGAAAA 


ACACAATCCA 


TATACACAAC 


GTGGTTGGAA 


TGGTAAGAAA 


6120 


GTCGATGATG 


TTATCGAAGG 


AAATTGGTCA 


CTCAAGACAA 


ATGGACTAGT 


GAGCCGTCCT 


6180 


AACTTGGTTT 


ACC AAAC CAT 


CCCACAAAAC 


TTCCGTTTTC 


AAGCAGGTAA 


GACCTACCGT 


6240 


GTAACCTTTG 


AATACGAAGC 


AGGATCAGAC 


AATACCTATG 


CTTTTGTACT 


CGGTAAGGGA 


6300 


GAATTCCAGT 


CAGGTCGTCG 


TGGTACTCAA 


GCAAGCAACT 


TGGAAATGCA 


TGAATTGCCA 


6360 


AATACTTGGA 


CAGATTCTAA 


GAAAGCCAAG 


AAGGCAACCT 


TCCTTGTGAC 


AGGTGCAGAA 


6420 


ACAGGCGATA 


CTTGGGTAGG 


TATCTACTCA 


ACTGGAAATG 


CAAGTAATAC 


TCGTGGTGAT 


6480 


TCTGGTGGAA 


ATGCCAACTT 


CCGTGGTTAT 


AACGACTTCA 


TGATGGATAA 


TCTTCAAATC 


6540 


GAAGAAATTA 


CCCTAACAGG 


TAAGATGTTG 


ACAGAAAATG 


CTCTGAAGAA 


CTACTTG CO A 


6600 


ACGGTTGCCA 


TGACTAACTA 


CACCAAAGAG 


TCTATGGATG 


CTTTGAAAGA 


GGCGGTCTTT 


6660 


AACCTCAGTC 


AGGCCGATGA 


TGATATCAGT 


GTGGAAGAAG 


CGCGTGCAGA 


GATTGCCAAG 


6720 


ATTGAAGCTT 


TGAAGAATGC 


TTTGGTTCAG 


AAGAAGACGG 


CTTTGGTAGC 


AGATGAC TTT 


6780 


GCAAGTCTTA 


CAGCTCCTGC 


TCAGGCTCAA 


GAAGGTCTTG 


CAAATGCCTT 


TGATGGCAAT 


6840 


GTGTCTAGTC 


TATGGCATAC 


ATCTTGGAAT 


GGTGGAGATG 


TAGGCAAGCC 


TGCAACTATG 


6900 


GTCTTGAAAG 


AACCAACTGA 


AATCACAGGA 


CTTCGCTATG 


TTCCGCGTGG 


ATCAGGTTCA 


6960 


AATGGTAACT 


TGCGAGATGT 


GAAACTTGTT 


GTGACAGATG 


AGTCTGGCAA 


GGAGCATACC 


7020 
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AAGCTTCGCG ATGATGCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3 54 0 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 
C AAAAG ATT G ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3 660 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG CGGCAACCAT GTCAAACAAT 37 20 

ACGCATGTCA GCGGAGATGA TCATATCGAT GTAACCAATC CAATGAAGGA TTTGGCTAAG 3 780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 3 840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGCACTCCTT TGACAGCTTA TAAAGAAACA 3900 
GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 39 60 

AAGGGCATTG TTTTCCCAGA AT AC AC GAAG GAACTTCCAA GTGCTAAGGT TGTTATCACT 4 020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG GTGCCATTGC TTATCGTAGC 4 080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 4 2 60 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCCAA 4 320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTAGG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATC AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGCCTAGATC AAGGTATCAA CATTGATGCT 4 500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4 560 

GACGGTCTCG ACTTTATCT A TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4 620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4 680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4 74 0 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGC C ATCA CCCGCTTTAT CCGTAACCAC 4 800 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4 860 

CTAGGTGGCT ACAGCATGAA AG ACTTTG AA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4 920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4 980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGCTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC AGGCCGGTGC AACAACTTGG 52 80 
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CAGATAGTCC TTTGATTATT CCAACTACAT CTCGTACAGG 


: GCGTCCAATC 


1 TTGTCTAAGA 


1800 


TGGTACCATT 


' TACAATACCA 


l TTTGCATTGT 


' CAGGAAATAA 


AGGTACAAGT 


GAACCAGTCT 


1860 


TGTATAAATA 


■ CTTGGAACTT 


CAAGACAAGG 


CAGTCACTGT 


AGATGAATAC 


CAAAAAGCTC 


1920 


AGGAAAAATG 


GATGAAAGAA 


AAAGAAGAGT 


CTAATAAAAA 


GGCTCAAGAA 


GATCTCGCAA 


1980 


AACATGTGAA 


ATAACTGTTG 


CAAAATATAA 


GAAACCATTT 


AGTATTTCCC 


TTGAATGCTG 


2040 


AATCCTTTTT 


TACATTTGTA 


AAGAAAGATT 


CTAAAATGTA 


CGGACCCCCA 


AAAGTTGGAG 


2100 


CCTCTTTTTG 


TCAGAATAGA 


GAAAATTTTT 


GTTAATTTTA 


CTTGTTTCCT 


ATTGCTTTCT 


2160 


CAGCTATTAT 


TTGTTATATT 


AAAAGTATAA 


TTATTTTTTA 


TTTATCACAG 


TTAAGCATTG 


2220 


CACTTTCAGA 


GGAAGGAGTA 


TTTTTTAAAA 


AGAAAATGTA 


AACGTTTGCT 


CAAAAATGAA 


2280 


AGGATTTAGA 


AGTTTATGAA 


TAAAGGATTA 


TTTGAAAAAC 


GTTGTAAATA 


TAGTATTCGG 


2340 


AAATTTTCAT 


TAGGTGTTGC 


TTCTGTTATG 


ATTGGAGCTG 


CATTCTTTGG 


GACAAGTCCG 


2400 


GTTCTTGCAG 


ATAGCGTGCA 


GTCTGGTTCC 


ACGGCGAACT 


TACCAGCTGA 


TTTAGCTACT 


2460 


GCTCTTGCAA 


CAGCAAAAGA 


GAATGATGGG 


CGTGATTTTG 


AAGCGCCTAA 


GGTGGGAGAA 


2520 


GACCAAGGTT 


CTCCAGAAGT 


TACAGATGGA 


CCTAAGACAG 


AAGAAGAACT 


ATTAGCACTT 


2580 


GAAAAAGAAA 


AACCGGCTGA 


AGAAAAACCA 


AAAGAGGATA 


AACCTGCAGC 


TGCTAAACCT 


2640 


GAAACACCTA 


AGACGGTAAC 


CCCTGAATGG 


CAAACGGTAG 


CGAATAAAGA 


GCAACAGGGA 


2700 


ACAGTCACTA 


TCCGAGAAGA 


AAAAGGTGTC 


CGCTACAACC 


AACT ATCCT C 


AACTGCTCAA 


2760 


AATGATAACG 


CACGCAAACC 


AGCCCTGTTT 


GAAAAGAAGG 


GCTTGACCGT 


TGATGCCAAT 


2820 


GGAAATGCAA 


CTGTTGATTT 


AACCTTCAAA 


GATGATTCTG 


AAAAGGGCAA 


ATCACGCTTT 


2880 


GGTGTCTTTT 


TGAAATTTAA 


AGATACCAAG 


AATAATGTTT 


TTGTCGGTTA 


TGACAAGGAT 


2940 


GGCTGGTTCT 


GGGAGTATAA 


ATCTCCAACA 


ACTAGCACTT 


GGTATAGAGG 


TAGTCGTGTT 


3000 


GCTGCTC CTG 


AAACAGGATC 


AACAAACCGT 


CTCTCTAT C A 


CTCTCAAGTC 


AGACGGTCAG 


3060 


CTAAATGCCA 


GCAATAATGA 


TGTCAATCTC 


TTTGACACAG 


TGACTCTACC 


AGCTGCGGTC 


3120 


AATGACCATC 


TTAAAAATGA 


GAAGAAGATT 


CTTCTCAAGG 


CGGGCTCTTA 


TGACGATGAG 


3180 


CGAACAGTTG 


TTAGCGTTAA 


AACGGATAAC 


CAAGAGGGGG 


TAAAAACAGA 


GGATACCCCT 


3240 


GCTGAAAAAG 


AAACAGGTCC 


TGAAGTTGAT 


GAT AG CAAGG 


TGACTTATGA 


CAC GATTCAG 


3300 


TCTAAGGTCC 


TCAAAGCAGT 


GATTGACCAA 


GCCTTCCCTC 


GTGTCAAGGA 


AT AC AGCTTG 


3360 


AACGGGCATA 


CTTTGCCAGG 


ACAGGTGCAA 


CAGTTCAACC 


AAGTCTTTAT 


CAATAACCAC 


3420 


CGAATCACCC 


CTGAAGTCAC 


TTATAAGAAA 


ATCAATGAGA 


CAACAGCAGA 


GTACTTGATG 


3480 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

CCGGAGAAAA TGATGAAAAG TTCAAAACTA TTTCCCCTTG CCGCCCTGAC ATTATTGGCG 60 

GCGACTACTT TAGCTGCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 120 

T CAT ACATTT ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 180 

ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AXSTGATCGC TACGGGAACT 2 40 

TTGTGCCGT C TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 300 

CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 3 60 

CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 420 

TTGTTCAAGA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 480 

CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 540 

AAAG CTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT - 600 

TGAATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 6 60 

GTCCTTATTT GTTGAAATCC ATTGTGACCA AATC CTCTGT TGAATTTGCG AAAAATCCGA 720 

ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC T GGG ATGGTC - 7 80 

AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA GCTCGTCTCT 84 0 

ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGTCTATA 900 

CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 960 

ACACATCTAA G AC CAGCGAC GAACAAAAGG CATC G ACT AA AAACGCTCTC TTAAACAAGG 1020 

ATTT CCGTC A GGCTATTGCC TTTGGATTTG AC CGTAC AGC CTATGCCTCT CAGTTGAATG 1080 

GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 1140 

CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 1200 

GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCAGAA AAAGCCAAGG 12 60 

CTGAATTTGC T AAAG CT AAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 1320 

TGGATATGCC AGT TGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 1380 

AATCCTTGGA AGCAACTTTA GGAGCTGAT A ATGTCATTAT TGATATTCAA CAACTACAAA 14 40 

AAGACGAAGT AAACAATATT ACATATTTTG CTGAAAATGC TGCTGGCGAA GACTGGGATT 1500 

TATCAGATAA TGTCGGTTGG GGT CCAG ACT TTGCCGATCC ATCAACCTAC CTTGAT ATT A , 1560 

TCAAACCTTC TGTAGGAGAA AGTACTAAAA CAT ATTT AGG GTTTGACTCA GGGGAAGATA , 1620 
ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG , 1680 

ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 174 0 
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TCTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATC C AGT AG CTGTATCTCC 1860 
ACTCGCTAAG AAAAATCCTG AAGACCAACG CTTTACTGAC CGTTTCG AG C TCTTTATCAT 1920 
GACTAAGGAG TACGGTAATG CCTTTACTGA GTTGAACGAC CCAATCGACC AACTTAGCCG 1980 
TTTTGAAGCC CAAGCTAAAG CCAAAGAACT TGGTGATGAT GAAGCGACAG GAATCGACTA 2040 
TGACTACATT G AAG CTCTTG AATACGGTAT GCCACCAACA GGTGGTTTGG GAATCGGTAT 2100 
CGACCGTCTC TGCATGCTCC TCACTGATAC AACAACT AT C CCTCATCTAT TGCTCTTCCC 2160 
AACAATGAAA T AAAT TCTT A TCCTCTGGGT CTTATCAGAG GATTTTTTGA TTCAAAAAGA 2 2 20 

GACTGAATTT AAGGAGAAAA TGAAGTGTAG T AT ATTG AAA TTG AAATAG T ACACTTTGAT 2 2 80 

TTCTAAGACA TTGTTAGAAA TTGGTTTAAA TTCCCTAAGC AATTTGTGCA TGTTTTATTT 2 3 40 

CATTTTACGA TAGTACGCTG AAACTTTTCA AAAAGT ACT A GAAATTGACT TGGATTCCCC 2400 

AATTGATTTG TTCAGATTCA CTATAAATAA AAAATTAATA AGTGGGATAG GAAGTTAGCG 2 4 60 

TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG TCCATAGGTC 2 52 0 

CTATTAGAGG ATGTT CTGGT GTCTTATTCA CTTGTTTTTT ATAGTATTAG TAGATAGAAT 2580 

CAGCAAATAA AAACCCAAAT CATTCATACC TCTCTCAACT AGATGTAACT TACAAAACCC 264 0 

CTGACCTCAT GAGCCACTTT CTTCCTCCTC ATGAGGTCAG TTTTACTTTC TGCTGTTCCA 2700 

GTATCGTTTT TC CTCGCT AG ATTTCCTCAA AAGGGCAGAC TCCTCCCTTG GTGCGTCACA 2760 

CGATTTTTTC ATCTCGACTG TTCTTTAATG CATCATTAAC GACGCTTTTC TTCTAGGTGG 2820 

TTCATAAGGA ACAGGAAGAT TCAGGTTGAC TTTTCTAATC CTAGAATAAA GTGCTGAAAA 2 880 

CAATTCGGAA TAGGCATAGA GACTAGACAA TTTGAGGAGC TGCTTGCGTC CTGTTCGAAC 2 94 0 

ACATTTTCCC ACCACGTGAA GAAAAAGATG GCGGAAGCGT TT G ATTGTT A AAGTTTGGAA 3 000 

GTCACCTCCA GCT AG ATGTT TGAGAAAAAG ATAGAGATTG TAGGCGATAC AGCTCATCAT 3060 

CATACGAACT TCGTTTTTGA TTAAGGTTGA ACTATCCGTT TTATCGCCAA AAAATCCCTC 3120 

CTTCATCTCC TTGATGAAAT TCTCGGCTTG ACCACGTCCA CGATAAAGCT GAAACTGGTC 3180 

TTGGCTTGTT CCACTCGTCA TATTTGTAAC GAGAGAAATA ACATCGTAGA AC 323 2 
(2) INFORMATION FOR SEQ ID NO : 145: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double - 

( D) TOPOLOGY: linear 
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CAGGGGCGTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 60 

TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 120 

AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT AGTGGTAGAT ATTGTTCAAC 180 

AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA AT AC AG TT AT TGTAGCACTT 24 0 

AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA TAAGGATTTT TATCATAGAC 3 00 

ATAAAATTTC TGAAATTTCC AAACAAAATA TTTTAAAAGT TTTGAAAAAG AGTTAAGATA 3 60 

TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG GACATTTTAT GTCATACAAA 4 20 

ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 4 80 

CAACAAGTTA TTCCTAAAGC AGCATTTGGC TATATCGCTA GTGGGGCGGG AGATACTTTC 54 0 

ACTTCTTTCC AGTGATTTTA GCGTCAGGTT CTTTTTAGTT TTTAAAGATT ATCCGTGAAT 600 

TTCTTGCTTA TTTATGATAA AATGGGAGTG TCGCAAAAAA TGACTCATCG TATTCAATTT 660 

TGAGTAAAAC TAGGAGGATC CCATGTCTAC AGAACATATG GAAGAACTAA ATGACCAGCA 720 

GATCGTTCGC CGTGAAAAAA TGGCTGCGCT CCGCGAACAA GGAATCGATC CTTTCGGAAA 780 

ACGTTTTGAA CGTACTGCAA ATTCACAAGA ATTAAAAGAT AAATATGCCA ACCTCGATAA 84 0 

AGAACAATTA CACGATAAAA ACGAAACAGC TACTATCGCA GGACGCTTGA TAACCAAACG 900 

TGGTAAAGGA AAAGTTGGTT TTGCCCACCT TCAAGACCGC G AAGGCC AG A TTCAGATCTA 960 

CGTTCGTAAG GATGCTGTCG GTGAAGAAAA CTACGAAATC TTCAAAAAAG CAGACCTTGG 1020 

TGACTTCCTT GGTGTCGAAG GTGAAGTGAT GCGTACGGAT ATGGGAGAAC TCTCTATCAA 1080 

GGCAACCCAC ATCACACACT TGTCTAAGGC TCTTCGTCCT CTTC CTGAG A AATTCCATGG 1140 

TTTGACAGAC GTTGAAACAA TTTACCGTAA ACGTT AC CTT GACTTGATTT CTAATCGTGA 1200 

AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 12 60 

CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 13 2 0 

TGCCCGTCCA TTTATCACCC ACCACAATGC CCAAAACATT GACATGGTGC TTCGTATCGC 13 80 

GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 144 0 

CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 1500 

AGTTTACCAA GCTTATGCAG ACTTCCAAGA CATCATGGAC TTGACTGAAG GCATTATCCA 1560 

ACACGCTGCT AAATC AG T CA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 1620 

GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT ATCAGAGAAA TTACTGGTGT 1680 

CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 1740 

TCCAGTTGAG AAACACTACA CTGAGGTTGG TCACATCATC AATGCCTTCT TTGAAGAGTT 1800 
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CTCAGCAACT 


' CTTTGCAGGA 


. AACATCAAAA 


> ACCTTTCTGT 


1 CAAAGAGCTC 


AAACAAGGAC 


3660 


TTCCTGGTCT 


GCCCAACTAC 


CAAGTACAGG 


CAGACGAAAA 


CAACAATATC 


GTGGAACTGC 


3720 


TCGTCTCATC 


TGGTATACTT 


AACTCAAAAC 


GCCAAGCCCG 


TGAAGACGTC 


CAAAACGGAG 


3780 


CCATCTACGT 


AAACGGCGAC 


CGCATCCAAG 


AGCTTGACTA 


TGTCTTGAGT 


GACGCTGATA 


3840 


AGTTAGAGAA 


TGAACTGACT 


GTTATCCGTC 


GTGGGAAGAA 


AAAATACTTT 


GTATTGACTT 


3900 


ACTAAACTAT 


TCAACATTTA 


TCTATAAACA 


AAGGAGTTAA 


CCTCGAGAAA 


GGTAACTCCT 


3960 


TTTTGCTGTT 


AATAACTCTC 


ATCTATCTAT 


TTTTAATAGA 


CAGGCTACGC 


AGGACAATGC 


4020 


GCAACGTTGT 


TAGATTATGT 


AAGATAGAGA 


GATTTGAAGG 


ACTGAACCAA 


TTAAATAAGC 


4080 


CAAACCCAAT 


CAAACTACTA 


TTTACGACAA 


CGGTATCCTG 


AATATTTTTC 


TTGATGAGTG 


4140 


TT TGCAAAG A 


TGATGATAAC 


GAATCCAACT 


CTTGGAAGAA 


AT CCAAACGA 


TTATCTAACA 


4200 


ATAAGATATC 


ACTCATCTGC 


TTAGAAATAT 


CTGCACTCTC 


ATTCATCACC 


ACACCGATAT 


4260 


CT G AT AGAGT 


TAAAGCCGCT 


GAGTCATTCA 


ATCCATCTCC 


AACCATCAAA 


ATAGTGTGAC 


4320 


CTGCTTTCTG 


CAGTTTCTCT 


ACTAACTCAA 


ATTTCCCATC 


AGGTTTCAAG 


TCTGTATAGA 


4380 


CCTGATCAAA 


GGGCAAATCT 


TTGACTAATT 


CCTCTGTCCT 


AATCAAGGTG 


TCTCCTGTTG 


4440 


CCAGAATCAA 


TTTTTTCCCC 


TGTGCCTTAA 


GTTTATCCAA 


GGCTGTTTTT 


GCTTCTTTTC 


4500 


TCAAAGGAGT 


ATCAATGCAG 


AACATTCCAA 


TCAATTCATT 


TTGATAAGCC 


AAGAATAAGA 


4560 


GATTGTAGTG 


ACTCTTGTAC 


TCTTCAATTA 


AAGCATTTTG 


TTCTGAACTG 


ATATGAATCT 


4620 


GCTCATCCTG 


CATCAAGACA 


TAATTCCCAA 


TAAGAACTGG 


TTGGCCATCT 


ATATGAGATT 


4680 


TGATCCCCTT 


GCTTGCGATA 


TATTGGAGTT 


TCCCATGCAT 


TTCCTCATGT 


TCAATTCCCT 


4740 


CTATCTCAGC 


TTGCTTGACG 


ATGGCATTAG 


CAATAGGATG 


ATAAATGTGT 


TCCTCAAGAC 


4800 


AGGCACTGAT 


TCTGAGAATA 


TCTTCCTCAC 


TATACTCTCC 


AAAAGGTAAC 


ACCTTTTCAA 


4860 


CTATAGGATA 


ACTAGTTGTG 


ATTGTTCCTG 


TCTTATCAAA 


CAAGAAAGTA 


TCAACTTCCA 


4920 


GATATTTCTC 


CCTGTTGTGG 


CCTCTGGCTG 


TCATCTCTGT 


GCTGG 




4965 



(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 
{C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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TTCTGTCCCT 


TATTATTTCG 


GCCAAAGGGA 


GCCACATTGA 


GATAGGTCGT 


TAAAATCTCA 


1860 


TCTTTATTCA 


TGGCGCGTTC 


CAAGGCAAGA 


GCATCCACAA 


TCTCTGCCGC 


CTTACGAGCC 


1920 


AAGGTCGGCG 


CATCCCCAAC 


CACCTGCTGT 


TTAATTAGTT 


GCTGGGTCAA 


GGTTGAACCC 


1980 


CCACTAGAGG 


AACCCAAACC 


TACAAATTTC 


CCCAAGGTCG 


CACGAATCAC 


CGCCTTGGGT 


2040 


ACTACACCCT 


TATGTTCTTT 


AAAGTGTTCA 


TCTTCTGTCG 


CAATGATAGC 


CTTCTTCAGA 


2100 


TTTTCCGAAA 


TTTGCTCAGA 


TGAGATAGAA 


GTGCGCAACA 


AATCACTCTC 


TATGGAAGCA 


2160 


ATCACCGTCC 


CGTCCGAATA GGTAATCTCT 


GAAATAGAAG 


AGATGTCCTT 


GACCTG ATTC 


2220 


ACCAATTCTT 


CTGTCTG AG G 


CACCCGAACC 


TTGTCAAATA 


AGGCCACTCC 


GTATCCCAAA 


2280 


GCAATCCCAG 


CTCCCAACAT 


TCCTCCTAGA 


AAACCGAGTA 


CAAAGAGTAA 


GTTAAATAAG 


2340 


GCTTTTATAC 


TCAGTAAAAT 


AGCTGGGAAA 


ATGACTGACT 


TATCTAAGGT 


TTTAGATTTT 


2400 


TTGGTACTTG 


AACCTTTCTT 


GCCAGGTCTA 


GCTGATTTTT 


TATTTTTTTG 


TTTTTGCTGG 


2460 


AAAAATTCCA 


GCATTTTTCG 


TTTTAATTCA 


TTTAATTGAT 


TTTGCATGGA 


TTT CCTC ACT 


2520 


TTATCTATTA 


TACCACAAAA 


GGGAAATTTT 


CAATAAAATA 


GCCACTTTCT 


TCCCTATTCT 


2580 


GCTAGGCTAT 


TGCCCAAGTT 


TGTGATACAA 


TAGGTAGAAA 


CAATAATTTT 


AAAAAGGAGA 


2640 


AAAAACACAT 


GCACATTTTT 


GATGAGCTAA 


AAGAGCGTCG 


TTTGATATTT 


CAAACGACTG 


2700 


ATGAAGAAGC 


TTTGCGTAAA 


GCCCTAGAAG 


AAGGTCAAGT 


TTCTTATTAT 


ACTGGCTACG 


2760 


ATCCAACTGC 


TGACAGCCTT 


CACCTAGGCC 


ACCTTGTCGC 


AATCTTGACA 


AGTCGTCGCT 


2820 


TGCAACTAGC 


AGGTCACAAA 


CCTTATGCGC 


TCGTTGGCGG 


TGCTACAGGT 


CTCATCGGAG 


2880 


ATCCGTCCTT 


CAAAGATGCT 


GAACGTAGTC 


TCCAAACAAA 


AGACACAGTA 


GATCGCTGGG 


2940 


TCAAGTCTAT 


CCAAGGACAA 


CTTTCTCGTT 


TTCTTGACTT 


TGAAAATGGC 


GAAAACAAGG 


3000 


CTGTCATGGT 


CAACAACTAC 


GACTGGTTTG 


GC AG CATC AG 


CTTCATTGAC 


TTCCTCCGTG 


3060 


ATATTGGAAA 


ATACTTCACG 


GTCAACTACA 


TGATGAGTAA 


GGAATCTGTT 


AAAAAACGGA 


3120 


TCGAAACAGG 


AATTTCTTAC 


ACTGAGTTCG 


CTTACCAAAT 


CATGCAAGGG 


TATGACTTCT 


3180 


TCGTCCTTAA 


CCAAGACCAT 


AATGTCACTC 


TTCAAATCGG 


TGGTTCTGAC 


CAGTGGGGAA 


3240 


ATATGACAGC 


TGGTACCGAA 


TTGCTTCGTC 


GTAAGGCGGA 


CAAGACTGGT 


CACGTTATCA 


3300 


CTGTTCCACT 


AATCACAGAT 


GCAACTGGTA 


AGAAATTTGG 


TAAATCAGAA 


GGAAATGCCG 


3360 


TCTGGCTCAA 


TCCCGAAAAG 


ACTPCTCCAT 


ACGAAATGTA 


CCAATTCTGG 


ATGAACGTGA 


3420 


TGGACGCTGA 


CGCTGTTCGC 


TTCTTGAAAA 


TCTTTACTTT 


CTTGTCACTT 


GATGAGATTG 


3480 


AAGATATTCG 


TAAACAATTT 


GAAGCAGCGC 


CACACGAACG 


CTTGGCTCAA 


AAAGTCTTGG 


3540 


CTCGTGAAGT 


TGTTACACTT 


GTTCACGGAG 


AAGAACCCTA 


CAAAGAAGCA 


CTTAACATCA 


3600 
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GAAGAAGGTC GTGTAGTACT TGAGTTACTG CTATCGCTAG AACTACTACT TGAACTGCTG 120 

GAG CTGGATG GAGTTGGTAG ACTCCCCACA ATACTAGACC AAGCATTCTG ATAATCCGCA 180 

TCACTTCCGC CAATAGCAAA GCGATAACTT GTCGCTGGCG CTCCTGACTT ATTAGCCCAA 240 

TACCTGCTAA CAGTCGAACC TGTGACCTCT ACTTCTTTTC CTTCAACAGA AACCTTCTCT 300 

GGTTTTTGAC CTGTTGATTT CAAGACTTCC GATTTCACTA CACTAGGATC TAAAGCAAAG 3 60 

CGCTCGTTCC CCCAAATGCT TGGGGAAGCT TGCTGAATCG CATTTACCAG ATGAGCCATG 420 

TAATTAGAGT TATTAGAATA ACCTGCTCTA CGTGACAATG AATGATTATC ATCATGCCCA 4 80 

ATCCAGCCAC CTAGGGTTAA TCTAGGTGTC GAAAGC ATG A GCCACATATT TTCGTCTTGG 54 0 

TTGGTTGTAC CAGTCTTCCC AATCCAATCT GCATTAGCCA GAG T AGG ATT TAAAGAAGTC 600 

AGGTTAGACT TGAAGGTTGT TGTCACACGA GAGGATAGAA CTTCTCGTAG CAATCCCTGC 6 60 

ATAATCGTCG CAGTAGCTTT TGAATAGACT TGAACCGGTT TATCCTGATA CTCATACACC 720 

ACTCTACCAT CTGCTGCTTC AATCTTTGAA ATCACATGCT TCTGAT GAT A AACTCCATTA 780 

TT AGCT AAGG TCTGAT AGCC ATTGGTATGC TGGGCAACTG TGACTTCAAT ACCACCACCC 84 0 

ATTGGCAAGC TCTCAATACC GTACTCAGGA ATCTCGTAAC CCATCTTTTC CATATAACCC 900 

TTGACATCAA CACCCTTTTC ACGGAGCATA CGATAGGTCC AGTAAGCAGG GATATTCCAT 9 60 

GAATAGTTCA GAGCTTCTCC CAAGGT CATC ATTCCTGTTC CCTTGCTATT AGCATACATA 102 0 

ATCGGATTGC CATTAGCAAA GTTTGTTGGA TAGTTAGATA GAATCGTTTC ACTTCCCATC 1080 

AAGCCCTGGT CAATAGCAAT ACCGTAGGCC AG CAAGGGCT TGGTAGTAGA AGCTGGCGAA 1140 

CGTTTGGTAT CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 1200 

CCT AGAAT AG CACCTGTTTG GTTATCCATC AAGACATTCC CTACTTCTAC ACGACCTGTT 12 60 

CCATCGTCTA AAAGATAGCC ATAATCAGCA ACCGCACTTT GCATGGCAGA ATG AATTTT C 13 20 

TGATCTATGG TACTAGTAAT C TTAT AACC A CCATTTTCAA TTTCCTTGGC TGCCAAATCT 13 80 

CGATAAAACT TCTGAGTTGC CTCATTTTTC AACTC CTT AG CGGAGACATT GTCTCTCTGA 1440 

GCTAGATAGT CATACATACG TTCTTGAGCT TCTGCCAAAG TTGTAAAGTA TAAATAGTCT 1500 

CGTGAAATTC CTGTAACCGT GCCCGATGGT AAAAAGTCCT GTTTAAGGTC ATAATCCTTG 1560 

TACTGAGAAT ACTCGTCTTT GCTTAATGCA CCTGTACGAT ACATACTGTA AAGAACTGCC 1620 

TTAGCCCGTC TTAAGCCAAT TTCTAGGTCT TCATCACTCT TCAACTCCCC AGTATTTTCA 1680 

TAAGGAGAGT AAGTAATGGG ACTCTGTGGA AGTCCTGCTA AAAATGCTGC TTGAGGAACA 1740 

GTCAACTGAC TGGCATCTAC ACCGAAAATT CCCTCAGCTG CTTGCCGAGC CCCTGCAATA 1800 
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CTGCGCAATT TCAGGAGACA ATGACATTAG 


1 AAAGATCTCG 


; ACTCTTTAGT 


CAAAAGTATC 


3720 


CAGAAAATAC ATTGATTGCG ATGGATGGTG 


TGAAGATAGT 


TGGTTTTATA 


AGTTATGGCA 


3780 


ACTGTCGTGA TGAGACTATT CAAGCTGGTG 


AAATTATTGC 


TTTATATGTT 


TTAAAAGACT 


3840 


ATTATGGAAA AGGAATCGCA CAAAAGTTAG 


TGAAAGCAGC 


TTTGACTGAT 


CTTAATCATT 


3900 


TTTCTGAAAT TTTCTTATGG GTATTGAAAG 


ATAACAAGCG 


CGCCATTGCT 


TTCTATCAAA 


3960 


AAATGGGTTT T ACTTT TG AT GGACAAGAAA 


AAATACTTGA 


ACTTGGAAAG 


CCTATAAAGG 


4020 


AAAAACGGAT GGTATTCTAT TCTAAATAAT 


TCTCAAAAGT 


AAAAGCTAAT 


ATGGTACCAA 


4080 


GTCTGAAAAT TTAATAAATT AGAAAGCGAG 


TAAATTTATG 


TCCCGTTCCC 


AATTAACAAT 


4140 


TTTAACAAAT ATCTGTCTGA TTGAAGACCT 


CGAAA CTC AG 


CGCGTGGTGA 


TCCAGTATCG 


4200 


CGCCCCTGAA AACAATCGCT GGTCTGGTTA 


TGCCTTTCCT 


GG AGGT CATC 


TAGAAAATGA 


4260 


TGAGGCTTTT GCGGAGTCTG TCATTCGTGA 


AATCTACGAA 


GAAACAGGGT 


TGACTATCCA 


4320 


AAATCCTCAA CTTGTCGGCA TTAAAAATTG 


GCCACTAGAT 


AC AGGTGGG C 


GCTATATTGT 


4360 


CATTTGTTAT AAGGCGACTG AGTTCTCTGG 


TACCCTTCAA 


TCTTCAGAAG 


AGGGAGAAGT 


4440 


TTCTTGGGTG CAAAAAGACC AGATTCCAAA 


CTTAAATCTG 


GCCTATGATA 


TGCTACCATT 


4500 


GATGGAAATG ATGGAAGCTC CCGACAAGTC 


AGAGTTTTTC 


TACCCTCGCC 


GTACAGAAGA 


4560 


CGATTGGGAA AAGAAAATCT TCTAGTCTTT 


TACTAAATAA 


CCTAGCTGAT 


CCAAGGCCTC 


4620 


CTCGATATAG TGGAGCTCTT GTTCTGTCTC 


GGCTTCAACT 


AGGTGATAAT 


GAATACCATC 


4680 


TGTTAACTCA GAAATTGGCT T AAAGTC AG A 


ACGTTCAACT 


TGTTCTAGAA 


AATGTTGCAC 


4740 


GTCGCGGCGA C AGGT CAGTT TTAGTAAGGT 


TTCAATCTCT 


CCATAAACAG 


GATGATCAAT 


4800 


CAAGATATTT TGAACGCGAC CACCATTATC 


TACGATAGCA 


AGTAATTCTC 


GTCCAATTTC 


4860 


TTCAACTTCA TGCTTGACCT TAAATAATTT 


GTGATGATAA 


GTATTTGCAT 


TAGCATCTTT 


4920 


ATAGATATAA CCACGATTGG TAGATAGAAT 


TGGAGATCCA 


TCAGCTCTTA 


AAATTGCAAT . 


4980 


ATCTTGAACA ATAACTTGTC GAGTGACATG 


AAAGTGCTCA 






5020 


(2) INFORMATION FOR SEQ ID NO: 14 3: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4965 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 



60 
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GTATGATATC 


GAAATGTTGC 


GTGAGATGGG 


CTATACCAAT 


GGGGTTGAAA 


ATTATTCTCG 


1980 


CCACATGGAT 


GGACGCAGCG 


AAGGAGAGCC 


TCCTTATACG 


CTTCTCGACT 


TCTTCCCACA 


2040 


TGATTTCTTG 


ATTATGATTG 


ACGAGAGTCA 


TATGACCATA 


GGGCAAATCA 


AGGGCATGTA 


2L00 


CAATGGAGAC 


CGTTCGCGTA 


AAGAAATGCT 


GGTTAATTAT 


GGTTTCCGTT 


TGCCGTCTGC 


2160 


TTTGGACAAT 


CGTCCTCTCC 


GTCGGGAGGA 


GTTTGAGAGT 


CACCTTCATC 


AGATTGTTTA 


2220 


CGTTTCAGCG 


ACACCTGGTG 


ACTATGAAAA 


TGAACAGACC 


GAGACAGTGA 


TTGAGCAAAT 


2280 


CATTCGTCCA 


ACGGGACTCT 


TGGAT CC AG A 


GGTGGAAGTC 


CGTCCGACTA 


TGGGACAGAT 


2340 


TGATGACCTC 


TTGGGTGAAA 


TCAATGCCCG 


CGTTGAAAAA 


AATGAGCGTA 


CCTTTATCAC 


2400 


AACTTTG AC C 


AAGAAAATGG 


CAGAGGATTT 


GACCGACTAC 


TTCAAGGAAA 


TGGGTATCAA 


2460 


GGTCAAGTAC 


ATGCACTCGG 


ATATCAAGAC 


CTTCGAACGG 


ACGGAGATTA 


TCCGTGACCT 


2520 


GCGCTTGGGT 


GTCTTTGATG 


TCTTGGTCGG 


AATTAACCTG 


CTCCGTGAAG 


GAATTGACGT 


2580 


TCCTGAAGTG 


AGCCTCGTAG 


CTATTCTCGA 


TGCTGACAAG 


GAAGGTTTCC 


TTCGCAACGA 


2640 


ACGTGGACTC 


ATCCAGACCA 


TTGGACGTGC 


TGCACGTAAT 


AGCGAAGGTC 


ATGTTATCAT 


2700 


GTATGCGGAC 


ACGGTTACCC 


AGTCTATGCA 


ACGTGCTATC 


GATGAAACTG 


CCCGCCGTCG 


2760 


CAAAATCCAG 


ATGGCCTATA 


ATGAAGAACA 


TGGTATCGTT 


CCACAAACCA 


TCAAGAAAGA 


2820 


AATCCGTGAC 


TTGATTGCTG 


TGACCAAGGC 


AGTTGCTAAG 


GAAGAAGACA 


AGGAAGTCGA 


2880 


TATCAATAGC 


CTCAACAAAC 


AAGAGCGCAA 


AGAACTAGTC 


AAAAAGCTTG 


AGAAACAAAT 


2940 


GCAAGAAGCA 


GTTGAAGTGC 


TTGACTTTGA 


ACTAGCAGCT 


CAGATTCGTG 


ATATGATGCT 


3000 


GGAAGTCAAG 


GCCTTGGATT 


AGGGGAATAG 


TATGATTTAT 


TTAAGAAAGT 


TAAAGAAAGA 


3060 


AGATTTGATG 


TCTTTATGGG 


AAATGGCTTA 


TTCACAGCTT 


AATCCAGTTT 


GGAAACAGTA 


3120 


TGATGCTCCC 


TATTATGATG 


ATTATCAGTA 


TTTTTCAAAT 


TTTAAAGAAT 


TCGAACTACA 


3180 


AAAATCAGAA 


TCCATTTTAA 


GCAACTCAAA 




» IMf 1 MfU III f W' *'|M|I/^ 

Al 1 1 1 1 vj 1 lvj 
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1 *3 A rt 
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AGTTGGGACT 


GTTTCGCGTT 


ATTGGGTATG 


TAAAGAAACA 


AGATGGATGG 


AATTGGGAAT 


3300 


TGGTATTTAT 


GATAAAAAAT 


TCTGGAACAC 


TGGTATTGGG 


AAAGTTGCTA 


TGTTGCAGTG 


3360 


GATAGATAGG 


ACGTTTCAGG 


ATTACTTGGA 


GTTGGAGCAT 


CTGGGTTTGA 


CAACTTGGTC 


3420 


AGGAAATATT 


GGTATGATGA 


AACTTGCTGA 


AAAATTAAGA 


ATGAAAAAAG 


AAGCTCAT A T 


3480 


TCCAAAAGTT 


CGTTATTATC 


AAGGTAAATA 


TTTTGACAGT 


ATTAAATATG 


GTATTTTGAG 


3540 


AGAAGACTGG 


GAGAAAATAA 


ATGACGGTTA 


TTATCAAATC 


AATGGAAACT 


CCTGAAGAGA 


3600 


TAGAAGGTAA 


ATCCTTCGTT 


CACTGGCAAA 


CGTGGAGAGA 


GGCTTATGAT 


GACCTTTTGC 


3660 



WO 98/18931 



PCT/US97/19588 



964 

ACTTTGTATG GTATTGGCTT GTTAGCTCTC ATCGGACTTG CTAGAAATTT TGGAGAGGCT 180 
GCTCAAAATC TTGCAAG CT A CTTGCAGACC TTGCATCAGA GCTTGACGGA TAAAACAAGT 24 0 

GACTTTCGTT TAATTTTAGG ATT ACT GG C C TTTGGTTATT CTTAACACTG TGTTCAGATG 300 
GACAAGAAAA GTTGAGAAAA GACCTATTCG AACCTTGGGA TTTTATAGAG AG AATTT CCT 3 60 

CAGCAATCTT CTGAAAGGAT TTAGTCTAGG CCTGGCACTT TTTCT T CTGA CCTTGTTAGG 420 
TTTAGTGGTC TTAGGTCAAT ATCGTTTGGA ATCCATTCAC TTGAATCCTT ATTCTCTTGC 4 80 

CTTTGTCGTC TTTACTATCC CATTTTGGAT TTTACAGGGG ACAGCAGAAG AAGTGGTGGC 54 0 

CCGTGCTTGG CTACTTCCTC AATTGGCCTC AAGAACCAAT CTAAAACTAG CT AT TCTT AT 600 
ATCTAGCCTG TTCTTTACCC TGCTTCATAT GGGCAATTCT GGTCTCACCC CTCTATCTCT 6 60 

AGTAAATCTC TTTTTATTCG GAGTTGCCAT GGCTCTTTAC CTTCTCAAAA CTGATACAGT 720 
TTGGGGTGTT GCAGGTATTC ATGGTGCTTG GAATTTTGCT CAGGGTAATC TCTTTGGGAT 780 
TTTAGTTAGT GGTCAACCGT CAGaACGTCT CTGATGACCT TTTTACCACA AGGCAATCAA 840 
GATTGGCTAT CAGGTGGTTC TTTTGGCATA GAAGGTTCCA TTATGACAAG TCTGGTATTA 900 
CTACTGCTGA TTGTCTATCT TGCTAATAAA TTAAAGAAAG AAAATGAAAG GATGTGACTT 960 

CGGTCCGTCC TTTTCTTCGT GAAAATACTA TAAGTATGCT AAAATAGGAA TAGCACATGG 1020 

AGAGAGGATT CTT ATGATC A AT CACATT AC AGATAATCAA TTTAAACTAG TATCAAAATA 1080 

TCAACCATCA GGAGATCAAC CCCAAGCTAT CGAGCAGTTG GTGGATAACA TTGAGGGGGG 1140 

AGAAAAAGCT CAGATTCTGA TGGGGGCGAC TGGAACAGGG AAGACCTATA CTATGAGTCA 1200 

GGTCATTTCT AAAGTCAATA AACCAACTCT GGTTATTGCC CACAATAAAA CTCTGGCTGG 12 60 

TCAGCTCTAT GGGGAGTTTA AGGAATTTTT CCCTGAAAAT GCAGTTGAGT ATTTCGTATC 1320 

CTACTATGAT TATTACCAGC CAGAGGCCTA TGTCCCTTCT AGCGATACCT ATATTGAGAA 1380 

GGATAGTTCT GTCAATGACG AGATTGACAA GCTTCGCCAC TCAGCTACCT CAGCCCTTTT 14 4 0 

GG AG CGTAAT GATGTTATTG TCGTGGCCTC AGTCTCTTGT ATCTATGGTT TGGGTTCGCC 1500 

CAAGGAATAC GCTGATAGTG TCGTTAGTCT CCGTCCTGGT CTAGAGATTT CTCGTGATAA 1560 

ACTCTTGAAT GACTTGGTCG ATATTCAGTT TGAACGTAAT G AT ATTGATT TCCAACGCGG 1620 

AAGATTTCGC GTTCGTGGGG ATGTGGTAGA GATTTTCCCA GCTTCCCGAG ATGAACATGC 1680 

CTTTCGAGTA GAATTTTTTG GAGACGAAAT TGACCGTATT CGTGAAGTTG AGGCTCTGAC 1740 

AGGTCAGGTG TTGGGAGAAG TGGATCATTT AGCGATTTTC CCAGCGACAC ACTTTGTGAC 1800 

CAATGACGAC CACATGGAAG TTGCCATTGC AAAGATTCAG GCCGAGTTGG AAGAACAATT 1860 

AGCTGTCTTT GAAAAGGAAG GTAAACTGCT TGAAG CCCAG CGTTTGAAAC AGCGGACAGA 1920 
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AAAATGTTTC 


CACGCTTCCA 


TCCTACCACT 


TGGAGGAAAG 


AGATTTCCAC 


ATT CCTCAAG 


11640 


TCAAGTGCTA 


TGAGTTAACA 


TTTGAAGGAA 


GCAAGGAAGG 


AAAGGTCTAT 


GCACGCATTC 


11700 


TTCTTCCAAA 


GAGTGAGGAG 


AACGTCCCAT 


TAATCTTCCA 


TTTTCATGGT 


TATATGGGAC 


11760 


GTGGCTGGGA 


CTGGGCCGAC 


ATGCTGGGCT 


TCACCGTAGC 


TGGTTACGGT 


GTTGTTTCCA 


11820 


TGGATGTGCG 


GGGCCAGTCA 


GGTTACTCAC 


AAGACGGCTT 


GCGTTCTCCT 


TTAGGAAATA 


11880 


CCGTGAAGGG 


G CAT ATTAT C 


CGTGGTGCTG 


TGGAAGGTCG 


GGACCACCTC 


TTTTATAAGG 


11940 


ATGTTTATCT 


GGATATTTAC 


CAGTTGGTCG 


AAATTGTTGC 


TAGTCTGTCT 


CAGGTTGATG 


12000 


AGAAGCGTCT 


TTCTAGCTAT 


GGTGCCTCAC 


AAGGAGGGCC 


TCTAGCTCTA 


GTTGCAGCAG 


12060 


CGCTCAATCC 


TCGAATTCAG 


AAAACAGTTG 


CCATTTATCC 


CTTCTTGTCA 


GACTTCAGAC 


12120 


GGGTGATTGA 


GATTGGTAAT 


ACTAGCGAGG 


CTTACGACGA 


ACTTTTCCGT 


TATTTCAAGT 


12180 


TTCACGACCC 


CTTCCATGAA 


ACAGAGGAGG 


AAATCATGGC 


GACCCTTGCC 


TATATCGATG 


12240 


TCAAAAATCT 


TGCCCATCGT 


ATCCAAGGTG 


AGGTTAAGAT 


GATTACGGGC 


TTGGACGACG 


12300 


ATGTTTGCTA 


TCCCATTACC 


CAGTTTGCGA 


TTTATAATCG 


TCTGACCTGC 


GATAAAACCT 


12360 


ATCGCATCAT 


GCCTGAGTAT 


GCTCACGAAG 


CCATGAATGT 


ATTTGTCAAT 


GACCAAGTCT 


12420 


ACAACTGGCT 


CTGTGGAAGT 


GAGATTCCTT 


TTAAATATCT 


AAAATAAGHA 




12 4 80 


AGC ACAAAAT 


CTTAAAAATT 


ACAAACACGC 


ATAGTATCAG 


GGGATTAAGA 


AAACTTTATA 


12540 


CTATGCGTTT 


TATCATGGAA 


ATATAGTAAA 


ATGAAATAAG 


AACAGGACAA 


ATCGATCAGG 


12600 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


GAAACAAATG 


T GT ACT ATTC 


TAGTGTCAAT 


12660 


CTATTATATT 


TATAGAATTT 


TTTGTTGCTA 


GATTTGTCAA 


ATTGCTTAAA 


ATAATTTTTT 


12720 


TCAGAAAGCA 


AAAGCCGATA 


CCTATCGAGT 


AGGGTAGTTC 


TTGCTATCGT 


CAGGCTTCTC 


12780 


TGTAGGTGTT 


AATACTTTTC 


AAAAATCTCT 


TCAAACCACG 


TCAGCTTCGC 


CTTGC 


12835 


(2) INFORMATION FOR SEQ ID NO: 14 


2 ; 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 
ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 
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TCGTCTCCTA CAAGGGGATG TGGGGAGTGG AAAAACGGTA GTCGCTGGCT TGGCCATGTT 9840 

TGCGGCAGTG ACAGCAGGTT ATCAGGCTGC CCTAATGGTA CCAACAGAAA TCCTCGCAGA 9 900 

GCAACACTTT GAGAGTTTAC AGAACCTTTT TCCCAATTTG AAACTGGCTC TCTTGACAGG 99 60 

TTCCTTGAAA GCTGCAGAAA AGAGAGAAGT CTTGGAG A CC ATTGCCAAGG GTGAGGCTGA 10020 

TTTGATTATA GGAACTCACG CTCTGATACA AGATGGGGTG GAGTATGCTC GTCTTGGTTT 10080 

GAT T ATT AT C GATGAGCAGC ACCGTTTTGG TGTAGGGCAA AGGCGTATTT TACGGGAAAA 10140 

AGGTGACAAT CCAGATGTCC TCATGATGAC GGCGACTCCC ATTCCACGGA CGCTTGCCAT 10200 

CACAGCCTTT GGAGATATGG ATCTTTCCAT TATCGACCAG ATGCCAGCAG GTCGGAAGCC 10260 

TATTGTGACG CGCTGGATCA AACATGAGCA ACTACCTCAG GTCTTGACTT GGTTAGAGCG 10320 

GGAAATTCAA AAAGGTTCCC AACTCTATGT CATCTCTCCT TTGATTGAAG AATCAGAACC 10380 

TCTAGATTTG AAAAATGCCA TTGCCTT AT C AGAGGAGTTG ACGACTCATT TTGCAGGCAA 10440 

GGCAGAGGTG GCTCTTCTAC ATGGTAGGAT GAAGAGTGAC GAAAAAGACC AGATCATGCA 10500 

GGATTTCAAG GAGAGAAAGA CGGATATTCT GGTTTCGACG ACGGTTATTG AGGTTGGGGT 10 560 

CAACGTTCCC AATGCGACTG TCATGATTAT CATGGATGCC GATCGCTTCG GTCTCAGTCA 10620 

ACTTCACCAG CTTAGAGGTC GTGTCGGTCG GGGGGACAAG CAGTCCTACG CTGTTCTCGT 10680 

TGCTAATCCC AAGACGGATT CTGGGAAAGA CCGCATGCGT ATCATGACAC AAACGACCAA 10740 

TGGATTTGTC CTTGCGGAGG AAGATTTGAA AATGCGTGGT TCTGGTGAGA TTTTTGGAAC 10800 

CAGACAGTCA GG ACT TCC AG AGTTCCAAGT GGCTGATATT ATCGAAGATT TTCCGATTTT 108 60 

AGAAGAAGCA AGAAAGGTTG CTAGCTACAT TAGTTCTATA GAAGCTTGGC AAGAAGATCC 10920 

AGAGTGGCGC ATGATTGCCC TTCATCTGGA AAAGAAAGAA CATCTGGATT AAGCTTTCTC 10980 

TAAGGAAAAC TTATACTCAA TGAAAATCAA AGAGCAAACT AGGAAGCTAA CCGCAGGTTG 11040 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 11100 

TGAGGTGGCA GATAGAACTG ACGAAGTCAG TAACATATAT ATACGCTAAG GCGACGCTGA 11160 

CGTGGTTTGA AGAGATTTTC GAAGAGTATT AAGCTAGTTT TTAGGTTTGG CTCTTATACT 11220 

AGAGTCATCA AAAAGAAACG AGGACTCTCA TATGACAGTA ACGATTAAAG TAAATTACCA 11280 

AACCACTTTC CAAAAGAAGG AAGCAAAAAA CTAGTATAAA CAGAAGAGAG AGCGAAATGC 11340 

TCTTTTTTCG TTTCTAAAAC TACTTTCAGC CCATCATCCT AAAAGTAAAG AATCTAAATT 11400 

CACTTTCTAT TTACCCTTCT TTCTTGCATT G ATT AC A TAG ATATGCTACA GTTGTGGTAA 114 60 

CGATTACAAA ATAAAAGGAG CATGCTATGA AAAATCCAGC TTTGCTAGAA G AAATT AAG A 11520 

CCTATAGAGG AAGGGATGAG GTTCCGGAAG ACTTTGATGA TTTCTGGGAT GGGGAAGTGA 11580 
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ACCCATTGAA 


CTCAGACAAG 


! CTGGACTCAG 


1 CAAGC CAATC 


CTCATTTTAG 


GAGTTTCTGA 


8100 


AATCGAAGCT 


1 GTTGCTCTAG 


CTAAAGAATA 


TGACTTCACC 


TTGACAGTGG 


CTGGACTGGA 


8160 


GTGGATTCAA 


GCACTCTTAG 


ATAAGGAAGT 


GGACCTAACT 


GG ATTG AC AG 


TCCACCTCAA 


8220 


GATTGATTCA 


GGGATGGGAC 


GGATTGGTTT 


TAGAGAGGCA 


AGTGAGG TTG 


AGCAGGCTCA 


8280 


AGATTTGCTC 


CAACAACACG 


GTGTTTGTGT 


TGAAGGAATC 


TTTACCCACT 


TTGCTACTGC 


8340 


TGATGAGGAA 


TCAGATGACT 


ATTTTAATGC 


CCAGTTAGAA 


CGGTTTAAAA 


CTATTTTAGC 


84 00 


TAGTATGAAG 


GAAGTTCCAG 


AGCTGGTTCA 


TGCTAGCAAT 


TCTGCAACGA 


CTCTTTGGCA 


8460 


TGTAGAGACT 


ATTTTCAATG 


CGGTTCGTAT 


GGGAGATGCC 


ATGTATGGCC 


TCAATCCAAG 


8520 


TGGAGCGGTC 


TTGGATTTGC 


CTTATGATTT 


G ATACCGGC C 


TTGACCTTGG 


AGTCTGCTCT 


8580 


GGTTC ATGTC 


AAGACAGTTC 


CACCTGGAGC 


TTGCATGGGC 


TATGGAGCAA 


CTTATCAAGC 


8640 


GGATAGCGAG 


CAAGTCATCG 


CGACCGTGCC 


AATCGGGTAT 


G C AG ATGG AT 


GGACAAGAGA 


8700 


CATGCAAAAT 


TTCTCTGTCT 


TGGTAGATGG 


CCAAGCTTGC 


CCAATTGTCG 


GCAGGGTTTC 


8760 


GATGGACCAA 


ATCACTATTC 


GATTGCCTAA 


GCTTTATCCG 


CTACGAACCA 


AGGTAACCTT 


8820 


GATTGGCTCC 


AATGGGGATA 


AGGAAATCAC 


TGCAACTCAG 


GTAGCGACCT 


ACCGCGTAAC 


8880 


CATTAACTAT 


GAGGTGGTTT 


GCCTCCTCAG 


CGACCGTATT 


CCGAGAGAAT 


ATTATTAGAA 


8940 


AAGAAAGGAG 


TGGAGCATGA 


ATCTACATCA 


ACCCTTGCAT 


GTCTTGCCTG 


GTGTGGGACC 


9000 


AAAGTCAGCA 


GAAAAATACG 


CCAAACTAGG 


AATTGAAAAC 


TTGCAAGATC 


TCTTGCTCTA 


9060 


CTTTCCTTTC 


CGTTATGAAG 


ACTTCAAAAC 


CAAGCAGGTG 


CTGGAGCTGG 


AAGACGGTGA 


9120 


GAAGGCAGTT 


CTTTCTGGTC 


AGGTAGTGAC 


TCCTGCTAGT 


GTCCAGTATT 


ATGGTTTCAA 


9180 


GCGCAATCGC 


CTGCGTTTTA 


GTCTCAAGCA 


GGGAGAGGTC 


GTTTTTGCGG 


TGAATTTCTT 


9240 


TAACCAGCCC 


TATCTGGCTG 


ATAAAATAGA 


GTTGGGAGCA 


ACCCTTGCTG 


TCTTTGGAAA 


9300 


ATGGGACCGC 


GCTAACGCTA 


O 1 \» 1, VlnW 1 


uAl\>AAub 1 I 


CTGGCTCAGG 


TAGAAGATGA 


9360 


CCTCCAGCCT 


GTCTATCCTC 


TGGCTCAGGG 


AATCAGTCAG 


GCCAGTCTGG 


TCAAGGTCAT 


9420 


CAAGACGGCT 


TTTGATCAGG 


GACTGGACCT 


CTTGATAGAA 


GAAAATCTGC 


CCCAGTCTTT 


9480 


ACTAGACAAA 


TACAAACTCA 


TGTCCCGTTG 


TCAGGCACTC 


CGTGCTATGC 


ATT TT CC AAA 


9S40 


GTATTTGGCA 


GAATACAAGC 


AGGCTCTTCG 


CCGTATAAAG 


TTTGAGGAAC 


TCTTTTATTT 


9600 


CCAAATGCAG 


CTGCAGATGC 


TCAAGTCTGA 


AAATAGAGTT 


CAGGGAAGTG 


GTCTGGTTCT 


9660 


GAATTGGTCT 


CAGGAAAAAG 


TGACAGCAGT 


TAAAGTAAGT 


CTTCCTTTTG 


CCCTGACCCA 


9720 


AGCTCAGGAA 


AAGAGTTTGC 


AGGAAATTTT 


AACTGATATG 


AAGTCCGACC 


ACCACATGAA 


9780 
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GAAAAAATTA 


AG CAGTTC GT 


TCGTGGTTTT 


ATGATTGAGT 


CTTATCTGGA 


AGATGGTCGA 


6300 


CAAAATGAGC 


CAGAAGTATT 


TGGTAAGTCT 


ATCACAGACC 


CTTGCCTGGG 


TTGGGATAAC 


6360 


ACAGAAGCTC 


TTGT C AG AG A 


AATTTACAAA 


ACGTTAGGAG 


AATAAGATGG 


CATTTATTGA 


6420 


AAAAGGTCAA 


GAAATCGATA 


TGGAAGTCAT 


CAAGGCTGAA 


AC CC AATTGT 


CTGCGGAAGC 


6480 


CTTGAGACTC 


AAGGAAAGCC 


GTGACAGGGA 


ATTGGCAGAT 


ATTATTTCAG 


GGGAAGATGA 


6540 


CCGTATTCTC 


TTGGTGATTG 


GTCCTTGCTC 


TTCTGATAAT 


GAAGAGGCGG 


TCTTGGAATA 


6600 


TGCTCGCCGT 


TTATCTGCCT 


TGCAAAAGAA 


GGTAGCGGAT 


AAGATTTTCA 


TGGTCATGCG 


6660 


CGTGTATACT 


GCTAAGCCTC 


GTACCAATGG 


AGACGGCTAT 


AAAGGATTAG 


TTCACCAGCC 


6720 


AGATACTTCT 


AAGGCTCCAA 


GCCTGATTAA 


TGGCTTGCAG 


GCTGTGCGCC 


AGTTGCACTA 


6780 


CCGCGTGATT 


ACAGAGACTG 


GTTTGACAAC 


GGCAGATGAG 


ATGCTTTATC 


CGTCAAATCT 


6840 


GATCTTGGTG 


GATGACTTCG 


TCAGCTACCA 


TGC CGTTGG A 


GCTCGTTCTG 


TGGAAGACCA 


6900 


AGAGCACCGC 


TTTGTGGCTT 


CTGGGATTGA 


TGCACCAGTA 


GGGATGAAAA 


ATCCAACCTC 


6960 


AGGAAATTTG 


GGTGTTATGT 


TTAACGCCAT 


CTATGCTGCT 


CAAAACAAGC 


AAACCTTCCT 


7020 


TTATCATGGG 


CAGGAAGTTG 


AGACATCAGG 


TAATCCTTTG 


GCCCATGTTA 


TCCTCCGTGG 


7080 


AGCAGTCAAC 


GAGTATGGCA 


ATTATATGCC 


GAATTACTAC 


TATGAAAATC 


TACTCCAAGC 


7140 


CATTGAACGC 


TATGAAACCA 


TGGGACTTGA 


AAATCCTTTT 


ATCCTCATTG 


ACACCAACCA 


7200 


TGATAACTCA 


GGCAAGCAAT 


ATATGGAGCA 


GATTCGAATT 


GTTCGCCAGA 


CCTTGCAGAA 


7260 


TCGTGATTGG 


AATGAGAAAA 


TTAAAAAGAC 


GGTTCGAGGA 


TTTATGATTG 


AATCTTACCT 


7320 


AGCAGATGGT 


CGTCAAAACC 


AACCAGAGAT 


CTTTGGTTGC 


TCTATTACTG 


ACCCTTGCCT 


7380 


AGGTTGGGAA 


AATACAGAGG 


CCTTGGTAGA 


AGAGATTTAT 


GTTACCTTGA 


CAAAATAAGT 


7440 


GAAAAGGATG 


GAGTTGGGGA 


ATCTCAACTC 


CTTTTGATGA 


GAATGATAGT 


TGGACACGGA 


7500 


ATTGACATCG 


AAGAATTGGC 


TTCGATAGAA 


AGCGCAGTTA 


CACGACATGA 


AGGATTTGCT 


7560 


AAGCGTGTAC 


TGACCGCTCA 


GGAAATGGAG 


CGCTTCACCA 


GTCTCAAAGG 


ACGCAGGCAA 


7620 


ATAGAATATT 


TAGCTGGTCG 


CTGGTCGGCT 


AAGGAGGCCT 


TTTCCAAGGC 


TATGGGAACG 


7680 


GGCATTAGCA 


AGCTCGGTTT 


TCAGGATTTG 


GAAGTCTTGA 


ACAATGAACG 


TGGGGCGCCT 


7740 


TATTTTAGTC 


AGGCACCATT 


TTCAGGAAAG 


ATTTGGCTGT 


CT AT CAGC C A 


CACCGATCAG 


7800 


TTTGTGACAG 


CCAGTGTCAT 


TTTGGAGGAA 


AATCATGAAA 


GCTAGTCCAC 


ATAGACCAAC 


7860 


CAAGGCTCTG 


ATTCATCTGG 


GAGCTATTCG 


ACAAAATATT 


CAGCAAATGG 


GGGCTCATAT 


7920 


CCCTCAAGGA 


ACGCTCAAGT 


TGGCTGTGGT 


TAAGGCCAAT 


GCTTATGGTC 


ATGGAGCTGT 


7980 


TGCCGTTGCC 


AAGGCAATTC 


AAGATGATGT 


TGATGGCTTT 


TGCGTTTCCA 


ATATCGATGA 


8040 
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GACGCGTCAG GTTG AAGCAG CTCAGAAACG TGTCGAAGGA AATAACTACG ATACCCGTAA 
ACAAGTCCTT CAATACGATG ATGTCATGCG TGAACAACGT GAGATTATCT ATGCTCAACG 
TTACGATGTC ATCACTGCAG ATCGTGACTT GGCACCTGAA ATTCAGTCTA TGATCAAACG 
CACGATTGAA CGTGTCGTTG ATGGTCATGC GCGTGCCAAA CAAGATGAAA AACTAGAGGC 
AATTT TGAAC TTTGCTAAGT ACAACTTGCT TCCTGAAGAT TCTATTACGA TGGAAGACTT 
GTCAGCCTTG TCTGATAAGG CCATCAAGGA AGAGCTTTTC CAACGTTCCT TGAAGCTTTA 
CGATAGTCAG GTTTCAAAAC TACGCGATGA AGAAGCAGTT AAAGAATTCC AAAAAGTTTT 
GATTCTACGA GTGGTGGATA ACAAGTGGAC AGATCATATC GATGCCCTTG ATCAATTGCG 
TAACGCGGTT GGACTTCGTG GCTATGCTCA GAACAACCCT GTTGTTGAGT ATCAGGCAGA 
AGGTTTCCGT ATGTTTAATG ATATGATTGG TTCGATTGAG TTTGATGTGA CACGCTTGAT 
G ATGAAAGC A CAAATTCATG AACAAGAAAG ACCACAGGCA GAACGTCATA TCAGTACAAC 
AGCGACTCGC AATATCGCTG CTCACCAAGC AAGTATGCCA GAAGATTTGG ATTTGAGCCA 
GATTGGACGC AATGAACTTT GCCCATGTGG TTCTGGTAAG AAATTTAAAA ACTGTCACGG 
TAAAAGACAA TAAAATGAGA TAGTTTAGAG GCGGATATCT TGTGAAAAGT AAATTTTTAC 
TGGGTATCCG TTTGCTTTAT AAGGAGATGA GTTATGGTAT TTACAGCAAA AAGCTCTAAA 
ATAAATATAG AAGAAGTTCG TGCCTTGTCA AAATTAGAAG GTCAGGCTTT GGAGAGGAAA 
TCACAGCGAG ATCAAGAGCT AGAAGCCATT ATACGTGGAG AAGACCAGCG AATTCTCTTC 
GTAATCGGGC CATGCTCATC TGACAACGAA GAAGCTGTCC TTGAATACGC TAAGCGTTTG 
GCAGTCCTAC AAGAAGAAGT GGCAGATCGT ATCTTTATGG TTATGCGTGT TTATACTGCC 
AAACCCCGTA CCAACGGAGA TGGCTATAAG GGCTTGATTC ACCAGCCTAA CGCGACAGAA 
GCGCCTAGTC TTATCAATGG AATCAAAGCC GTTCGCCATC TTCACTATCG TGTCATCACA 
GAAACAGGGA TGACAACTGC TGATGAAATG CTTTATCCTG AAAACCTTCC GCTTGT AC A T 
GATTTGATTT CTTACATGGC AGTTGGTGCC CGTTCAGTTG AAGACCAGCA ACACCGCTTT 
GTGGCAAGTG GGGCAGGATT TTCTACTGGT TTTAAAAATC CAACCTCTGG AAATCTCAAT 
GTCATGTTTA ATGGGATTTA TGCTGCTCAA AACAAACAAA GTTTCCTTTT CTTAGGAAAA 
GAAGTAGAAA CAACTGGGAA CCCGCTTTCA CACGCTATTC TTCGTGGTGC TCTTAATGAG 
TATGGAAAAA ATATTCCCAA C TACT ATT AT GACAATTTAA TTGATACCAT TG CCCAGT AT 
GAGAAAATGG GCTTGGAAAA TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 
AAGCAGTATA TTGAACAGAT C CGAATTGTC CGCCAGACCT TGATTAACCG TGCTTGGAAT 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 
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CTTCCTTGAA AATAATCAGT AATTGTGCTA AAATTAAACG AACATTCTAA AATATTCGGA 27 60 

ATTTAAAGTA AGGAAAAACA TGGCTAATAT TTTAAAAACA ATTATCGAAA ATGATAAAGG 2820 

AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAACCTTTTC AAATACGAAG ACCAAATGGC 2 880 

TGCTTTGACT GACGACCAAC TAAAAGCAAA AACAGTTGAA TTTAAGGAAC GTTATCAAAA 2940 

T GGAGAATC A CTGGATTCAT TGCTTTACGA AGCATTTGCG GTTGTCCGTG AAGGTGCCAA 3000 

ACGTGTCCTA GGTCTCTTCC CTTATAAGGT TCAGGTCATG GGGGGGATTG TTCTTCACCA 3060 

TGGTG AC GTG CCAGAGATCC GTACAGGGGA AGGGAAAACC TTGACTGCGA CCATGCCGGT 3120 

ATACCTCAAT GCCCTTTCAG GTAAAGGGGT TCACGTAGTT ACCGTTAATG AATACCTGTC 3180 

AGAACGTGAC GCGACTGAGA TGGGTGAATT GTACTCTTGG CTTGGTTTGT CAGTAGGGAT 3240 

TAACTTGGCT ACCAAATCTC CAATGGAGAA AAAAGAAGCC TATGAGTCTG ATATTACTTA 3300 

CTCAACTAAC TCAGAAATCG GATTTGACTA CCTTCGTGAC AACATGGTCG TTCGCGCCGA 3 360 

AAACATGGTA CAACGTCCGC TTAACTATGC CTTGGTCGAT GAGGTTGACT CTATCTTGAT 3420 

TGACGAGGCT CGTACACCTT TGATTGTATC AGGTGCCAAT GCGGTTGAAA CCAGTCAGTT 34 80 

GTATCACATG GCAGACCACT ATGTAAAATC TTTGAACAAA GATGACTACA TCATCGATGT 3 540 

GCAGTCTAAG ACTATTGGTT TGTCTGATTC AGGGATTGAC AGGGCTGAAA GCTACTTCAA 3600 

ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG ATAACGCCCT 3660 

TCGTGCCAAC TACATCATGC TTCTCGATAT TGACTATGTG GTGAGCGAAG AGCAAGAAAT 3720 

CTTGATTGTC GACCAATTTA CAGGTCGTAC CATGGAAGGT CGTCGTT ATT CTGATGGATT 3780 

GCACCAAGCT ATTGAAGCCA AAGAAGGTGT GCCAATCCAG GATGAAACCA AGACATCTGC 3840 

CTCAATCACG TACCAAAACC TCTTCCGTAT GTACAAGAAA TTGTCTGGTA TGACGGGTAC 3 900 

AGGTAAGACT GAGGAAGAAG AATTCCGTGA AATCTACAAC ATTCGTG TT A TTCCAATCCC 3 960 

AACAAACCGT CCTGTTCAAC GTATTGACCA CTCAG AC CTT CTTTATGCAA GTATCGAATC 402 0 

TAAGTTTAAA GCGGTTGTCG AAGACGTTAA GGCTCGTTAC C AAAAGGCT C AACCTGTCTT 4080 

GGTTGGTACA GTAGCGGTTG AAACTAGTGA CTACATTTCT AAGAAATTGG TTGCAGCTGG 414 0 

TGTTCCTCAC GAAGTCTTGA ATGCCAAAAA CCACTATAGA GAAGCCCAAA TCATCATGAA 4200 

TGCTGGTCAA CGTGGTGCCG TTACCATCGC AACCAACATG GCGGGTCGTG GTACCGACAT 4260 

CAAGCTTGGT GAAGGTGTTC GTGAACTTGG AGGACTTTGT GTTATTGGTA CAGAACGTCA 4 320 

TGAAAGTCGT CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG GAGATCCAGG 4 380 

TGAGTCACAA TTCTACCTAT CTCTTGAAGA TGATTTGATG AAACGTTTTG GTTCTGAACG 4440 

CTTGAAGGGA ATCTTTGAAC GCTTGAACAT GTCTGAAGAG GCCATTGAGT CTCGCATGTT 4500 
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GCGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 

TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCGTAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTGTCAACA U40 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 1200 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 12 60 

TTCATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGG CAG A CCATTACCAT 13 2 0 

CACTTGATTC GCTATCAATC AAGCTCCTTT TT CG ATT AT A CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 14 4 0 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGCCGCTG GTTTTTCCTT TTTTCTCTAT 1560 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TT GT AAC ACT GTATTTAGTA 1620 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGC AAGAGAAATT TAAGAAAGGA 1680 

TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG A CCT ATTTTA 174 0 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 1800 

TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 1860 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT AT ATT CAGG A TGCAGGAATT 1920 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 204 0 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTC GATC A ACCTATGAAT 2160 

GCTTT GGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 22 20 

GAAAATGGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG AT ATTT C AG A CCTTTGTACA 2280 

GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 2340 

TGGCTTATGG CACATCTAAA ATCATTTATT ACAC GAT ATT C CAAGGTTT A TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGG AT A 24 60 

AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 2520 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2640 

ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2700 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTG6TATCT TCTTCACATC 285 60 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 2 8620 

AACTCT ATT G ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 2 8680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 2 8800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 2 8882 

<2> INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCCTATGTCT 


TTTTCAAAAA 


AATGCTTGAC 


TTGAGACGGG 


AACTAGGGAA 


GTCTAAAGGC 


60 


GGAAGGCATT 


GATTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGGA 


120 


TTATATATGT 


AACTGACTTC 


GTCGATGCTT 


ATCTACAACC 


TCAAAGCAGT 


GCTTTGAGCA 


180 


ACTTGCGGCT 


AGTTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATTATA 


TTACTTTCTA 


240 


TTTGTAGGAG 


GTGGCTTATG 


AAGATTCCTC 


TCTTAACTTT 


TGCAAGGCAT 


AAATTTGTTT 


100 


ATGTCTTGCT 


TACTTTGCTT 


TTTCTTGCTT 


TGGTTTATCG 


TGATGTTTTG 


ATGACTTATT 


3 60 


TCTTTTTTGA 


T ATT CATGCG 


CCCGATCTAG 


CTAAATTCGA 


TGGACAAGCA 


ATTAAAAATG 


420 


ACT T ATT AAA 


ATCAGCATTA 


G ATTTTCGT A 


TTCTCCAGTT 


CAATCTAGGT 


TTTTATCAAT 


480 


CATTTATTAT 


TCCAATCATC 


ATTCTPl-ltjC 


TAGGTTTTCA 


ATATATTGAG 


CTGAAAAATA 


540 


AAGTTTTACG 


ATTGAGTATT 


GGAAGAGAAG 


TGAGTTATCA 


AGGGTTAAAA 


AGAAAGTTGA 


600 


CTTTG CAAGT 


TGCAAGTATC 


CCTTGTTTGA 


TATATTTAGT 


GACTGTGCTG 


ATAATTGCAA 


660 


TTATAACCTA 


TTTCTTTGGG 


ACTTTTTCTC 


CTCTTGGATG 


GAATTCTCTA 


TTTTCTGATG 


720 


GAAGTGGTTT 


ACAAAGACTC 


CTAGATGGAG 


AGATAAAAAG 


CTATTTGTTC 


TTTACTTGTG 


780 


TCCTACTAAT 


CGGTATTTTC 


ATCAATGCAA 


TCTATTTTTT 


ACAAATAGTT 


GATTATGTGG 


840 


GGAATGTGAC 


TCGTTCGGCA 


ATCACCTATT 


TGATGTTTCT 


TTGGCTTGGT 


TCTATGCTGC 


900 


TTTATAGTGC 


CTTGCCTTAC 


TATATGGTTC 


CTATGACGAG 


TTTGATGCAA 


GCTAGCTATG 


960 
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TTCTGTTCCT 


TCTCCTAATC 


CACCTTTGGG 


TAACACATCC 


TGAACTGATA 


AGCCCTTACC 


26760 


ATCTTCATTA 


TATGCTCCCT 


CTACTTGATT 


AGCTGCAACA 


GCTCCACCCC 


AAAGAAAATC 


26820 


ATCTGGAAAA 


ATGGTCATAA 


CTTTCCTCCA 


TTATAATATT 


ACCAGTAATT 


CCTTAGAATG 


26880 


CTCGATTGTC 


TG ATT ATT AG 


GTAATACTAA 


TACATCTAGA 


AAATCATTGG 


TATTCGTTAC 


26940 


AATTACTGCT 


GTAACTGTTT 


CGTAGCCTTT 


AGTCTTGATT 


AAATTCAAGT 


CC ATTTC AAA 


27000 


AATCAACTGA 


TTTTTGAAAA 


CTCTGTCTCC 


TTCTTCTACA 


TGACTAATAA 


AACCTTGACC 


27060 


TTTTAGCTCA 


ACAGTATCTA 


ATCCAATATG 


AATTAGTAAC 


TCAACACCCT 


CATCACTCTT 


27120 


CAATCCAATT 


GCGTGCTTAG 


TCGGAAAAAT 


ATTTGTAATT 


TTCCCATCAA 


ATGGTGCATA 


27180 


AACCTTACCT 


TCACTTGGGA 


TAATCGCTAC 


TCCGTCTCCA 


ATTAGTTTAT 


CTGAAAATGT 


27240 


TTTATCCTGG 


ACATCGCTTA 


ACGGAATGAT 


TTCTCCTGAT 


ATAGGAGAAA 


ATATCATTTT 


27300 


TTTATTTGAA 


ACTCCAGCTT 


CAACTTCTAA 


ATTGCTAGAA 


CTCTCTTCTT 


CATCGATTCC 


27360 


AAATATATAA 


GCTAATACAA 


AGGTAATAAC 


AACCGAAATG 


ACCGCCACAA 


TTAAAGCATT 


27420 


TACAATATTT 


GATGGCACAT 


CAGAATAAAT 


AAATTGAGGC 


AACGCTATCA 


AAGATGGGAC 


27480 


AGCAAATAGA 


TATGCTTTAA 


CACTAGTAAG 


ACCTCCAAAT 


AATCCCGCTA 


ATCCACCACC 


27540 


AATCATAGCT 


GCATAAAGCG 


GTTTTTTATA 


TTTTAAAGTC 


ACACCATATA 


ATGCAGGTTC 


27600 


GGTAATCCCT 


GCAAGTAAGG 


CTGAGAAACC 


TGCTGCAAAA 


GCAATTTGTT 


TTGTATTATT 


27660 


ATTTTTACTC 


TTTAATGCAA 


CAGCCATCGA 


AGCAGCCCCT 


TCAGCTAACT 


TTGACCCTAA 


27720 


CATTGCTGGA 


AGAATTAATA 


CGTCTGGAGT 


AGCAATAGAT 


GCCGCCAAAA 


AAATAGGTGC 


27780 


AAAAGCCCAA 


TGCATTCCAG 


T CAT AACAAT 


AAATGGCATA 


ATAGCACCAA 


GAATAGCTAA 


27840 


TGTAAGCCAT 


CCAGCTACAC 


CATACATTTG 


CCCAACTAGA 


TTTGATAATC 


CTTCACCAAC 


27900 


AATTACTCCA 


ATAGGTCCGA 


CTACAACTAA 


GGCAATACAG 


CTTGATACTA 


ATAATACTAG 


27960 


CGTAGGTTGC 


AAAAAACTCT 


TAGTAATAGC 


TAGTGTTAAT 


TTAGCAATTA 


TTTTTTC A \T 


28020 


ATATTTCATC 


AACCAAACCA 


TAATAAGAAT 


TGGAACGACT 


GATGAACCAT 


AACTAGCTGG 


28080 


TGTCACAGGT 


GCACCAAATA 


AACTAAGAGG 


ATTCCCTGAT 


TGCACCATTT 


GAACAAAATT 


28140 


TGGATGGAGA 


AGTACACCTG 


CTACAGACAT 


AGCTAATGTA 


GATGTTACTT 


TTAATTTTTG 


28200 


TGATGCAGAA 


TAAG CTAAT A 


ACAGCGGTAA 


GAAATAATAT 


GGAGCATCCC 


CAAAAAATGT 


28260 


CAAAAAAGCA 


ATAGTCTGAG 


AATCTGATTG 


CAATATACCA 


AGCATTGGTA 


AAATGATTAC 


28320 


CAAGACTTTC 


AACATACCTC 


CCCCTAACAT 


TGCTGGAATG 


ATTGGAGTCA 


TGGAACCAGC 


28380 


GATATACTCA 


ATGATTCTTT 


CTAAAATATT 


CCCTTTGTGC 


CCTTGAACAA 


CTGAATCGGA 


28440 
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TTTCTTCGCG 


AAGCGCTTTT 


AATTGTTCTT 


CAATAGTTGA 


CATATTTCCT 


CC ATCAGT CT 


24960 


CTCGTAGATA 


AAAAGAAAAC 


CACATGCCAA 


AAACTCCACT 


CGGAGCGTTG 


ACACGCGGTA 


25020 


CCATCCGTTT 


TCATCTCACA 


AGTCAGACCT 


TCATTTCTAA 


ATCC ATGCG C 


AAGTGAATTC 


25080 


ACCCAGCTTT 


CATATAGAGA 


GCTTGCAGTC 


ACGGCTCTCC 


TCCCTGATAT 


ACTTCCCTTG 


25140 


GGCTACTAGT 


CTTTCAGATT 


C CT ATTCAAT 


TACTACTTAC 


TTTATCAGAT 


TTTTACCATT 


25200 


CTTGCAAGAC 


CTATCTTACT 


TCTGCTTGTT 


AGCTTATTCT 


TATCTAAATT 


TATATAAACc 


25260 


TTATCTAAAT 


TAACTATTTA 


TAATTTTTGT 


AACAAAATTA 


AATTAATTGA 


CACTCCCCTA 


25320 


TAAAATAAAG 


AAGTTTAGAA 


TTTAATGTCT 


TCCAAACTTC 


TTTATTCCAT 


ATTTAATGAA 


25380 


ATGCCACCTT 


AACCGTGATA 


ATAGCTAGTC 


ATCAATAAAA 


AACTATTTGA 


ATAAGGATTC 


25440 


TCCATTTGAT 


TCAATCACTT 


CTTTATACCA 


AGTAAAAGAC 


ATTTTCTTAT 


ATCGATTTAA 


25500 


TGTACCACTT 


CCATCATCGT 


TTCGATCAAC 


ATAAATGAGA 


CCGTACCTTT 


TAGAAAGTTG 


25560 


TGCAGTGGAC 


ATAGAAACAC 


AGTCAATACA 


TCCCCAAGAC 


GTATAGCCCA 


TAATTTCAAC 


2S620 


ACCATCCTGT 


AGAGCTTCAG 


CAACTTGCAA 


TAAATGTTCT 


TTCATATACT 


GAATTCTATA 


25680 


ATCATCTTGG 


ACGGTTAAGT 


TATTAAGTTC 


ATCTTTTATT 


AGTTGATCTT 


TAGCACCTAA 


25740 


TCCATTTTCT 


ACTATAAATA 


ATGGGATTTG 


ATAACGGTCA 


TAATATCTAT 


TTAAAATTAT 


25800 


ACGTAGTCCA 


ATTGGATCAA 


TTTGCCATCC 


CCACTCTGAA 


GACTCTAAAT 


AAGGATTTAC 


25860 


TAAACCACCA 


ATAATATTCC 


CTTCTCCTGA 


ATTATACTGT 


GTTGGAAGAG 


CAGATTGAGT 


25920 


CACACTCATG 


TAATAGCTAA 


AGGATAAAAA 


ATCTACGGTA 


taatttttta 


AT AACTCTG C 


25980 


ATCTTCAGCT 


GCAAACTCTA 


TGTTAATGTC 


ATTTTCCTTA 


AAATATCTTT 


TTGCATAATT 


26040 


CGGATAATAA 


CCTCTAACAT 


GCACATCTGA 


AAATAGATAA 


TTTAGATTCT 


CATACTCATG 


26100 


AGTCGCCCAT 


ACATCTTTTG 


GATTTGGAGT 


CATTGGATAA 


GCTGGCATAG 


CTAATACCAT 


26160 


ACATCCCACC 


TTAAACTCTG 


AATTAATCTC 


ACGAGCAATT 


TTTGTAACCA 


AACTTGAGGC 


26220 


G ACT AATT C A 


TGATGTATAG 


CTTGATATAA 


TTCTTGTTTC 


GAAAGATTCT 


CCTTAGGTAT 


26280 


ATCTATTCCT 


CCACTAGTAA 


ATGGTAATTC 


CAAAACAGAG 


TTTACTTCGT 


TAAATGTAAG 


26340 


CCAATATTTA 


ACTTTATCTT 


TATACCTTTC 


TAAAACTGTT 


CGAGCAAATT 


TTTCATAAAA 


26400 


ATGAATCATT 


CTCCTATCAA 


CCCATCCATG 


ATATTTTCTT 


GCTAAATATA 


ATGGAGTCTC 


26460 


ATAGTGTGAA 


AGAGTTACAA 


GTGGTTCTAT 


CCCGTGAGCA 


TGTAGTTCAT 


CAAACAATTC 


26520 


ATCATAATAT 


TTCAACCCAG 


CTTCGTTAGG 


TTCTTCCTCA 


TCTCCTTTTG 


GAAAAATTCT 


26580 


ACTCCATGCA 


ATAGAAGTAC 


GAAAAACATT 


AAAGCCCATT 


TCAGAAAACA 


AGGATATATC 


26640 


TTCCTTATAT 


TTATGATAAA 


AATCAATACC 


TATCAATTTT 


AAGTTATCTT 


CTCTAGGATT 


26700. 
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AGACCTCACC GACGACAATT T TTG AG AG AC CAGCAGCTGG TGATTCGACA CCCTCTACCT 2 3220 

CGATCCCTCT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 23280 

ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG ACAGTTGTCA 2 3340 

CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA CCTTAATTTC 23 400 

TTTCTCAGTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG TTGGCTAAAT 234 60 

TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG T ATT ATCCTC CCAAACACCT 23520 

AGCCAAGCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 23580 

TTACCTAGTC CAAATCCTTG GAATTTTTGT AGCACATAGA GACGTTGAAT TTCAAAAGCG 2 3640 

TCCTCTAATT CTCTCTCAGT TTG AGCACTT CCCCAGTTGA CTTTGAGAAA ACCAGCTATC 2 3700 

TCCTCCTCAT GCATAATGAA ATAGGTTTCA GAGTCAGGAT TTCCCAACTC AGTTGACAAA 2 3760 

GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATTATCATAC 2 3 820 

GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 23880 

TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 2 3 940 

CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTGTCCA AGACCAAAGG 24000 

CAAAGCCAGA GTATACAGTC GCATCGATAC CACTCATTTC AAGGACACGT GGGTGAACCA 24060 

TACCGGCCCC CATAATTTCG ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 24120 

CACACTTGAA GCAAGAAACA TCCACCTCAA CAGATGGCTC TGTGAATGGG AAGTAAGATG 24180 

CACGCAAACG AATTTGACG C TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 24240 

CTTGAAGATC AGCCATAGAG ATATTT TT CC CAACTACCAA GCCTTCGATT TGGTGGAATT 24300 

GGTGACTGTG GGTCGCATCG TCCGTATCGC GACGGAAG AC ACGCCCTGGC GAGATCATCT 243 60 

TCAAAGGACC TTTAGAAAAA TCATGGGCAT CCATAGCACG CGCCTGAACT GGAGACGTGT 24420 

GGGTACGGAG CAAGATTTCT TCAGTGATAT AGAAAGTATC CTGCATATCA CGAGCTGGGT 24480 

GGTCTTTTGG AAGGTT CAT A CGTTCAAAGT TATAGTAGTC TTGCTCCACT TCAAAACCAT 24540 

CCACGACTTG ATAACCCATA CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 24 600 

AAACGTGACG GTGACCAGTC GCAACTGGAC GACCTGGAAG CGTCACATCT AT ACT CTCGC 24660 

TAGCCAGTTG AGCCGCGACT TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 24720 

CAGTCAAGAC ATCACGAGCT TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 24780 

AAACATCTTT CATCCCTTTG AGCATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 24840 

CACG CAAATC TTGCATCTCT TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CT AG CCAGCG 24900 



WO 98/18931 



PCT/US97/19588 



952 

GAAGGCTAAC GATTTCAGAT GTTGCTGTAT AGGTTACTTG GAGTCCCAAA CGAGTAAATA 21420 

GGGCTTCAAG GATTCCCTTA GCATAGAAGA AATCAACTGG AACTGCTGCT GTTTGGAAAT 21480 

CTTTTTCAGC AACCAAGCCT GTCAAGGCAA AGGCAAAGCT GTTGATCTCA TTTGGAAGTT 21540 

CTTCTTTTGG ATTACCTGTT TGTTCAAAGA CTTTTCCAAT CTCATAAAGG GCCAAGTTTT 21600 

TATTCTTACG AGCCACGTTG TAGGCAACGG TAT CAAGG AT CCCTGAAATC ATATTTTGAC 21660 

GGAGGACTGA ACGATCCACA GTCATTGGCC ACATGAGTTC AGTAAGGTTA CTTGGTTGAG 21720 

CTGTGAACTC AACTGCTTTT TCAGGAGTTG TCAGAGCATA GGTGATGATT TCTGTCAAAC 21780 

CTGCTCCTTC AGCAATGGTA CGAACTTGAC GGCGGAGTTT TTGTATCACA GTCAATTCAC 21840 

CAGCTGTACC ATC GTCTTTT GGAAGGCTGG TTGGCAACCG GTCATATCCA TAGATACGAG 21900 

CGATTTCTTC AAAGAGATCA GCTTCGATTC TGATATCCCA ACGACGACGT GGTACGCTGA 21960 

CTGTAAAGCT GTCTGCATTT CCAGAAAGAC CAAAGCCAAG ACGACGGAAG ACGTCTTCTA 2 2020 

CATCAGCATA AGACAGCTCA GTTCCGAGGA CACGGTTAAC ATCAGCAAGG GTTGAAGAAA 22080 

CTTCCACATC AGAGGTATCA AGCTCACCCG CTGAAACGAT ACCCTTACGC ACCGTCGCGC 22140 

CTGCAAGCTC TGCAATCATG CTAGCTGCCG CATCAAGGGC TTCATTAACT GTTGCCACAT 22 200 

TAATTCCTTT TTCAAAGCGA GAAGATGACT CAGAACGAAG GTTCAGGC G A CCACTTGTCT 2 2 260 

TACGGATAGA TTTGCCATTA AAAACAGCAG CTTCAAGGAT AACACGACTA GATTTTTCAG 223 2 0 

AAATTTCTGT AGCCTGAC C A CCCATAACAC CGCCAAGGCC TACTG G TTTG TCAGCAACTG 22380 

TAATCACGAG GTCTGTCTCA GCCAAGTCTC GTTCTTCACC GTCCAGGGTC ACTAATTTTT 2 2440 

CACCATCACG CGCTTCACGC ACACGGATGT CAGTCCCTTC AAATGTGTCC AAGTCAAAAG 22500 

CATGCATAGG TTGACCAAAG TAGAGCAGGA TGTAGTTTGT CACGTCTACA ACGTTATTGA 2 25 60 

TGGGACGGAT GCCTTCGTTC ATGAGAAGGT TTTGCAACCA TTGTGGACTT GGTGCGATAG 22 620 

TCACATTGTC CAAGATACGA GCTGCATAGT AAGCCGCCTT GTCTGTCTCA ATGCTGACAG 22 680 

AAAGG GCATC TGCCGCAGCT TCATTAGTTT CTGTTAGAGT AAATTTTTTA AAGTT GACTG 22740 

CCTTGTCATA GATGGCTGCC ACTTCGTGAG CCACTCCACA CATAGAAAGG GCATCTGCAC 22800 

GGTTTGGTGT GATGGAAAGT TCGATGATTT CATCATCCAA GTCTAGGTAA GAAAAGACTT 22860 

CCTCACCTGG CACGGCATCT TCAGGCAAGA TTTGGATGCC ATCTGCGAAT TCCTTAGGCA 22 920 

CAACTGAGTC AGAAATTCCC AATTCACCAA GTGAACAGAT CATTCC AAGT GACTCCAAAC 22980 

CACGGATTTT TCCTTTTTTG ATTTTGTAGT TATCAGCGAT ACGAG CTCCT GGAAGAGCCA 23 040 

CCATGACCTT GATCCCAGCA CGCACATTTG GGGCACCACA AACGATCTGA CGCTCTTCTT 23100 

CTTCGCCAAC GTTAATCTGA CAAACATGGA CGTGACTCTC TGGCACATCT TCGCAAGACA 23160 
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CGATTAAGCG ACGGCTATCG 
TTTCTGTGCC TTTTAATCAT 
AATCTATGAG GAAGACAAGA 
CTCCGAGCAA CCATTGCAAA 
CTTTCGGTCA TTCTCTTTTC 
CTTAACATTT GGCTATCTAC 
. ACTGCAACTA TGGCTGCTAT 
AATAAACAAA TACGGTTTTA 
TTCGGAACAG ATAAAACCCT 
ACTTGGAACG TCGCTAATCA 
G AC G CCG AT A TGGCTATATT 
CAGAGAATCA AACTATTGTT 
ACTTCTCCAC CTACCAATAG 
GGTTTCTATA CAGAAGCTAA 
TCGAGAAAAC AAAATATACC 
TTAATGGAAA TCTGGAAGCA 
CCAAAGGCTA TTATTGCAGG 
ATAAGCTCTC ATAGGGACGC 
AGCCAAAGTC CAAAACTTTT 
TACTATGTTA AAGATTTAGA 
ACAGAAATCA CATTTTAATT 
GGGGGAATTT GTATCCTACT 
ACGCTTGGAT TTTTTCCATA 
TTTGGAAGGT CAAGCTATAA 
AGACGTCAAA GAGTTTGATA 
CAACTTCTTG GTGAGTCACT 
GGAATTTGGT GATTTCCACA 
GGTTAAGCTC AG CT ACAT AC 
CTTGG CCAAG GAAACCAAGA 



ATTGTCAAAT 
TTCAATCTTA 
AAAAGAATAT 
TTGAGGTACT 
TATCCCTATC 
CCAACCTATC 
TTTAAGTTTC 
CTCTGGCATT 
TTCTTCTGCA 
AATAGAAGCA 
CCCTGAACTA 
TCATCAAGTT 
TGGAATAGCT 
AACTTTTCAT 
AGATATCATT 
AGACTTAAAC 
TGATTTTAAT 
ATTAAATGCA 
TAATGCAACA 
CATTGTAAGT 
ATTTTATATA 
ATCGTTTAAC 
TAGCGTGCGA 
GCCATTGACT 
TCTGTCAAAC 
TCTGCCTTGA 
AATGGAACAG 
GTTTCTGGAA 
ACTTGGTCAC 
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GTTTGCCTTT 
AATTGGACTT 
CAATCAAGTA 
CACACAATGA 
ATAACTTATT 
TTGGCACAGA 
TTATTTTTTT 
TTGCTCTTAC 
TCAAATAAGA 
CAACATATTG 
GCTACCAATA 
GGACTTTCTA 
CCTGTGACTG 
ACAACACGGT 
GCCTTGCATA 
ATCATTCATA 
GCAACTATGC 
CTGCCACCTT 
ATAGATCATA 
TTTCAAAACT 
AAATCACCCC 
GCACTTCTGC 
CTTCTTCGTC 
TCATACCAAG 
GTTTCACGCC 
GGAGAAGGGC 
CAGGTTGGAG 
TATCGTAAGC 
CGAGTGAAAT 



TGTAGTATAA 
TTTTTACTTG 
AAGTCACAAA 
TTAAAACATT 
CTTTTTTCCC 
TTTATGCCTT 
TCCTATCTTT 
TATCGCTCAT 
CTAAAACCTT 
AGCGAATTTT 
TCAGAGGTGA 
TGGCCAACTA 
TGATTGTCAA 
TCGGGACAAT 
CTGCGCCTCC 
ATCAATTGGC 
GTCATGGAGC 
TTGAAAGAGG 
TTTTATTGCC 
CTGATCATAG 
TCTAATGTTC 
ATTGACTTTT 
CGTTAAGCTG 
TCC CAGTTTT 
GGCAGCTTGG 
AACGTCACGG 
CGCCCCTTCG 
CTTGGCAGTG 
CACGGCTGTA 



TTGTCTTGCA 
GGTTGTACTT 
GTCACATTAG 
TCTCTCTGCC 
ATCTTCTAAT 
CCCCTTAGCT 
TTACAAGAAA 
ATT ACTATT A 
AAAATTAGTA 
TAGCCATTTT 
GCAAGAAAAC 
TGATATTTTC 
GAAAAGTTAT 
TGTATTACAT 
TCTGCCAGGT 
TTCAAAATAT 
ACTTGCAAAA 
AACTTGGAAT 
TAAAAACCAC 
ATGTATTTTT 
ATAAACTAGA 
TCTTCGAG A G 
TCTTCTCGAT 
TCACCTGAGA 
ATAGCATCTA 
CTGACTGCTG 
ATGGCTGAAA 
ACTGGATGCA 
CGACCTGGAT 



19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 

21300 

21360 
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AGTCAAGACC GATAGCATCT ACTGGTAAGT TTACAAGGTC AGCGTATACG TCACGAACAT 17 880 

CACCGAAATA AGTTTGAAGC AAGACTTCAA GACCTTTTTT GTCAGCCAAG AGTTTGTTGT 17940 

AAAGGTTCAA GAAGAGACCT TTTTCTTCAG CTGTCAAGTC TTTTACAAGA GCCGCTTCAT 18000 

CCAATTGGAT GCGAGTCGCA CCAAGTTCAG CCAATTTAGC AAAAACTTCT TGGTAAGCAG 18060 

CCACTAAGCT ATCTACGAAG TCGTCTGCTT TCACGCCTTC TTCAAAGTCT GACAATTGAA 18120 

GGAAAGTGAA GGGACCTACA AGAACAGGAC GAGTGTTCAA TCCAAGTTCT TTGGCTTCTT 18180 

GGAACTCATC GAAAATCTTG TGACCAGCCA ATTTTACTTG AGTGTCTTTT TCAAATTTAG 18240 

GAACGATGTA GTGGTAGTTA GTGTTGAACC ATTTCTTCAT TGGAAGGGCG CGAACGTCCC 183 00 

CTTTTTCTCC CTGGTAACCA CGTCCCAAAG CGAAGTAGCG CTCAAGGTCA GACAAGTCCA 183 60 

AGTTTTGAAC GGATGCAGGC ACCACGTTGA AAAGGAAAGC CGCATCTAGG AAGTTATCAT 18420 

AGTGAGAAAA GTCATTTGAT GGAATTTCAG TGATGCCTTT TTCT T TGACA ATGTTCCAGT 18480 

GTTTAGCACG CAAGTCTTTT GCTGCTGCTA AAAGTTCTTC TTCTGAGATT TCTTTTCTAA 18540 

AGTATTTTTC AGTTGTAAAT TTTAATTCAC GGAATTCGC C CAAACGAGGG AAACCGATGA 18600 

TTGTAGTTGA CATGATGTGT CCTCCAAAAT TTGTTGTTGA AACTATCTTA ACAGAAAAGA 18660 

AAGCGTCTGT ATAATTGTAA AAAATTAGGG TTTGATATAG TTTGAAACTA TATATCTGTT 18720 

TCGGACAAAA GAAAAAGACT TGAAGCAAAC GTCTCAAATC CTTTGTAATT CTTACTTTAC 18780 

AGCTATATTC CAATTAGAAT ACTAAAACAT GTTATTAGTA ATTCTTATAA GTG ACT ATG A 18840 

CCTGTTATTA GAAAAGACTA TAACTGATTC TAGTCAACTT TTTCCCTGTT CAAGTGGGAC 18900 

GATTGCTAGT GTCTTTCCTA AACTGGCTAG GACTTTTAAG ACTGTATCCA ACTGAGGACT 18960 

AGTCTTTCCT GTCTCCATCC TAGCTATGAC AGGCTGGCTT ATTCCACTGA CTTCTTCCAG 19020 

CTTTTTCTGA CTGATTCCTT GTT CAT AC CT AGCCTCAATC AACTCGCTCA TGATAGCCAC 19080 

TCGCATATCA CTTTCAAGGA TTTCCTCCTT GCTAAAGAGC TCAGATGGAC ATCCTTCCAA 1914 0 

TTACTCCCAA TAGCACTATT CTTCATCACT TAACCCTCTT TTTTTTACGT CTATGTATTT 19200 

TTAAAAAAAT GAGCGAATTA TGATTCGATA GATTGACCAG TGGGTTTAAA GTTGGTGCTA 19260 

GCCTATTTCT TAAGCGATTT TCCTTTTCTA GGATAAAGCA CTTCCTGCTT GCTTAACCCC 19320 

AATTTTCCAC GATGAATCCA ATAGTAAATG GTTGAAATTC CCACGTTAAC CCCTTTAGCC 19380 

ATC AC CAT CA TTTCAGGCGA AAATTTTTGG TTATGTTTTT GGTTATGTAT AGTCGAGAAT 19440 

CTTTTCCTTT AGTTTCTTAA GACTGTTGAG CGTAGTCGGC AGAATAAATC TCTTTGAAG C 19500 

GCCCTTTTCC AAGACATTGT CGGACTGTCC CACGCTTGAT TTCAGTGTGG ATAGTTTGAG 19560 

GAGCTTTTCC AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTC CATCGTT 19620 
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GCAAGTGGGC AATGGTCGGA 


, ATCGCCAAAT 


' CATTTTGGAT 


AAAGTCAGCC 


AAACGAACCG 


16140 


TCGTTTCCTT 


1 GAT ATT AAA T 


TTATTATTGC 


TGGCAGTTAC 


ACTGATAAAA 


TGGGGACCCA 


16200 


ACTCCTGCAT 


1 ATCCTGCAAG 


GCTGAAATAA 


TGTTATCATT 


ACCCACGGCT 


GGGTTTGGAG 


16260 


GGAACACTTC 


AAATGAGAGT 


GACGGTGTTT 


GGCGTGACAT 


ATGTAATAAC 


CTTTTCTAGT 


16320 


TGATTTCTTT 


TTGAACAACC 


ACTGTATGGA 


GAGAAATCCA 


ATCTTACAAT 


TTCTCACGCG 


16380 


CAGCTTTAGC 


TGCTTCAACA 


AGGCGGATCA 


AGCTTTCTTT 


TGTTTCTGGG 


ATACCACGTG 


16440 


TTTTCAAACC 


ACAGTCAGGG 


TTGATCCAAA 


CTTTCTTGCT 


TGGCACTTTA 


GCAAGGATGG 


16500 


CTTCGATTGT 


GTTGTCGATT 


TCGCCTTCAT 


TTGGTACACG 


AGGTGAGTGG 


ATATCGTAAA 


16560 


CCCCAGGTCC 


CACTTCTGTT 


TGGAAGTTTT 


TCGCTTTGAG 


TTCGTCCAAG 


ATTTCAAGGT 


16620 


TTGAACGGTT 


AGCTTCAAAG 


GAAATAACGT 


CTGCATCCAT 


GTTATCGATA 


GCTGGGATGA 


16660 


TATCTGTAAA 


TTCTGAGTAA 


CACATGTGAG 


TGTGGATTTG 


TGTCTCTCGC 


GCTACTGTTG 


16740 


AGTGTACCAA 


GCGGAAGGCA 


GGAATAGCCC 


AGTCAAGGTA 


GTCTTCGTAC 


CAGTCGCTAC 


16800 


GGCGGAGTGG 


CAATTTTTCA 


CGAAGAGCAG 


CCTCGTCGAT 


TTGGATGATT 


TTCACACCAG 


16860 


CAGCTTCAAG 


GTCAAGTACT 


TCATCCTTGA 


TAGCAAGGGC 


GATTTGGAGA 


GTTGAATCCT 


16920 


TGATAGAGAT 


GTCTTCACGT 


GCGAATGACC 


AGTTAAGGAT 


GGTAACAGGT 


CCAGTCAACA 


16980 


TACCTTTAAC 


AGGTTTGTTT 


GTACGACTTT 


GTGCATAGCT 


AGACCATTTA 


ACAGTGATAG 


17040 


GGTTAAGACG 


AGTGACATCA 


CCCCAGATGA 


TTGGTGGTTT 


TACCCCACGC 


ATACCGTATG 


17100 


ATTGTACCCA 


TCCATTTTTA 


GAGAAGAGGT 


ATCCTGACAA 


GTTTTGACCG 


AAGTACTCAA 


17160 


CCATGTCATT 


ACG CTCAAAT 


TCACCGTGAA 


CAAGGACATC 


AAAGTCAATA 


TCTTCTTGCC 


17220 


ACTTGATCCA 


TTCGTCAATC 


GTTTCAGCAA 


GGAAAGCGTC 


GTACTCTTTT 


TGAGACAATT 


17280 


CACCTTTACG 


GTAAGCCAAA 


CGTTTGGCAC 


GAACTTCTTT 


TGTTTGAGGG 


AATGAACCAA 


17340 


TCGTTGTTGT 


TGGAAGAGCT 


GGAAGTTTGA 


AAGCTTCTTC 


TTGGATAGCT 


TCACGTTCTG 


17400 


CAAAGGCTGG 


CAAACGAGTG 


TAGTCTGCGT 


CTGTCAAGCC 


AGCGATACGC 


GCACGAAGTT 


17460 


CAGCATTTTC 


ACCAACACGC 


TCAGTCGCAA 


AGAGTTCTTT 


GTTGGCTGCA 


AGAGCTTCTG 


17520 


AACCTTGACC 


ATTTCGGATA 


GCATCCAAGT 


CACGGATTTC 


ATCCAATTTT 


TCAACTGCAA 


17580 


AGGCAAAGTG 


GTTCAAGAGT 


GCTGGTTCAA 


ATTCTTCATT 


AGCAGTTGTA 


AATGGCACAT 


17640 


GAAGAAGTGA 


GCAAGAGCTT 


GTCAAAACGA 


TGTTTTCAGC 


TGGAATTTGC 


TCAAGAACAG 


17700 


CCAAGCTCTT 


TTCGTAGTTG 


TTGCGCCAGA 


TGTTTTTACC 


ATTGACAATA 


CCTACATAGA 


17760 


GAGTCTTGTC 


AGCTGGGAAG 


CCACCTTTAA 


CGAGTTCAAG 


AGTTTTCTTA 


CCTTCAACAA 


17820 
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CATCTACATA 


. TCCCACTTGT 


' ACCCCAGCAA 


TTGGTCCGTC 


AAATGGAATA 


, TCTGAAATAG 


14340 


ACAGTGCCAA 


. AGATGAACCA 


> AACATAGCAG 


CCATTGGTGC 


AGATGCATTT 


TCATCATAAG 


14400 


AAAGCACTGT 


ATTGATGACT 


TGGACTTCAT 


TACGGAAACC 


TTCCGCAAAC 


ATAGGACGAA 


14460 


TCGGACGGTC 


AATCAAACGC 


GCTGTCAAGG 


TCGCATCTGT 


TGAAGGACGT 


CCTTCACGTT 


14520 


TCATAAAGCC 


ACCAGGAAAC 


TTCCCAGCCG 


CATACATTTT 


TTCTTCGTAG 


TTGACTTGGA 


14580 


GTGGGAAGAA 


ATCCCCAGTT 


GCCATTTTCT 


TAGACATAAC 


GGCAGCAGTC 


AAGACAGTTG 


14640 


ACTCACCGTA 


ACGTACGACA 


ACAGATCCAT 


TTGCTTGCTT 


AGCAACCTGA 


CCAGTCTCTA 


14700 


CAATTAACTC 


ACGACCCGCA 


AAAGTCGTTT 


GAAACACTTG 


TTTTGCCATT 


TTAATCCCCT 


14760 


TTGGATTGAT 


GAAATTATAC 


GCCTTGCCTA 


CAAAGATCAA 


GATACCAAGG 


ACGTCAAAAG 


14320 


CAAAGTAAAA 


ATAGGAAACT 


GACGAAGTCT 


TCGATGAAGA 


CAAGACAGTT 


TATCTTTTTT 


14880 


ACACAGCTTT 


TCGGCCGTGT 


TCAATTACAC 


AAGATATTTT 


GGACGGTTCG 


GCTTGCCGAA 


14940 


CATTTCTCTA 


GAAAAATAGG 


AAGGTGACGT 


CGCACTCGAC 


GAGTGCTAGG 


AAGCTTATCT 


15000 


TTTTTCCTAA 


GAAATGAGAC 


CAAAATTCAA 


GTCATCAAGA 


TACCAAGCCG 


TCAAGCAACT 


15060 


CAAAGGAAGA 


TAGGAAATCG 


AACGACGGAG 


CGACTACTCC 


TAGGGAGATT 


TATCTTTTTC 


15120 


CACAGAGTTG 


TAGGCAAGTT 


CAGTTTTCAA 


GATACATCAT 


TAGAAAGGTT 


TAATACTAAA 


15180 


GTATCTAAAG 


CTTTCACGCT 


AATCGCTATC 


GGGCGATTAG 


CTAAATGCTT 


TACTAACTCT 


15240 


CTCGTCAAAT 


AACATCGATT 


TGACTCACTC 


GTGTCGTTAA 


ATCTTACAGT 


TTAAATGCAT 


15300 


TGTATTATTT 


AATACCTTCA 


TCTTTGTATC 


AAGTACGTAC 


AGAATTTATT 


TT AT CAT ATT 


15360 


TTTCTTAAAA 


AGTGAGGTCT 


TTACCATTAA 


AAAGGAACCA 


TTCCCCTCAC 


CTGAGAAGAA 


15420 


TGGTTTGCTT 


TTATTATCCT 


AGAGACTGGT 


GATTAAACAA 


GGCATGGCTT 


GCTTGATGGA 


15480 


TGTATTTTGC 


TGTATCAGCA 


TTATTCATCG 


TATAGAGATG 


CACACCGGCA 


ACATCCTGAG 


15540 


TTACCAAGTC 


CACGATT TGG 


TCCACTGCAT 


AGGCAAGTCC 


TGCTGCTCTG 


ACCGACTCAG 


15600 


GGTCATGCTC 


ATACTT GTCT 


AAGATGGCTT 


TAAATTTGCG 


TGGAAGATGG 


ATATTCTCAC 


15660 


AAGTCTTCAA 


GAGTCGGAGA 


GCCTGATTTC 


GATTCAGAAT 


TGGCATAATT 


CCTGCATGAA 


15720 


TGGGAACATC 


AATCCCAGCC 


AAGATACACT 


TGTCCTGAAA 


ATCATAGAAG 


CGCTCATTGT 


15780 


CAAAGAAGAG 


CTGAGTTACG 


AGGCTCGAAC 


AGCCTGCATC 


CACTTTCTTC 


TTAAGATTTT 


15840 


GAATATCTGA 


AATCTGATTT 


GGCGAATCTG 


GATGCC CTTC 


TGGATAGCAA 


GCTCCAATAA 


15900 


TATCAAAGTG 


AGGGGTTTGT 


TCCTTGATAA 


ACTCAATCAA 


GTCGGTTGCA 


TAGCGGAAAT 


15960 


CCTTTTGTGG 


TTCCACGTCT 


GGAATAATAT 


CCCCACGAAG 


AGCCAAGATT 


TTCTGCACCC 


16020 


CAACTTTGTC 


CAAGTCAGCA 


ATAGTTTCAG 


CAACCTTGTC 


CTTAGTTAGA 


TAAATAGCTG 


16080 
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CTGAATCTTT 


TGATGTTTCT 


GTAAATTCTT 


TCTTAGGTTT 


GTAATCCTTT 


TGATGACGTG 


12600 


GGCGGTGAGG 


GCGCTCAGAC 


TTTTCACCTT 


TTTCATCATG 


CTCAGGTTTT 


GGCGGACGAG 


12660 


GTAGAAGAGC 


CTTCATAGAG 


GCATCGATAC 


GGCCTTTTTC 


ATCAATTTTG 


ATAACCTTAA 


12720 


CATCAACTTC 


AT CCCCGATT 


TCTACCAAAT 


CCTCTACACG 


ATTGGTACGA 


GTCCAAGCCA 


12780 


T CTCAGAG AT 


ATGAACAAGG 


GCATCTGTCT 


TATCAAAGAG 


GTTAACAAAG 


GCACCAAATT 


12840 


TCTCGATACG 


AACGACTTTA 


GCACGGTAAA 


CTTCATCCAC 


TTTGGCTTCA 


CGAACCAAAC 


12900 


CAGCAATAAT 


TTCTTTGGCA 


CGGTTAATAG 


CATCTTGGTC 


ACTAGAGTAG 


ATAGACACAT 


12960 


TTCCTTCTTC 


GTCTATATCA 


ATCTTAACAC 


CTGTTTCAGC 


GATAATCTTG 


TCGATGGTTT 


13020 


CTCCACCCTT 


ACCGATGACA 


ATCTTAATCT 


TGTCCAC AT C 


AATCTTGATC 


GTATCAATTT 


13080 


TCGGAGCAGT 


TGGAGCCAAT 


TCTGGACGAA 


CTTCTGGAAT 


GGTTGCTTCA 


ATGACATCAA 


13140 


GGATTTCAAA 


ACGCGCTTTC 


TTGGCTTGAG 


CAAGAGCCTC 


CGTCAAGATT 


TCTGCAGTAA 


13200 


TCCCTTGAAT 


CTTGATATCC 


ATTTGAAGGG 


CTGT AATCC C 


ATCACGAGTA 


CCTGCAACCT 


13260 


TGAAGTCCAT 


ATCTCCAAAG 


TGATCTTCCA 


AACCTTGGAT 


ATCTGTCAAT 


ACTGTGTAGT 


13320 


TATTTCCATC 


TGAGATAAGC 


CC CAT AGCAA 


TACCAGCTAC 


TGGCGCCTTG 


ATTGGCACAC 


13380 


CACCAGCCAT 


AAGGGCAAGA 


GTTCCCGCAC 


AGATAGAAGC 


TTGAGATGAA 


GAACCGTTTG 


13440 


ATTCCAAAAC 


TTCTGCTACT 


AGACGGAT AG 


CGTAGGGGAA 


TTCTTCCAAG 


CTTGGCAAGA 


13500 


CTTGAGCAAG 


AGCACGCTCA 


CCAAGGGCAC 


CGTGACCGAT 


TTCACGACGA 


CCTGGCGCAC 


13560 


CGTAACGACC 


TGTTTCCCCT 


ACAGAAT AT T 


GAGGGAAGTT 


ATAGTGGTGC 


ATAAAGCGTT 


13620 


TCTTGTACTC 


TGGATCCAAA 


CCATCAATGA 


TTTGAGTTTC 


TCCCATCGGA 


CCCAAGGTCA 


13680 


AGACTGAAAG 


AGCTTGAGTT 


TGCCCACGAG 


TAAAGAGACC 


TGAAC CATGT 


ACACGAGGAA 


13740 


GGAAGTCAAC 


AACCGCATCC 


AAAGGACGGA 


TTTCATCGAC 


CTTACGACCA 


TCAGGACGCA 


13800 


CCTTGTCTTC 


TGTAATTAAA 


CGTCGCACTT 


CTGCGTGTTC 


CATTTGTTCC 


AAGATTTCAG 


13860 


CCACATCACG 


CATAATACGG 


TCAAATTCTT 


CGTGGTCCGC 


ATATTTTTCT 


TCGTAAACGG 


13920 


CAGTCACTTG 


GTCTTTCACT 


ACTTGAGTCG 


CAGCTTCACG 


GGCCAATTTC 


TCTTCTACTT 


13980 


GAACTGCCTT 


TTGGAGGTCA 


CTGTTGTAGG 


CTGCAATGAT 


TTCAGCTTGC 


AATTCAGCAT 


14040 


CCACGTGAAG 


CAATTCCACT 


TCTGCTTTTT 


CTTTACCGAC 


AGCAGCAACG 


ATTTCTTCTT 


14100 


GGAAGGCAAT 


CAATTCTTTG 


ACAGCTTCGT 


GCCCTTTAAG 


GAGCGCTTCC 


AACATGATTT 


14160 


CTTCTGACAA 


TTCTTTGGCA 


CCAGACTCTA 


CCATGTTGAT 


AGCGTGCTTG 


GTTCCAGCTA 


14220 


CTGTCAATTC 


AAGAAGAGAT 


TGCTCTGCTT 


GTTCTTGACT 


TGGGTTGATG 


ATGATTTGGC 


14280 
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GCAACCTTAT ACCCACGGTA CTCAAAATAG CGACGAATCG TATCAAAAGC TACCGTCGAA 10800 

CGGGCGTTTC CTACGTGGAT ATAGTTGTAC ACCGTTGGCC CACAAACATA CATCTTGATC 10860 

TTGCCCTCCT CAATCGGGAC AAATTCTCGC AAATCACGAG ACATGGTGTC ATAGATTTTA 10920 

ATCATAAATC ATAATCAGGA AAGCTGAAAT CCAAGAACAA TTAGTTTCAT CACTAAAAGT 10980 

TCAAGTAAAT TTCAGTCCGA ATATCTCTAC ACTTCGGAAT CeeTTGCTCC TTTCTCATTC 11040 

AGATAAACCA CCTGAGTCTG TTTGACAAAG CCAATTTTTT CATACAAACG TTTGGCACCT 11100 

ACATTGCTAT CTTCCACTGC AATCTGAAAT TCCTTGTCAT TTTGCTCAAT TAGTTGGTTG 11160 

ACGAGGGATT TTGCTAAGTA GCTTCCATAG CCTTTTCCAC GTTCAGGTTC CAATATTGCT 11220 

AAACCGT AG A GGTAATTCGT ATTAGTCGAT AAATCAACCG TACAAGTTCC AATAACCTGA 11280 

CCAGCTTTTA ATAAAATATA TAGTCGGCTT TCTGGATCTT TCAGAGCTTC AGCGACATAT 11340 

CTATCCACAA CTTCTCTCGA TTCATGTTCC TCTGAAAATG CCTGAAATTT TAATTGACTA 11400 

ATTTGATCCT GATACGAACT ATCTGCTAAC AAAACTTCAA GATGGGAAAC ATTTGCTAAC 11460 

GGATAAGGTC TTCTATCCTT ACCTAACCAA GTTTCTGTCT CTTCATCCTC GATTAGTCCC 11520 

CAGTTACTGG CAAAGTCAGG ATGATTCTCT AAAAAAATAC GTTCTGTCTG AAAAGTGACT 11580 

GACCGAATGG GGAAAGAAGC TGTTTCTCTC TCAAAACTAG TAAACAATGC ACGCGCAATC 11640 

CCCTGACGGC GATGACCTGG ATGAACCAGT ATCGTCACTT CTACATCTTG GTCATCTGCA 11700 

TAGACAGTTA ATAAACCAAC AAGTTCGCCT TTTTCATAAT AAAGGAAAAA GGCGGGCATG 117 60 

TTTGGGTCAA AATTAAGCAT GTTAGAGAGA TAGGGATCGC GATAGGTACC GTCATAGTTT 1182 0 

TGGCAACAGT TAATTACTTT TTTCGCCTCA GATAGCTCCT CTTGGCTTAA CT rGTTT C TT 1188 0 

GCTTGAATCA TATAGGTATC CTCTACAAAC CAGACGATCT GTGACTGGCA TCTTTAGCCT 11940 

GCTCGAGTTT ATTGACATAA TACTCTCGTT TTTCTTCGAC TTCGTGAATG ACAGGCTCAT 12 000 

CTTTCTTACC ATGAAGACGG ACAATCTTGG CCGGAATACC GACAACCGTC ACGT CACTAG 12060 

GTACATCTGC TACGACAACT GCTGCAGCAC CGACCTTGGC ATTTTCACCA ATTT CC AC AG 12120 

GCCCGATAAC TTGGGCATGG GCTGATATGA GGGCTCCCTT TCGTACAGTC GGATGGCGTT 12180 

TGCCACAGTC TTTCCCTGTT CCCCCGAGAG TCACTCCGTG ATAGAGAAGA ACGCCTTTTT 12240 

CAACAATCGC TGTCTCTCCA ATCACCAGAC CAGAACCATG GTCAATAAAA ACACCTGAAT 12300 

CAAT CTGGGC TCCTGGATGA ATCTCAATCT GAGTCCAAAA GCGCCAAAAC TGACTGTACA 12 3 60 

TACGAGCTAA TAGTTTGAAG CCGTGCTTCC AGAGAAAATG CGAGAGACGG TGGGCCGCCA 1242 0 

AGGCCTTGAC ACCTGGATAA GTCAGCAAAA CCTCCAAAGT GGTGCGGGCC GCTGGATCAT 12 4 80 

TTTCTTTTAC AATATCAATG GTTTCGCGCC ACCACCCCAT ACATTTCTCC TTTTCTTATT 12 540 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AAGGGCAACC AACTGCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGAGAACCT AAAGGGAGCG 9120 

TGAGATAGCC CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9180 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG GATGCACCAT GAAACCAAAC 924 0 

TCTCAAGACG TTCCAGATTC TC AG TC AT AT GGAGATAGCC CATAACCGCT TCAAATCCCG 9300 

TGGACATACG ATAAGTCACG ACATCTGCAT TTTTAGCCTT TGTGTGGCTA TTGGTATTGC 93 60 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGCGTTTCGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCCTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAG GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAAGTCACG ATTGGCACGC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 9720 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CATATCTGCA 9780 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9900 

TTAAAGTTTT GTAACTCTTG GGCATCCACA TTTCCTGTAA ATGGTTGTTC GTAACTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCG CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT GCCCATCAAG AGTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG GACTTAGACA TTTTGACATT GTCGATATTG 10140 

ACAAAGCCAT TGTGCATCCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 10260 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 10320 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TACAGCAAAG 10380 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGGTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 10680 

CCTTCTTCCC TGGCACGGTT GATAATCTTA T CATCCAC AT CTGTAAAATT GGAAATATAG 10740 
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AAAATATTGT 


TAGAAATCGA 


TTTGACTGTC 


CTGAACAATT 


CGTCCTATTC 


TTATTTCATT 


7260 


TTACTATAAT 


TCATACTGCT 


CGCCCCAGCC 


AGATACCTTA 


TCTGCTATCC 


ATTTAGGAAC 


7320 


CCCTAACTTA 


AGCAATCCCC 


ATAATCGTCT 


CGATTTCTTC 


TTCCATTGCT 


TCCAGATAAT 


7380 


CACTCGTAGG 


CGAGTACGCA 


AGCGCTCATC 


TATGCTAGTG 


ACTATACTTT 


TCATATTTAT 


7440 


AATTCATTCC 


TTTCGTTTCA 


CTCAAGGCAC 


AACACAGAAT 


GAAAAAGTGT 


TGTGATCTTT 


7500 


ATTTTGTTTT 


ATAATAATAG 


TGAGAAAACC 


TAT CACT ACT 


ACAAATCACG 


GGGAGGTGAA 


7560 


TAAGTGAGTG 


GTACAGCCAC 


TACCTCGCAT 


ATTTTGTCAC 


ATCATTTAAC 


GGTACATAAT 


7620 


AAGTTGTACC 


ATCTGAATAA 


GTTGCTACAA 


TATCATTTGC 


ATGCTCTCCT 


TCACCTTTAG 


7680 


CAAAGGTTGG 


AGCTCCTGCT 


GGATGATTTT 


TATTTGCCTC 


TTTCAATTTT 


TCAATAATGG 


7740 


CATTTTTTCT 


GTATCTTTTA 


TATTATCAGG 


ATTTTTCACT 


AAGATTTTGT 


CTGGATATGT 


7800 


CGGTTTAGCA 


GAAACAATTT 


TTACTGTTAC 


TTCTTTTTTA 


TTCGAAGCAC 


TTGTCCAGTT 


7860 


TCCAGCATTA 


TCTTTAGCAT 


TTAATTTTAC 


AGTAATTCCT 


GAACTAGGAA 


CTTCAGTAGC 


7920 


AGGTTGATTA 


TCAACATTAT 


TCAACTTTAA 


TTTCAAAAGA 


GCTGTTGCAT 


CAGACGTTTT 


7980 


ATCAATCGTT 


ATATATAATG 


ATGAATTGTT 


ATTATAAACA 


GTTCCTTCAT 


ATTTAGCTGT 


8040 


TTGTGAGCTA 


CTTGAAACAG 


AACTGAAATT 


ATACCCACTA 


CCTCCCTGAT 


TATCTTCAAT 


8100 


GCTTACGTCT 


AAATGAACTT 


CCCCACTATT 


ATTTGGCTTA 


GCAACAACTG 


TTATAGTAAA 


8160 


ATAACATAAA 


ATTTGCATAA 


ATAGATTAGG 


GAAATCAAAG 


CAGCTTCTAG 


GAATGTTTTA 


8220 


GCAGTCACAG 


TGTACTTTCC 


CAGCATCAAG 


CCACTATAAC 


TCTGCACATA 


AAAATGGAGA 


8280 


AGATGGCAAT 


CCTCTTCTCC 


AAATATTAAC 


TTCTTTACAA 


ACCAACTATA 


GTTGACAAAG 


8340 


AACCTAAAAT 


CAATTGATAA 


CACAAGGTCA 


GGTCGGTCAA 


CTCTTTCAAC 


TGAAGCCCTG 


8400 


TCAACTCTTC 


CCATTTATCA 


ATCTTGTATT 


GGAGAGAATT 


CCGGTGCAGA 


TAGAGTTGCT 


8460 


GGGCTGTTTT 


AGTGAGAACA 


GCACTATTTT 


CCCAAAGAGA 


GAGAATGATT 


TCCTGAATCT 


8520 


GATCTTGATC 


CAAAATCATC 


TGGTGTAGAC 


ATTCCTTGAT 


TGGCTTCAAG 


TCCACGAGTC 


8580 


TTTCTCCCAT 


ACTCCAAAGA 


TAGAGCTGAG 


AAAAAGTATG 


AACACCTTGG 


TGACCCTGAC 


8640 


GCCACCATGT 


CTTGAACAAA 


TCCCGCTCAG 


CTTTGATTAA 


GTCTGATAGG 


GCTTGATGTC 


8700 


CCGTCTGAGA 


CCAAACCTGA 


CCCAACATGA 


TAGAAAGACG 


AAGTCCAAAG 


TCATACTCAA 


8760 


CCGCTTCAAT 


CGTATCACTT 


AAAATATCTC 


TTACAGAACT 


GTATTTGTCT 


TGTTGAAGCA 


8820 


CGAAAACATA 


ATCCTGAGCT 


CCGACCTGTA 


GCACTGTCTG 


ACAATTCGGA 


AAAAGAGTCC 


8880 


GCATCATATC 


TAGCCAAGAA 


GCCAGATTTT 


CCTGCTGAAA 


ATAAGAAAGA 


TGGCAATAAA 


8940 


CCAACTGAAT 


CTTTTTAAAA 


ACTTGCGGTG 


CCTGTCCCTT 


GCCCTCAACC 


AGATAGGAAT 


9000 
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GACCAGCAAG 


TAGGGACAAG 


AG TGTGG ATT 


TTCCTGCTCC 


TGACTCCCCA 


ATAATACTGT 


5520 


AAAATTTTCC 


GGGTTCAAAA 


TTATAATTGA 


TCTGATATAG 


G ACTGCTTC A 


GCAGTATTCT 


5580 


TATAACGGTA 


GGTAACATCT 


TGTAATTGTA 


ATAAAGTCAT 


CATTTCTCCT 


TCTTAACTAA 


5640 


TAGATGATAA 


AATTTCTTTC 


GGTGATTTTC 


TAAATAAGAA 


TAGGAAACAA 


AGGGCTACAG 


5700 


ATAAGCAACT 


AAGCAGAACT 


AGAAAAACAT 


AGGATTCTGC 


AAAAGATAAG 


ATGCTAGTTG 


5760 


AT AAACT GCT 


TGCTTTGGCT 


AGTGTATCTT 


GTAAGCTTGC 


CTGATCTCCA 


CTTGCTAGTA 


5820 


GAGTTTGGAG 


TAGGTAAGTT 


GTGATTGCGT 


TTCCTGCAAC 


AAATGCTGGA 


AGCAAAGCTC 


5880 


CAAGAGATAC 


CAAAACTACC 


TCTAAACAGA 


ATTGTAGGAA 


GATCGAGCTC 


TTGCCTTTTC 


5940 


CAAGTGCAAG 


TAAAATCCCC 


ACTTCATAGA 


CCCGTTCTCT 


CAACCAGAGA 


GACAAAACCA 


6000 


G AATTAAGG C 


TCCAGCTCCT 


GCTATCAACA 


TCCCATAAAG 


GAAGATGGTC 


AGGAAGGTTT 


6060 


GGAAAGTTGC 


AACTGAGTCT 


TTGATTTGTT 


CAAAAGCCTT 


GTTTTCCTTT 


TCGACTTGGT 


6120 


AGCCTTGATT 


TTCCAAGGCC 


AAGTTTTCTA 


CCTGCTT CAT 


GAGTCCGTCC 


ATTTCCTTAG 


6280 


GATTTTCTAC 


ATAGAAGCGT 


GCTGCACTGA 


CTTGAGCTTC 


ACTATTGCCC 


AAAAGGGTTT 


6240 


GGCTACTTTC 


ATAGTCTGTA 


AAGACTTGAT 


TTTCACTGAA 


GTCAGAAGAC 


AAGCCTGTGA 


6300 


ATTTCTCTTG 


TTTTTTACCA 


GAAAAGATGC 


CGATAATCTC 


AAACTCTACT 


GTTTGTCCTT 


6360 


TTCCAG ATT C 


AGACTGACCA 


GCATCCAAGC 


CAATCTTGTC 


ATGAAGCGAA 


AG AC CGTTCT 


6420 


TCTTAGCCAA 


TTCTTCGTGG 


ATAAGGATTT 


TCTTGGAATC 


CCCTTTTTGA 


AGGTGTCGCC 


6480 


CTTCTTTTAG 


ATTGAAAGCC 


GAACTGGTAA 


AGGTTACATC 


CTTGGATGAA 


TCCTCAAGAG 


6540 


CCGTTAAGCT 


AACCAAGTTA 


TTGTCTGCAG 


CTGATAAATC 


ATCACGCTCC 


ACGCTCTGCT 


6600 


CGCCAGTCAC 


TGCTTCCTTG 


TCTTTTAGTT 


TTGCGACCGT 


CTCAAGTTCA 


GGAGAGACAT 


6660 


TTTCCAGCCC 


CTTAATCTTG 


CTTACAGATG 


CTAGGTCTGA 


CAACTTGAAT 


GTCTGACCAT 


6720 


TCTCTATCTT 


CTTAATAGAA 


AAAGATGTAT 


TGAGTGATTT 


ATAAAGATTG 


CTTTCTAC* T *G 


6780 


TTTTGTTGGA 


CTTCATCAGA 


GTCAAACAGG 


CTGAAATTCC 


GGCCAATAAG 


ACCAATAAAA 


6840 


TCAGAAATAA 


AATAAAACTT 


CTCAGTCGCT 


TTCTGCTGAC 


AT AAGCC CAA 


GATCTTTCGA 


6900 


TTGGATT CAT 


TTGTCACCTC 


CATATTTGTA 


AGACTATTAT 


AAAACCCAAA 


TATGAAATAT 


6960 


TTATGAAATA 


CGAAAAAAAA 


ATATCGAGTA 


GGGGATAATC 


TCTAGCCCCT 


CTCACACCAC 


7020 


CATACGTGCC 


GTTCGGCATA 


CGGCGGTTCA 


ACTAACTTTT 


AACGCATGTC 


GTTCAAGGTA 


7080 


ATAATCCAAA 


CACGAAACCA 


GTCCACGTTT 


TTCAAGGACT 


GGTTTTGATA 


TAGCACGTTT 


7140 


AAGTACCGAC 


TTCTGAGCTA 


CTATAGTAGA 


TTG AAACT AG 


AATAGTACAC 


CTCTACTTCT 


7200 
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AA X vj I LTAGG 


CT CG ACAAGG 


TCTTGCTAAA 


GCCATCTACT 


TCTGCACCAC 


3720 




AGAGGtrrrGA 


GCCGCCTTAC 


TAGCCTGTTT 


GGCAACACCT 


GAAGTCACAT 


3780 


I \jOV»AAGGA*» 


AGTGTTTCCA 


ATTGCACGGG 


CAGTGTAATT 


AGCTAGGAAG 


TAAGCAGAAA 


3840 




GATAGCAATC 


AAGATAGATT 


CGGTGATGAA 


TTGACCCAAG 


ATACTTGC CT 


3900 


GL. I IajAGGCC 


GATAGAGAGG 


AGAATTCCCA 


CTTCCTTGCG 


ACGGGCGTTG 


ATCCAAAGGC 


3960 


I vJAG*_ AAGAG 


GGCAAGGAGG 


AG AACTG AG A 


AGCTCAAGCT 


ACCCCAGAAG 


AGGAGGTTGG 


4020 


t. C ATCTTGT A 


CATACCAGAG 


ATAGATTGCT 


CAAGAGCTGG 


GTAGTTAGAG 


GAGCTCTTGA 


4060 


LGAGTGTGTA 


GCTCTTCCAG 


TTGATACCAC 


TGATGCCATT 


CAACTCTTTC 


ATAACATC AT 


4140 


v_ CAAGTT CTT 


GTCTGCTGTT 


ACAAAGAAGG 


TTGCGTCCCC 


ATAAATGGCT 


GTGTCTTCTG 


4200 


TGTATCCATA 


AAGTTTTGCA 


GCAGTGTGAA 


TGTCTGTAAT 


AGCTGTGTTT 


TCGTAAAGTT 


4260 


Cl l GTG AGT A 


GGTTACTGCT 


GACTTATTAT 


GACCATCAAA 


GAGTCCCTTG 


ATTGTCACTT 


4320 


CAACTGTTTC 


CTTGGCTCCT 


TTTT C ATT AT 


CTGCATCGTA 


GATATTAGAG 


TCCAGTTTAA 


4380 


C CTTGTC C CC 


T ACTTTC C AG 


CCGTGTTTGG 


CTGCCAAGTC 


CTTGTGCAAG 


AGGATTTTAT 


4440 


C CTTGTCGTC 


GTTGGTTAAG 


TGCTCTCCTT 


CGACTAGTTT 


ATAAGAACCA 


GAGACAAACT 


4500 


TGTCTTCTTT 


AGAGGAGTCA 


TTGACACCTG 


TAATCATCAA 


GCTACTTCCA 


AAACGCTTCG 


4560 


CACGATCAGC 


AGTGAGATTC 


TTCTTGGTTT 


CTGGCGTTTC 


AATCAGGTCA 


TATCCAGTCA 


4620 


AATCTCCGAT 


AGCGTTGATA 


CGTTTGACAT 


AAGACTCAAT 


GGCCTTGTTT 


TCGGTGATTT 


4680 


TTTTGATGTC 


TTCACCCTTG 


ATATTCCCAG 


CACCACGAGG 


CGTTCCTTGG 


TTGACGCGAC 


4740 


G ATT G ATTTG 


CATGGAGAAG 


CTATTGGTGA 


TATTTTTAAA 


GGTCTCCTGA 


GAAGCCTTGG 


4800 


CAGTAGCTCC 


CTTGATTGAC 


AAGCCGACCA 


AACTCAAGCT 


CGCCATGAGG 


AGAATAATCA 


4860 


GGAAGATGAC 


AATCGATTTG 


AAAAACTTCC 


TTGTAACATA 


GGCAAATGCG 


TTGTGTAACA 


4920 


i Av*Arrcc-L r 


TTCTAGATTT 


TGTTTTAATC 


ATTCTATTAA 


AATAAGCTCA 


AATTATTTAC 


4980 


rAGTATTGCG 


CGTTTCAGTC 


AGTTTCTTAT 


CCTTTAATTC 


AAGTGTAATA 


TCTGACGCTT 


5040 


GTGCCACTTC 


TTTACTGTGA 


GTTACGACAA 


TCACACATTT 


ACCTGTTTTC 


TGGGCAAGTG 


5100 


ATTTGAGTAG 


TTCGACAATA 


TCTCCAGCAG 


TTTTAGGATC 


CAGATTTCCT 


GTTGGCTCAT 


5160 


CAGCTAGAAT 


AACTGGAGCT 


TCTGAGACCA 


AACTGCGAGC 


AATGGCAACA 


CGTTGCTGTT 


5220 


GACCACCTGA 


TAACTGGAGA 


ACATTCCGCT 


TGATCTGGCT 


TTCATCCAAA 


CCAAGCTCAA 


5280 


GAAGTGTATT 


CTTGCTTGCC 


ttittg'itga 


CCAATCGGAT 


ATTTTCCAGC 


GGAGAAAGAT 


5340 


AATCTATCAA 


GTTATAATTT 


TGAAAGACCA 


GGGAAATATG 


GTGCATGCGA 


TGGTAAGAAT 


5400 


AGCCCTTCTT 


ACGAATATCC 


TCTCCTTGAA 


AAAGGATAGA 


ACCTTCAACA 


GGACTATCTA 


5460 
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ATGCAAGTCC CCAATAACAG TCAAGAGATG CTGGTAGAGG CTATTGATTT GT TCCTTG AG 
ATTACCTATC TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA GGTCCATCAT 
CCGACGGGTC ACCCGCTTGA TTTCCAAAAT CGGTG CAACA ATAGTCCGAG CGTAGATGTA 
GGCCACCAAA AGGGAAATCA GAAAGGAGGC CAGCAAGGTA TAGGGAAGAA ACTGGAGACT 
GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCAT AGTACC 
ACCGTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 
GTCCAGAGGA AG ACTGTC CT TGACTTCTAA CTTGTCCTCG GTCATCTCAC CTTTGACGGT 
CCCCTTGATA TCACTAGTCT GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 
TTTCCCTTCT AGGCACTGCG CAATGGCTGT TGCCTTTTGA CCAATGGTTT CCTGACGATG 
ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 
ACT AAAT ATC GAGAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 
CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 
CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 
CAGACGGCAT CGATAATCTG AGATCGAGTC AAGGCCCGGC CTTCATTTTT CACTAGATAG 
TCCAGAATTT CCAACTCTTT GGCATTGATA GGCACTT CTT GACCTGCGAG GCTTGCACTG 
TAGCTTTCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCGTAGTAG 
CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA GGGAGAAAGG TTTTTCCAGA 
TAGCCATCTG CCAAAGAGGC AAAGGCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 
AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAG CTAGGACTTC TAAGCCGTTG 
AGCTTGGGCA T CTGGAT ATC CAGTAAAACC AGGGCCACCT CATAGCTAGA AAATTGCTCC 
AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 
CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAATTTTCAT ACTTTAACTG 
CTCTCTATTT TTTATTTTTC TTAGAATAAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 
GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA C GTTGTG AAA TTCCTTTCTC 
ATAAATTCCA TAACTTTAGT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 
AAGCAATGAT TTTCACCACT GCTTTCGGAT TTATTTTCAA TTGTTAAATA GCCATTCCTA 
TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 
TTTATT C ACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGGAGATTG CTTGAAGCAA 
G CG C CAT AAC GAGAACCACT AGAACCAAGC CAAGGACAAA AATGATGATA AAGTCTGATG 



1980 
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3060 

3120 

3180 

3240 

3300 
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Al^HJAGCA i 1 G 


TGCTAGTCkA 


GATTCAGTTT 


ACTATACTAA 


AACGAGTAGC 


TTGAAATCAA 


180 


AAAAL CC AC C 


CTCACAGGCA 


GGTTTTATCT 


GTATTATTCA 


GCTAGATTAT 


GCTTTACCTT 


240 


CTGAACCGAA 


TACGTCGATA 


CGTTCTTCAA 


CCGATGCTTG 


GATAGCTTTT 


ACACCGTCAG 


300 


CCAAGAATTT 


ACGTGGGTCG 


AAGAGTTTTT 


TCTTGTCGTA 


TTCTGCTTCG 


TTTGCTTCGT 


360 


AGTCACGAGC 


AAATTTACGA 


GTTGCGTTAG 


CGAATGCGAT 


TTGGCATTCT 


GTGTTAACGT 


420 


TAACTTTGGC 


AACACCAAGT 


TTGATAGCTG 


CTTGGATTTG 


CTCATCAGGA 


ATACCTGATC 


480 


CACCGTGCAA 


TACGATTGGG 


AATCCTGGAA 


G AGC TTCTGT 


CAATTTTTGC 


AAGTGGTCAA 


S40 


GGTCAAGACC 


TTCCCAGTTT 


ACTGGGTAAG 


GACCGTGGAT 


GTTACCGATA 


CCAGCTGCCA 


600 


AGAAGTCGAT 


ACCAGTTTCA 


ACCATTGCTT 


TAGCGTCTTC 


GATTGGAGCC 


AATTCACCTT 


660 


T AC CG ATG AT 


TCCATCTTCT 


TCACCACCGA 


TAGTACCAAC 


TTCAGCTTCT 


ACT GAG AT AC 


720 


CTTTAGCGTG 


TGCTTTTTCA 


ACAACTTCTT 


T AGC CAATTT 


AAGGTTTTCT 


TCAACTGGAA 


780 


GGTGTGAACC 


GTCAAACATG 


ATTGAAGTAT 


AACCAACTTC 


GATACACTCA 


AGTGCATCTT 


840 


CGTAGTGACC 


GTGGTCAAGG 


TGGATAGCTA 


CTGGTACAGT 


GATACCCATT 


GATTCAACAA 


900 


GGTTAGCGAT 


CAAGTTGCGA 


GCAACTTTGT 


AACCACCCAT 


GTATTTAGCA 


GCACCCATTG 


960 


AAGTTTGGAT 


CAAAACTGGA 


GCTTTTTTAG 


CTTCTGCTGC 


GCGCAAGATA 


GCTTGAGTCC 


1020 


ACTCAAGGTT 


GTTTGTGTTA 


AATCCACCAA 


CTGCATAACC 


GTTGTCACGG 


GCTGCTTGGA 


1080 


CAAATTTTTC 


TGCTGAAACG 


ATTGCCATTT 


TATCAGGCCT 


CCTGTATATT 


TTTATGGGTC 


1140 


ATCCCATTTA 


CATTGTTCAT 


TTTATCACTT 


TTTGCCAAAA 


AAATCTAGTT 


TTTC CCGC AG 


1200 


TTTCGATTGA 


TTTTCTTCTA 


ACTCCATCTA 


TGTAAACCCT 


TTCTCTCCCT 


AGTCTTGGAC 


1260 


GACTTTTGGA 


AAATCTATAA 


AGAAGGTTAA 


ACTATTCTCC 


TCCATCTCGA 


AACGATAAGC 


1320 


TAATTTTTCA 


TGTTCTAATA 


GACTCTTAAC 


CACAAAGAGC 


CCCATACCAG 


ACCCCTTGAC 


1380 


CTTGCGACTG 


GCATTGTCAG 


AAAAAGACTG 


GGCTAGTTTT 


TCTTGTTCCT 


CTG AGCT AC A 


1440 


GCTATTTTCG 


ATAAAAAGTT 


CTCCTTCTCT 


TTCTCCAATT 


CGAACTAAGC 


CACCTGGAAC 


1500 


AGAGTGCTTA 


ATGGCATTGC 


TGATGAGATT 


AGAAAGAATC 


AACTTCATAA 


CTGATGGGTT 


1560 


TAGATAAGCC 


TGCTGATGGG 


TCAAACTATT 


GTCTATCTGG 


AGCTCTCTTT 


CCTTGGCTAC 


1620 


CAAGGCATAA 


TCTTTGACCA 


GATTTTGCGT 


CATCTGGAGG 


AGGTCAATTG 


TTTCCCTATC 


1680 


ATCTCGCAAT 


TCCTGCACAG 


AAGAGAGGGA 


AAGTATCTGC 


AGAACATGGT 


GATTGAGTTC 


1740 


ATCCACAATC 


CCCAAGGCAA 


CTCCCAGATA 


CTGGTCTCTA 


TCCTTATAAC 


GACCGATATT 


1800 


CTCTCTCATA 


TTTTCGATTA 


GGATTTTCAA 


ACT AG CCAGC 


GGTGTTTTCA 


ATTCATGAGA 


1860 


AGCTCCTCGT 


AGGAATTCGA 


CCTTCATCTT 


CTCCAGCTGG 


AGAATGGCTT 


CATTCTTTTC 


' 1920, 
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GATGAGCAAG AAGTATAGTG TCCAAGCAGA ACCCCAAGTG ATTGTAGCAA GTGGTGCCCA 14 220 

ACCAACGTCG GTAATACTCA ATTGGATACC AGTGTTTTCA ACGAATTTTG CTAGTGATGC 14280 

TGAGAAAGCA GTGTTTAGCA TACCGATCAT AGC ACCG AT A CCTGTAAGAG CGATGGCAAG 14 340 

TTTGAT AC C A CCTTCAAGCG CTTTGGAGAA TTTCACTCCA AAAAGTAAAG CCAATACTGT 14400 

CAAAATGATT AACATGATGA CAGGTCCACC CATTTCTAAG ATGGGATTGA AAACCTTTCC 144 60 

GATTAGGTCA AAGATTGCAT CCATAACAGT TCCTCCCTTT TTGATGTTAT ATGAATGTTA 14520 

ACAAATTAGA ATTAGCTTAA TCCGTGTTCT TTAATAGCTG CTTCAATATT GTCAAATACT 14 580 

GGAGCGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 14 640 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGC 14700 

TCTTCGTTTA CATCTTT CAC CATGACTGCA TCACAGTGAA CAT C AT AACC ACGGTTTGAA 14 760 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 14820 

GCAAGAATTT TAATCATTTA GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 14 880 

GGTTGCTTCA GCAATGTAAG TATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 14 940 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 15000 

ACTTGAATTA TTAATGATAA AGAAGAGTAG GGATACTTCT ACTTCCTTAT CAGGAGCTAT 15060 

CATATTGTGA AAAGTTATTG GTTTTTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 15120 

ATGAACAATA TCTGTGTGAG GAATCGCT AC ATTTGGCAAG TCCTTTCCTA GAAATTCCAT 15180 

ATCTAAACCA GTTGGAAATG ACTTTTCACG CGTGATCAAG GCTTCACGAT AAGTTGGAGT 15240 

GACAATTTCT CGTTCTTCCA ATAAAGTTGC AACCTGATCA AAGAGTTGTT CTTGACTATC 15300 

CGCTTCTAAG CAAAACACAA GGTTTTTGTC AAAGAAATAA TCTAATACCA TAAGTTTTTC 15 360 

CGG 15363 

(2) INFORMATION FOR SEQ ID NO: 140: 

" (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28882 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 

GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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TCACTTTGTT 


CTGTCCTTTC 


AACTATATTT 


TTAGTTTCCA 


AAGCTTTATC 


AGCCTTTTCT 


12420 


TCTACTATCA 


TTTTTTCCTC 


TTTAGGTTTC 


TCAGCAGTAT 


GAGTAATAAG 


TGTTTCATCC 


12480 


GCATAAACTA 


CAGATTCTCC 


AGCTATATTT 


CCT CCTAAT A 


AAACTG C AC A 


AGTCCCAATC 


12540 


ATTACTGAGC AAGCTCCCAC 


AG C AAA CTT A 


CGAATGCTAT 


AAACTCTTTT 


CCGATTCCAA 


12600 


TGGCCTTTC C 


CCATAAAAPf* 


CTCCTTATAT 


TATATTTAGT 


GCAGTTAGCT 


ACTACCAAAG 


12660 


CCCAAGTGGT 


at Af* ATrirrr a 


TGACAACCTA 


GTTTCAACAA 


TTTACACTCT 


GCG AAAATCC 


12720 


AATTCAAACrr 




CGCCTTGCCG 


TAGATATGAT 


TACTGACTTC 


GTCAGTTTCA 


12780 






TTTTGAGCTG 


ACTTCGTCAG 


TTTCATCTAC 


AACCTCAAAA 


12840 


CCA TGTTTTG 




GTCAGTTTCA 


TCTACAACCT 


CAAAACCATG 


TTTTGAGCTG 


12900 






AACCTCAAAA 


CTGTGTTTTG 


AGCAACCTGC 


CGCTACCTTC 


12960 


* /WJ A A i. Uv 1 


<>i i iWAl 111 


CATTGAGTTT 


ATATTTTATA 


GGAGCGCATT 


ATTTTGCTTT 


13020 




TCTTCG TT AC 


GTTTGATCAT 


TTGTTTTCTG 


TACCAAGCAA 


AGATACCGAT 


13030 


AT AG AAT AC* A 




CTGCACCAAG 


GATTGCTTTG 


ATATCACCAG 


TTGTAGTGTT 


13140 






GTTTTTCGAT 


TGGTCCTTCA 


AGAGTAGAGT 


GACTAATCAA 


13200 




CTCAC AC CTT 


CTGGGAAGGC 


ACCTACACCT 


TTAGCAAGTT 


CTGTTGCAAA 


13260 


TfifJIV^MiATl 
i vjw 1 A 




AAAGAAGGAA 


GAGTGGCAAC 


AAGAGTGTTC 


CGAAGATAAT 


13320 




AATTTACCAC 


GAGTTACAAC 


CAAGAG AG CT 


GGAGTAACAC 


CCATAGCGAT 


13380 


GATACfTnT'A 




TACCATTTCC 


AACTTTTGAA 


AGAAGCACTG 


CTTCAATCAA 


13440 


C ATG ATTGGT 


G<~AA£yFAfWT* 


TGGCACAAGC 


CCAGATTTCA 


GCACGACCAG 


CGATGAATGG 


13500 


CCAGTCAAGA 


CCGATATTGA 


ATTTACGTCC 


TTGAAGACGT 


TTAGTAGCAA 


CGTTTGTAAT 


13560 


ACCTTGTGAT 


AGTGGTTCTA 


CGGCTGCGAT 


GAACCATGAA 


CCGATAAGTG 


AGAAGAGTTC 


13620 


CAAAGATACA 


CCGGCAGTCA 


AACCAAGAGA 


CAACCATCCT 


TTGATAACAA 


GACGCCATTT 


13680 


ATCTGCATCT 


GCAACACCTG 


CAATTGGATG 


TGGAGTTCCC 


AT AAT AC CG A 


TAACGATACC 


13740 


AAGGATGAAA 


CCGATGAAGA 


ATTTAGATCC 


CCAGAAACCG 


ATTTTCTTGT 


TCAATTTAGC 


13800 


AGCATCAAAG 


TCATATTTAT 


CAAGGCCTGG 


GAAGAATTTT 


TCAAAAATCT 


TATCCAAAAC 


13860 


C ATG AT AA CT 


GGGTTCATCA 


TGTAGTTCAT 


GTGAGTTGAT 


GTCATTGGTG 


ATGAACTTGG 


13920 


GGCGTTAAGA 


AGGTCATCAA 


ATGTAGGTTT 


CATCAAGTCA 


GAGTTGATAA 


TTTTCAACAC 


13980 


ACCGACAAGG 


ACGATAGCTG 


CTGTAGCAAT 


AAAGAGTGAA 


ACCCCTTGAC 


TCACACCATT 


14040 


GTTATCAGCA 


TACCATTTAA 


TCAAGAGACC 


TGTGATAGAC 


AAGTGCCAGA 


TATCAAAGAT 


14100 


ATCGACATCA 


AGTGTATCTG 


TTTTCTTCAT 


AGCTAGCATC 


ACTATGTTGA 


CAATCAACAT 


14160 
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TGTTTCAATT CACGTTCAGG GCGATAGTAA CTTCCACGTG TACGGGTAGC TGAAGATGTT 10680 

TCTGATCCAT AAATCAACCA TTTTG GATGC TTAGCTCTAA GGGCTTTGTA ATTATCTTCA 10740 

GAATAGTTAA ATCCAACAGC ATCGAGTTCA TCAGCAATTT TCTCATGCCC TCCGCTACCA 10800 

TT AC CG AAA C GGAATTT AT C TGCTCCCATG GTAACATAGC GAGTCTTATC AACATCCTTG 10860 

ATAACCTTAA CCAAACGTTT AACAGTTGCT AAAGAGTGGG CATCACCATT AGCTTCACCT 10920 

ATTTCATTAC CAATTGACCA CATGAAGATA GCAGGGTTGT TTTTG CCTCT TTCGACCATG 10980 

GTACGTAGGT CAAAATCAGA CCATTTTTCA CCTTTTCGAG CTTCTGGGTG AGTGGCATCT 11040 

TTTTCAAAGA AACGTCCATA GTCATAAGGT TTCTTGCCAC CATACCACGT ATCAAAGGCC 11100 

TCTTCCTGAA CGAGTAAACC TAGTTCTGCT GCGATTTGCA AGGTTTGCTC ACTAGCAGGG 11160 

TTGTGGGTTG T ACGGATGG A GTTAACTCCC ATCTCCTTCA TTTGTTTGAG ACGGCGATAT 1122 0 

TCTGCTTTAT AGTTTTCTTC TGCTCCAAGC GCCCCATGGT CGTGGTGCAA GG AT ACT CCA 1128 0 

TGGAATTTAA TACGTTCACC ATTCAAAGAG AAAC CTTC AT TTGGAGTCCA GTGATAGTAA 1134 0 

CGGTAACCAA ACAAATCCTT CTTAGCATCA ACCAATTGAC CGTCACCCTA AACACGCGTA 11400 

ATCAATT CGT ACAAGGCAGG TTTGTCATTT AAAACAGTCC AGAGTTTTGG TCTTTCAACT 114 60 

TCTAAAATCG CATCTAGGCT TGTTGATTCA TGTGCTTTTA AGGTACGACT CGCTCTACGA 11520 

ACTAAGCCTG TTACAGCATG ACCACCTCGT TCAACGATTT GATATTCGGC TACAAGTTCA 11580 

TGGTCTTTGT CGTCCGTATT GACGATTTTG CTGGTCACAT GAGTTTCAAC CTTGCCATGT 11640 

TGTTGTTCTT CAAGTTTTGG TGTTAAAATA GTTGTCCCAT TTTTCTCAAC ATGCACCTTA 11700 

TCTGTCACTT GTAAAGTCAC ATCACGATAG ATACCACTTC CTGAATACCA ACGGCTACTT 117 60 

GGCTGTTTGT TGACTGCATG GACAGCAATC ACATTCTCAC GACCATCTTT TTGAAGGTAT 11820 

TTGGTGATAT CATATGAGAA CTGGTTATAA CCATTTGGAT AATGCCCCAC TAACTGACCA 11880 

TTGACATAAA CTTGAGAATC CATGTAGACG CCATCAAAAG TAAGGCGAAC ATTTTTCTTG 11940 

AGGTCTTTTT CAT CT AGTTT GAAAGTCTTG CGATACCAAG CTTCCCCACC GTTGAGCTGT 12000 

CCACCTTCAT TTTGTGCAGG AGATTCATGA TCGAAATCGT T AAAGAT A CT CCAGTCATAC 12060 

GGTAAATCTA ATTTTTTCCA CGTAGATACG TCTGCATCAG GTTTAATGGC TTCCTTAGAA 12120 

TTTGCATTGA GTTTAAAGTA CCAATTTTGA TTAAAATCCA CTTTCCTGTC TTCAATCATT 12180 

TGATTCACTT CTTCATTTGT TACAGCTTTA GCATCTTCCT TGAGCGGTTT TTCTTGATTT 12240 

GAAGCTTGTG ATTCTATCCT TGGAGCTTTT TCTTCCGGTT TAGCAGACAC TTTTTCCTCT 12300 

TTTGGAGTTA CGGCTTCATC TTCTTTCTTC T CAG ATGCAA TAGCCTCAGT TGAACTAGGT 12 3 60 
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G TT AC T TT AT 


CAGCTGGTAA 


TACAGCTTGC 


GTTCCATCTT 


GATAGTGAGC 


TCGAACCGAC 


3880 


AATTTGACAG 


TTTGGTCTTC 


TTTGAGACTG 


TCAGCTTTTT 


CCACTTGCAA 


GCTCAAGTGA 


8940 


GCAATTTTTG 


GCGCTTCTTC 


AAGGAATTGA 


ATTGCATAGG 


TTTGAAGAGG 


GCCACCATCT 


9000 


TTAGGCTGAA 


TAAAGATGCT 


CGCACGCATG 


CCGTTTGCTG 


CGCTTGCTTG 


AAGAACTGTA 


9060 


ACAGCTGCAT 


TTTTAGCACT 


TGCTGTGACT 


TCTGGCAACT 


TAGCTCCATA 


AGCAAGAGTG 


9120 


CGGTATTGCA 


TTGGTTTTTG 


ACTAGTAAGA 


CCTGTTACTG 


CCTCACCACC 


AACCGTTACA 


9180 


GTTGGTACTG 


CAGGTGCCGC 


AGGATTGCCT 


TCTTCTACCA 


CAAGGGTTGC 


ATGAATTGGT 


9240 


TGACCTTCTA 


AATAACCGGT 


CGCTTGAATA 


CGAGAACCTG 


GAATTGCTAA 


CTTAGCTTTA 


9300 


TCTTCTTCGG 


CAATCTCCCA 


CTTGTCCACT 


TCATACTCTT 


CAACACTTCC 


ATCAATCAAA 


9360 


ACATAGGAAA 


CAGATTTGTC 


TACAGAATTC 


AAGTCAGTAT 


TTGGAGCAAT 


ACGTTTCACA 


9420 


ACTGGTAGCT 


CTGATTTAAG 


AGCAATCACT 


TCTACACGAG 


CTTCTACTTC 


TCGTCCGTCA 


9480 


GCCATACCTT 


TCACCGTTAC 


AATACCAGGC 


TTGCTCACAT 


CTACTGAAGA 


C C AGGTTAC A 


9540 


GGACGTTCTG 


CACGGCTACC 


ATCACTGTAT 


ACAAACGGAA 


CAGTGGTAGG 


CATTTCAGGT 


9600 


GCCTCTCCAA 


TAATGGTCTG 


TACTTTTGGC 


ACTTCTGTCC 


CCAAAACAGT 


CTTCTCTTGT 


9660 


CCTTCTTTCT 


TACCAGTAAA 


GACAGTGACT 


TGGTTCGATT 


TCAAGAGATC 


AGAGTGGGCA 


9720 


GTCAGGGTGA 


ATTTCCCTGC 


TTGTTCAGTT 


GATTTGACAA 


TGGCAACACC 


TTTAC CATT A 


9780 


AATGCTTTAC 


GAATCCAAGA 


ACCATCTGCT 


TGC GCCTTAT 


AGCGTTCACG 


GCTGGCTTGT 


9340 


TCTCCGTTAT 


CTACACCGAC 


CAGTTGACCT 


TGGCCATGCA 


ATTGGAAGCG 


AACCAGATTA 


9900 


TTACCAGTTG 


GAACCACATT 


CCCCTGCCTC 


TCAACAATTT 


CATAGTAGAT 


GTAAGTCAAG 


9960 


TCTTTT CCAT 


CTGCTGCAAT 


CGCATGGTCT 


TCCTTAATAA 


GACGAACTGC 


CGCTGGCTTA 


10020 


CCAGCAGTCG 


TAATCTTATC 


TCGAGCAATT 


TCCTTGC CAG 


ATTCATCACG 


AGCAATTGCT 


100BO 


TCCAAGGTAC 


CTGGTTGATA 


GGCAACTTTC 


CATTCAAGAT 


AAAGTTCATT 


AGCATTTGCA 


10140 


CCTTCTTGGT 


AAGTCCGCCC 


ATCGCTGGTT 


TGTTTTTTAT 


TGAAAGTCTT 


AAGACCAAGA 


10200 


GATTTTCCAT 


TCAAGAACAA 


TTCTACACTA 


GAAGCATTCG 


AATAAGCACG 


AACTGGAATC 


10260 


TTACCTTCTG 


AGTCAGCTAC 


TTTGGATCCT 


AATTCTTTGT 


TTTCCCAGTT 


CCAGTGAGGA 


10320 


AGAAGGTGTA 


CCATCGGTTT 


CTTCTTAACA 


GAAACC CATT 


GGCTTTGGTA 


GAGATAGAAG 


10380 


TCATGTTTTG 


GAATGCCGGC 


TGTATCTACG 


ATACCAAAGT 


AAGAGCTCTT 


AACAGGAGTT 


10440 


TGATTTTGGT 


TGTGCCATGG 


TGTAGGTTCA 


CCAATATAGT 


CCGTACCTGT 


CCAGATAAAC 


10500 


TGTCCAGCAT 


AGCCAGCGTT 


GTCACGGTCA 


AAAGTCCATG 


AAGCGGTTGC 


TGTTTTCCCC 


10560 


CAACCCACAC 


GATCATTTCC 


ATAATCTGAC 


TGTTCATAAT 


TACGCTCAGG 


TCCATTGCTA 


10620 
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TCATGAGAGT 
TCATAGTTAG 
TCAGCAACAA 
AGATTTTCCG 
TTACCTGTGA 
AAGCGAGCTT 
AAGTGCAGAC 
GGCTCACCTT 
GTGACTGCCG 
AAGTTTGC T A 
ACTTGTTTCG 
ACCATGCGAA 
CCAGCCTTGA 
TTAAAGACAT 
TTACCAACAT 
AATCCGACAC 
ACACCGACTG 
TTATTGTAGG 
GAAGCAAAGG 
TCAGTTTGAG 
CCATTAACTG 
ACCTTAGCTG 
GGTGCTTCTG 
GAGACTTTTG 
TGATAGGAGT 
AAACCTTTGT 
GTCACTACAT 
GTGAGTTCAA 
TTATGCAACT 



TTTTGTCAAT ATCATTGGTT GCTGAAGCAA 
GGTTATCTGA AAGAGTCTCA CCAAGTTTGT 
GG TT ACT AC C AAGGACACGG CCTCGAACAG 
CTGGAACTTC TTCCCATTCA ACTGTCAGGT 
AGTAAACTGG AACCTTAGTC GGCAATTCAA 
GTTTAACCGC AGCAACTGGT TTATGAGAAA 
GGTATTCTCC TAAGATGTCG CCATTTTCAG 
CACGAACGCT TGGAACGACG GTAGCGAGAC 
GAACTTTTCC ATCTACAGAC TCAAGGTAGT 
AGTCTTTGCC GTCAACTTGG ATTCTTGTTT 
CAAAGATTTG TACCTCTGTG ATAGACGTTC 
TACGAACAGC ATAGGTTTCA ACTTTATCAA 
GTTGAGCAGG GGCTTTTAGA TTAGTAACTG 
GGTCCTCATT ACCAACAAAA CTAGGGTTTT 
AAT ACT C AAT CACATAAGAC TTCGGTACAC 
TTAGATTATC AACGGAGCGT TTGCTCAAGA 
AAGCTTCTGG ATTAGTACGA TTCCAGTTTG 
AAATGAGCTT GTCATTAACA TTTGAAACTG 
CAAGTGGCAA TTCTGAACCG GTCCATTGGT 
CAGATACGCG AACATGAAGT TTAGTTGTTA 
TAAAGACACC TTCCTTAGCG TATTGCTCTG 
ATGAAACGTG ACCATTTGAA TCATATGTCC 
CGATTGGAGT TGTCACACTG ACTTCTTCAA 
CACGCGCTTC AAGGTCAATT CCTTCAACTT 
CTAGTTTTTC TTTCGGAATA GCTTGCCAAG 
CATACTCAAC TGTTACTGTT GCTGGAAGAC 
TTACAGGACG GATGGATTGC GCAATCTTCT 
CTTGGTCTTT AGCTCCCTCA TATTCAGCGT 
CAAGCATTCC TTTACGAATT GCGACTTCCC 



AGGCCTGGTT 
CTGTCACTCG 
TAAATTGACC 
CTTTTGTTTC 
GTGCTTGACC 
GTAAGCTCTT 
CTTTCGCGAT 
CATTGTTGCT 
AGTCTGTCAA 
GTCCTTGCTT 
CACGCTTGTT 
AGCTAAAGTG 
GTTTCCAGTT 
TAG G AGCTGT 
CAACTCCATG 
TACCTGAATC 
TCCAACGATT 
GGTCGCTTGG 
CAGAAATGTT 
ATTGCGTACC 
GACGAATCGC 
GAACACTTTC 
CTGAAACGAT 
TACCTAGTAC 
TGACTTTATG 
TTGGTTCCTG 
TCTCAGTATT 
TCAGAGTGAC 
CTTCACCACT 



ACTGTTTTCA 
TACAGTGATC 
TGCTTTTGTC 
GTAGCCGTCT 
TACTTGTAGC 
ATCCTTAGTG 
GACACGAACT 
AACACTTGCT 
ATCAGGGTTG 
GGCTCCCGCA 
ATCTGCTTTA 
GTTCATTTCT 
GGCAGAATCA 
TGGGACAGTC 
GTCTTCATGG 
TCCAAACAGA 
GGCTGGTTGG 
ATTTGAGTCT 
TGCACCTTGC 
TTCTAAGCGA 
AT CCCATGC A 
TGGTAATTGT 
ACCTTCTACA 
TTCAAATGTT 
AGTTTTAGGG 
ATGCAAATCT 
GGCTTGGATA 
TGCTCCTGGC 
TGTAGAGAAG 



7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 
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CTATAATGAC 


AAGGAGGATA 


GCTACACACA 


TCCTGATGGC 


TGGTATTATC 


GTTTTCACCA 


5340 


TACCAAATAT 


CAGAAAACAC 


AGACAGACTT 


TCAACAAGAA 


ATCAAGGTTT 


ACTACGCCGA 


5400 


CGAACCTGAA 


TCAGCCCCTC 


AAAAGGGACT 


GTATATGAAC 


GAACGCTATC 


AAAACTTGAA 


5460 


AGCTAAAGAA 


TGTCAGGCGC 


TTTTATCTCC 


CCAAGGTAGA 


CAGATTTTCG 


CTCAACGCAA 


5520 


GATTGATGTG 


GAACCTGTCT 


TTGGGCAGAT 


AAAGGCTTCT 


TTG€GTTACA 


AGAGATGTAA 


5580 


TCTGAGAGGG 


AAGCGTCAAG 


TGAGAATTGA 


CATGGGATTG 


GTACTTATGG 


CCAATAACCT 


5640 


CCT AAAAT AT 


AGTAAAATGA 


AATAAGAACA 


GGACAAATCG 


ATAAGGACAA 


TCAAATCGAT 


5700 


TTCTAACAAT 


GTTTTAGAAG 


TAAAAGTGTA 


CTATTCTAGT 


TTCAATCTAC 


T AT AC AAT AA 


5760 


GAGAATGACT 


CAAAATTAAA 


AAGCT AGAGT 


TCCACAATTG 


G A AAT AT CT A 


GCTTTTTTGT 


5820 


GGTTGAGAAC 


TATTTTGTCT 


CAGGCTCTTT 


ATCTTCTATT 


TAGGACAAGA 


GTTTTTCTTT 


5880 


GGTCTTTAAT 


GATAAAGAAG 


GTATC AAAAT 


TTCTAGTCTT 


CTTTTTT AC C 


TTTAGTAACT 


5940 


ACT AAT CCTG 


CACTCAAACC 


TAGAAGAGTT 


AAACCTGCTG 


CTACTGCTGC 


TTGGCTTGCC 


6000 


GCACTACCTG 


TACTTGGTAA 


CTGGGCTTTA 


TTAGTTTGAC 


TAGCTTCACT 


TGAATCAATT 


6060 


GGTTTTGTAT 


CTGCTTTTTC 


TGACACTTGT 


GGTTTTTTAG 


CTTCTTGAGC 


TACTGGTTTG 


6120 


GTT C CAACC A 


AGACGATGCG 


GTCTGTCGGA 


ACTTCTACCA 


CTTCACGGAG 


TTTTTCTTCC 


6L80 


TTACTTCCAT 


CAGGATTAAT 


CGCTGTAAAG 


ATACGTTCTT 


TTCCAACTTT 


TCCTTCTTGT 


6240 


TCTACACGAG 


TTTCACCTAG 


ATACAGTGTT 


GAATCTTTTT 


TCTCAACTGT 


CTTGTATGCC 


6300 


AAATCTTTTT 


CAACAAATTC 


GATTTTTGGA 


AGATCTTCTT 


GTACAGCAGC 


AACTGTCTTC 


6360 


TCAGAAACTG 


GTTTTTCCTT 


AGTCAAGTGG 


ATACGCTATT 


CCTTGACTTG 


TTTTCCACTT 


6420 


TC TGAAACG A 


GGCGAACAAG 


TACTGGAAAG 


CTATCTTCTC 


CACTATCTAC 


CACAGTTGAA 


6480 


GCTACTTGAT 


TGTTTTCTTC 


AACTGAGACT 


TTTGGCCGTT 


GACCTTTATA 


GGTAATTTGA 


6540 


TAGTCTTGAC 


GATTTTCAGC 


GAAATCAGCA 


AGTTCTTTTC 


CATCTACAAG 


AATCTTTGAT 


6600 


TGAGTGCTTT 


CTTGAGGCAA 


TTCACTTGGT 


GCAAGGAAGG 


TCATCTCAAT 


CATCGCAACA 


6660 


CCGCTCTTAT 


CTGCTTTACG 


CTCCATACGC 


CATCTCATAG 


CTTTGGCTTT 


GATAGCTTTA 


6720 


AATGTTACGT 


TGATTTCATC 


ACCAGCTGCA 


ATGTCTTTAT 


CCGCACGATA 


AGGAACAGCT 


6780 


TCCCAATTTT 


CTGGATTGTT 


GAATGGATGG 


TCTGCGTCGT 


AGGCTTGGTA 


CTTTGAATAG 


6840 


T AGGTTGGC A 


CTTCAAACTC 


TGGACCGACA 


TAGCGTTCTA 


AAACGAGTTT 


AGATGGTGCA 


6900 


TCCGTACCAC 


TATCTGCAAA 


GAACTGAACT 


TTTCCTTGTG 


TAACAGTCCG 


TTCTACAATC 


6960 


TT AC CATTTT 


CACGGAAAAT 


CACACCCGCT 


GATACTTCTG 


GATTAGAAGA 


TGGTGTTGGT 


7020 


GACCAGTTTG 


TCCAACGACG 


ATTTTCTGAA 


TGATCTCCGT 


CATTGAGATA 


GTCAACGCGG 


7080 
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TCAACCAT C A 


T>/-«»T I r» ft TVL/"" CJV 
1 V. 1 LA i«J<j , o 1 




AAACCATTGA 


TCAAATGGGA 


AACAATCTCA 


3600 


A L JTU GGA T 


ACTTTCTCAA 




GTTTC C ACCT 


ACAATTCATA 


AGAATGCGCA 


3660 


CGOU *AAH_A 


ULi 1 V_AIjAL»Vj i 


- vjv_ 1 1 X r\/^ 


GT AGT TTGC A 


AGCCCAATTC 


AACCGTCACA 


3720 


TGCATGCACT 


CLuA i AAL i 




TCG ATGGT TT 


CGTCTGGTAA 


ACAGTCTGGG 


3780 


CGCGTTCCAA 


TA1 T (jAl I Lt_ 




P^CTCATTGA 


TAGCCTGTTC 


ATAACGCTCT 


3840 


CGAATAACTT 


/-•ft TV /—* t •ip'P^'P/** 

LLALL I l IIU 




GTAAAATTTT 


GAAAATAAAC 


CAGATACTTC 


3900 


C G AAC ATCCG 


GCC AC nvjLb 




TPAATTTCCT 


TATAAAATTG 


CTCACGGATA 


3960 


GGCGCATCLG 




nnrATCTC c a 

VjVJ A\ ^ V^ ± 


GAACCAGAAA 


CCGTACAAAA 


AGTACAGCCC 


4020 


k_l_A 1 AL>v_t_ A 


^.AVj * LLIwAl v- 


ACGATTGGGA 

nv»WA 4 * VJ\J/1 


CAATCAAATC 


CCGCATCAAT 


AGGGACTTTA 


4080 


AAAGTCn x T 


V_ 1 LLAAAbAu 


TTTTCGATAA 


TAATCATTCA 


AGGTATTATA 


AGATTTCATG 


4140 


ACTTTCATTA 


1 AALAAnnAl 




CTCAAAAGCC 


TGACTTTC CT 


AT AAATT CCT 


4200 


CTG l I I CTCC* 


111 \_A I I AU 




TGATACAATA 


TGCGTATGAT 


TTTAATGAAA 


4260 


TTAGCATCTA 


TTTTATTATT 


flATAPTfiAPC 


TTAGTCGTCT 


GCATTATCCT 


AACCAAACTT 


4320 


TTTAGATTAA 


TV ft ft 7A ft /^T 1 ft 

AAAAAL 1 Avju 


ALvjArtA*- III 


GCGGATTTGG 


CTTTTCCAGT 


CTTGGTATTT 


4380 


GAGTATTAUT 


IvjAX 1 /\\— AVjVv. 


TAAAACCTTT 


ACCCATAATT 


TCCTCCCTAG 


ACTGGGGCTA 


4440 


GCCCTCTCGA 






TTTTTCTTCC 


TTTTGAAAAA 


ACGCAGCTTT 


4500 


TACTAC(_(_ 1 A 


AAl 1 lAlv,Art 


ATTCTTCTGG 


CGTGCAGGAT 


TCTTATTAAC 


CCTTATCATG 


4560 


TATATAGAAA 


TV™" ITTTTTflt 
L Aj A 1 IVJ 1 i \ir\ 


ATTGTTCTTA 


ATGAAATAGT 


CGAATCCCTA 


AGCATTTTCT 


4620 


AGGGATTTIl 


GCTTTCTCTA 


P A A AAT AGTA 


TAGACAATAA 


C ACT AT AC AA 


TTTTATACAA 


4680 


J»/-»Jva ft ft i"" ft /""P 




AGT CTCTT AT 


AT C C AAAAAG 


GCAACGGATT 


TGCCGTTGCT 


4740 


TTTTTGGATG 


vjI 1 a\v»»vj/\ 1 rW* 


TCTTGGTAAA 


ATAGAATTGC 


CCAATAAACC 


ATTT AG AAAG 


4800 


GCTATCCCAT 


GCATATTCAC 


TATAACACAA 


ATCAAACAAC 


TTTACCACTA 


GAAATCAGTT 




CCTTCTTACC 


ACAAGATCAT 


CT CGTTTTT A 


CTATTGAAAA 


AGTGGTGAAT 


ACCTTGGAGG 


4920 


AACGTCACTT 


CTACACCTCC 


TATCATGCCT 


TTGATCGCCC 


GTCTT AT C AC 


CCTAAAATGC 


4980 


TTGTATCTAC 


TCTTCTATTT 


GCCTATTCAC 


AAGGGATTTT 


CTCTGGTCGA 


AAAATTGAAA 


5040 


AATCGAAGAG 


TTAGTGACCT 


TAGATTGTTT 


GTTTATTGAC 


AGAACTAAGA 


TTGAAGCCAA 


5100 


TGCCAACAAG 


TATAGTTTTG 


TGTGGAAGAA 


AACG AC AG AG 


AAATTCTCCG 


CCAAACTTCA 


5160 


AGAACAGATA 


CAGGTCTATT 


TTCAAGAAGA 


AATCACTCCC 


CTTCTGATTA 


AATATGCCAT 


5220 


GTTTGATAAG 


AAACAAAAGA 


GAGGGTATAA 


AGAGTCAGCT 


AAAAACTTAG 


CGAATTGGCA 


5280 
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TGGCAATATC 


CGTTGCGCGC 


ACACTTAGCA 


AGAAGGTATG 


GAGGAAGCGG 


TTGGTCATGG 


1800 


CTGTTAAAAA 


TCCAATTAAA 


AAACCGCCTA 


GGATTGAAAA 


GATGAGCGAA 


CTGCTAGCTT 


I860 


GCCCCAGAGA 


AAAAGCTCCA 


GTTGTCCAAG 


CTGTCAAAGC 


TACCTGAAAA 


CCCACCAAAC 


1920 


CAGAAGCATC 


ATTCAAGAGT 


CCTTCGCCCT 


TAAGAATATT 


GGACACGCGC 


TTAGGAAAGC 


1980 


TAAAACGCTC 


CGAAAGAGAG 


GCAAAGGCCA 


CCAAGTCCGT 


AGGACCAAGG 


GCTGCCCCAA 


2040 


CAGCCAAGCA 


AGCTGCCAAG 


GGAAGGCTGA 


ACCAAAGAAG 


ATGGGCCAAG 


CCACCCAAAC 


21O0 


TCAGGGTCGA 


GATAAAAATC 


ACTGGAAATA 


TGAGATAAAC 


AAT GATTCGC 


CAGTGTTTTA 


2160 


AAATAGCCGT 


AACATCTGCT 


TCTTCAGCCT 


CTCGGAAAAG 


CAAGGGTCCG 


ATAACCAGTG 


2220 


CCAAAAACAA 


CTCCGTATTA 


AGGTGAAAGT 


CAGTATTGGG 


T AAAAAG AG A 


CCAATCACAA 


2280 


TTCC CAAAAG 


AATTTGCACC 


AAAGGGAGAG 


GCAAAAAGGG 


CAGGAGCTTA 


TTGGTTGTAC 


2340 


TTGAGACAAT 


CAAAACCAGT 


AAAAATAGGA 


TGAGGTAAAT 


CAGTAATTCC 


ACGCACGTCC 


2400 


TCCTTAATCT 


TTTTTACAAC 


AGGATTCAAA 


TATCTCCTTC 


TGCTCTTTGA 


TTTTTTGGTC 


2460 


AATCTTGGAA 


CAGTCTTTGT 


GCTCAATTTT 


TCTCTGGCAC 


CGTTCCATTT 


CAAGAGCAAC 


2520 


TAATTTTTTC 


TTGATTTTAA 


GCATTTTTTT 


GCTCATATGC 


GCTTGGTCTA 


GCACGCCCAT 


2580 


CGCTCGTTCG 


TGGTGGGTTG 


ATTCAACAAA 


ATTCTGGCGC 


ATGGCATCCA 


GCTTTTCGTG 


2640 


TAAGTATTGT 


TTATCCATGT 


CTGTATCTCT 


CTAATTTTTC 


AATCATCACT 


AAAAACGGCG 


2700 


CGTTGTTGAC 


TTGGTTTAAA 


GTTCGGTAAA 


TGGCAGCTGT 


GTACTCTTGT 


TGGTTCAACT 


2760 


GGATCACAAA 


ATCCAAGACA 


GCATCTCTCT 


CGAGATCGCC 


TCCTTCATGA 


CC AT AG T AAA 


2820 


TCATAATAGC 


AATTCGTCCA 


CCTTTGACAA 


GTAAGCCACA 


TAGCTTTTCT 


AATGCCTCAA 


2880 


TCGTTGTCTG 


CGGTCGGGTG 


ATGACAGACT 


TATCAGCTGC 


CGGCAAATAG 


CCCAGATTAA 


2940 


AAATCCCTGC 


CTTAGCTTTT 


ATCACAAACT 


GGTCCAGTGT 


CTCATGGCCT 


TGCAAGATTA 


3000 


ACTGGGCATT 


TGTCAAGTCA 


GCCTGATGCA 


AACGCTCTTG 


GGTCTTTTCC 


AAGGCTTGCT 


3060 


TCTGAATATC 


AAAGGCATAG 


ACTTGCTTGG 


CTAGCTTGGC 


TAAAAAAAGC 


GTGTCATGAC 


3120 


CATTTCCCAT 


AGTCGCATCC 


ACTACGACAT 


CCTCTTTTGT 


CACGACCTCA 


GCCAAAAAAT 


3180 


CATGTGCCAT 


CTCAAGTGGT 


CTTTTCATTT 


TCAAACTCCT 


GTTTTACAGC 


CTTGCATCCT 


3240 


TGAACACTTC 


CACGACGTCG 


CATCTCCATC 


TCAATGCTGT 


TGAGGACTTC 


CCATTTATTG 


3300 


AGGCTCCACA 


TAGGACCAAG 


CAGCATATCC 


CTAGGCGCAT 


CTCCTGTAAT 


TCGATGGATG 


3360 


ACGATATGTT 


TGGGAATAAT 


TTCCAGTTGG 


TCACAGATGA 


CCCTGACATA 


TTCGTCCTGA 


3420 


CTCATCAATT 


GTAAACGCCC 


CTCATGGTAA 


TCTCGTTCCA 


TACGAGTATT 


TGTCATAAGA 


3480 


TGGAGCAAAT 


GCAGTTTAAT 


CCCTTGAATA 


TCGTTATCCG 


TGACACAACG 


GCGGACATTT 


3540 
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CCGGAGGATA 


TTGACCACCA 


CCAAAAGCAG 


GGGGAAAATC 


G AAATC AAC C 


AATAGTAGGC 


60 


TACTGCGACA 


CTGGTCAACT 


CACTATCTGA 


TGCTTGATAA 


TAATGCAAAA 


AACCTTTTAA 


120 


TAAAGGTTTG 


TCTATCAGCT 


CTTTCCACCA 


CTTTTTCATG 


TCATACTCCT 


TCACTTATAA 


180 


TCTTATACTC 


AATGAAAATC 


AAAG AG CAAA 


CTAGAAAGCT 


AGCCGCAAGC 


TGCTCAAAAC 


240 


ACTGTTTTGA 


GGTTGTAGAT 


AAGACTGACG 


AAGTCGATCA 


CAT A CAT AC G 


GTAAGGCGAC 


300 


GCTGACGTGG 


TTTG AAG AG A 


TTTTCGAAGA 


GTATTAACTA 


ATTTCTTCTT 


ACCAATTCCA 


360 


C C AT ATC AT A 


CGGTAGGGTA 


TTGGCAGCTT 


CCTTCAAGGA 


ATAGTTCTCT 


AAGTT ATTT A 


420 


C ATT TTGTC G 


TAATTTCTTG 


GCATACTTAG 


TCGTAATCAA 


TCGTTTTTCT 


TCGTATTCGA 


430 


AAATC AAC TT 


GCGCTCCAGA 


TAATAGCCTC 


TCAGCATTTC 


ATCGATATTG 


TTGGGTTTGA 


540 


CACGATTGAT 


AAC C CGTTCG 


ACAAAGG C AC 


CACTGCTGAT 


AAT AG CTGTT 


TCTCGAAGAC 


600 


GAGACTCCTG 


CATAAAACTA 


ATC AAAG AGC 


GTCTGTAGAC 


TCCCTTCAGG 


TTTTCCAAAC 


660 


TTTCAATAAT 


CATCTCTGTA 


TTGGCAAGAT 


AGAGCTCTGC 


AATTTGGTCA 


T AATC AAG AG 


720 


CACGGAGACG 


GCTTTGCTCC 


TTGTTCTTCC 


AGCTACGGAA 


GGTCTTTCCG 


AGAGTAAAAA 


780 


CTT C ATG AAG 


GAGAAAACGT 


AAAATCCTCA 


AGGAAACAAG 


AAAATAATAG 


GTCAGTCTTG 


840 


AG GC AAGTTT 


ACGATTGATT 


C CTTGTTCT A 


TATTTTTCAG 


ATAACGTTGG 


TAAACTCGGT 


900 


AAGCACGATT 


GCTAATGTTC 


CCCTCTTCAT 


AGGCCTGTTC 


CAAACCATCA 


CTTTCAATAC 


960 


TAAGAATCAA 


GAGTTTCAAA 


GCAGCCCAGT 


CTTCTTGATC 


ATCCTGGTTT 


TCTTGGCTTA 


1020 


AAAT G AG AT T 


TTCAATACGT 


CCATGATAAT 


TGTCAATAGC 


CGCATAGAGG 


GG AAGTTT AT 


1080 


TTCTGGTGTC 


TTCCAACTCT 


TTTTCCAACT 


CTAGCGTTAC 


TTC ATT CAAA 


ATGGCGATAT 


1140 


LK-ATAAGATA 


ATCCTTGCTT 


TCTTCCTCTT 


CATC AG AAAG 


ATGAGGCAAG 


ACCAAGAGAC 


1200 


CTGTTAAAAA 


GCTAACAAGC 


GTCACACCTG 


CAACAAGGAA 


AAGCAAAAGA 


GGATACTCCT 


1260 


GTTCT AG AT T 


ACTTGGT AT C 


AAGAGAATCG 


TAGCAATCGA 


CACCGTTCCC 


TTAACACCTG 


1320 


AAAAGGTCAA 


GAGAAACATG 


TCCTTCATAT 


AC TT ATTT AG 


CTTTTTCTTG 


AGGCGTCGGG 


1380 


TTCTATAGGC 


ATAATAGCCA 


TAGATCATAA 


TAAAACGAA? 


G AC AAAAAGG 


ACAAAGGTAA 


1440 


GGGCGATAAG 


AG AT AGCAAT 


AAAAGTAGAG 


GATTATAGAT 


TGGATTGGTC 


AAG AT AGGTT 


1500 


CTGCTATCAT 


TTCCAACTCC 


ATCCCTAAAA 


TCACAAAGAC 


AGAACCGTTG 


AGCATAAAGG 


1560 


TCACTGTATG 


CCAGACCGTC 


TCGGTCACCG 


TATCCACTTG 


GGCTTCGACG 


AG CGTG ATTT 


1620 


T CTT G AAG CG 


ACTTGCCTTT 


AAAATTCCAG 


CAACTACGAC 


GGCAATAATA 


CCTGAAACAT 


1680 


GAACTTCTTC 


TGCCAGAAAG 


AAGGTCACTA 


GAGGCAAACT 


CAATTCTAAT 


AAAAGTTCAC 


1740 
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CTTCTCTTTC AAGCGTAATA CCTGAGTCTT CCTTGACTTT TTCGATAACC GATTGGATCA 17 40 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1800 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AG CTTCTC AA ATTAACTGAC 18 60 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 19 20 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGGTTC . 198 0 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AG ACTC CTG A ATAGCTGAAT 204 0 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACACCGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAC ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2150 

AGACAGCACC GCCAACGCTT CCTGGAATAC C AC AAGC AAA CTCAAACCCA GTTAAACTAT 22 2 0 

CACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2 2 80 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2 34 0 

CATCACGAAC CATGATATTG CTTCCATTGC CAACAACCAT CCAAGGGATA TTTTCTTCCT 2 400 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAAT AATC AG 24 6 0 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGATATCAA 252 0 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 258 0 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 264 0 

TTTTGCATAA GAAAAAGAGG TTCCCCCCTT TTTATCATTT TTTACAAAAC ATTTCCTTGG 2 70 0 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2 7 60 

TCAACCAGCC TACTTGCAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AAGCCATAGG 2 820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACACCTTG GTAAACCCAA ACTCTCAAAA 2 83 0 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2 94 0 

AAAGTG AC TG AATGGTTTTT AACATATTTT CAGACAGTAA TTGCATAACC TGTGGATTC: "i.CCO 

CACTTTGAAC TGCCCAAAGA ACAATGTAAA CATTGAGTAA ACT AGT AAGC CCTAGAAAAA 3 060 

TCAACTCCAA GCCACCATGC CCC 3 08 3 

(2) INFORMATION FOR SEQ ID NO: 139: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15363 base pairs 
( 8 ) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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( xi > 


SEQUENCE DESCRIPTION: 


^ FO T n mo • 


1 JO . 






AG C AACTGTT 


GTGAACCAAT 


TCCGATAAAT 


Tpr 1 A/^ a ATT 


\->Uj 1 1 AA 1 AOA 


GCCATTTTGA 


60 


CCAAAAATCC 


CGATAAAAGC 


AT AGGCTTT A 




1 VjA r L. E.AGGT 


11 ft ft. *» rrl ft 

AGGAAGGATA 


120 


ATCAGCATGA 


GCCAGAGTTG 


ACGGTGTTTG 


AOACOGGTCA 


AAAAGAGGGC 


CGTCGGATAA 


180 


CTGATAAGCA 


GTGCCACAAA 


CGTCACAATC 


rrrrr a t a a a 

vL 1 AAA 




GAAACTCATT 


240 


TTAAGATACG 


TCAAGTTTTG 


TGACGCAAAG 


1 AAuA i * l\) 1 


AATTTTCTAA 


ACTGAACTGG 


300 


CCTTCGATGT 


TGAAAAAGCA 


TTGACCGAAA 


A 1A„ AA*J At-t» A 


AGGGTGCCAA 


TACAAAGAGC 


360 


GCAATCCAAA 


GCATGTAGGG 


TACTACAAAG 


r\\j ill AunuL 


TTGTTTTCTT 


CATCTCTTTC 


420 


CTCCTCGATT 


GCATTGATCA 


AACCTGCTTC 


I 1 1L 1 1 \.w 


A iTTLTACGT 


ACTCCTCAAT 


480 


ACGAGCATCG 


AACTCTTCTT 


C GGTTTC ATT 


f* a c a cc c a or 


A I OTGG ATGT 


CTTCTGGTTC 


540 


AAAGTCCAGA 


CCGATTTCCT 


CACCCACGAT 




o i i vtAU rt»t» A 


TCATCCATTC 


600 


ATTTCCAAGT 


TCGTCATAGG 


CGATAATTTC 


A i AA 1 Vj nAL 1 


L. L. ACGG AAAA 


GCTGGGTATC 


660 


GACCTTAACT 


TCGAGCTTGC 


CTTCTTCAGG 


a a nr^r^T a i<pr 
AAVjtjLt 1 AA I O 


CGCAAGTCCT 


CTGGACGAAT 


720 


AACGACCTCA 


ACAGGTTCAT 


TTGGCTTCAT 


1-V.l-At.^. A l^A 


nLLbU 1 I LAA 


AGCGTTTGCC 


7 80 


GTTAAATTCG 


AC CAAGT AGT 


CCTCAATCAT 


1 Av-v. 1 


ax. » mpiMMr/* 
AACjA rvi l l l u 


ACTCCCCGAT 


840 


AAAGGTGGCA 


ACAAAGTGGT 


TGATTGGCTC 


A 1 Lu 1 Akj A 1 Li 


TCCACAGGGG 


TTCCAGACTG 


900 


GACAATCTCG 


CCATCATTCA 


TAACGAAAAT 


L(.Ab 1 LAl. 1. L. 


ATGGCAAGAG 


CTTCTTC CTG 


960 


ATCGTGAGTG 


ACAAAGACAA 


AGGTAATGCC 


^.AA 1 L.Vj i i I 


(pp»p * TV nwfip x /— 

I tfTAATTCAC 


GC AATTCGTA 


1020 


CTG C ATGT CT 


GTTCTC AATT 


TCAAGTCCAG 


^v»V_ 1 uM 1 AAA 


WjCTCGTCCA 


ACAAGACCAC 


1080 


ACGGGGTTGG 


TTGATGATAG 


CACGGGCGAT 


Vj *J W A l» A V- Vj 


1 1» AC Vi i I GT C 


CTCCAGAAAG 


1140 


TTTGCGGATG 


GAACGTTTTT 


CATAACCTTC 




a t nwrr' x /*• x x. 
AiLl 1 \jA\j AA 


CTTCCGCTAC 


1200 


ACGCTGCTCG 


ATTTCTTTCT 


TATCAATTTT 


ACGCAAGCGA 


AGTGGAAAGG 


CAACATTTTC 


1250 


AAACAC ATT C 


AT ATGTGGG A 


AC AAGG CAT A 


GGATTGGAAG 


ACGGTATGTA 


CGTCGCGCTT 


1320 


GTTGGTTGGA 


ATATCATTGA 


TACGAACACC 


GTCTAGCATG 


ATATCTCCTG 


TCGTCGCATC 


1380 


CAGTAAACCT 


GCAATAATGT 


TTAGGATAGT 


TGATTTCCCC 


GAACCAGATG 


CACCTAGAAG 


1440 


GGTGTAGAAT 


TTCCCTTCTT 


CCAACTCAAA 


GTTGATGTCT 


TTGAGAACCT 


TGGTGTTGCT 


1500 


GTCTTCAAAA 


ACTTTAGAGA 


CGTTTTTGAA 


TTCGATAATT 


GGCTTTTTCA 


ATTGGCATAA 


1560 


ATTCCTTCTT 


TTTCATAGAT 


TAACCGATCG 


GGGCTCTGTC 


AGGTCCCCAC 


TACCTCTTGC 


1620 


AGGGAGTAAA 


ACCACCTGCA 


TACATCTTCG 


CTACCGATAG 


GCTTTCACCC 


AAGATCCGGA 


. 1680 



WO 98/18931 



PCT/US97/19588 



928 



ATTAATTTCT 


AGAAATATGT 


TAGAAATTGG 


TTTGAATTCC 


GCAATCAATT 


TGTT C AGTTT 


11280 


TTATTTCATT 


TCATTTTATT 


TAATTAGATT 


TTCCAATTTT 


TTAATTCAAG 


CTAAAAATCC 


11340 


CCAATCGTAG 


TGATTGAGGA 


TTGAGTAAAT 


AAATCTTAAA 


CAATACCTTG 


TGCAATCATG 


11400 


GCATTTGCTA 


CATTTTCAAA 


GGCAGCAATG 


TTAGCTCCTG 


CAAGGTAGTC 


TTTATCAAGA 


11460 


CCGTATGTTT 


CTGAAGTCGT 


TTTAGCTGTG 


TTGAAGATGT 


TTGTCATGAT 


GTCTTTGAGA 


11520 


CGGCCATCAA 


CTTCTTCACG 


AGTCCATGAG 


AGGCGAAGAC 


TGTTTTGGCT 


CATTTCAAGA 


11580 


GCTGAAACGG 


CTACACCACC 


AGCGTTGGCA 


GCTTTTGCAG 


GTCCGTAGAA 


GATACCATTT 


11640 


TCTTTGTAAA 


CTTTGATGGC 


ATCAAGGTCG 


CTCGGCATGT 


TGGCACCTTC 


AGATACACAG 


11700 


ATAACGCCTT 


GAGCAACCAA 


ACGTTTAGCT 


GCTTCACCGT 


TGATTTCGTT 


TTGAGTGGCA 


11760 


CATGGAAGAG 


CAATGTCATA 


GTTTCCAGCG 


TAAGTCCATA 


CAGTACCTTC 


GTGGTAGGTT 


11820 


GCAGTTGCTT 


TTTCAGCTGC 


ATACTCAGTC 


AAACGAGCAC 


GACGTTTTTC 


TTTAACATCA 


11880 


AC C AAAAG AT 


CGAAGTCGAT 


ACCATTTTCA 


TCGATGACAT 


AACCATTTGA 


GTCAGAAACA 


11940 


GAAATAACAG 


TTGCACCGAG 


TTCAGTTGCT 


TTTTGAAGAG 


CATATTGAGC 


AACGTTACCA 


12000 


GAACCTGAAA 


TAACGACTTT 


CTTACCAGCA 


AAGCTGTTAC 


CGTTAGCTTT 


GAGCATTTCT 


12060 


TCAGTATAGT 


AAACCAAACC 


GTAACCAGTT 


GCTTCTGGAC 


GAATCAAGCT 


ACCACCAAAT 


12120 


CCAAGAGGTT 


TACCAGTCAA 


GACACCAGCA 


TCAAATTGGT 


TAAGACGTTT 


GTATTGACCG 


12180 


TAAAGGTAAC 


CAATTTCACG 


TCCACCAACA 


CCGATATCAC 


CAGCAGGTAC 


GTCAAGTGAT 


12240 


GGTCCGATGT 


GTTTTTGCAA 


TTCAGTCATG 


AAGCTTTGGC 


AG AAGCG C AT 


CACTTCAGCA 


12300 


TCTGTTTTAC 


CTTTAGGATC 


GAAGTCTGAT 


CCACCTTTAC 


CTCCACCGAT 


AGGAAGTCCA 


12360 


GTCAAGACAT 


TTTTAAAGAT 


TTGTTCAAAT 


CCGAGGAATT 


TCAAGATCCC 


TTGGTTTACA 


12420 


GTTGGGTGGA 


AACGAAGTCC 


ACCTTTGTAT 


GGTCCAACAG 


CTGAGTTGAA 


TTGAACACGG 


12480 


TAACCACGGT 


TTACTTGAAT 


TTTTCCATCA 


CGGTCAACCC 


AAGGAACACG 


GAAAGAAACC 


12540 


ACGCGCTCAG 


GCTCAGTAAT 


ACGTGCCAAG 


ATATTTTCTT 


CGATATACTC 


AGGGTGTTTT 


12600 


TCAAATACAG 


GTTCTAAAGT 


GTTGAAAAAT 


TCTTCAACAG 


CTTGGAGGAA 


TTCAGCCTCG 


12660 


TGCCGG 












12666 



(2) INFORMATION FOR SEQ ID NO: 138: 

tx) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA CTTAATCCG T 954 0 

CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTC TTACGATTTC AAGAACTCCT 9 600 

CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA TCGCCCTAAA 9 6 60 

AAAAGAATTA GCGAATTATT CTGGTAAAAA G AAATTT C AC GCTATGAAGG CTCAAGCGAT 9720 

TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT GTCATGATAT 9780 

GAAGTTGTTC AAAATGAGTC GC AG AAAT AT CAGACAAGCT GGTAAAATCT TGGCTGACAG 9 84 0 

TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA AATCCAGCAA 9900 

ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCGCTATCTA AGGAGAGAAG 99 60 

CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA CAACCTATCG 10020 

AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT CAATCATGAA 10080 

CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAACAT TATATAAGAT 1014 0 

TGATACAGGA AAAGTTTTAT TTGATGGTGT AAAT ATT AAT CAAATAGATA AAAAAATATT 10200 

AAGTCAAAAT TTAGGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA GTATTCTTGA 102 60 

TAATATAACT TTAAAGCACG AAGTT ACTT C ACAAAAGATA GAGGAAGTTT GTAAAGCAGT 10320 

TCAAATCTAT G ATGAAATC A TGGCTATGCC GATG AAATTT AATACTATCA TCTCAGAGAT 10380 

GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATACCA CTGGCACGTG CATTAATAAA 10440 

TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA TTAATGAGGA 10500 

AAGAATAACA AAGTATATAC AAAGTCAGGG CTGTACTCAA ATAATTGTAG CTCATAGATT 10560 

GTCAACGATT AAGGATGCGG ATGTTATTTT TGT AATG AAA GGTGGTAAGA TTGTTGAGTC 106 20 

AGGAAATCAT AAGTACTTAA TGGATCTTGG TGGAGAGTAC TACAGCTTAT ATACAAAAAG 10 680 

GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT CTAATTTTAA CAACAGCCTC 10740 

ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG TTGAATATGG 108 00 

TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTG ATTGTTT ACAAAAACAA 10860 

TAAGAATTAA GCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA TAGTTGAAAT 10920 

TATAAATCTA ACAAAAAGCT TTAAAGATAT TG AAGTT ATT C AT AA C AC TT AAATAATAGA 109 80 

GCAACTACAG TAGTAGCTTA AAAACATGAT TAAATCGCTA TTCTTAGGAG TAGCGGTTTT 1104 0 

TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG CTTTACCGTA 11100 

CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATAAAA 11160 

AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA GAAAAGTCAA 11220 
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AAATGAGGAC 


TTTCTTTTTA 


TACTCATCTG 


CTTTCAAAAA 


GCATTCTAGT 


CCATCTCCGA 


7740 


TTAACGATGG 


ACTTTATCAC 


CTCCTTCTCC 


AGTCCTTGTA 


TAACATCTTG 


GAGTTGATTC 


7800 


ATGACATCTT 


CC AAAGTTTA 


AAAGGCTTTA 


TTCTTAAATC 


CACGTTTACG 


AATCTCTTTC 


7860 


CACACTTGTT 


CAATGGGGTT 


CATCTCTGGT 


GTGTATGGAG 


GAATAAATGC 


AAAGCCAATA 


7920 


TTAGTCGGAA 


TCTTTAAGGT 


ACTTGATTTA 


TGCCATATAG 


CATTGTCCAT 


AACGAGTAAA 


7980 


AGATAATCAT 


CTGGATAAGC 


TTGTGAAATC 


TCCTATTCCT 


AAAGCCCCTT 


TAGCGCATAA 


8040 


CT TTGGCTC A 


GCTTCTATTA 


TCGCTCACAC 


CATCCATCAG 


AAGTTTAATC 


TGAAGGTACC 


8100 


CAATTATCGC 


CAAGAAGAAG 


ATTGGGCTAG 


GATGCCTTTA 


CCAATCACAC 


GTAAGGAAAT 


8160 


CTCTAATTGG 


CATATCAAGG 


CGAGTCAATA 


CTATTTGGAG 


CCCCTTTATA 


ACCTCTTGCG 


8220 


AGAGAGACTA 


TTGACTCAGC 


CCTTACTTCA 


TGCGGATGAA 


ACTTCTTATA 


GGGT GCTAG A 


8280 


GAGTGATAGT 


CAGCTGACTT 


ACTATTGGAC 


TTTTTTGTCA 


GGTAAAGCAG 


AGAAACAAGG 


8340 


GATTACGCTT 


TACCACCATG 


ATCAGTGTCG 


AAGTCGTTCA 


GTAGTACAAG 


AATTCCTAGG 


3400 


AGATTATTCT 


GGCTATGTGC 


ATTGTGATAT 


TTTGCGGCAG 


TAACTTACGA 


CTTTAGTCCT 


8460 


CTAGTTCTGC 


CTATGCGATA 


GCAGTCCAAG 


GTTT AGG A GC 


AAGGCGACCC 


TAAGCTTGGT 


8520 


AAACTTCGAA 


CCGCTCGTCT 


GCTTATCGTC 


AACTGGAAGA 


AGCTGAACTT 


GTTGGATGTT 


8580 


GGGCGCATCT 


GAGAAGGAAG 


TTTTTTGAAG 


CGCCCCCCCA 


AGCAAGCGGA 


TAAATCATCC 


8640 


TTAGCAGCTA 


AAGCTTTACC 


TTATTGTGAT 


CAGTTATTTT 


CCTTGGAAAG 


AG A C TGGG AG 


8700 


GCTTTGCCAG 


CTGATGAACG 


ACTACAGAAA 


CGTCAAGAAC 


ATCTCCAGCC 


CTTAATGGAA 


8760 


GACTTCTTTG 


CTTAGTGCCG 


GCGTCAGTCA 


GTTTT AGCAG 


GTTCAAAACT 


AGGAAGGGCA 


8820 


ATTGAATACA 


GCCTCAAGTA 


TGAAGAAACC 


TTTAAGACCA 


TTTTGAAAGA 


CGGACATCTG 


8880 


GTCCTTTCCA 


ATAATCTAGC 


TGAACGCGCC 


ATTAAATCAT 


TGGTTATGGG 


ACGGAGTAAA 


8940 


AGAGTCCAGT 


GGACTCTTTT 


AGCCTAAGCT 


CACTTTAAAA 


AAGCG AG GGT 


GGTTATTTTC 


9000 


TCAAACTTTT 


GAAGGAGCTA 


AAGCAAGAGC 


TATTATTATG 


AGTTTGTTGG 


AAACAGCTAA 


9060 


ACGTCATCAA 


TTAAATAGCG 


AGAAATATC? 


ATCCTATCTT 


CTAGAATGTC 


TTCCAAACGA 


9120 


GGAAACTCTC 


GTAAACAAAG 


AG GTTTT AG A 


GCCTTATTTA 


CCATGGACTA 


AAGTTGTACA 


9180 


AGAAAAGTGC 


AAATAAGAAA 


TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


CCAGTCTGGA 


9240 


GATTTTTCAA 


TAGACTTCCT 


GCGAAACAAA 


ATATGGTATA 


ATAGTTCTAT 


GAATGATGAA 


9300 


GCAAGTAAAC 


AACTAACCGA 


TGCACGATTT 


AAGCGTCTTG 


TTGGTGTTCA 


ACGCACGACT 


9360 


TTTGAAGAGA 


TGTTAGCTGT 


ATTAAAAACA 


GCTTATCAAC 


TTAAACACGC 


AAAAGGT GG A 


9420 


CGAAAACCTA 


AATTAAGTCT 


AGAAGACCTT 


CTTATCCCCA 


CTCTTCAATA 


TGTGCGAGAA 


9480 
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GG AG AATCGG CTTTATCTGA TGTAGATAAA TTGTATGTGA GGTTTACAAA CCGTTTTGAA 6000 
GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 
GGGTGGGAAT TACTATCAAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 6120 
CTTG ATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 
ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGGAAC GTTTGACAAC 6240 
AGCTGAACGT GGACATAAGT T ATT AAAGG A TAAAAGAGAT GAATTGATGA GGCGATTTAT 6 3 00 

T T CTTTG ATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG GAAACTTATC TAATTGATAA 63 60 

TCTAAAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 64 2 0 

TTCAATTCCA TCGAAAGAAA TTGAATTATT TGTTGAGAAA G AAAAT AT C A TGAGTGTAAC 64 80 

AGTTCCTAGA ATGCATATGA AT ATT AC TT C TC AAAATG AG AACAGTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG ATG AT GT ATT TGCTACAATG AATAGTTTAA TTTATAAATT 6600 

ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GCTGATGAAA TAGAAAAAAC 6 660 

ACGTAGACGT GTAAATGGTT TAGAATACTC GATTATTCCA AACTTGTCGG AAACTATTCA 67 20 

TTATATAGAA TTGAAACTAG AGGAGGCAGA AAGAGCCAAT TTACTTCCTA TTATGAAAGT 67 80 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATCAACAA ATATCAGCTT GGATAAGGCT 6840 

TTAAGCCTTT CTAAGCTTTT TTTATTGACA GTATCAGGAT ATCTTTTTCA AAATTTTGGT 6900 

TTGTTAGATA ATGAAAATGT TTCTACTAAT CTACATTTAG GATTAGTAAA TCGTAAATGT 6960 

AATTATATAG AAAGTAAGCG CGTCATAACA AGGTATCTAT CATTCATGGA GCTCCTCCTC 7020 

TAT ACT ATT A GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7080 

GTAGAGATTC CACAATCTCG TCCTTTTGAT TCTAAAAAGA GAAATGATAT TCTGCTTAAA 714 0 

ATTCGTATTG GCAAGCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7 2 00 

CAGCTTTTGG ATAAGGTGTT GCTCTATGAC AATTCATCTA TCTAGCCTAG GGG AGGTCT A 72 60 

TCTCCTGTGT GGGAAAACTG ATATGAGACA AGGAATCGAT TCACTGGCTT ATCTGGTTAA 7 3 20 

AACCCACTTT GAATTGGATC CTTTCTCCGG TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7 3 80 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT TGGCTACTAT ATAAACGCTT 744 0 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAAAAGGAT GTCAAAGCTC TCACACCAGA 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAAT AT AGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7 620 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 7680 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTCGAG AT ATTT C AG A AC CAGTT ACT 4 200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 42 60 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 4 32 0 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4 380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 4 440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4 500 

CAGAACGCTT TTGATTCGGT AGATACATTC ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4 560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4 620 

ACAGACATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATCTT 4 680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AG ATT AC AC A TCAAATTCAT 4 740 

TTGATATTAG AAACAGGAGG T CT AT AAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4 800 

CAAGTTCTTG GGCCTCTTAT GATTGTTCAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4 860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAG TTTT AG A GATCCACGAA 4 920 

GATAAAGCAA TGGTTCAGCT TTTTGAACGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4 980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT T GGTCGT ATT 504 0 

TTTAATGGGA TGGGAAAACC AATTGATGGT GGACCAGATT T AATTC C AG A GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

AC AG GG AT CT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTC AAAA ATTACCAGTA 5 2 20 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AG AT AG C AAG AC AAG CG ACT 52 80 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT GTATTTGCAG C AATGGGT AT TACTTTTGAA 53 40 

G AAG CTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CGATCGATCG TTCGGTTTTA 54 00 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 54 60 

ACTCCGGCAG AGTATCTAGC TTTTGAAAAA GATATGCACG TTCTAGTTAT CATGACGGAT 5 5 20 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 5 5 80 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 57 00 

GACATAACAC ATCCAATTCC TGATTTAACT GG AT AC ATT A CTGAAGGCCA AATTATTTTG 57 60 

TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 582 0 

CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 58 80 

AATCAACTGT TTGC AGCCT A TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5 94 0 
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ACAATTTCAT 


CTTCTGAGTT 


AG AATTTTT A 


TGTGATTTAC* 


TATTf!TATA & 


AA CTTT AG AT* 




CAAGGAAGCT 


ACAATGTAGA 


vj \_ \_ i fc r\ 




r.A?ATT" , A , l v ? 
1 A I 1 ,Al 1 


VjOvjA 1 L» 1 \j Ata 


"> Q "5 r» 


TTTGAACTAA 


AGAATCTCAG 


AATGATPATA 






TCCCTTTCjAA 


J 3 oU 


TCAATAAAAG 


AAAGGATACG 


V-V.l.S\l_Al I A 1 


l^l^AAlal, 1 AA 1 


aaU 1 A i AAAA 


rTCiGCATAAT 


2640 


TGGTAGCCGT 


GATAT^ATTT 


TArrATTTAf 
1 nLL/\ I 1 1 nlj 


TATTATTrrr 
^ A A V_»M 1 1 Uuu 


111 lAlftl 


TTCCTGCCTA 


2 7 00 


C C AAG AAC AA 


GAAGPTATA A 


1 Al_ I AAv» 


AAAA 1 1 Au L. 1 


I- AA 1 1_ 1 CiA 1 1 


ATGG TGTC AT 


2760 


TTATATCACT 


GAAGACATTG 


f^TCAATflA™ 




All (.uLLA 1 1 


ATt»A iTuCCA 


2 8 2 0 


AGTTGTGCCT 


GCTATTATTT 


TATTACCGAC 


TGATAAAPAA 


Uu ill AAA L 1 


T 1 A fir"* iTTi a a 
1 AUOA 1 i AAA 


2 880 


ACGTATAGAG 


GATAATGTAG 




A^r.AfArAAT 


A 1 1 1 i A i AA 1 


a a r T^f~' r v a a a a 
AA i CjTAC- AAA 


2940 


ATTGTCTGTA 


ATATTATTCT 


ATA ATTT'T'IY^ 


Ortv. 1 1 1 r\A 


WjAuAft 1 AAL 


TTTtjACTCAA 


3000 


GGGAAGATTA 


TAAAACTATC 


GGGAfTTCTA 


GTTATTGrAT 


rar^T a rrr a 

k_/\Lrvj 1 A 1 A 


0(jAlj<jt. I AAT 


3060 


ATTCAAGATA 


TTTCCCGTGT 


AGHTAAnrTA 


UOvi I l AA 1 Lo 


AAA TT A T* 
L» i VjAAA 1 I A 1 


1 liAAA 1 VjA(j A 


3120 


AGAGATCAGG 


CATCTATCCA 


A GT CT ATG A A 


r.AAAPATrTr 


(jlv.il v>o 1 LL 


/^*/™*a/* , a a 
tK*t*AC*AACCT 


3 180 


GTTGTTACAA 


CTGGAGAACC 




GAATTAGGGP 


V-A*j(jA 1 1 VjA i 


T»T*f*T , <" AAA TP 


3 240 


TTTGATGGCA 


TArAArnrcr 


AT^AGATGGA 


111 AAA 1 1 vjlj 


4- 1 Al_ AA 


TGATTTT CT A 


3 300 


GTTCCTGGGG 


TAGAAGTTCC 


AAGTTTGG AT 


AG AG AT ATT A 




TP A TTPP A PT 

iuA L 1 LLAL 1 


J JdU 


ATAGCAATTG 


GTC AAAAAG T 


GAGTAGGGGT 


GATATTr^TG 


r.AaPTPTPa a 

U/VnL 1 vj i LrtA 




34 20 


G T AGTT AATC 


AT AAA ATT AT 




uunu 1 n 1 L 1 \j 


r* AP A Af*"TT*^"T» 
VjAuAAu 4 k_L> * 


TTCTATTGCA 


3 4 8 0 


TCTGGCGATT 


TTACAATTGA 


TCAAGTTGTA 


TATGAAATAA 


AAAAATTrra 


PPPTaP^TTP 
1 AO 4 i 11. 


J 34 U 


TATAAAGGAA 


CGCTTATGCA 


AAAATGGCCT 


GTCCGCAAGG 


rGCGTGfTGT 


TTPTAA A (~*(~"V 


T *t PI A 


TTAATTCCAG 


AAGAACCATT 


AAT C AC AGG T 


GAAGGACTT*A 




PTTTPPAPTa 
^.111 V_ AV_» 1 A 




AC C AAAGGG G 


GAGCTGCAGC 


AGTTCCTGGA 


CCGTTTGGAG 


CAGGAAAGAC 


AGTT GT AC A \ 


3720 


CACCAAGTAG 


CTAAATTTGC 


CAATGTTGAT 


ATTGTTATTT 


ATGTCGGTTG 


TGGAGAACGT 


3780 


GGAAATGAAA 


TGACGGATGT 


AC TG AAT GAG 


TTTCCTGAGT 


TGATTGACCC 


TAATACCGGA 


3840 


CAATCAATTA 


TGCAACGGAC 


AGTTCTGATT 


GCTAATACTT 


CAAATATGCC 


TGTTGCTGCT 


3900 


CGTGAGGCTT 


CAATTTATAC 


AGGAATTACC 


ATG GCTG ACT 


ATTTTCGTGA 


TATGGGCTAC 


3960 


TCTGTCGCCA 


TTATGGCTGA 


TTCAACTTCA 


CGTTGGGCAG 


AAGCGCTACG 


TGAAATGTCA 


4020 


GGACGTCTAG 


AAGAAATGCC 


TGGTGATGAG 


GGTTATCCTG 


CTTATCTCGG 


AAGTCGTATC 


4080 


GCTGAATATT 


ATGAAAGAGC 


AGGACGTTCT 


CAGGTTCTAG 


GGCTTCCAGA 


ACGTGAAGGA 


4140 
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AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAAGGAT 66 0 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTT CCTTT AG 720 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTG G CT AAA AGACCAAAAG 84 0 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTG CTT TTG 900 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGC.AATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC ATG AACG AAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA AT AAGGAA CG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG AT ATT C AC AA CAAGAGGTCA 13 20 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TC C AAATT AT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 1440 

TTATTTCAAT AGGTAAAATT G ATGATAA CT ATTTGTATAA AACATTAATT GGATCGATTT 15 00 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 15 60 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 162 0 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT T ATT AC AG AA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 174 0 

TCAACTCCCT ATCATTTATC GATTGACATC TTACAACATC CTAGTCAGAC AG AG ATTTC G 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAACTCC G TCTG AT AT A I8 60 

ATTGTGACTT TATTTGCTTT ACGATATGTT TATCATAATA TCAAAGTTTT ATTAAAATCT 1920 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

G AAAGTTT AA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 204 0 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTT AG ATG AG 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT GACTTTTATA AT ATT ATT AC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT ATACATTGTA GAAAATCAAG AAATATTTGT GTGGTTCAAT 2 340 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2 400 
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AAA CAT ACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3 300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3 3 60 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 342 0 

TTTTTTATCA AATGCTCATT AG AAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 34 80 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3 54 0 

G AG AT A CTCT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3 660 
CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTT AGTTT AG 
CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT G AAT TTCTAG 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3 900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3 960 

3964 

(2 1 INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12666 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

TGAGACCGTT ATTTGT ATT A GGGAAATGGG TATCTATTTT TAATGCTGTG GGGATTTTGA 60 

TTG TTTCTAT TATTCAAACC AAAAGCTTGT CAGGTATTGG AGCAGGATTG TTT AAT CT AT 120 

ATAACATTTC ATCTTATATA GGTGATTTAG TT AGTTT C AC TCGATTGATG GCATTAGGAT 130 

TATCTGGAGC AAGTATAGCA TCAGCTTTCA ATTTAATTGT TGGTTTGTTT CCGGGAATAT 2 40 

TGGCTAAACT GACAATTGGA TT AG T ATT AT TCATTCTTTT ACATGCGATC AATATTTTTC 300 

TATCGTTACT ATCAGGATAT GTTCATGGAG CACGTCTGAT ATTTGTTGAA TTTTTTGGTA 3 60 

AGTTTTATGA GGGTGGAGGA AAACCATTTC AACCTTTGAA GGCTTCTGAG AAATATATTA 42 0 

AGGTTATTAC AAAGAATTAA TGGAGGATAT ATATAATGGA ACATTTAGCA AC T T ATTTTT 4 80 

CAACCTATGG AGGAGCTTTC TTCGCTGCAT TGGGAATTGT ATTGGCGGTT GGATTAAGCG 54 0 

GTATGGGGTC TGCTTATGGA GTTGGTAAGG CTGGGCAATC TCCCGCAGCT TTACTGAAAG 600 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT AGGTGCCCCT CCTCTTTTTT GGT AT AA TAG 1500 

AAGATAGAAA ACGAGCTTAG AAGAGATGAT TTTTGATACA CATACACACT TGAATGTAGA 15 60 

AG AATTTG C A GGTCGTGAGG CAGAAGAAAT TGCCTTGGCT GCTGAGATGG GTGTGACACA 1620 

GATGAATATT GTT GG TTTTG ATAAACCGAC GATTGAGCAT CCCTTGGAGT TGGTAGATGA 1680 

GTATGAGCAG CTCTATGCGA CTATTGGTTG GCATCCTACA GAAGCTGGTA CTTATACAGA 17 40 

GGAAGTTGAG GCTTACTTCT TGGATAAGTT AAAACATTCC AAGGTTGTGG CTTTAGGTGA 1800 

AATTGGCTTA GATTACCATT GGATGACAGC GCCCAAAGAG GTGCAGGAGC AGGTTTTTCG 1860 

CCGTCAGATT CAGCTATCTA AGGACTTGGA TTTGCCTTTT GTTGTCCATA CCCGTGATGC 1920 

GCTGGAAGAT ACCTATGAGA TTATCAAGAG TGAGGGCGTT CGTCCTCGTG GTGGTATCAT 1980 

GCATTCATTT TCAGGGACGC TTGAGTGGGC AGAGAAGTTT GTGGATCTTG GTATGACCAT 204 0 

TTCCTTCTCA GGAGTGGTGA CTTTTAAGAA GGCAACTGAC CTCCAAGAAG CAGCTAAAGA 2100 

GTTACCTTTG GACAAGATGT TGGTGGAAAC AGATGCGCCT TACTTAGCAC CTGTACCCAA 2160 

GCGTGGTCGT GAAAATAAAA C AG CCT AT AC T CGCT ATGTG GTCGACTTTA TCGCTGACTT 22 20 

GCGTGGTATG ACGACAGAAG AGCTGGCGGT AGCAACGACT GCAAATGCAG AACGAATTTT 2 2 80 

TGGACTGGAC AGCAAGTAAT GAAAGAGAAA ATTTCTCAAG TTATCGTGGT TGAAGGGCGT 2 3 40 

GATGATACGG TCAATCTCAA ACGTTATTTC GATGTGGAGA CCTATGAGAC TCGAGGTTCT 2 400 

GCCATCAATG CTCAGGATAT AGAGCGGATT CAGCGCCTGC ACCAACGTCA TGGAGTCATT 2 4 60 

GTCTTTACAG ACCCAGATTT TAATGGGGAA CGGATTCGGC GCATGATCAT GATGGTCATT 2 520 

CCAACAGTTC AGCATGCCTT TCTCAAGCGA GATGAAGCTG TTCCCAAGTC CAAGACCAAG 2S80 

GGG CGTTCTC TGGGAATTGA GCATGCCAGC TATGAAGACC TGAAAACGGC TCTAGCTCAA 2 64 0 

GTGACAGAAC AATTTG AACA TGAGAGTCAG TTTGACATTA GTCGTAGCGA TTTGATTCGC 27 00 

CTTGGTTTTC TAGCAGGGGC AGACAGCCGT AAGCGTAGAG AATATCTCGG AGAGACTCTC 2 7 60 

CGAATCGGCT ATTCCAACGG CAAGCAACTC CTCAAACGCC TAGAGTTGTT TGGGGTTACT 2 82 0 

TTGGCAGAAG TGGAAGAAGC TATGAAATCT TATGAGTAGG AAAGATGTAG CCGTTACAAT 2 8 80 

TTTTTAAGTT TCACAGTATT TTTCGAAGCA GGTAGAAGAG GAGGCGTCTG ATGTTAATTG 2 9 40 

GTCAAAAAAT TAAAGAGATT CGGATAGAAA AAGGAATTAG TCGTCCAGAT TTTTGTGGAG 3000 

ATGAGCAAGA ACTGACAGTT CGTCAACTGT CGCGAATTGA AAGTGG AG C T TCGCAACCGA 30 60 

GTTTGCCCAA GTT AG A CT AT ATTGCTCGCC GGCTAGGAGT TCCAGTTTAT AGCCTTATGC 3120 

CGGATTTTTC AGCTCTTCCT TCTGCTTATT TAGAATTGAA ATACCAGATT TTACGTGAAC 3180 

CAATCTATGG TAAAGAAGAG GAGTACGATA AGAAGGAAGC CTGTTTGGAA GAGATTTATA 3 2 40 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3964 base pairs 
(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi ) 


SEQUENCE DESCRIPTION: 




1 JO . 






TGGCAGCTCG 


TCGTCGTAAA 


GGACGCAAAG 


111 1 VjvjL 1 ^J^_ 


A I AATCCAAA 


CGAATTCTAT 


60 


CAAAAATCAG 


TAGGAACTCG 


AGTCTACTGA 


^T^I^T*' 1 A 


i L» 1 AAAAAAG 


TTCAGTAGAT 


120 


GCAAATGGAT 


TCGGAAGCGA 


TGTTACAGTA 


n att^iaa a*t 




ACCTCTGTTT 


180 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


. 1 1 LUrt 


I T I GTCCTGT 


TATTATTTTA 


240 


TTTTACTATA 


AAGTTGAAGT 


AGGTGGAGAT 




AA TV-GTCTT 


TAAAGATGGT 


3 00 


TCAGCTATTA 


CAATTCCAGG 


AAATCAATTG 


GTAGCAPAAC 


A Tf*P AAA A f /*" 


AC AAG AT AGC 


360 


ACTAAACTGA 


CTGCTGAAAA 


ATCAACTGTT 


AAA C1C A f* PTH 




AGATGT AAAA 


420 


GATATAACTC 


ATTTAACAGA 


TGAAGAAAAA 


vl 1 AftUU A * v.* 


IT A TTT'P ft r~ a 


ACiLAAATGG T 


4 80. 


T C AG C ATT AG 


ACGGAGCGAC 


AATCAATGTA 




vj I MV-AtiLAAt. 


AATCACATTC 


54 0 


CCAGATGGTT 


CAGTAGTGAC 


GATTCTAGGA 


AAAflATAPAr! 




TGCGAAACGT 


600 


GAATCTGTAA 


CTCAAGAAGC 


TACACCAGAG 


TATA AfTPTAfi 


AA A a *P A, /•* H 


AGGTGGAGAT 


660 


AAGGGAGGCA 


ATACTGGAAG 


CTCAGATGCT 


AATGCGAATf; 




1 AG<-V-AGG(.G 


7 2 0 


OGTGGATCAG 


CTCACACAGG 


TTCACAAAAC 


TCAGCTCAAT 


CACAACCTTC 


TAAGCAATTA 


780 


GCTACTGAAA 


AAGAATCAGC 


TAAAAATGCC 


ATTGAAAAAG 


CAGCCAAGGA 


CAAGCAGGAT 


840 


GAAATCAAAG 


GCGCACCGCT 


TTCTGATAAA 


GAAAAAGCAG 


AACTTTTACC 


AAGAGTGGAA 


900 


GCAGAAAAAC 


AAGCAGCTCT 


CAAAGAGATT 


GAAAATGCGA 


AAACTATGGA 


AGATGTGAAG 


960 


GAAGCAGAAA 


CGATTGGAGT 


GCAAGCCATT 


GCCATGGTTA 


CAGTTCCTAA 


GAGACCAGTG 


1020 


GCTCCTAATG 


CTCCTCCTAA 


GACAACAAGT 


GCACCGCAAG 


CAACTGCAGG 


AACAATGCAA 


1080 


GATGTTACCT 


ACCAGTCACC 


TGC T GGC AAA 


CAATTACCTA 


ACACAGGTTC 


AGCATCAAGT 


1140 


GCAGCACTTG 


CTAGTCTTGG 


TCTAGTGGTG 


GCAACAAGTG 


GTTTTGCTTT 


GCTAGGAAGA 


1200 


AAGACTAGAC 


GTAGAAAATA 


GAACAGCTAG 


AAAATTCTAT 


TCTCTACTTA 


AAGTTAGATT 


1260 


ATAAGGGGGA 


TTTTGAGAAG 


TCATCAATCC 


TAGTGATGGG 


TGAGAAAAGT 


GAGAACCCAA 


1320 


GATAATCACA 


TACTTTAGCT 


GAATAGGAAT 


ATTCTATCAA 


TGTAGCCAAT 


CTCTTCTGTC 


1380 


TCTAACTGTG 


GAATAGGAGA 


TGGGCAATAT 


CGGATAGAAA 


AGATAGCAGA 


ATAGCTCTCT 


14.4 0 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3 660 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCCATTTCA 3720 
TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3 7 80 

CAGGCAGCAA CTCCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3840 
GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 
TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 39 60 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4 0 20 

C C ATCCTG AT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4X40 

TATCAGCATT GTTAAGATAG ACGAGGCTCA TTTAACCTGT GAAATTGGCT ATGTGTTAGG 4 200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 4 2 60 

TTTTACTCAA GCAGGTTTTC AAAAG GTC AG AGCACCTTAT GCCAGTCTCA ACCCAGCTTC 4 3 20 

AGGTCGTGTC ATG G AAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4 380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 4 440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA AT AT AAT AAT TAGCATTCCA 4 500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA ACACAGATGT TCTAGTTCCT 4 560 

TCTTTAATCT GGTTTAGTGT TACTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4 620 

TAATCGACTA GATTCTCCAG CCGAACAGG1 GGTAATGTAC TTTTTATAGT GTAATCCTAG 4 680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4 74 0 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4 800 

CTATTCCCAA AT CAT TC AT A CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4 860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4 920 

TTTCCTCGCT AG ATTTCCT C AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4 980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT CGTTCATAAG 504 0 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 5 220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGT AGGCG AT ACAGCTCATC ATCATACGAA 5 2 80 

CTTCGTTTTT GATTAAGGTT GAACT 53 05 
(2) INFORMATION' FOR SEQ ID NO: 136: 
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CTTGGTACAG 


GTTTATGGCA 


ATGACTATGA 






LUALCCAlAA 


1920 


AAATTGTGTC 


TTTCTTGTGG 


ATACCTATGA 




A 1 LWj 1 V_» i At. 


(-AGCTGCCAT 


1 980 


TCAGGTGGCG 


CGTGAGCTGG 


GTGATCAGAT 


*p a a p' t ** i^p & nr p 




TT GALiCTUG 


2040 


GGATATTGCC 


TACATTTCTA 


AG AAAGT CCC 


T P AP.P AA PTP. 


^HTV ft /^»^/^»T«/*— 

^>rt 1 V^AvjOC- I Vj 


GAlii AC AG A 


2100 


GGCTAAGATT 


TATGCTTCTA 


A TG ATCT AG A 


TP. A AA AT A P P 


ATPPTT ft ft /"* /—» 


TCAAGATGCA 


2160 


AAAGGCCAAG 


ATTGATGTCT 


GGGGTGTGGG 




All ALAULU T 


ATGACCAGCC 


2220 


GGCTCTTGGG 


GCGGTTTACA 


AGATTGTTGC 




bAAAuTOGTC 


AGATGCGCAA 


22 80 


TACGATTAAG 


CTGTCTAATA 


ATGCTGAAAA 


AGTTTCTAPP. 


LLnuu 1 A>\vj A 


ALjCaUu ibHj 


23 40 


GCGCATTACC 


AGTCGTGAAA 


AAGGCAAGTC 


acaagp.pp.ap 


i A 1 A A V. Av. 1 1 


ATGATGGTGT 


24 00 


GGATATTAGC 


GACATGACAG 


AAAT C AAG AT 




A. f*fT & *P R /*• ft H 
Al_ 1 A I At_ A 1 


ACATCAAGAA 


2460 


GACGGTTCGT 


AATTTTGATG 


CCGTTCCTCT 


CTT GGTGfi AT 


ATPTTri »if 


AAUUAAi A l I 


2 52 0 


ACTTTACAAC 


TTGCCTAGTT 


TG AC TG AC AT 


TCAGGATTAT 


Pff^rvTa. b L f*f m> 


a a iiummi/^ ft /"^ ft x 
AA I I Tvj ACAA 


2 580 


GTTGTGGGAT 


GAGTATAAGC 


GTGTGCTCAA 




T A. T , f*l'*' R f r T^rT~ 
1 A 1 LLnU i 


ATTTGGCGCG 


2 640 


TGATGTATGG 


CAAGATAAGA 


TGG ACTTG AT 


TCATAAGATP. 


r^CLC \ arr air' 


Ct. CJTTGGTG A 


2700 


AGGAGAAGAA 


GAATGAGTTT 


GCAAGAAACG 


ATT AT PC* AA fi 


a ppTPrvny**?* 


CAAALCAGTG 


2 7 60 


ATTGATGCCC 


AGGAAGAAAT 


CCGTCGTTCT 


AT*PP A TTTP T 


1 ft>UUU\un i A 


I v„ iuAAAAM 


2 820 


CATCCCTTCC 


TAAAAACCTT 


TGTACTAGGG 


ATTTCTGGGG 






2 8 80 


GGACGTTTGG 


CGCAATTAGC 


TATGGAAGAA 


CTP.PP.AP.PTP 


A A a fr r^r* a f" a 


CGATAGCTAC 


2940 


AAATTT AT CG 


CTGTCCGCCT 


GCCATACGGA 


GTGCAAGPT'fi 


A « VjAAULALiA 


T»/-"/^»»r»(-» a a ft ft ft 
1 \»\- I LAAAAA 


3 000 


GCCCTAGCCT 


TCATCCAGCC 


AG AT GTC AG C 


TTGGTTGTGA 


ATATCAAflfiA 


A 1 v_/\Vjt_ 1 u A i 


t n c r\ 


GCCATGACAG 


CTGCAGTTGA 


AGC G AC AGG T 


AGTCPTGTTT 




LAAGULiGAA I 


312 0 


A* CAAGGCAC 


GTTGCCGTAT 


GATTGCTCAG 


TATGCCCTTG 


CTGGTTCCCA 


TAGCGGAGCG 


3180 


GTCATTGGAA 


CAGACCACGC 


CGCGGAAAAT 


ATCACAGGTT 


TCTTTACCAA 


GTTTGGTGAC 


3240 


GGCGGTGCCC 


ATATTCTCCC 


TCTTTACCGC 


CTCAATAAAC 


GCCAAGGAAA 


ACAGCTCTTG 


3300 


CAGAAACTTG 


GCGCAGAGCC 


AGCCCTTTAT 


GAAAAAATCC 


CAACGGCAGA 


CCTAGAAGAA 


3360 


GATAAACCAG 


GCCTAGCTGA 


CGAAGTCGCA 


CTTGGAGTCA 


CCTACGCAGA 


GATTGACGAC 


3420 


TACCTAGAAG 


GC AAAACAAT 


CAGCCCAGAA 


GCTCAAGCGA 


CCATTGAAAA 


CTGG TGGC AC 


3480 


AAAGGCCAAC 


ACAAACGC C A 


CTTACCCATC 


ACCGTATTTG 


ATCACTTTTG 


GGAGTAAAAA 


3540 


GGTCCGGGGG 


ACCTTTTTAG 


CTTCTTGCCC 


TG AAAT T AAA 


AAGCAAGAAA 


AACCTCCACT 


3600 
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CATTTTAGTT TCATTTAGTA TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 

TTGGTATGCT CCTGCTTTCA CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA ISO 

AAAGCATTAT ATAATAGTGA TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 24 0 

AAATTCGTTT AAAAGATCTT ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 3 00 

GAAGTTTATT TCAAACAACC TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 360 

AATTAATGGG GTTATAATAA ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 4 20 • 

AGATGAGTAT TAAAACTATA AGGAGGACGA AGGTGGCTAA AAATTTAAAA TT AAAATT AG 4 80 

CTCGGGTAGA GCGTGATTTA ACACAAGGTC AACTGGCAGA GGCTGTCGGG GTGACACGCC 54 0 

AGACTATTGG TTTAATAGAG GCGGGAAAAT ACAATCCCAG TCTCTCGCTC TGCCAGTCTA 60 0 

TTTGCAGATG TTTAGGGAAA ACCCTAGACC AACTATTTTG GGAGGAAGAA GATCAAAAAT 6 60 

AGATTTTATT ATTCTC AATT ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 720 

TCTGAAAGTT TCTATATCTG CATTGCTTTG TCGCTCCTAT CTTATATCAT TTCAGT ATT A 780 

GCACCAAGCC TTTTTAATTC TAATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 84 0 

TTTTTCAATC GTGCCCGTTA TCTGGGAGTG AC CT ACT ATG GTCGTTTTCA TTTTACGATT 900 

TTGGGTTGTT TTTTCCTAAC CTTGGCTATT ACGGCTCTTT TGATGTTGCA G AATT AT C AA 9 60 

TTCAACATAG AAATTTATCA GCACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 1020 

ATTACTTATA TCATTTACCT TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 108 0 

GGCGAATGGG CTCAGAAAAA ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CTTGTTACTC TTTTCTCAAT CCAGCTAAAA TGTG AT AT AA TAGTACTAAT TTATTGGAAT 1200 

ACATGAAAGT TCTTGAAAAT TTTCATGGGT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 12 60 

TCCAGATGAT AGTTTGACAT TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 13 20 

CTTTGACCAA GGGATTCACA ATAAGAAGGC GGTCTTTGAG CTGTATTTCC GCCAACAGCC 1380 

TTTTAAGAAC GGCTATGCGG TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 14 4 0 

CTTGCGTTTT TCAGATAGTG ATATAGCCTA TTTGGAGTCC CTTGGTTATC ATCGGGCCTT 1500 

CTTGGATTAC CTTCGCAATT TCAAGTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 1560 

TTTGGTTTTT GCTAATGAAC CGATTGTGCA GGTGG AAGG A CCTCTAGCCC AATGTCAGTT 162 0 

GGTCGAAACG GCTCTTTTGA ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 168 0 

TCGTATTCGT TCGGTTATCG AAGATGAACC CTTGATGGAG TTT GGG AC AC GTCGGGCTCA 1740 

AGAAATGGAT GCGGCCATCT GGGGAACACG CCCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

CAGCAACGTG CGTGCGGGTA ACCTCTTTGA , CATTCCTGTT TTGGGAACCC ATGCCCATGC 18 60 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC 114 60 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTT ATT C CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT AT GAT CT AAA AG A T AT ATT T AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCCTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTCGATTCA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACT G AG C C TCACCAGATA CAACTTTAGA 11760 

GTTTGCC GAT AAGCTATCAT C AG C AAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AG AAAAC AG A CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 1194 0 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA C AC AG AAGC T 120 60 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 1212 0 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC ma AATGC AT A 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC CGGCATGTCG 1224 0 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGG AAGGG AT 12300 

AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAC CGGATAAGGA 12 3 60 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12 420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 124 80 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12 600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12 660 

AGGGG 12 665 

(2) INFORMATION FOR SEQ ID NO: 13 5: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5305 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TT ATT GT ACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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TCAGCTTGGC 


AGATACTTTG 


TCCCTTCACT 


ATGCAATGCG 


CAATGAGTTG 


ACTCTTAGCC 


9660 


CAGTTGACTT 


CCTTCTTCGT 


CGTACCAATC 


ACATGCTCTT 


TATGCGTGAT 


AGCTTGGATA 


9720 


GTATCGTTGA 


GCCAATTTTG 


GATGAAATGG 


G ACG ATTCT A 


TGACTGGACA 


GAAGAAGAAA 


9780 


AAGCAACTTA 


CCGTGCTGAT 


GTCGAAGCAG 


CTCTCGCTAA 


CAACGATTTA 


GCAGAATTAA 


9840 


AAAATTAAGA 


AAAAATAAAA 


GAGGTGGAGG 


GCAGCATTCC 


TTGTCGCCCC 


TCCCTTCTTT 


9900 


TTAATGGAGA 


CAGAAAGATG 


ATGAATGAAT 


TATTTGGAGA 


AT TTCT AGGG 


ACTTT AAT CC 


9960 


TGATTCTTCT 


AGGAAATGGT 


GTTGTTG C AG 


GTGTGGTTCT 


TCCTAAAACC 


AAGAGCAATA 


10020 


GCTCAGGTTG 


GATTGTGATT 


ACTATGGGTT 


GGGGG ATTG C 


AGTTGCGGTT 


GCAGTCTTTG 


10080 


TATCTGGCAA 


GCTCAGTCCA 


GCTTATTTAA 


ACCCAGCTGT 


GACCATCGGT 


GTGGCCTTAA 


10140 


AAGGTGGTTT 


GCCTTGGGCT 


TCCGTTTTGC 


CTTATATCTT 


AGCCCAGTTC 


GCAGGGGCCA 


10200 


TGCTGGGTCA 


GATTTTGGTT 


TGGTTGC AAT 


TCAAACCTCA 


CTATGAGGCA 


GAAGAAAATG 


10260 


CAGGCAATAT 


CCTGGCAACC 


TTCAGTACTG 


GACCAGCCAT 


CAAGGATACT 


GTATCAAACT 


10320 


TGATTAGCGA 


AATCCTTGGA 


ACTTTTGTTT 


TGGTGTTG AC 


AATCTTTGCT 


TTGGGTCTTT 


10380 


ACG ACTTTC A 


GGCAGGTATC 


GGAACCTTTG 


CAGTGGGAAC 


TTTGATTGTC 


GGTATCGGTC 


10440 


TATCACTAGG 


TGGGACAACA 


GGTTATGCCT 


TGAACCCAGC 


TCGTGACCTT 


GGACCTCGTA 


10500 


TCATGCACAG 


CATCTTGCCA 


ATTCCAAACA 


AGGGAGACGG 


AGACTGGTCT 


TACGCTTGGA 


10560 


TTCCTGTTGT 


AGGCCCTGTT 


ATCGGAGCAG 


CCTTGGCAGT 


GCTTGT AT T C 


TCACTTTTCT 


10620 


AGTTTATACT 


CTTCGAAAAT 


CAAATTCAAA 


CCACGTCAGC 


GTCGCCTTAC 


CCTACTCAAG 


10680 


TACAGCTTGC 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAAAACAA 


10740 


TTATGTTGAT 


AGAGCTTGGG 


CAAGAGCCCA 


ATTTCAGCAA 


AAAATGAAGT 


AAATCTTCTC 


10800 


ATAATAAAAC 


GCATCATATC 


AAG CAC G AAA 


ATT CC ACG AG 


GTCAACTACA 


CTCAGAAAGC 


10860 


TGAACAACAA 


GCCAAAACGC 


CCAAAAAAGG 


CGGCAAAAAG 


CAAGCACCTG 


CAAGCAACGT 


10920 


GCCGAAATGG 


TCAAATCCTG 


ATTATGTCAA 


CGAATTAGAC 


CCAAAAATCG 


TTGATATCCT 


10980 


AGTAGAATTT 


CACAAGTCAC 


AAGGC AC TTT 


GGAAACTCCC 


GAGGCGCAAG 


CAGAAATCGC 


11040 


CCAAAAACGT 


GAAGAAATCG 


AGCAAAGGAG 


AGCTGAGCTT 


GAGGGTAAAA 


AAC AAG AG CT 


11100 


TTTGAACCGC 


TTGAACAAAT 


AGAGTTTCGC 


AAGTATTATG 


CTTACAAATT 


ACTTGAGCAA 


11160 


TTAACTAAAA 


TATAAACCCT 


GCCTTTATAT 


CTAGGCAGGG 


TTT AT ATT TT 


AGAAATTCAC 


11220 


GTAGGTTGTT 


ACGGTTTTTA 


CATACCCAGT 


ATAGTTTGAG 


TTTCTATAGT 


ATTCAGTGAT 


11280 


AAACTTC C AT 


TTTCTTTGAG 


CAACATGGAT 


ATAAGTACTT 


GTTATGTACT 


ATGGATATGG 


11340 


GCTTTGTGAA 


TCCAAGTAAG 


ACTCATAAGC 


TTGTATACCA 


AAATATGCTC 


CACCAATTAT 


11400 
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TTGAACGAGA 


CAGGAGAACT 


' CTTTGAGCCA 




rvilLiLvjCAA 


GGAACAACTC 


7920 


TACAAGGGCT 


GGAAGAAGGC 


TCTGAAACfA 






AGACGACTAA 


7 9 80 


TACTGGCAGA 


ATAAAGCGAT 


TTATTTAGAA 






TCAAAGAAAA 


804 0 


CACGTGAATT 


GTC AATT AAA 


AAA A TCXC A nr. 




oUALLTLliG 


ATTATCGGTG 


8 100 


GAGGAATCAC 


AGGAGCTGGT 


GTAGCCTPGr 




I AULOGTC T T 


GAGACTGGTT 


816 0 


TGATTGAAAT 


GCAAGACTTT 


GCAGAAfV^AA 




1 1 LAALAAAA 


TTGGTTCACG 


8220 


GAGGACTTCG 


TTACCTCAAA 


CAATTTGACG 


TAHAAflTfinT 


1 U A(jA i At. l»» 


G rTTCTGAAC 


82 90 


GTGCAGTGGT 


TCAACAAATC 


GCTCCACACA 




nUAI LLAA 1 Vj 


CTCTTACCAG 


8340 


TTTACGATGA 


AGATGGAGCA 


ACCTTTAGCC 




1 AAAU I A.k_jt_ L_ 


ATGG ACTTGT 


84 00 


ACGACCTCTT 


GGCAGGTGTT 


AGCAACACAC 


CACIPTflfY^ AA 


pal r^r , TT , *T w rr* 
UAAOOi 1Mb 


AGC.AAGGATC 


84 60 


AAGTCTTGGA 


ACGCCAGCCA 


AA CTTG AAG A 


AGGAAGGCTT 


wVj 1 nuunuu 1 


oOnb 1U1 AIL 


8520 


TTGACTTCCG 


TAACAACGAT 


GCGCGTCTCG 


TGATTGAAAA 


V— A 1 LAAA(_U I 


AAC C AAG 


8580 


ACGCTGCCCT 


CATTGCCAAC 


CACGTGAAGG 


CAfiAAftnr'TT 


1 I L»Al_ 


VjAAAGTG g c a 


8640 


AGATTACAGG 


TGTTGTAGCT 


CGTGATCTPT 




Avj lbi i 1 bAA 


ATCAAGGCCC 


8700 


GTCTGGTTAT 


TAATACAACA 


GGTCCTTGGA 


v> 1 \j A i nAnu 1 


ALblAAl I I Vj 


TCTAATAAGG 


8760 


GAACGCAATT 


CTCACAAATG 


CG C C CAACT A 




v_ I 1 L»Vj 1 AO I A 


Unl 1 LAAb(_A 


8820 


AAATCAAGGT 


TTCACAGCCA 


GTTTACTTCG 


ACAGAGGTTT 




LljIAl L»0 I V_ i 


8 880 


TTGTTCTCCC 


ACGTGAAAAC 


AAGACTTACT 




i wA 1 #i Unun(. 


1 ALALAUO i Vj 


8940 


ATTTGGAGCA 


TCCAAAAGTA 


ACTCAAGAAG 


ATGTAGATTA 


tit & rrTrrr 

i <— i Al- i i buL 


T^TLAACA 


9 000 


ACCGCTTCCC 


AGAATCCAAC 


ATCACCATTG 


ATGATATCGA 


AAGrAGPTfY" 


UCAvjIj 1L 1 IL 


9060 


GTCCATTGAT 


TGCAGGGAAC 


AGTGCCTCTG 


ACTATAATGG 


1 uOnnn 1 AAw 


v»V> 1 AH_A I \- A 


9120 


GTG ATG AAAG 


CTTTGACAAC 


TTGATTGCGA 


CTGTTGAATC 


TTATCTCTCC 


AAAGAAAAAA 


9180 


CACGTGAAGA 


TGTTGAGTCT 


GCTGTCAGCA 


AGCTTGAAAG 


TAGCACATCT 


GAGAAACATT 


9240 


TGGATCCATC 


TGCAGTTTCT 


CGTGGGTCTA 


GCTTGGACCG 


TGATGACAAT 


GGTCTCTTGA 


9300 


CTCTTGCTGG 


TGGTAAAATC 


ACAGACTACC 


GTAAGATGGC 


TGAAGGAGCT 


ATGGAGCGCG 


9360 


TGGTTGACAT 


CCTCAAAGCA 


GAATTTGACC 


GTAGCTTTAA 


ATTGAT CAAT 


TCTAAAACTT 


9420 


ACCCTGTTTC 


AGGTGGAGAA 


TTGAACCCAG 


CAAATGTGGA 


TTCAGAAATC 


GAAGCCTTTG 


9480 


C GC AACTTGG 


AGTATCACGT 


GGTTT GG AT A 


GCAAGGAAGC 


TCACTATCTG 


GCAAATCTTT 


9540 


ACGGTTCAAA 


TGCACCGAAA 


GTCTTTGCAC 


TTGCTCACAG 


CTTGGAACAA 


GCGCCAGGAC 


9600 
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TGGCCAGCAT 


' ATGCGGATTG GTCTGGATTT 


' GACATCTGGT 


' TTTCTTGTCT 


TTTCAAGGAT 


6120 


GGCAGCCATT 


1 TTGAAACGGT 


ATTTGGAATA 


. CAATCGTTTT 


ATTACCATTG 


AAGCTTATGA 


6180 


CCCTAGTCGG 


CATTATGATT 


TGCTGGTTAC 


CAATAACCCG 


ATTCATAAGA 


AGGAACAGAC 


6240 


ACCAGTCTAT 


TATTTAAAAA 


ATGACTTGGA 


TAT GG AGG AT 


TTGGTAGCGA 


TTCGCCAGTT 


6300 


ATTATTCACT 


TAAAAGGCTT 


GGTTAATCCA 


GGTCTTTTTT 


GTGAAATTCA 


CACAATCTCC 


6360 


TCACATTTTT 


TTAAAAATTA 


AAAAAAGTTG 


ATAAACAAGA 


AAGCGCTTTA 


TTTTGTATAC 


6420 


TAGTAAGTGT 


AAAGAGGAAA 


CACCTCAAGA 


TCTTTATCAG 


GAGGACAGTA 


CATGTCACAA 


6480 


GAAAAATACA 


TCATGGCCAT 


TGACCAGGGA 


ACTACAAGTT 


CTCGTGCCAT 


CATTTTCAAC 


6540 


AAAAAAGGGG 


AAAAGGTTAG 


CTCGAGTCAA 


AAAGAGTTTA 


CCCAGATTTT 


CCCTCAGGCA 


6600 


GGTTGGGTTG 


AGCACAATGC 


CAATGAAATT 


TGGAACTCTG 


TTCAGTCAGT 


TATTGCGGGT 


6660 


GCTTTCATCG 


AAAGTGGTGT 


CAAGCCAAAT 


CAAATCGAGG 


CAATCGGGAT 


TACCAACCAA 


6720 


CGTGAAACAA 


CGGTTGTCTG 


GGATAAGAAA 


ACAGGACTTC 


CTATCTACAA 


TGCTATCGTT 


6780 


TGGCAGTCAC 


GCCAGACAGC 


ACCTTTGGCT 


GAGCAACTAA 


AAAG CCAAGG 


TTATGTGGAA 


6840 


AAATTCCATG 


AAAAGACTGG 


TTTGATTATT 


GATGCTTACT 


TCTCTGCTAC 


CAAGGTTCGT 


6900 


TCGATTTTGG 


ATCATGTAGA 


AGGTGCTCAA 


GAG CGAG C AG 


AAAAAGGGGA 


ATTGCTCTTT 


6960 


GGTACTATCG 


ATACTTGGTT 


GGTTTGGAAA 


TTGACTGACG 


GTGCGGCTCA 


CGTGACTGAC 


7020 


TACTCAAATG 


CAGCTCGTAC 


CATGCTTTAT 


AAC ATT AAAG 


AACTCAAATG 


GGATCATGAG 


7080 


ATTTTGGAAA 


TCCTT AAC AT 


TCCGAAGgCT 


ATACTTCCAG 


AAGTTCCTTC 


TAACTCCGAA 


7140 


ATCTACGGCA 


AGACAGCTCC 


ATTCCATTTC 


TACGGTGGAG 


AGGTGCCAAT 


CTCAGGTATG 


7200 


GCTGGGGACC 


AACAAGCAGC 


CCTCTTTGGA 


CAGTTGGCTT 


TTGAGCCAGG 


TAT GGTT AAG 


7260 


AATACTTATG 


GAACAGGCTC 


TTT CATCATC 


ATGAATACTG 


GGGAAGAGAT 


GCAGTTCTCT 


7320 


GAAAACAACC 


TCTTGACAAC 


C ATTGGTT AC 


GGAATCAACG 


GTAAGGTTTA 


TTATGCCTTG 


7380 


GAAGGTTCTA 


TCTTCATCGC 


AGGAAGTGCT 


ATTCAGTGGC 


TTCGTGACGG 


TCTTCGCATG 


7440 


GTTGAAAATT 


CACCAGAATC 


TGAAAAATAC 


GCTCGTGATT 


CTCACAACAA 


CGATGAAGTT 


7500 


TATGTCGTTC 


CAGCCTTTAC 


AGGTCTAGGC 


GCTCCATACT 


GG AACCAAAA 


TGCTCGTGGT 


7560 


TCCGTCTTTG 


GTTTG ACT CG 


TGGAACAAGC 


AAAG AAG ACT 


TTATCAAGGC 


GACTTTGCAA 


7620 


TCTATTGCTT 


ATCAAGTCCG 


TG AT AT CAT C 


GACACCATGC 


AAG TGG AT AC 


TCAGACCGCC 


7680 


ATTCAAGTAC 


TGAAGGTGGA 


TGGTGGTGCA 


GCCATGAACA 


ACTTCCTCAT 


GCAGTTCCAG 


7740 


GCGGATATTT 


TAGGCATTGA 


CATTGCACGT 


GCTAAAAACC 


TGGAAACAAC 


AGCTCTAGGA 


7900 


GCGGCCTTCC 


TAG C AGG TTT 


GTCAGTAGCG 


TACTGGAAAG , 


AC TTGG A C G A < 


GTTGAAACTC 


7860 
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ATGAGGGGAG 


TTATAGAGTT 


GTAAGGATTT 






AACGAGGAAT 


4380 


TGACCATTTC 


CAACAAAAGG 


TCCGACTAGG 


ACT TP A CC AC. 




T i TGATGTCA 


4440 


AC AC C AGG AT 


TTTGAACATA 


G CCTTTG ACG 




I A I O AGC-GAC 


GGTGATAATA 


4500 


GCACCTAGAG 


AGCTAGATCC 


CAACACCTTA 




ITjoTAT 1 i CC 


TTTTTCATTG 


4560 


GCTGCGAGAA 


TCTGGCTAGC 


GATAAGAGTT 


CCACCn. arrr 


t~ i At_ AGTTGA 


GCTAGCTTGG 


4620 


GTTTGATGTT 


TTTCTTGAGC 






I ATCAAGu AC 


AAAAGCACGA 


4680 


AAGGcTCCGC 


TTTCTGATAT 


AGTTTTAATA 


f\ I I 1 1 rt 1 LLrt 


1 AO O TACT AT 


TTTAGCATAA 


4740 


AAATGCCCAA 


AGGGGGAGCC 


GTGTGTTTAC 


TGATTTTCAG 


OA 1 HA i LjUAL 


CAGCjAAATCA 


4800 


GCATGAAAAT 


AAAAAGAGAA 


ACAGATTATT 


TT AGCATTTf: 


1 L AliA I 1 1AI 


oCTATGCTTA 


4860 


AGGTAGAAAA 


TGAAAGGGAT 


AACAAATGTA 


TTTAGGAHAT 


Tvp/* » rp/— /— ft. 
* I OA i UbA(jA 


AAGCCGAGTG 


4920 


TGGTCAATTT 


TCAATACTTT 


CCTTTCTATT 


ACAAGACTr" 1 


Plf j r^r* i\ err* 

\- AO A O O At. O O 


TCAAGGCTGT 


4980 


AATGGAAGAA 


ACAGGATTTT 


CAAAAGCAAC 




I A roTCACCC 


TCCTCAATGA 


5040 


CAAGGCTTTG 


GATAGTGGCT 


TAGAGCTGGC 


TAT^CACTC & 


l"* A ft ft TV* ft A ft 

yAAo A 1 UAAA 


ATCTGCGTCT 


5100 


GTCTATCGGT 


GCAGCTACCA 


AGGGGAGAGA 


T ATTCGG AG C 


I 1 O i 1 1 L 1 OO 


aoaCjTGCTGT 


5160 


TAAATACCAG 


ATTTTGGTTT 


ATCTTCTCTA 




i i n iaoccc 


ATCAGCTGGC 


5220 


TCAAGAATTG 


GTGATTAGCG 


AGGC T ACGCT 




4 roooT&oTT 


TAAATCAGAT 


5280 


TTTGTCAGAA 


TTTGATTTAT 


CCATCCAAAA 


TGGC CCTTCfi 


^UAlilj 1 OOAO 


A(jCA 1 CAGAT 


5340 


TCACTATTTC 


TATTTCTGTC 


TTTTCCGAAA 


«VJ * V. 1 VJV 4 


AO L V_AOOAA L 


GGGAAGGTCA 


5400 


CATGCAGAAA 


CCAGAGAGAA 


AACAGGAGAT 


TG CCAAT T^A 


unUVjAAn ILL 


OCGGTGCAAG 


5460 


TTTGTCTCCG 


GGGCAGAAAT 


TGGACTTGCT 


TCTCGGGrT 

* W 4 4 WVJVJv 4 


A I Lnu i v_ 


AACAACGTCT 


5520 


TCGGGTCAAT 


GCTTGTCAGT 


TTCAAGTCAT 


AGAAGAGAAA 


A T ("^ f*CL A C* r* f* 1* 


ft T< ' 1 w T Irf f m « m 

a n : i\jACAA 


5 580 


T AT CTTTTAT 


CTTCGTTTG C 


TGAGAAAGGT 


TCCCTCCTTT 


TTTGCTGGGC 


AACATATTCC 


5640 


ACTAGGAGTT 


GAGGATGGTG 


AG ATG ATG AT 


ATTCTTCTCT 


TTTCTCCTAT 


CTCATCGCAT 


5700 


TCTTCCTCTT 


CATACTATGG 


AGTATATTCT 


TGGTTTTGGA 


GGGCAGTTGG 


CAGATTTACT 


5760 


GACGCAATTG 


ATTCAAGAAA 


TGAAGAAGGA 


GGAACTATTG 


GGGGATTATA 


CAGAGGACCA 


5820 


TGTCACCTAT 


GAACTCAGTC 


AGCTTTGTGC 


TCAAGTCTAT 


CTCTATAAGG 


GCTATATTTT 


5880 


ACAGGATCGC 


TACAAGTACC 


AGTTAGAGAA 


TCGTCATCCA 


TATTTACTGA 


TGGAACATGA 


5940 


TTTTAAAGAG 


ACAGCAGAGG 


AGATTTTTCA 


TGCTCTACCT 


GCTTTTCAAC 


AGGGGACAGA 


6000 


TTTAGATAAG 


AAGATTCTCT 


GGGAATGGCT 


CCAGTTAATC 


GAATATATGG 


CTGAAAACGG 


6060 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATCGTACTA TGTCAATGGC 258 0 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 264 0 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2 7 00 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 27 60 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TA8AGACTAC GGAGTTTTTT 2 82 0 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2 880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTCC CAGGGTGCTG GCTTGCGAAA 2940 

TGGCTCCACG G AGTTTGGC A GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3 000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TG AGGTT AAT CAATCGTTTC AAGTGTTCGT 3 060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTG GTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AAC TTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3 240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3 300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTC 3 3 60 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 34 2 0 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 34 80 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTC TTGACAGGGC 3 54 0 

AG CCC ATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3 600 

GTGCTAGGCT CTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3 660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTCTATTC GATTCCCTTG TCAGAGACCA 3 7 20 

TTTCCATTAC AACGAGTCCA GCTCCG AG CT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3 7 80 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3 840 

AAGGTGT AT T AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3 900 

GT AGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3 960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4 0 20 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4 080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 414 0 

ATCCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4 2 00 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4 2 60 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4 3 2 C. 
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CGAGATAAGG 


CAAGGAAAGA 


GGTCGAGGAA 


T ATG T AAAAA 


AAAT AG TGGG 




840 


GCAAAATCAA 


CTAAAAAGCG 


AC AT AC AATT 


ACTGTAGCTC 


TACT AAC C. A 






ATTAAGAACG 


AGT ATTT G AA 


TAAAATAGTT 


GAATCAACCT 


CAGAAAr.rra 




q f; a 


CTGATGATGC 


AGAGTCGATC 


AAAAG T AG AT 


GAAGCTGTGT 


CTAAGTTTGA 






TCTTCTTCGT 


CAAGTTCAGA 


CTCTTCCACT 


AAACCGGAAG 


CTTCAGATAP 


a a a n rr & 


i nan 

1UOU 


AACAAGCCGA 


CAGAACCAGG 


AG AAAAG GT A 


GCAGAACCTA 






X 1 4 (J 


GAGAAAAAAG 


CCAAGGATCA 


AAAAGAAGAA 


GATCGTCGTA 








AAAACGCTTG 


AACTTGAAAT 


TGCTGAGTCC 


GATGTGGAAG 


TTAAAAAAP.P 


Ownw^. 1 i u/in 


1 i □ u 


CTAGTAAAAG 


TGAAAGCTAA 


CGAACCTCGA 


GACGAGCAAA 


AAATTAAfirA 




1 lift 
1 j£ J 


GAAGTTGAGA 


GT AAACAAG C 


TGAGGCTACA 


AGGTT AAAAA 


AAATPAAr. A C* 


Aun OrV/K 


iian 
iJoU 


GAAGCAGAAG 


AAGAAGCTAA 


ACGAAGAGCA 


GATGCTAAAG 


AGCAAGGTAA 




1 a a n 


CGGGCAAAAC 


GAGCAGTTCC 


TGGAGAGCTA 


GCAACACCTG 


AT AAAA AAG A 


A A A TfZ A.T^fT' 


1 jUU 


AAGTCTTCAG 


ATTCTAGCGT 


AGGTGAAGAA 


ACTCTTCCAA 




\j /vrt-rt <k» A v» Art 


I joU 


AAAAAGGTAG 


CAGAAGCTGA 


GAAGAAGGTT 


GAAGAAGCTA 


AGAAAAAAfif 






AAAGAAGAAG 


ATCGCCGTAA 


CTACCCAACC 


AATACTTACA 


AAA CGCTTG A 


APTTA A ATT 


i can 


GCTGAGTCCG 


ATGTGG AAG T 


TAAAAAAGCG 


GAGCTTGAAC 




nfiAA^rTA Ar: 


1 *7 A Ci 


GAACCTCGAA 


ACGAGGAAAA 


AGTTAAGCAA 


GCAAAAGCGG 


AAGTTGAGAG 


TAAAAAA^PT 




GAGGCTACAA 


GGTTAGAAAA 


AAT C AAGAC A 


G AT CGT AAAA 


AAGCAGAAGA 


Afi AA f^f^T A A A 


130U 


CGAAAAGCAG 


CAGAAGAAGA 


TAAAGTTAAA 


GAAAAACCAG 


CTG AAC AAC C 


ACAACCAGrr. 




CCGGCTCCAA 


AAGCAGAAAA 


ACCAGCTCCA 


GCTCCAAAAC 


C AG AG AATf 


Anr^HAAPAA 




CCAAAAGCAG 


AAAAACCAGC 


TGATCAACAA 


GCTGAAGAAG 


ACTATGCTCG 


TAG ATP AG A A 




GAAGAATATA 


ATCGCTTGAC 


TCAACAGCAA 


CCGCCAAAAA 


CTGAAAAACC 


AGCACAACCA 


2100 


TCTACTCCAA 


AAACAGGCTG 


GAAACAAGAA 


AACGGTATGT 


GGTACTTCTA 


CAATACTGAT 


2160 


GGTTCAATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGCTCAT 


GGTACTACCT 


C AAC AG C AAT 


2220 


GGCGCTATGG 


CGACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTATCT 


AAACGCTAAT 


2280 


GGTTCAATGG 


CAACAGGATG 


GCTCCAAAAC 


AATGGTTCAT 


GGTACTACCT 


AAACGCTAAT 


2340 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2400 


GGTTCAATGG 


CGACAGGATG 


GCTCCAATAC 


AATGGCTCAT 


GGTACTACCT 


AAACGCTAAT 


2460 


GGTGATATGG 


CGACAGGTTG 


GGTGAAAGAT 


GGAGATACCT 


GGTACTATCT 


TGAAGCATCA 


2520 
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GTCCCAGTTT C ACT AG CTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2 940 

ATAAGCTTGC TC AAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG CACAAAGAGC 3000 

AACTTAGTTT TTTCTTG AT A GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3 060 

TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA 312 0 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 318 0 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 324 0 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3 300 

GCAGTTGCAT ACATGGTTTC CAT C ATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAC 3 3 60 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTCTGTT AATAGCTGGT 3 42 0 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 3 4 80 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

( D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TT CG AT AG AG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 12 0 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT G TTGGG AG AT 24 0 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 3 00 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 3 60 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTT ATTTTG 42 0 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 4 80 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 54 0 

GTTAAAGAAA AAATAT AG AA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAG AAAAGT A 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 72 0 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CCATTCAGAA 7 30 
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CTTGTCGTCA TACCAGTTTT TGATACCACA GAG TTGGGGT TTGGAAATGA TCAGACCAGT 1200 

TTCTTCTTTC ACTTCACGAA TGACAGCATC G AC AAAGG AT TCGCCACGTT CAACATGACC 12 60 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCGG TCTTGGACCA GGACCTTATC 1320 

TCC GTTTTT A ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 1380 

CAACCTTTAA TCTTTAATCA TAATGCAGAC TTCCCCCCAC CC AGC CGGT A CAGAGGGCAG 14 4 0 

AAGTGATGTT AAAGCCACCC GTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA 1500 

GGCCAGGTAC CACCTTACTT TCAAGCGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC 15 60 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT T AC AGG AATT TTAAGTTCTT 162 0 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1630 

ATCCTTGTAC AAAAAATTCG GCCAAGCCTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA 174 0 

AGGATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCT C AG AA AGTTGAGGCA 1800 

AAACATCGAG TGAGAGAACC TCCCCACCTT TG AC AAAGCT AGACATGCGT AGGGCAGCAG I860 

GACCTGACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT GATGACATGC TTACCATAAC 1920 

TTAGGGTCAC ATCGTCCAGA GAAATACCTT GTAAGGCTTT ATGTCGAAAA TCTGTTAATA 1980 

AAGCACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCGT 204 0 

GACCAAAACC AGTCG AAC C A GTCGAAGGAT AAGACTTACC ACCTCTTCTG ACAATGAGTT 2100 

TCTCACAAGT GAAGGTTTCA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGCTCATT TTCTTTTCCA 2220 

AAGCTTCGAT AATAGTCCGA CACTTGTCAC TGGCTGGAAA GACGCGTCCG TGGTCTTCGA 2 2 80 

CCTTAAGTTT AACACCATTT TCTGTAAAAA AGTTGATCAT GTCATGATTA TCGAACTGGG 2 3 40 

AGAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTAGCAGG TTGTCTAAGC 2 4 00 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT TTTCCAAG^T 2 4 60 

TCCGATTTTT TTCGATGAGG AGGGTTTTCT GTCCATAAAA GCTACTGGAA ATCCTAGCCA 2 520 

TCATACCAGC AGGTCCCCCA CCGATGACAA TAG T ATCAAA ATGTTTCATA GCTCTATTGT 2 580 

ACCACAAAAA AACAAGAGAT G AT GGTC AC C TCTTGTCAAG AATGCAATTA ATCAATTTCA 2 640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 2700 

ATTTTAATAT CCGCTTGTGG GAAGGTTTCA CGGATTCGCT CTGAGACCTG TTGACAACAT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CATATCCAGC TTTTACTAAC 2 820 

T CATC AT AG G CAGCTTGAAC TGATTTCTTT GATCCCCTTG CTTTTTGTAG CAATTCGAGA 2 880 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AG CT AG AAAG 9 3 60 

CTTGATACAA TAGGCTTTTT AAAG AC TG AT TATTTAACAG CGTCTTTAAG AGCTTTACCA 94 2 0 

GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 94 80 

CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 9540 



T 9541 
(2) INFORMATION FOR SEQ ID NO: 133: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 



TTGACTATCC 


TATCATGCTT 


TCTAAGGTCT 


ACTCAAGAAA 


ATCATTTTCA 


AGTTTTCACA 


60 


CCTTTCTCAA 


AAAAGTTAAA 


AAATTTTCTC 


AAAAACGCTT 


GACTCTGACC 


TAAGGCGAAG 


120 


GGTTATACTA 


TCATTGTAAG 


GAGGAAATCA 


TGTACCATAT 


AAAAGAAGCT 


GCGCAGCTTT 


180 


CAGGTGTCTC 


TGTC AAG AC C 


CTGCATCACT 


ATGACAAGAT 


AGGACTCTTG 


GTCCCCTTAA 


240 


AGTCGGAAAA 


CGGCTATCGA 


ACCTACAGTC 


AAGAGGATTT 


GGAACGCCTT 


CAGGTCATTC 


300 


TTTACTACAA 


ATATCTAGGC 


T TTTCTTT AG 


AGAAAATAGC 


AGAGCTGTTA 


AAG G AAG AAA 


360 


GGACAGATTT 


ATTGCCCCAT 


TTGACTAGGC 


AGTTCGACTA 


TCTAACTCGC 


GAAAGGCAAC 


420 


ATCTGGATAC 


CTTGATTTCC 


ACCTTGCAAA 


AAACTATTCA 


AGAACAAAAA 


GGAGAAAGAA 


480 


AAATGACCAT 


TGAGGAAAAA 


TTCACGGGAT 


TTAGCTATCA 


AGACAATCAA 


AAATACCACC 


540 


AAGAAGCGGT 


AGAGAAATAT 


GGTCAAGAAG 


TCATGGGACA 


AGCGCTCGAA 


CGCCAAAAAG 


600 


GTCACGAAGA 


CGAGGCTACG 


GCCGCCTTTA 


ACCAACTCTT 


TCAAACTTTG 


GCACAAAATC 


660 


TTCAAGTTGG 


TTTACCTGCA 


ACAGCAACCG 


AAAACCAGGA 


GCAAGCAGCC 


AAGCTCTTGC 


720 


AAGCCATTCG 


CACTTATGGA 


TTTGACTGCT 


CTATTGAGGT 


ATTCGGTCAT 


AT CGGT AAAG 


780 


GTTACGTCTA 


CAACCCAGAG 


TTTAAGGAAA 


ACATTGACAA 


GTTTGGTTCT 


GAAACAGCCC 


840 


AGTACACGTC 


AG ATGCC AT T 


GCGGTTTACG 


TTCAGACAAA 


TGCAGAATAA 


ATAGGCTAGG 


900 


AATTTCCTAG 


CCTATTTTTT 


ACTTCAAATC 


AT AAAG CC AG 


TCGTCACCGT 


TTTTGTAGTA 


960 


AAAGAATTCA 


CTGAGATCTT 


CTTCTAGAAA 


CACACGAAGC 


ATATCAGACA 


TATCATCGGT 


1020 


TG C AAGTTTT 


AGATGAGAAA 


GATTTTCAAA 


GTCCTCCCAC 


CAAACTTTCC 


CTTCGTCTGA 


1080 


AGACTGGAGT 


TCACCAGTAA 


AGTGTTCTGT 


CTTGTAAAAA 


AGGACGACAT 


AACGATAATC 


1140 
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TGTTTTC CAT 


AGACAATTTA 


GCTAAAACCA 


AACG AGCTTT 


7620 




»jA 1 AULA 1 L»C 


CGACTGATI I 


TTTAACATCA 


TCTCCTGAGA 


AAAGGAAGGC 


7680 




l I uv.uuA 1 1 1 


LAAL 1 1 L IbO 


TGTCaGTTTG 


AAATCATTCC 


AGAGTTCATC 


7740 




1 1 AL i i VjO I Lj 


TCAGCTTGCT 


TTGGGTTTGG 


TCATAGTAAC 


CAACCTCAAC 


7800 




AAtjCL»CTTTT 


CTCCCTTGAT 


AAAAGGAATC 


TGGTCCACAA 


TAGACTTGAT 


7860 




I ILiCLGATAC 


CATTTGGACC 


AACGATAGCG 


ACAGCATTCA 


TCTTACGAAG 


7920 


A 1 v_ i A\jO i In 


A 1 Lbo 1 TVj i vj 


AC AAG AC TTC 


CCCGTCATAG 


CCAACAGCTG 


CATTTTCAAC 


7980 




a /** ft Ttrrprr 

ALA 1 ILiLLLu 


ACGTTTTTTC 


AGACTGGAAG 


GTCATGTTGG 


CTGATTTCTT 


8040 


GCCAGCTTCA 




ft »/»p7W«(-i^ ft m 

AALu I i LLA 1 


TTTTtCCAGT 


TGTTTACGGC 


GAGATTGAGC 


8100 


ArnTTTAnTP 
oV.vj ill nu L \_ 


IjI I bAAOL. AL 


GAACTAGATT 


GCGATTGACA 


AAGTCTTCCA 


GAG C AGC GAT 


8160 


l 1 ^_ ± 11. 1U1 


TGCTTTTCAT 


AGTTTTTTGC 


CTCAGTAACT 


AGCTTTTGCT 


CCTTCAATTC 


8220 


GAPAAAArr:A 


VxAIj 1 AA I 1 V_l_ 


F*f* ft ft t& ft 

CCACATAGCG 


ATCCAAGGAA 


TCCTTGGTCA 


AATCTAGCGT 


8280 




AV_ L 1 1 Lt I LLA 


ft ^ ft ft x ft ft r*f+ 
AGAAATAaCG 


GTCGTGGCTG 


ACGATAATGA 


GGGCACCGCT 


8340 


ATACiTTTAPf* 


AAu i AA 1 IV. 1 




GATGGTTTCA 


ATATCCAAGT 


GGTTAGTTGG 


8400 




ALUAAunuA i 


TGGCCTTTTC 


AAGGAGCATT 


TTGGCAAGTG 


CCAAACGAGT 


8460 




LLAIjAAALiL 1 


LALiLAAT I TT 


CATCTGCCAC 


ATAGACTCGT 


C AAA CTTG AA 


8520 


TCCATTPAAA 


ATrrrTrr a a 


TATCaGCTTC 


ATAGGTAAAG 


CCACCTGCTT 


GGCGAAAATT 


8580 


CTCAGATAAR 


v.%_»Vj 1 LA I AA 1 


L 1 VjALA 1 V-ALt 


rpnvp ft o ^ ft ft ft 


TCCTCACCAG 


ACTTTTCACC 


8640 




i LLAl C 1 LiAC 


GCAGTTGTCT 


CTC CGTCC G A 


CGCAAATCAT 


TAAAGACATG 


8700 


AAGCATTTrA 


1 LVj 1 ALtA i 


TATTTTCAGA 


CT C AAAACGG 


CTATCTTGGG 


CTAGGTAAGA 


8760 


CAGAGAAATA 


TCTT TTTT CTT* 


T ATTG A TTT C 


rn/T'fT^ rTy ft /^•T T« 

ILCGC * AGTT 


GGCTCCTCTT 


CTCCAACTAA 


8820 


AATCTTCAAA 


AGAGTACACT 


TACCTCCACC 


ATTTTTCCCA 


ACAAGAGCAA 


TCCGATCTCG 


8880 


TTCATCAACC 


TGCAGGTTGA 


TATTATCGAA 


AAGAACCTCT 


CCTGCAAAAG 


AACGTTCAAT 


8940 


TTTATTAGCT 


TGTAAAATAA 


TCATACAAGT 


AGTATAGCAT 


GTTTCCCTAA 


GGCATTCAAG 


9000 


ATAATCGTAA 


GTCTTTTAGT 


ACAACTTTTA 


TAACATAAAA 


TAAACTAAAT 


TATGTATATT 


9060 


TTATATTAGA 


TTACTTCACT 


ATCTTGTTGG 


ATTTTCTAAC 


CAGCTAATCT 


TGTTTCAAAT 


9120 


AGTTATCGCA 


CAAGTCTATT 


ATTTAATTCT 


TTTCATCATT 


TACGTACGTA 


TAGCAGATTG 


9180 


AAATAAGATG 


AGAACAAATC 


GATTGGGAAA 


GTAAAATTAA 


TTT CT AT AAA 


TGTTTTAGCA 


9240 


ATTGTTTCGT 


ACTATTTTAG 


ATTCAGTCTA 


CTATATACAA 


TATTTTCGGA 


ACATTCAACT 


9300 
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AGCTTTATTG 


GCTGTGCGAG 


AAAAGGCAAA 


AGCAATAGCC 


TGCACAATGT 


TAGCAACAAT 


5820 


CAAAGCGCCA 


ATCATCCAGC 


TATCATTCCT 


TATGAAAGAA 


ATAGCCAGAC 


AAAGAATCCC 


5880 


ACAAACAAGA 


TCTGCCGTCA 


TTAAAATCTT 


ACCACGAGAA 


AAACGGTCTG 


AAATAACTCC 


5940 


GCCAAAGGGA 


TTGACGAGAA 


TAGATGTGAC 


GAGCTCAGAA 


ATCTGATACA 


TTCCTAAAAC 


6000 


TGTCTGTCCT 


ATAGTCCCCA 


TAGAAGCCAA 


CC AG AC ACT A 


TTTCCATAAT 


CAT AG AG CAT 


6060 


ATTTCCCATT 


TTATTGATAG 


CCCCACGGCT 


AATCAACTGC 


ACTGCATAGC 


GATTCATATT 


6120 


AAAGCTCCTC 


TCAAATTTTG 


AAACTATTGT 


ATCAAAACCG 


AAAGG AG CTT 


TTTATTTTTT 


6180 


CCCTTATTTG 


GGAAAATTAA 


CTTTTGACAA 


ATTTTTCGTA 


GTCTTCCTGA 


TAATAGGCTA 


6240 


CTTGCTCTGG 


AAGACCTAAC 


ACATCAAAAA 


TATGCATGGC 


CTCTTGCATC 


TGCTTACAGC 


6300 


CTTCTTTACA 


CTGTCCTTTT 


TGATATAAGG 


CAAAACCTTT 


TAAATAATGG 


AAAACATTAC 


6360 


GCTCATAAAG 


CTTAATACCT 


TTGTCAATAA 


TCTTC TCTGT 


ATAAGCCTCA 


AAATAGTTGG 


6420 ' 


CATTATAAAA 


AGAAGAATGC 


TCTAAACAAT 


GCTGGTAACA 


ATTGAGGGCC 


AAAATCAACA 


6480 


CTAATCTCTT 


ATGGCGACTA 


ATCTCTTGGT 


AAAATTCCTC 


CCTCTCCATA 


ACTTCTCTAC 


6540 


CAATCCGAGT 


GACATAGTCT 


ACATCGTAGA 


AACTATAGAG 


GTT AC C G AAA 


AGAATCAACT 


6600 


CATACATGGT 


CCATTCTTCT 


GTTTTGAAGA 


GATAATCTGC 


TACCTTACCC 


AAATCATCCT 


6660 


GCTTCATATC 


ATAACTCGCA 


TCTCTTTGAC 


AAATCAGACC 


TTGTAGCAAA 


ATCCAGTTCA 


6720 


GCTCAAAATA 


AAGGGG AG T C 


GTCGAACTCT 


TAGACTTTTC 


AAGTTGTTCT 


CTTTGAAGCT 


6780 


TTTGAAAACC 


TGCAATATCG 


TTTGAATAGT 


AAAGTGGGAT 


AATCTGTGCC 


ATCATAGACA 


6840 


CATGTTCATG 


ATTATGAAAA 


TTCCTTGCCT 


TATCCATGAA 


ATTTTCGATT 


GTTACATGAA 


6900 


TGTTATCCAA 


AATCTCAAAG 


AAACGGGAGA 


CTGCCAGGTC 


AGACTCCCCA 


AGCTCAAAGC 


6960 


GAGATAACTG 


AGAGGTAGAG 


CAGGATTCGC 


CTGCTCCTTC 


CTTTAAAGAA 


TAATTTCCAC 


7020 


TTGTTCGAAA 


TTCACGAAAT 


ACTTTTCCAA 


GATGTTCCAT 


CTTTACACCT 


CCTCTGATAA 


7080 


TTCTTCCCAC 


TCAAGCATAG 


CTTCTTCCTG 


ACGATGGCTG 


ATTTTGTCCA 


GCTCACCCTG 


7140 


TAATTCCATG 


AGTTTGT CGG 


CATCGTTTGT 


TTCCAACATT 


TGTTCAGAAA 


TGGCTTGGCT 


7200 


TTGACTTTCT 


AGCTCTTCAA 


TTTCAGCTTC 


TAGACTTTCG 


ATTTGTCGCA 


TGAGTTTGCG 


7260 


AACTTCTTTT 


TGACTTTCTT 


TCTGGGCCTG 


ATACTCATTG 


ACTGGACTTG 


CTTCCTTTGC 


7320 


TTGATTGCTA 


GTTGAAGCTT 


CCTCAGTCTG 


ACTCATTTCT 


GCTGTTGCTT 


TCTTCTCAAC 


7380 


ATAGTACTCG 


TAATCTCCAA 


GGTAGAGAGT 


TGAACCATTC 


TCAGACAATT 


CCAAAACATG 


7440 


AGTTGCCACA 


CGATTGATAA 


AGTAACGATC 


ATGACTGACA 


AACAGCAAGG 


TTCCATCAAA 


7500 


GTCAATCAAG 


GCATTTTCTA 


GCACTTCCTT 


ACTATCAATA 


TCCAAGTGGT 


TGGTCGGCTC 


7560 
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CCACTCCATC AATATCGTAA GGCAGATTTT CCCGTTCCTG TCCTACTTCT TGGATAAAAT 
TCCAGATTTC ATCTATGTTT TCAGCCAAGA TTCGCTTAGG ATTCACCACA AAACCTAGTT 
GTTCT AG GT A CTTCAAACCC TTTTCTTGGC TATCACGAGT TGAAGGGCTG GCTTCTTGAT 
AGAGAAACGT TGCAAGATTA CGCTTGGCAA CTACTGCTGT ATCCAACTGA CGCAGAGTTC 
CTGCTGCCGC ATTACGAGGA TTAGCAAATT CAGGCTCTCC ATTTTCTTGG CGCGCTTGGT- 
TAACTTGGTC AAAGGAAGCG CGTGGCATGT AACATTCCCC ACGAACTGTG ATATCTAGTT 
CTTCTGGCAA AGTCAAAGGG ATGTCCTTAA CACGCTTGAG GTTTTCTGTG ATATTTTCAC 
CAATTGAACC ATCTCCACGT GTTACCCCAG CAACCAAAAT CCCCTTTTCA TAAGTCAGCG 
AGATAGATAA GCCATCGATT TTCAGCTCAC AAATATAGGT CGGATGAGCC ACTTCCTTAC 
GAACACGCGC ATCAAAAGCA TCTAGCTCCT CACATGAAAA AGCATCCTCC AAACTATAAA 
GAGGATACTG ATGACTGTAT TTTTCAAAAC CATCTAAAAC CTTGCCACCA ACACGATGAG 
TCGGACTGTC TGCTAGCACT TGCTCTGGAT AAGCAGTTTC TAACTCGACC AACTCACGGT 
AAAGGCGGTC ATACTCACTG TCTGAAACCG AGGGATTATC GCTGGTATAG TACTCAGTCC 
CATAGCGATT GAGCAAAGCG ACTAACTCAT TCATTCTTTT ATTCATAAGA CCATTTTACC 
ATAAAACAAG CCCTCCTCAC AAACGAGAAG GGCGGAAAAA AC ACTT AG TT TGAAATTATT 
TTTGAAACTC AAGCAACCTT AT AT C AATTT TTCAAAATCA GTTCGAACAT ATCCGAGAGC 
TAAGAAATAT AAGGCTACAA CTCCAACTCC AATAATCAAG AAAGAATAAA GATGGACACT 
TGGCAAGACT GTCATAAATC CTTTTGCAAT AGGCATAAAT AGAATAGCTA AGGTAAAAAT 
TGTACTCAGT ACTCTTCCAA GAAATTCGCT CTCAACCTTG GTTTGTACTT CACTAAAAAA 
GTGAATATTA AAAATCGTCA TAAACAATTC ACAAACTAAA TTTCCAGAAA AGGAAAGAAA 
AGTTGG AAG T GGTAATCCCA TCATAAAAAC TCCGACACCT GTCAAAGCCA GTAAAATCAA 
AAGATTATAA ATATTACCTT TAATTTTACT AG CT AG AAG A GCCCCAATGA TGGAACCAAT 
AGCCCCCATA GTTAAAATAC TTGCATAGGC TCCTTCTGAC CCGTAAAGCT GATTCGAAAA 
GGGAAGTAGA AATTCAAAAG CTGCAAAAAA GAAATTAACG CTGGAAGCTA CCAGCAAAAG 
GAAGAAAATT TCTTGCTGAT GCCAGATATA GTGTAACCCA TCCTTGATAT CTACAAAAAT 
ATCTCTCCCA GTAAAAGCCT TTTTCTCTTG AACTTTTGCT TCCTCTTTTG GAAGGAAAGC 
CACTAGAACA AAAGCAATGA AAAAAGTCAG CGAGTCTAGC AGTACCGTCA TATGGAGACT 
TGCAAACTGT AAAACAAGGA AGGAAAGAAC AGGAGAGCTA ACACCTACAA CCTGCAAAAC 
CAGCTCTAAG CGAGAATTAT AGATCACAAT CTCATCTTTC TCCACCACTT CAGTTATGAT 



4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 
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TGTCCTTATG ATTATAAGAG TAGCTATTGC CTCTTTTCAT CTCAAAAGTC TTCCAAACGG 2280 

GTCCATCATT AGCAGCTCA7 TCATAAACGA CAACTTCCAC TTCTGTCGCT GTAGGTGACC 2 3 40 

AGAGAGAAAA ATGAGCCTGA TTGTCCTCTA CACGGCAACC CAATTCTCCT TGGTAACCCC 2 400 

AATGATGATC AAAACTAGCA CTGTTAATGG CCTTATCAAA GGCAAAAGGA TTTTGATTTT 24 60 

TATAGAAAGG ACTGGCAATA ; GCAGGATTTT CAGAGTAATA AATCCT AT C A TCGCCTTCCA 2 520 

AAATCCAGAC CTCTGTTAAT AGGGGATAGT GATTAAAACG GATAGAATAT TCTTTACTAG 2 580 

TTTGACCTGT ATGAACCACA AAATTCAAGC TTTCTATAAC ATGTGAACTT GGGTGTTCAA 2 640 

AGCTAAATAA AGCTCCAAAA TAATCTTCTT TGTAGGTTAG CAAATCAATT CGTTGATCCT 2 700 

GACTTTTTAC AAAGGAGCAA GTGTCATATT CTCCATTCTT ACGATGGTAA TGAATGCGCA 2 760 

TAGGGTAGTT ATACATTTTT TATTTTTCCT TTTTACTTTG TTTCTATTTC ACTAATAAAT 2 820 

TTTTGTCAAT CTCGTCTCAA TTAACAGACA TAGTCATATT CTCTAAACTC TGT TTTT AAA 28 80 

CGATCCATTA CAAACTTTCT AGCCATGCCT CATCTCTGAC CTGGATACCA AGTTCTTCTG 2940 

CTTTTTGCAG TTTACTTCCA GCGTCTGCAC CTACCACGAC GAGGTCGGTC TTTTTAGAAA 3 000 

TACTACCTGT CACTTTGCCA CCCAGACTTT CG ACT TT ACT TTTAGCTTCT GAGCGCTTGA 3 060 

GTCGTTCCAA TTTTCCTGTC AATACCACCG TCAAACCTGA CAAGGCCGCA TCCGCTACTA 3120 

CCGTCTGTCC TTTATAGTCC AGATTGACCC CAGTTTCTTT CAATTCTCTG AG CAG AATTT 3180 

CAGAGCCTTC TGTCGCAAAA TAAGTCTGAA GACTTTTCCC AATCACGCCA CCTAGACTTT 3 24 0 

CAATACTAGC CACTTCCTCT GAATCTGCCT GAGACAGATT TTCAATTGAA TGGAAATATT 3 3 00 

GAAGTAAAAG CTG ACT AAC C TTGCTTCCGA CATGACGAAT TCCCAAACCA AATAAGAGCT 3 3 60 

TCTCGGCAGA ATTTTCCTTT GATGCTTGGA TAGCCTGATA CAGTTTAGCA GCGGACTTTT 3 4 20 

CCTTAACTCC CTCTAAAAGG AGGAAATCCT CTTCTTGCAA ACGATAAATA TCCGCCACAT 3 4 80 

CCTTGACTAA ATTAGCAGCA AAAAGCTTCT CAACAATACA TGGACCAAGG CCTGTAATAT 3 540 

TCATAGCATC ACGAGAAGCA AAGTGAATCA ACCCTTCCAT GATTTGAGCA GGGCAACGCG 3 600 

GATTGATACA ACGTAGGGCC ACTTCATCTT CAAAGTGCAA CAAGTCAGAG TTACAACTTG 3 6 60 

GACAGTTTGT AGGGATATCT AGTTTTTCTT CAGAAACCCG TTTGGACTCT ACCACACGTA 3 72 0 

AAACGGCAGG GATGATGTCA CCAGCCTTAT ATACAATGAC CGTATCCTCT TTTCGGATAT 37 80 

CTTTTTCAGC AATATAATCT ACATTGTGCA GGGTCGCACG GCTAACAGTC GTACCGGCAA 384 0 

GTTGTACTGG TGTTAGATTA GCAGTTGGAG TTACAACACC GCTACGGCCA ACTGTCCAGT 3900 

CAACTGATAA GAGTTGAGCT TCTTTTTCTT CGGCAGGGAA CTTGTAGGCT ACTCCCCACT 3 960 

TTGGAGCCTT AACTGTAAAA CCAAGTTCTT CTTGACTTGC TAGGTCGTTG ACCTTGATTA 4020 
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GCGAGATAGA TGGTATTTAT AAAACAC T C A AG AC AG CT AG ACTAATATCA TTTAAAACAT 
TAT CTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 
GATAGGGTTC TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 660 
AGATAAGGTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATGATGG TAAATTTCAT 7 20 

CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 
ACT C AAT ACT GTCTTGACGC TCATTAATCA AGTTCC AGTT CACTTGGTTC ACACTGTCAG 
GAGCATTATA GCT ATT CATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTT C AC 
CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 
GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 
GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG CCTTCTACAT 1080 
AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 1140 
TAGCTAGAAT TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 
CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGT CATT AAA GAAACCAATA TTTGGCATCT 
GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 
CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 
TGGTCTTGAC ATCATGAATC CCCATCAAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 
GCACCCAGTA TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 
TTTCATTTCC AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CG AT AAT ACT 
AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAACGTATGG TTATAGACTA 
CATC CAT AAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 
GAATGACCTG AGCTGGATCA TCTGGATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 
TTTGTGGATC ATAACCCCAG TTGTAGGTTA C ATTTC CATC CTCATCGTAT TCTTTATCAC 
GGTCTGCAAT TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG I860 
CAGTTGACTG GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 1920 
GAAGATGTTC ATCTACACCC GATGTAGCTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 
AGATAACTGC CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 
CGAAGTTTTC AACTTGCTTT TCTACATGGC TC AG AAT AG C TGAACGTTTG CCATCAGGGC 
TGGTCGCGAT TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATG AGGG AAT TGGACTTGAT 2160 
ACTGATAAGT CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 2220 



540 
600 



780 
840 
900 
960 
1020 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 



1980 
2040 
2100 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5 3 40 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAAT GGC AAA AATCTCTCCT AAATTCTGAC 54 60 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5 520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGT ATG AT CTTCTTTCTA GGTGCTTCTT 5 580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 564 0 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGCA CGCCAGAAAC TTCAAGTTCC T CTGCATT AT TTCCAAGACG 5880 

ATCAAAGGTC TT ATG CGT AT CGTAATTTTC CAAATAGACA G C AAGCTGAA CCTGTGAAGA 5 94 0 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG T G TTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CAT AAAT ATC 6180 

CATCAA 6 * 86 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAAT C AC A ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 12 0 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 2 40 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 3 00 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 3 60 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 4 20 

ATTTCAATGA ATGGATTTTT TACT AAT ACT CATAACTGGG AATTTGT CTG TGTAAAAATA 480 
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TAAATCAAAT CAATGGAGAT ATTGTCAAAA CCACCCACTT TCAGGCGATC GATATTTTCA 3 600 

TAAATATCCT TCTCCAAATC ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3 6 60 

TCGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 3720 

TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 3780 

TTAG7CAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CCgTGTTCCA 3 840 

CCACCGATAT AAAGG GTTG A CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3 900 

TGCTCTAAAT AGCTGTCGAC TGGCTCATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3 960 

TAACAAATCT GGGTACAAAA TGCGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 4 020 

GTAATTATTA TACCACAAAG ACTAGATTCC AGATAAAAAT CACCATCCCC AGATACATAG 4 0 80 

TCCGTCCGGA GATGGTGATG GTTTATTCTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4 2 00 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4 2 60 

TTTCACCTGA CTCAACCTCT TCTT TTGTTT TCAGAACAAA ATCTGTACCC TGCTCCTTAA 43 2 0 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 4 3 80 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTCTT AAAAACAAGG 4 44 0 

CAAGAGCTGC ACCTGAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC ATAAGGATTC 4 500 

TCCTTTTTTA TTTTTTCAAA AATTTACTTG CAAGACGAAG AGCTGACAGA CTTGCACCAG 4 5 60 

TCTTGAGTGT TTTTGAACCA GCTGATGAAG CTTTCTTGCT CAAGACACGC GCATGGTCAT 4 62 0 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTGGA TCAATCGTAG 4 6 80 

CCACCTTGAC ATTGATATCA TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4 740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4 800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4 8 60 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAC 4 920 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAG AC CG AT A AG G AC AACTG 4 9 80 

ACAGCCATTG GGACACTCGA AAGCCGAAGA AC AT GAG A CT ATCTGTTCGC ATACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCGTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAGTCGGT TGACGGTAGC TCCCCTCAAT ATACATCTGG TCACGGATAA 5 2 20 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TTCCACCATA AGCTTCTTGG TTAAAGAAAT 52 80 
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TTTATTCATA GAAATCAACT TGGGTATCCA ACAATTTATC 
CTGAAAAGAA GGGTTTATCC TCTAAAAGCC ACTCAACAAA 
TCGGCTTGCT CAAAACCTCA TCATAGGGAA CCCATTCTAG 
TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT 
TAAATTCAGG AAAAGTGATG ACACCTTTTA GAACTGGCTT 
CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCGTCTCTCC 
CACCAATCCA TTTCCCTTCA TCGACATCAT TGGGTTTCTT 
GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG 
TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA 
GCT TGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT 
TCAAGTGATG AGGATTGGTA ACTAATCCCT TCTTCTTACG 
CAAAAACGAC TGGTCTATCT AGATGATTGA CCACCTGAAT 
CCCAG ATT*1"G ACCAGCATGA CCATTGGCCT GTCCAGCAGG 
GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG TCAAATCATG 
CATCTGACAA TTCTTTCAAG ATATTTTGCA AGCATGGTGG 
CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA 
CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG 
GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG 
CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA 
(2) INFORMATION FOR SEQ ID NO: 14 8: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
<D> TOPOLOGY; linear 



CCCATCATAA 
GGTCTGGTCA 
CGTCCCCTCA 
GTACCAGTCT 
GGCTTTCACC 
TCTCTCTAGC 
ATTACGATGG 
AGGCATATTT 
TGCATTGGCC 
TGGATGGGCT 
TGCATAACCT 
CACAGCATCA 
AACAGTCAAA 
AGATTTCTTA 
ACCTGGGATA 
GGGGTCTTGC 
AAAAACCTTT 
ATTGATTTTC 
GTGTTC CAT A 



ACAAACTTGG 
CCTTCCCAAC 
TTGCAGTCAA 
AAATCTGGTG 
CCTGTTTCTT 
TTACCACCCA 
AGCATGAGCA 
TCTCCTTATC 
TTGGAAAAGG 
GATTCGATAA 
CCCCAGAGTA 
GTAAAAGGCT 
CAAGCATTAA 
ACTCCGATAT 
GAGTCAGGTA 
CCTAGAATTA 
TCCTTGGGTG 
CCGAAATAAC 
GCCGACTCGG 



10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 



<xi) SEQUENCE DESCRIPTION: SE0 ID NO: 148: 

AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA. TACATAGAGA 60 

TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 120 

GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 180 

CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 24 0 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTCACAA C ACTGC C AAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAACG GAATGCTAAT GCTATCGGCT 36 0 

ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AG AG ATTG AT GGTGTCAAGG 420 

CTACTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACCTCCCTTC AACATTGTTT 480 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 54 0 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGCAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTACGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 7 20 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTG CTTT AG 840 

ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGG C AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTT CTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAC 1080 

CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG U4 0 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 132 0 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 13 80 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 144 0 

TCGGCCTACA ATTATTGGTG CCTTGGATTA CAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCACT TTGTCACAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGT AT T ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGCTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 17 40 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG I860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 19 20 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 19 80 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2040 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 6 72 0 

TTGCATTTGC TG ACTG AG GG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC $7 80 

ATTGGC AC C A AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 684 0 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 
AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 69 60 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 702 0 

TTTCCCTTTC TT T ACT AATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 
ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 714 0 

TTCTTCTTAC CTTACTATTT TACCGCTTTT G GC AAAT ACT GTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7 260 

TTTTTCACAA ACTTTTT AG C ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7 3 20 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT CACAATCAAA CTAATAATCA AGCCGATCCA 7 3 80 

GTAAGAATAA GCTCCAAAAT CTCTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 7 4 40 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTG TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AG AAAAG ACT ATAAGTTAAA 7 560 

AAA CGC ACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 76 20 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7 680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 77 4 0 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7 860 

GAAAAGATAG CCACTTCCGC GAACACAGTT CCCCCAATAG GC AG AC CT AA ACGAACTCCT 792 0 

TCCTT AATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 8040 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAG CATG A GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

AT AG AT AAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 8280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC CAATGATAGC C AC C AAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 8460 



8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 



GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT CCATACTGTT GTATTGACCT 
GTCATTGCAG TATCAACAAA AGAGGCAGAA TAATTGGCAA ATTGCTAGAT CAGGATTGGG 
AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT ATACATACTT 
CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 
CGGAGGTTTA TTAGATTTTG AAGTAGTATC CCAACACGCA CATGTACGAC AATAATAGCT 
TCTAACTAAA CCTCCGTTAT CAT ATT GAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 
TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA CT ATT AG ATT TCCCTTGTCT 
CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCT A TCAAGCTTGA TAGACGATTT 
GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 
TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA T C AAAAAT AG 
T G AT ATCTGC TGGACCATTC TCAGCCAAGT AACCTG CTTC AAAGTTGTAA AGCTTGGCTG 
GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 
AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 924 0 

TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA ATAACTGTGA 9300 
TGACACCTGA TTTGAGACCT TCGATAACGG CACGACCGTC TGATTCCAAA CGAAGCGGTG 9360 
GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGAAGTGC TTCTGTCTTA GAGAAATGCT 9420 
GTGGCGCTAC TTCTGCTGTG ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 9480 
CACTTTCTTC CTT AG AC AAA TGCTGGATGT GAAC ATGGG C TTT AGTTG C A TAGGCAATCA 9540 
TGACATCACG CGCCATCATA GCGTACTCAG CCACCCCAGT AGCACCCCAG ATATGGAAAT 9600 
GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTC CTCAT 96 60 

GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 9720 
TGCTCTCAAC CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTT CT AAGAGTGCCT * 9780 
TAAAGTCAGT CAAGTTTTTA CCATTAAACT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 9840 
TAATCTTCT C TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 9900 
TTGGACTGGT ATT AGC CATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 9960 
CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC ACGGAAATGA ACATGAATAT 10020 
CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 10080 
TGATCTCAGA CGCAATTTTG ATAATTTTCC CATCTTGAAC T AAG A CAT CA . CAAACTTGAT 10140 
CCAAACCAGA CTTGGGATCC ATTACACGAC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 10200 
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CTCTTTTCTT CGACTTATAC TTCTGAGAAC AT TTCC CTT A TGCCAGCGAT TATTTCCACC 2100 

GTTATTCTGG TCTTTGGTCC TCTTCTTTTA GCCTTGCCCA TAGGGATTTT TGCTGGTTTT 2160 

TATCTTGTGG AATATACAAA AAAAGATTCC CTTTGTGTTA AAATCATGCG ATTCCCCTCA 2 220 

GATACCTTAT CTGGGATTCC TTCCATTGTT TTTGGTCTGT TTGGCATGCT CTTCTTTGTA 2 280 

GT CTT CTT AG GTTTTCAATA CTCTCTGTTA TCAGGAATCT TAACCTCAGT TATCATGGTG 2 3 40 

TTGCCAGTCA TTATTCGCTC AACAGAAGAA GCCCTTTTAT CTGTTAGTGA TAGCATGCGT 2 400 

CAAGCAAGTT ATGGACTTGG GGCAGGTAAG TTACGGACTG TTTTTAGAAT TGTTCTACCA 2 4 60 

GTTGCCATGC CAGGTATTTT AGCTGGAGTG ATACTAGCTA TTGGCCGTAT CGTTGGTGAA 2 52 0 

ACAGCTGCCC TCATGTATAC ATTAGGTACC TCTACCAATA CGCCAAGTAG TCTCATGTCT 2 5 80 

TCAGGCCGTT CTCTAGCCCT ACATATGTAT ATGCTGTCAA GTGAGGGGCT ACATGTCAAT 264 0 

GAAGCCTATG CTACCGGCGT GATTTTGATT ATTACTGTTT TAATGATAAA TACTCTATCA 2 700 

AGCTTATTAT CTCGAAAACT TGTGAAAGGA GCTTCCTAGT ATGCCAACAT TTTCAGTCAG 2760 

ACACCTAGAC TTATTTTACG GGGATTTTCA AGCCTTAAAA AATATTTCGA TTCAATTACC 2820 

AGAAAGACAG ATT ACTGCC T TGATAGGCCC ATCTGGTTGT GGCAAATCAA CTTTTCTAAA 2 880 

AACCCTTAAC CGGATGAACG ATTTGGTTCC TTCTTGC CAT ATTGAAGGCC AAGTCCTCTT 2940 

AGATGAGCAA GATATTTATA GTAGCAAATT CAACCTTAAT CAGCTACGTA AGCGTGTAGG 3000 

GATGGTTTTT CAACAGCCTA ATCCCTTTGC CATGTCTATC TATGATAACG TGGCTT ATG G 3060 

CCCAAGGACA CATGGTATTC GAGACAAAAA ACAATTAGAT GCCTTAG TGG AGAAATCTTT 3120 

AAAAGGGGCA GC CATTTGGG AAGAAGTCAA AGATGATCTT AAAAAGAGTG CCATGTCCTT 3180 

ATCTGGCGGT CAGCAGCAAC GCCTTTGCAT TGCGCGAGCT TTAGCAGTAG AACCTGATAT 32 4 0 

TCTGTTAATG GATGAGCCGA CTTCAGCCTT AGACCCTATC TCCACTTTAA AAATTGAAGA 3 3 00 

CCTCATTCAG CAACTAAAAA AGGATTATAC GATTATCATT GTTACCCATA ACATGCAACA 3 3 60 

AGCTTCACGT ATTTCAGATA AAACTGCTTT TTTCTTAACA GGAGAAATTT GCGAATTTGG 3420 

AGATACCGTT GACGTGTTTA CCAATCCAAA AGATCAGCGC ACAGAAGACT ATATTTCAGG 3 4 80 

ACGGTT CGG A TAAGGAAGGA AAAACCTATG AGAAATCAAT TTGACTTAGA ATTGCATGAA 3 540 

TTAGAACAAT CCTTTTTAGG ACTAGGGCAA CTTGTCCTTG AAACAGCTTC AAAAGCCTTA 3600 

CTGGCCTTAG CCTCCAAAGA CAAGGAGATG GCAGAGCTAA TTATCAATAA GGATCATGCT 3660 

ATCAACCAAG GTCAAAGCGC TATCGAATTG ACCTGTGCCC GTTTGTTGGC CTTGCAGCAG 3720 

CCACAAGTGT CTGACCTTCG ATTTGTGATT AGCATCATGT CTTCTTGTTC AG A C CTT G AA 3780 
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CG T ATGGG AG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3 84 0 

CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CT ATT ACT AT TGCTCAAAAA 3 960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4080 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA AC AGG AG AA C 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTAGATAG TGACGAGTTG 42 60 

TTTCAATTCT ATCGTGAACG AGGGAATGCT CAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4 320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 4 380 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 44 40 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC T TCCTTC AT A TTGCCGGAAA ATATGTCTCT 4 500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG C CT ATT C AAA ACAGTTTCAA 4 560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAG AG GT AG A 4 620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4 680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 4740 

TCGAACATCA GAAATTAAGC AAAATC AC C A GAAGGACAGT ATTTCAACTA G C TTT T CTGG 4800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4 860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 4920 

CTTTTGGGCG ATAGTTT CAT CTTCGTATGT AGG AGTCCTC ATCAAGAAAT ACTTCAATTC 49 80 

TACGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTCTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATTT V 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC :,220 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 52 80 

CACTTTAACT AAAG ACT AAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAG AAT ATG A TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 54 60 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT S520 

TCCCGAGCAA ATCAATGAAA TT AAT A C AC A CCATACTAAT AAGCACTACT TT AAAG ATG T 5580 
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CGTTCTAGAC GAAAATATCA TTCCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 564 0 

TGCGATTTTG TTTGTTGTCC C AACAAAAG T GACACGACTT GTTGCCCAGC AAGTTGCACA 5700 
AACCTTGGAC CATAAGGTTA TCATCATGCA CG CATC AAAG G G ATT AG AAC CTGATAGCCA 5760 
TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 
CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CCTGACCTAA CTTTAATAAC 5880 
TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5940 
CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 
TATTATTGCT GTCGG TGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 
AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 
TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 
CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 6240 
AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTATGAACT 6 300 

AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 63 60 

CGGAACCAAT AT C AAAG AT G CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 64 20 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTGGACTAG GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCT C AAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

G CTAGTTG AT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 68 40 

GC TTGGTG AT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 7080 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 7260 

ATCCATCGAC TACGCCCTCA AACACCCACA AGT CAAAG AT GATTTGAAGA ATTACCTCAT 73 20 
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CCAACTTGGA AAAGAATTCA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7 380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 744 0 

CGCTATACTT GTATTTGTTT TTTCATTAAA AT AAGAGT AG AATAAATTAG TATAGTAAAA 7500 
CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACCGACAAA GCCTTATTTT 7560 
AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAOTTTTAAA TTTGACTTAA 7620 
CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7 680 

GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GCTCTAATTT TAATTTTAAA 7740 
GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7 800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7 860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 
AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7 980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TAT CTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 82 80 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAACCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCT AATG A 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AG ATT AT AGT TCATCTGAAC AAACTAATGG 84 60 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 864 0 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 882 0 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTAC CGT AG GTATGGTTAC TGACT TCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 894 0 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC TGCTAGAAAT GCGCCACCAA CTGCACCACC GATATGGCCT GCTAGGCTGA 9180 

TTCCTGGAAT CAGAACACTT CCAATAATGT TAACCACAAA AAGTGTCAGA TAGGATTGCC 9240 

CTAGCTGTTG GATATAAGGA TTGCGAGTTG CATAGCGAAG AACAATAATC GCGGCAAATA 9 300 

GCCCATAAAG AGACGTAGAC GCGCCTGCTG CTAAGGATTT AGGACTAAAT ACAAAAACAA 93 60 

AGAGATTGCC CATCATTCCT GATAAAAGAT AGAGAAAGAA AAACTGCTTA GAACCCAAAA 9420 

TCTCCTCTAC CTGCCTTCCA AGATAATAAA GTGAAACCAT ATTAACAATG AAATGTTCCC 94 80 

ACCCAATATG AACAAAAATG GCAGACAAGA GACGCCAAAC CTGCTCGGGA AAGAGGCGAA 9S40 

TACCTGGCCC ATACATGCCT CCAAATCGAA ATAATCTATC TGCCCTGTCA AAGTTTCCGC 9600 

CTGCAGTGAC CAACATTAGT AAAAATACCA ACGCCGTCAC TAAGAGGAAG AAACTCGTCA 9660 

CAGGGTAACG TCTATCAAAG ATTTCCTTCA TCAATTAATA CCTCCTGAAC AGGAATATCA 97 20 

TCCTTTTCAG GTATAAAGTC CTGAATTTGA CAAGGATATA TCGTACTCAA AGTACGACCA 9780 

GAAAAATGTT CCAGATAGCG GTCATAATAG CCTCCACCGT ATCCTATCCG ATATCCTTTC 9840 

GTCGTAAAAG CCAGACCAGG AACATGAATC AAATCAATCT GAGATGCATC CACCACTTCC 9900 

AAATCTCCCT GTAGCTCCAG TAAGGCAAAG AAAGTTTTTA CCAACTGTTG CGGATCATAG 9960 

ACCACAAAGT CCATGCGCCC CTTGGGATAA GTTTTGGGTA TTAAAACCTT CTTGCCGTCC 10020 

TTCAGCGCCT GCTCAATCAG TTCCTGCGTT TGAAACTCAT GAGAAAAAGA GAGGTAGGTT 10080 

GCGATGACCT TGGCTTCTTG ATAAAAGGGG TGTTGTAAAA GCCGCTCGGT TAAACCTTGG 10140 

TCTATAGCCT GTTTTTCCTC TTGAGATATA GCCTTCATTT CATGCAAGAC TTGCTTGCGT 10200 

AATTCCGATT TCATAGACAA GCCCTCTATT CTGCTGCCTT CTTTTTCAGG AAACTAGACA 102 60 

CCGCAGCCAC CCCAATAGCT AAGACTTCTT CCTTAGGACT CATTTGAGGG TGATGAAGAG 10320 

CGTAGGGACT ATCGATACCT AGCCAAAACA TCACGCCATC AACCTTTGAA AGGAGATAAC 10380 

CAAAGTCCTC GCCTGTCATA GCAGGTTCGA TATCAATCAA CTCCATTCCG TCTTTTTCGT 10440 

CAAAGAAGTC CATCAGTTCA CGCGCCAAGG CTGGATTGTT CTCAACAGGT AGGTATCCAC 10500 

CTTGTTTGAG TTCCACTTCG ACTTCCATAT CAAAGGCAGC TGCAACCCCT TCTGCAACTG 10560 

TTTTTACCCT CTTTTGCACC AAGAGACTCA TGTCCTGTGT CAAGGCACGA ATAGTTCCAT 10620 

GTAAAAAAGC TGTGTCTGTG ATGACATTGT TGGTGGTTCC AGCTTGAAAA ACGCCGAAGG 10680 

TCACCACTGC TCCCTCGATT GGGTTGACAT TGCGGCTAAC AACTGACTGC ACTTGGGTCA 10740 

CAAAGTAACT AGCCGCCACC AAGGCGTCAT TGGCTTCATG AGCAAAAGCT GCGTGGCCAC 10800 

CTTTGCCTTT GAAACGGATC TTCACCTCGC AAGTTCCTGC AAAGAGTGTA TGAGTATTAG 10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG 
ACCAATCTCC AAAAGCACCG TCCT CAT AC A TGAGCATACC 
CAGGCTGAAA TAGAAAGAGC AGATTATTCT TCGGTTGCTC 
AGCCTAAGGC AATGGTCATA TGAAAATCAT GGACACAGGC 
GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG 
AACCAATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC 
TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC 
CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG 
GAATCAAATC TAACATCTAT CTGTCCTCCG AT AT AG CAG A 
CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG 
TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG 
TTTTCTGGGA CATCTTGGGT AACAATAGCT CCTGCTGCGA 
TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA 
GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG 
TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA 
CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA 
TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG 
AGCAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA 
AAACCTTCAA G AAGCGG AG C CACATCCTTC CAGTCTCCGA 
ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA 
TTCTTTTCAG CATTGGCGAT AAATTGGAT A ATTTCTTGAG 
ATAGGTG 

(2) INFORMATION FOR SEQ ID NO: 149: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12S66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



ACCATAGAAT 
ACC AG CTTC A 
CTCAAGGGCG 
ATGCATGCGA 
GCCATCAATA 
CAAAATCCCT 
AATCACATCC 
GTGTAAATCT 
AAGAGGCTGG 
CATCCTCTAG 
GAACACCTGC 
CAACTGAACC 
CATTGTCTCC 
CACCTGCACC 
TGTCAATCAT 
TAACAGCATT 
CGTTGATAGC 
CATAATCTTG 
ATAGGACATT 
AGGTTACTTT 
CGTTCATTTT 



TGATCTGGCA 
TTTTCTTCAG 
CGCTCAAGAC 
CCTTGGTGTT 
TCTGTCCGCC 
GTCCGCCAAG 
AGCAAATAAG 
CCTCTAGTCT 
AAAAAGGGTT 
CGCTGTTTTT 
TACTACCACG 
ACTACCGATT 
GACACGGACT 
AACGTGGCTA 
GCTTCCAGCA 
GTCACCAATT 
ACGCTTATCT 
ATTTTCTACC 
TCCTAGTTTG 
GACACTGGTT 
TGTAGCAGTC 



10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12127 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA CTCGCGAAGA 
GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 



60 
120 
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CGTTGAGCCA AATGAAGCTG GAGAAACACG C TTT ACCT AT GCCACTTATG GTGAGGGAAA 180 

GCTTCCAGAA GGTCTGACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG ATTTATTAGC 240 

GTCTTACTTG A TTGTAT C AG GAAGTTTGGA TGGAGTGAGC TTACAGACCA CCTTGAAAGA 3 00 

GCTTGGTTAT CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCGATAG TCTTACTATT 360 

GACGGCCACC OCT ATGGTG C TACTGAGTTT AGCTATTTTT CTGCTGACCT TTATGAGTCT 4 20 

GACCCTGATT TATCGGATCA AATCCCTTCG TCAGGCAGGG ATTCGCTTAA TAGCTGGTGA 480 

GAGCTTGTTT GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTG AG AC AG C TTATCTCCTC 540 

AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG CCTTGTTTAT 600 

GGCAACGGTG CAACTGGTCA TCATTGCTCT TCTACTTT AT GGATTGACCT TGGCAGGGAT 660 

TTCTACCTTA CTAAGTGTCG TCT ATCT ACT TGGTTT AC AG GAAAATAGTC TGCTGGATCT 720 

ATTGAAAGGG AAACTCCCTC TCAAACGTAT GATGACATTG ATGATCGTGG GGCAACTCTT 780 

AGCTGTATTG GTGGTCGGAT CGAGTGCGAC AGCTCT CCT A CCCCACTACC GTGAAATGCA 840 

GGAAATGGAG AGAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC GTCTATCCTT 900 

TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC GTGAGTGGCA 9 60 

GACATTTACT GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA GCAATGTTGA 1020 

CAATTTCTCA GATGGAGCAG AAGTGGACCT AGATGGCAAT CCTCTCACTC ACT AC AC AC C 1080 

GTCAGGGAAT GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA TT AC CGTTTC U4 0 

TTCAGAGTTT ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC TGATCTTGCC 1200 

TG AG AG CTTG CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTT AC AG ATT ACCTCCAAAA 1260 

CTTTTCATCT GAAAGTGTAG AAG TGACG AG TCAGAAACAC TACCTCCCAC AGGTAAGGCT 1320 

AGCTT TTAC A GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA ACACAACACG 1380 

CCAGTACCTA AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGACC 1440. 

TCTTGCAGGG ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG ATCGATATGG 1500 

AGACAGCATC ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT ACTTGGTAAA 1560 

AAGCCAGCTA TTTTTTGCCA AG GT ACT AAA TGACAAACGG GTGGAGTTTT ACTCTCTCCT 1620 

TATTGGGACG ATTTTGAC CC TGTCT AC GGC T ATCTTG TT A TTTGATTCCA TGAATCTTCT 1680 

CTATTTTGAG CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA TGACAATCTA 1740 

TGAGCTTCAT GG CAAGT ATT T A CTGGCGC A AGGAGGAGTT CTCTTGCTTG GCCTAGTCCT 1800 

ATCT AGT ATT TTGACAAGAG ATGGTTTGAT TAGCGCTCTA GTTGT AGCTT TGTTTACGCT I860 
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TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTGGT A GCATGGCAGT 1920 

ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 

CGATTTTCTC TGGTTTCAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 

AG AG TGGAAG CGGAAAGACG ACGCTGCTGA ATATCTTCCG AAAGCT AG AA AAGATAGATG 2100 

GTGGAAGG GT TCTCTATCAG GGGAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 

GAGACCAGAT GGGCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 22 20 

AAAATTTGGA TTTGGGTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGCC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2 340 

CTTTATCTGG GGGAGAGGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2 400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 24 60 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATC ATTGCG ACCCATAATC 2 520 

CCCTAGTCTG GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2 640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGG CT AT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAACGCA TGTTTCGAGA GAACTGCTAA 2 880 

GGC CACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAATAG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACGG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTGCCTTG ACAGAATTTT TGGCTGAGCG TGCAGCCCAA GTCATGGCTT TTGAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3 360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TT ACT AC AT C ACGACGCCTA TTCTCATGCA 34 2 0 

CTTGATTGAG AGTGGCATTC CTTTTTGTGA GTTTGTGGTC ATGATGCAGA AAGAAGTAGC 34 80 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3 540 
GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC • 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCGTCGTCCA GAGCCAGCCG TAGCAGTAGA 3660 
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3720 
3780 
3840 



4020 
4080 
4140 
4200 
4260 



4440 

4500 



1001 

AGATCACAAC TTTTTCTTTA AGCTTTCCAA GGCTAGTTTT ACCCATCGCC GCAAGACCTT 
GTGGAATAAC TTGACAGGTT ACTTTGCTAA GACTGAAGAG GTCAAGGACA AGCTGACCAA 
GGCTTTGGAC CAGGCAGGCT TGTCACCAAG TCTCCGTGGG GAACCTCTCA GCTTGGCACA 
ATTTGCCGGT CTAGCAGACG CACTTAAAGG GCAAGGACTC TAAGATGCAG GGACAAATCA 3900 
TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGCTTTAT CAAACACGCG 3960 
CGCGTGGGAA TTTCCCTAAA AAAGGCCATA CCCCTTATGT TGGGGACTGC GTAGATTTCT 
CTGCCGAGGA AAATTCAGAA GGCTATATCC TCAAAATTCA CGAACGGAAA AACAGTCTGG 
TTCGTCCGCC TATTGTCAAT ATCGATCAAG CTGTAGTAAT CATGTCCGTC AAGGAACCTG 
ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG GGCATCCATC 
CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG GATTTTTACC 
AGCAGACCTA TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA CTCCTGTCTT 4 3 20 

TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG TCAACTCTTC 4380 
TCAATAAAAT CGCACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC AGTCTAGGTC 
GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGGGT AAAATCGCAG 
ATACACCAGG ATTTTCATCC TTGGACTATG AAGTATCAAG GCCTGAAGAC CTCAATCAGG 4 560 

CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTCT ACCCATACCC 4 620 

ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTCAAGAGCG TGTTATTGCA ACCTTCCGTT 4 680 

TTGACAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC TATAAAAAAG 4740 
TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATGTCTC AATACAAGAT TGCTCCGTCA 4800 
ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA AGCAACTGCG 4860 
GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT CAGTTTTGGT 4920 
GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG CCACTTGATG 4980 
GTGTCAAACC CTGAGCATCA TCTGGAAGAT TTTGCGCGTG CAGGTGCAGA CATCATCAGT 5040 
ATCCATGTAG AAGCAACGCC TCATATTCAT CGCGCCCTCC AAAAAATTCG TTCACTCGGA 5100 
GTTAAGCCTT CAGTCGTTAT CAATCCTGGC ACATCAGTTG AAGCCATCAA GCACGTCCTT 5160 
CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG GTTTTGGTGG GCAAGCCTTT 
CTGCCAGAAA CCATGGATAA GGTCCGTGAC TTGGTTGCTC TTCGTGAGGA AAAAGGTTTG 
AACTTTGAAA TCGAAGTGGA TGGTGGGATT GATGACCAAA CTATTGCTCA AGCCAAAGAA 5340 
GCCGGTGCGA CTGTTTTTGT AGCAGGTTCC TATGTCTTTA AGGCAGAACT CAATGAGCGA 5400 



5220 
5280 
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GTACAAACTC 


TCAGAAAACA 


ACTGGACTAG 


GGTTGCAGTT 


TTTGCAGGCG 


GAAACCGCGG 


5460 


TCATTATCGG 


ACAGATTTTG 


ATGCTTTTGT 


TGGGGTGGAT 


CGAGGCTCGC 


TCTGGGTCTT 


5520 


GGAAGAAGAC 


TTACCTCTTG 


CTCTAGCAGT 


CGGAGATTTT 


GATTCTGTGA 


CGGAAGAAGA 


5580 


GCGACACGTG 


ATTCAAAAAG 


GTGCCCAGTA 


TTTTGTCCAA 


GCACGACCAG 


AAAAGGATGA 


5640 


TACAGATCTG 


GAATTGGCTC 


TCTTAACCAT 


CTTTGAACAA 


AATCCTCAGG 


CTCAGGTCAC 


5700 


TATTTTCGGT 


GCCTTGGGTG 


GCCGTATTGA 


CC AT AT GTTG 


GCCAATGTCT 


TTCTGCCTAG 


5760 


CAATCCTAAG 


TTGGCACCCT 


ATATGCATCA 


AAT AG AAATT 


GAGGATGGGC 


AAAACTTGAT 


5820 


TACTTATTGT 


CCAGAAGGAA 


TCAGTCAGCT 


AGAACCTCGT 


TCAGACTACG 


ACTATCTAGC 


5880 


CTTTATGCCA 


GTTCGGGATA 


GCCAGCTGAC 


TATTCTTGGA 


GCCAAGTATG 


AGTTGACAGA 


5940 


GGAAAATTTT 


TTCTTTAAAA 


AAGTGTACGC 


TTCTAACGAA 


TATATAGATA 


GGGAAGTGTC 


6000 


GGTAACTTGC 


CCAGATGGTT 


ATGTGGTCGT 


ACTGCATAGC 


AAGGACAGGA 


GGTAGGATGG 


6060 


AAAGTTTACT 


T ATT CT ATT A 


TTAATTGCCA 


ATCTAGCTGG 


TCTCTTT CT G 


ATTTGGCAAA 


6120 


GGCAGGATAG 


GCAGGAGAAA 


CACTTAAGTA 


AGAG CTTGG A 


GGATCAGGCA 


GATCATTTGT 


6180 


CAGACCAGTT 


GGATTACCGC 


TTTGACCAAG 


CCAGACAAGC 


CAGCCAGTTA 


GACCAAAAAG 


6240 


ATTTGGAAGT 


GGTTGTCAGC 


G AC CGTTTGC 


AAGAAGTGCG 


GATTGAATTG 


CACCAAGGTC 


6300 


TGACCCAAGT 


CCGTCAAGAA 


ATGACAGATA 


ATCTCCTCCA 


AACTAGAGAC 


AAGACAGACC 


6360 


AACGTCTCCA 


AGCCTTGCAG 


GAATCAAATG 


AGCAACGTTT 


GGAACAAATG 


CGCCAGACGG 


6420 


TCGAGGAAAA 


A CT AGAAAAG 


ACCTTGCAGA 


CACGCTTACA 


GGCTTCCTTT 


GAGACAGTTT 


6480 


CTAAACAACT 


GGAGTCTGTC 


AATCGTGGCC 


TTGGAGAAAT 


GCAGACAGTT 


GCCCGTGATG 


6540 


TCGGAGCTCT 


TAACAAGGTT 


CTCTCTGGAA 


CCAAGACGCG 


AGGGATTCTG 


GGAGAATTGC 


6600 


AACTGGGGCA 


AATTATTGAA 


GACATCATGA 


CACCTGCCCA 


GTACGAACGA 


GAATACGCAA 


6660 


CGGTTGAAAA 


CTCTAGTGAA 


CGAGTGGAGT 


ATGCCATCAA 


GTTACCCGGA 


CAAGGCGACC 


6720 


AAGAATACGT 


CT ATCTGC C A 


ATTGACTCTA 


AGTTTCCACT 


GGCAGATTAT 


TACCGCTTU3 


6780 


AAGAAGCCTA 


TGAGACAGGT 


GACAAGGATG 


AGATTGAACG 


CTGTCGTAAG 


TCACTCCTAG 


6840 


CAAGCGTCAA 


GCGCTTTGCT 


AGGGATATTA 


GGAACAAGTA 


CATAGCACCA 


CCTCGGACGA 


6900 


CCAATTTTGG 


AGTTTTGTTT 


GTTCCGACAG 


AAGGTCTCTA 


CTCAGAAATC 


GTCCGCAATC 


6960 


CGGTCTTCTT 


TGATGATTTG 


AGACGGGAAG 


AACAGATTAT 


TGTTGCAGGA 


CCAAGTACCC 


7020 


TATCAGCCCT 


TCTTAACTCC 


CTATCAGTTG 


GTTTCAAGAC 


CCTTAATATC 


CAAAAGAGTG 


7080 


CCGACCATAT 


CAGCAAG A CT 


CTTGCCAGTG 


TCAAGACCGA 


GTTTGGCAAG 


TTTGGTGGTA 


7140 


TTCTGGTCAA 


GGCACAAAAA 


CATCTCCAAC 


ATGCCTCTGG 


CAATATTGAT 


GAATTATTAA 


7200 
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ACCGTCGTAC CATAGCTATC GAGCGCACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 
CTGCGCTTGA TCTACTCCAT TTTCAAGAAA ATGACGAAGA ATATGAAGAT TAGTCACATG 
AAAAAAGATG AGTTATTTGA AG GCTTTT AC CTAATCAAAT CAGCTGACCT GAGGCAAACT 
CGAGCTGGGA AAAACTACCT AGCCTTTACC TTCCAAGATG ATAGTGGCGA GATTGATGCG 
AAGCTCTGGG ATGCCCAACC TCATAACATT GACGCCTTTA CCGCAGGTAA GGTTGTCCAC 
ATGAAAGGAC GCCGAGAAGT TTATAACAAT ACCCCTCAAG TCAATCAAAT TACTCTCCGC 
CTGCCTCAAG CTGGTGAACC CAATGACCCA GCTGATTTCA AGGTCAAGTC ACCAGTTGAT 
GTCAAGGAAA TTCGTGACTA CATGTCGCAA ATGATTTTCA AAATTGAAAA TCCTGTCTGG 
CAACGGATTG TCCGAAATCT CTACACCAAG TATGATAAGG AATTCTACTC CTATCCAGCT 
GCCAAGACCA ACCACCATGC CTTTGAAACG GGCTTGGCCT ATCATACGCC GACCATGGTG 
CGTTTGGCAG ACGCTATTAG CGAAGTTTAT CCTCAGCTCA ATAAGAGCCT GCTCTATGCG 
GGGATTATGT TGCATGACTT AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA CCAGACACAG 
TACACAGTGC CAGGTAATCT TCTTGGACAT ATCGCTCTCA TTGATAGCGA AATTACCAAG 
ACAGTTATGG AACTCGCCAT CGATGATACC AAGGAAGAAG TCGTTTTGCT TCGTCATGTC 
ATCCTCAGTC ACCACGGCTT GCTTGAGTAT GGAAGCCCAG TCCGTCCACG CATTATGGAA 
GCAGAGATTA TCCATATGAT TGACAATCTG G ATG CAAGC A TGATGATCAT GTCAACAGCT 
CTTGCTTTGG TGGATAAAGG AGAGATGACC AATAAAATCT TCGCTATCCA T AATC GTTCC 
TTCTATAAAC C AG ATTT AG A TTAATAATTT AAGAAAAATG AGCATTTTTT AGGATAAGAA 
TGTTCGTTTT TTTATCTCAA TATGGTATAA TAAGTAAAAG ACAAAAATGA ATACTCTTCG 
AAAATCTCTT CAAACTAGGG TAGTATCGCC TTGTCGTATG TATATATGCA GGTATATTAC 
AGGGTTTGTC AGTTCTATTG ACAATCTCAA AACAGTGTTT TGAACCACCA GCGACCAGCT 
TTCTAGTTTG CTTTTTGATT TTTTGAATAA AAATGGAATA GGAAATAGAA ATGAAATTAA 
GAAGAAGTGA TCGGATGGTT GTCATTTCCA ACTATTTGAT TAATAATCCT TATAAACTAA 
CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 
TCGTCATTAT CAAACGCGCC TTTGAGGAAA TTGAAATCGG TC AT ATC C AG ACAGTGACTC 
GGGCTGGCGG AGGTGTCATC TTCACACCGT CTATTTCGAG TCAGGATGCT AAGGAAATGG 
TTGAAGACTT GCGTACCAAG TTGTCAGAAA GTGACCGTAT CTTGCCAGGT GGTTATATCT 
ATCTGTCTGA TTTGCTTAGC ACACCAGCCA TCTTGAAAAA TATTGGTCGT ATTATTGCCA 
AAAGCTTTAT GGACCAAAAA ATTGACGCGG TTATGACCGT AGCAACTAAG GGTGTGCCAC 



7260 
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TTGCAAATGC 


AGT TGCCAAT 


GTCCTCAATG 


TCTCTTTTGT 


CATTGTGCGC 


CGTGAC CTG A 


9000 


AAATTACCGA 


AGGTTCAACT 


GTTAGCGTCA 


ACTATGTTTC 


AGGTTCAAGT 


GGTGAC CGT A 


9060 


TCGAGAAAAT 


GTTCCTTTCA 


AAACGTAGTC 


TTAAGGCAGG 


CAGCCGTGTC 


TTGATTGTGG 


9120 


ATGACTTCTT 


GAAAGGTGGC 


GGAACGGTCA 


ATCGTATGAT 


TAGTCTCTTG 


CGCGAGTTCG 


9180 


ACTCAGAACT 


GGCAGGTCTA 


GCGGTCTTTG 


CGGACAATGC 


CCAAGAAGAA 


CGTGAAAAGC 


9240 


AGTTTGACTA 


C AAGTC ACT C 


TTGAAGGTAA 


CCAATATTGA 


TGTCAAGAAC 


CAAGCCATCG 


9300 


ATGTTGAGGT 


TGGCAATATC 


TTTGACGAAG 


ATAAATAAGA 


GATAGAACTA 


AAGGTTGGAA 


9360 


CGATTGTCCC 


AGCCTTTCTT 


TGCAAACAGA 


ATAGAAGGAA 


GCTTATGAAA 


ACACCATTTA 


9420 


TCAATAGAGA 


AGAGTTAGAA 


GCGATTGTTG 


CCGAGTTCCC 


GACTCCCTTT 


CACTTGTATG 


9480 


ATGAGAAGGG 


GATTCGTGAG 


AAGGCAAGAG 


CCGTCAACCA 


AGCTTTTTCG 


TGGAACAAGG 


9540 


GCTTTAAGGA 


ATATTTTGCA 


GTTAAGGCTA 


CTCCAACTCC 


AGCTATTTTG 


AAAATTCTCC 


9600 


AAGAAGAAGG 


TTGTGGTGTG 


GACTGCTCTA 


GTTATGTAGA 


GCTTTTGATG 


AGCCATAAAC 


9660 


TGGACTTTCT 


GGGTTCTGAG 


ATTATGTTCT 


CTTCCAACAA 


CACGCCAGAC 


AAGGAAT AC G 


9720 


CCTATGCACG 


TGAATTGGGT 


GCGACCATTA 


ACTTGGATGC 


CTTTGAAGAT 


ATTGAACATC 


9780 


TGGAGAGAGT 


AGCAGGCATT 


CCAGAAATCA 


TCTCTTGTCG 


TTATAATCCT 


GGAGGCGTTT 


9840 


TTGAACTGGG 


GACAGACATT 


ATGGACAATC 


CTGGGGAGGC 


TAAGTTTGGC 


ATGACCAAGG 


9900 


AC C AGCTCTT 


TGAAGCCTTT 


GCTATCTTGA 


AGGAAAAAGG 


AGCCAAGACT 


TTTGGGATTC 


9960 


ACTCCTTCCT 


AGCGTCCAAT 


ACCGTGACCC 


ATCTCTATTA 


TCCAGAGTTG 


GCTCGTCAGC 


10020 


TCTTTGAACT 


GGCTGTTGAA 


ATCAAGGAAA 


AGTTGGGCAT 


TTCG CT AG AC 


TTTATCAATC 


10080 


TTTCTGGCGG 


TATTGGTGTT 


AATT AT CATC 


CAG AC CAGGA 


GCCGAACGAT 


ATCGCCTTGA 


10140 


TTGGTGAGGG 


AGTTCGTAAG 


GTGTATGAAG 


AGGTTCTTAC 


GTCAGCAGGT 


CTTGGTCAGG 


10200 


TCAAGATTTT 


CACCGAATTG 


GGTCGTTTTA 


TGCTGGCACC 


TCACGGTGCT 


CTAGTCACAA 


10260 


GAGTCACTCA 


TAAGAAGGAA 


ACCTACCGTA 


CCTATCTAGG 


TGTGGATGCC 


TCAGCAGTCA 


10320 


ACCTCATGCG 


TCCAGCTATG 


TACGGAGCTT 


ACCATCATAT 


TAGCAACGTG 


ACCCATCCAG 


10380 


ATGGACCAGC 


TGAAGTGGTA 


GATGTGGTCG 


GTTCACTCTG 


TGAAAACAAT 


GATAAATTTG 


10440 


CAGTTAATCG 


CGAACTGCCT 


CATACAGAAA 


TCCGTGATTT 


GCTGGTCATT 


CATGATACAG 


10500 


GTGCCCACGG 


ATTTTCAATG 


GGCTACCAGT 


ATAATGCCAA 


ATTACGTTCT 


GCGGAAATCC 


10560 


TCTATACCGA 


AGAAGGTAAA 


GCCCGTCAAA 


TCCGCCGTGC 


AGAGCGCCCT 


GAGGACTATT 


10620 


TTGCAACCTT 


ATATGGCTTC 


GATTTTGAAG 


AATAATCTGA 


TAATAGATTG 


AAAATGAAAT 


10680 


TGAAAAACAG 


ATTGCTTTCT 


AAAAAATAGG 


CAAAAATCTT 


GTTTTTCCTT 


CAAGTCGTGA 


10740 
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TATAATAAAA CTATAAAACG TTTTCAAGGA AGGTAACGAT ATGTCTGAAG AAACAATTGA 10800 
TTATGGACAA GTGACAGGAA TGGTGCATTC GACAGAAAGC TTTGGGTCAC TAGATGGGCC 10860 
TGGTATTCGC TTTATTGTCT TTTTGCAGGG CTGTCACATG CGTTGCCAGT ATTGCCACAA 10920 
CCCAGACACT TGGGCTATGG AGTCCAATAA GTCACGTGAA CGGACGGTAG ATGATGTCTT 109 80 

GACAGAGGCC TTGCGCTACC GTGGTTTCTG GGGAAATAAG GGTGGGATTA CAGTCAGTGG 11040 
AGGAGAAGCT CTCTTGCAGA TTGATTTCCT GATTGCTCTC TTCACCAAGG CTAAGGAACA 11100 
AGGAATCCAC TGTACCTTGG ACACCTGTGC TCTTCCTTTC CGTAATAAAC CACGTTACCT 11160 
TGAGAAGTTT GACAAACTCA TGGCTGTCAC TGACTTGGTT CTTTTGGATA TCAAGGAAAT 11220 
CAACGAAGAA CAGCACAAGA TTGTCACTAG CCAAACCAAT AAAAATATCT TGGCTTCTCC 11280 
CCAGTATCTA TCAGATATTG GAAAACCTGT CTGGATTCCC CACGTGCTAG TTCCAGGATT 11340 

GACAGACAGA GATGATGACT TGATTGAACT TGGTAAGTTC GTCAAGACCC TCAAAAATGT U400 

TGATAAGTTT GAAATTCTAC CTTATCACAC CATGGGTGAG TTCAAGTGGC GTGAACTTGG 11460 

AATTCCATAT TCCCTCGAAG GACTCAAACC ACCAACAGCA GATCGCGTCA AGAACGCTAA 11520 

ACAACTCATG GATACCGAAA GTTATCAAGA TTATATGAAA CGTGTACATG GATAGAAAAG 11580 

AAGCCTGATG GAAACATCGG GCTTTTGACT TGCAAAAAGA CTTAGCAAAT CAGCTAAGCC 11640 

TTTTTCTTCT TATCTCGAAC GTTGTTTTCC AGCGTTGCGA TTTTTGTGTT TTTTCTTGCT 11700 

TGTGATAGCA GTTGGTTGTT CAGGGGTAAC GTCTTTTCGT CCACTTCCTT TAGAGAAAGC 117 60 

ACTTGCTTTT GGTGGGTTCT TGCCTAGTTC TTCACGGACT TTTTTGCGAA GTTTTGGACG 11820 

AACGATATAG TTGACGATAA ACTGTTGGAG AATCATCATG AAACCACCGA CAACCCAGTA 11880 

AAGTGTGACA CTAGCTGGTG ACAAGAGGGA GAAGACGACG ATCATGAGTG GGCTCATGTA 11940 

AATCATTTTC TTGATTTGTT CTCTTTGCAT TT CAT CTTCT ACTCCGTGAA GTGAAAGGAG 120O0 

CGATTGAAGA TAGTAAAGGA CACCAGCACA GGCAACCAAA ATCATACTTG GAGAACCTAG 12 060 

AGGAATGCCT AGGTAGCTTG CTTGAGCAAC CCCTTCAGTA TGTTGGGCAG CAAAGTAGAT 12120 

AGCAGAGAAG AAAGGCATTT GAAGGAGGAT AGGGAAACAT CCTACACCGC CAAACATGCT 12180 

GATACCGTGC TCTTTTTGAG CAGCAAAGAG AGCTTGTTGG GCTTCGAGTT TTTCTTCTTG 12240 

AGTAGTCGCT TCTTTGAGAC GCGTTTGGTG TGGCTCAAGG ACGTGCTTGA GGGCGTTCAT 12 3 00 

CTTTTCAGAG TGAAGCGTTG CCTTCCATGA TTGGTAGATA CGAAG TGGT A AGATAATCAA 12360 

GCGTACGATA ATGGTTACGA TAATGATAGC GACACCAAAG CCTAGACCTT TATCAGTAGC 124 20 

GAAGTACTTG ATGGCTTCAG CCATAGGCGC TCCGATCGTA TTCCAAATAA ATCCTGTTGG 12480 
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CTGACCTGTG GTTTTATCGA CATTGACACA GCCAGTCAAG ACAAGCAACA TACCCACTCC 

CATAGCCGAG AGTGCAAAAT CGGGGT 

(2) INFORMATION FOR SEQ ID NO: 150: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



fxi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


150: 






TGACACTCTG 


: TAGGATTGTC GTTAATTGAT 


' TGCTCCTACT 


' CTCTACAATA 


ACC AC C AAAG 


60 


TAAAAACGAC 


ATAGAAAGAT AGCATCAGCT 


GTAGCCATAG 


CGCCTTTGAC 


ACCTT CTGG A 


120 


TGATTATGAG 


TTACCTCTGC AG AAAG ACT C 


GTAAGTCCTC 


TAGATGATGG 


CCATATACCA 


180 


GTTTTCGCAT 


AAAAACCACA GTCCATGATC 


CAAGCACATG 


GAG AAAT AC G 


CATAGCTGAT 


240 


CCATTCCCAA 


AG CT ATTAT A AGGCTCACGG 


TTATCGCTGT 


TTAGCCATGC 


ATTAAACCGA 


300 


GCACCGTAAT 


CAGCATTCGG ATACATTCTG 


CCATATTTCT 


TCATCGCGTC 


AATGAAGTCA 


360 


TCTTTTTGTC 


CACCATTCAT AATTGC TTCT 


GCAACAGCAC 


AGGTCATAAC 


CGTGTCATCT 


420 


GTAAAAAAGC 


AGTCCTTCCG AAATAAAGGA 


AAGTCCTTTG 


TTTTGATATT 


GTTCCATTCG 


480 


TAAACAGAAC 


CGACAATATC TCCAATAATT 


GCTCCAAGCA 


TCAGATTCCT 


CCTTGTTCAT 


540 


TTTGATGCTT 


TTTATATTGG TTATCTACCA 


TATTTATTTT 


AGAAAATAAC 


ATCCTGTTGG 


600 


ATTTTAAAAA 


TTTCATTTTT TTCAAAATAG 


GGTTTTACCA 


TTTCTTTCCA 


CCTAGCTCTA 


660 


TGAAAATTGA 


TTGATTTTAA AGGAGATAGG 


CCATAATTTC 


CCAATGCATA 


ACCATCATTT 


720 


ACTTCAACAA 


CAAGTGTTCT GCCATCGCGA 


GTAACACCGA 


TATCTAGTCC 


ATAAGCTATT 


780 


GGCGCATCTT 


TCCAACATGA TATCGCTTCA 


TCAATTACAC 


TTGCATCAAA 


TTGTGCATGA 


840 


TAATCACCTG 


TATAGGGTCG AACATCTAAT 


ACGCGACCAT 


CTAACACAAA 


AC AACGCCA T 


900 


TCAGCTATGA 


ATTCTACAAC CTCACTAATC 


CATATAGGAT 


AGT CGAAAGG 


TAGACCAATA 


960 


CCTATTAAAT 


CATGGGTTCC ATTAACAACT 


CTTCCAGTAA 


AGACTTTTGA 


ACCAGCTTTA 


1020 


GGCTTAATAA 


ATTTTCCCCA ATTATCAGGT 


ATATTCACAA 


TCTCTCCTAA 


AATACCAGCA 


1080 


TAAATCTTTC 


GACCATAAAA CTCTTTAAGC 


TCAATAGGAT 


AGTCATGAAC 


CGGAACGTTT 


1140 


AAGCCCATCA 


TTTTTAGTAA TGCTCTAGTC 


TCCATTATAT 


AATCTACAAC 


TATATCTTCA 


1200 


CTTGTTAACT 


CTTTTATTTC AGAAAAAGAT 


TGATATAAAA 


TAACTTCTTC 


TCCTTGTAAG 


1260 


TAGGCACCTA 


CTTGAGCATT GTATTTATTA 


ATTGAAACCT 


CACTTGGTAA 


TTTACTTTGT 


1320 
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CT AAT AT AAA CAACCATTTC ATCACTCCTA TATCACTAGT GTTACACCAA TTTCTAAAAA 1380 
ATAATAGCAA TTTTGCTCTT ATTTTTTTGA GTAAATAGCC CCCATAATAT CATCGAAATA 144 0 

ATCAACGGTA TTTAGGAGTA ATTCAATAAC CTGGGACTTT GTTAGTCGCA TTCCCCTTCT 1500 
ATCTCTAGCA TCTTCTACTA AATTTTCAAG TTTCTCTAGA TTTTT AT CAT CCAAGCTAAT 1560 
CATTATTCTA TTTTT ATCGG TTGCCATTTT CATCACCTCA AGTTAATTCT ATCACAGGTG 1620 
TAACACTAGT GTCAACTGGC TTTTATAATA CATTAGTTTA AAAGTGGAGA GGATTTTTAA 1680 
CACAGTAACT TTAAATCTTT GGTATTAAAA AATTTTCACA ATATTTATAG AAATAAAATC 1740 
TGTCTCAAAT CAGTT AT CAA ATCTAGTATA AATTATGAGC GGCTACTCTA ATACTTTCCC 1800 
TCTAAACAAG AAAAAGACTT ACACTCAAGG GTTTTCTTCC CCCCCTTCGT TATAACGTTT I860 
TGACTCTTTT ACTAGCAAAG GTATATACTC ACAAGGAACT TTGGTTGACT ATTGAATCTC 192 0 

TCCAACTTCT TCTTTAACAT ATCCTTCTAC AT CTT CAATC TCTACAAACA TTGGGTCTAA 1980 
GTGACACAAG AAATGCCAAA CTTCGATCCC TTTTTTTCTG TAAAGAATCG CTTCACCGTC 204 0 

TTCACTTCCG AAAAAGCTTC TGTCGATTTC ATATCCGCGG CTTTCTAAGA AGTCTTTTGC 2100 
TTTACGATAG TT CGTTTCTC TTGTTTCGAC ATAGGCTTTA ACTTCATGGT TGTTAACGAC 2160 

ATATGCATCA ATTTTTGAAT ATCCTTCGAT CACTCTATCA TTTTTGAGGG ATAAATTTGA 2220 

AATCTCTTTC CAAATAATGT TTACATTTTC CTCAGGATCG AACATAAATT TAGATAAAGG 2280 

AACAATATTT CCGTTAAAAA TAATTTCCAT AT AATCCGGT ATGTTTTTAG GATTAAAATA 2 340 

CTCCACTTCA AAACCATCTT CTGTTTCCAG AGTGTATCCC GGGATTTGAG CTACAAAGCC 2400 

TTTCCCATCT TCTATGGAAT C AAATG CT AC TAAATCTTTA GAATAATCAT TTTGGTACAA 2 4 60 

TTCCAATATA ACCATCGATA ATCTCTCCAT TTTCATTATC AGGCTAATGT AAATAAGCAC 2 52 0 

GTCACCTGAC CAATTCAGGC TCTCTGTATC ATCTCATCAT ATTTCCTACT TACTTTACGA 2580 

GTCTTATACC CAGAACACAC CTTATCGACC TTCGGTCTCA CCTCGTCGCA TTGGCTGAAC 264 0 

ATCTACTTTT ACTTTG CTG A TGCTTCAACT CGTACAAGCA GTGATACCGC CTCAGCGTGA 2700 

TGCGTCAGTG GGACTCAAAA GGTTCGCGGA ACCTTTTGAG GATTAACTAC GTTTCTCTAA 27 60 

TAAACTTACA CATTCAACTT GTTCATCATT GTCCAAACCT ATGTTGAGAT TTTCTTCTAT 2820 

AATTGGTAGC TTAAAAGTAA TGGATTTTAG CCATTGTCCG TTAGATTGTT TTTCTTCATA 2880 

AACTTGAATT TCAGAAATCA AAGCTGAAAT T AACTGC CT A CGCTCTACAT CATTCATGAC 294 0 

TTTATAGAGC TTATCAAAAT AGATCAGAAC CTTATATATG TTATCTCCTG TAAGCTTTTC 3000 

AGCTTCAATA GTCTGTTTCT TTGCTTTCGC ATCAATTAGT GATGATTCTA ATTCATCTAG 3060 
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TTTCTCATAC ATACGATATA GTCTATCATC TAAATCCTGT TTCCTTCTCT TATAATGCTT 3120 

ATCTTCAACA TCTAAATTAT CTATTTCCTC AATTAGCTTA AACTTTGTAG AATGACTCTT 3180 

TCTCAATTCC TTTTGGTAAT TATCTATTTC TTTTTCTATT TCAGAGGTAT CCACCTTCAT 3240 

GTTGATTTTT TCTTGCATCA TAGAAGCAAA TTTCGGATTA CTTACTATCT TGACAATCAC 3300 

CTCTGCAACA GCATCATCTA ACAATTCTTC TCTAATTTGC TTACTGAATG TACACTTATT 3360 

ACCTCTTATC ATCTGCCTAT GGTTACAACC ATAGTAATAA AAATCTTTAT ACTTTGTGCC 3420 

ATCTTTCTTT TTCTTGATAC ACTTGTTCCC AAACATTCCC ACTCCACATA TCGGGCATTT 34 80 

TACAATTCCA GAAAGCAAGT GTGTGCGTGT ATCTTTTCCT TTATTCACAT GCTCATATTT 3 540 

CTTTGCTTGA GATTTTAGCT TAACCTGAGC AG CTTG CCAA ACTTCATCCG AAACTATAGC 3 600 

TTCATGTATC CCTTCAGATA TTAGATATTC ATCTTGTTCA ACCTGCTTAT ATTCATTTCT 3 6 60 

TGTACCATGA ACTTTTTCTA AAGTTCTTCT TCCAAATGCT ATTTTCCCAT TATATACAGG 3720 

ATTCTTTAAT ATCTTTCTTA TAAGACCTGC ATCAAACAAA GGATTCTTAC C ATTCTGT CT 3780 

TGGGATTTTT CTAATTCCAT GATTCTCTAA GTATTTAGAT ATCCCATTGG CTCCTATCGT 384 0 

AGTATTTACA TACTGGTCGA AAATCGTTCT TATTGCAACT GCCTCTTCCT CATTTATAAA 3900 

CAGCTTGCCG TCTTCAAGTT TATATCCATA CGGAGCAAAG CCACCATTCC ATTTTCCTTC 3960 

CCCTGCTTTT TGAATGCGAC CTTCCATTGT TTGAATACTG ATGTTTTCTC TTTCTATTTC 4020 

AGCCACAGCT GATAAAACAG AAATCATTAG TTTCCCAGCA TCTTTAGATG AATCAATGCC 4080 

ATCTTCAACG CAGATAAGAT TAACTCCATA ATCCTGCATT ATATGAAGTG TAGAAAGAAC 414 0 

ATCAGCGGCA TTTCTTGCAA ATCTTGATAA CTTAAACACA AGAACAAAAG ATACTCCATC 4200 

TTTTCCAGAT TTTATATCTT CCATCATTCG ATTGAACTGT ATTCTACCTT CAATAGACTT 4 2 60 

GTCAGACTTC CCGGCATCTT CATACTCTCC AACAATTTCA TAATCGTTGT AAATAGCAAA 4 320 

AG CTTTCAT T CGTGATTTTT GTGCCTCTAA CGAATACCCC TCTATCTGTA TTGACGTAGA 4380 

TACTCGTGTA TAGAGGTATA CTTTTATTTT TTCTTTTGAC ATAGTATTAA CCTCAATATA 4440 

ATTTTTCTAT ATCATATATA ATTTTTTTAA TTTAAGTTTG G ACT AT C ATT TCAAGTATAT 4 500 

TATAACACTT TTATTAGTCC GTCTCAATTT GTGTTTTTGC CATGTCAAAA CTATTTTTCA 4 560 

TCTCTTGATT TTTTGCTGGC GTTGGATCGG GTAGATTATC T AAATCT AAA GCACCAGCAT 4 620 

ATTTTGCAAT CAGATTTGCT ATTAAATCAG CCAATCCATT CCAGTCATTG TCCAATATAT 4 680 

ACCTCCTCTA AAGTTTTATA TCTAATAATT ATTTGTTTAA TTAAGTTTTT TGACATTGAC 4740 

AAGTGCTTTG GATTAGCAAC ATAGGAATCT CACTTCCGCC TCTATTCCGG ATGAGCCGGC 4800 

TTCAACCTTA GAAGTATCAT TACCCTCATT TTCTTCATAG CGGATAGGGT ATCCCTCCCT .4860 
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ATATTCAAAC TCTTACTTAT CGCTCACTTT CTTTTTGCTT AGCAGAACTT TTTTTGCCGA 4 9 20 

ATTATTCAGC CGAAAGATCT TGACGGATAG GTTATTACGC TCCAAAAATA ATTAACGTCT 4 9 80 

TGTCTTCGTC TATTCAATTG TTAAGGTTCA AAATTTATCG AGAGTTATTA ATCTTTTTAA 504 0 

AATTTGACCA TCAGAAAATA TTTATCTTGA TGTAACAAAA TTCTATAAAT TACCCTCTTA 5100 

TACTTAACAG TGAAAAGAAG TCTTTCTTGG TAACCAATTT TGAAATAGAA TTTGCTTATA Si 60 

TAAAAAGGTC CAATTCCCAC TGCATAAATA GCAGTGAAAA TTACACCCTC TTGGTAACTG 5220 

TCATCTAAAA GTCTTCTA 52 38 



(2) INFORMATION FOR SEQ ID NO: 151: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



GACGATTTAC 


GAAGAATCGA 


AC AAGAACC T 


GCTCCTATCA 


ATTCCCAACC 


TCTATCTCTA 


60 


AAATCTTGCA GTTCATGCTT 


ATACTTTTTT 


AAGAAATCTA 


GAATCATAGA 


TACGGTAGAT 


120 


GACATCGTCT 


GGTTGACATT 


GGTCAAAATA 


GAACAAACCA 


AAACGACTCG 


TTCTATACCT 


180 


CCAACCTTTC 


AAATGCATCT 


CATGTAAATG 


TTCTTCTTCC 


TTGTCCAAAT 


CAACAATGGT 


240 


GAAAATCCGA 


AATTCTACTC 


TGCTATTCAT 


TGTCTTACCC 


CAAAATTAGA 


AAACATGCCT 


300 


GGCGTTATTT 


ATTAGATAAT 


TCTTTCCACT 


TTTGACTCAA 


TCTCCAAAAA 


ATATAAGAAA 


360 


TCTGAATCGC 


AAAAACTATC 


AATAAAACCC 


AATCTATTAT 


GAAAATCAAA 


AACACTTTCC 


420 


AACTGAAAGA 


ACTACCTCCA 


GTGACAAACT 


TTGAGAAAAA 


CGGTAGTAGA 


GCTAAAAAGA 


480 


GAAATAAAAT 


AGGAAGCATC 


CGCATTGTTA 


AAATCCGTTT 


GGCATAAAAA 


AATCTTTATT 


540 


TAAACGAAAA 


TATTATGGCA 


AAATTTACGC 


CAGTTTTTGA 


ACGG CTG ATG 


TAGATATTTT 


600 


ATACTTTCAA 


AATGTTTAAA 


TGTGATTATT 


TATTTTTGAA 


AAATAGATCA 


CCAGCCCGAC 


660 


TGAAAGTGCT 


TATAGAATGA 


TAATAAGTCG 


CCTGCCGAAA 


ACAGCGAAAA 


ATAGCGGTGT 


720 


T ATG CGGAG A 


TAATCTGACG 


CGATGCGAAA 


GTATATTGCA 


TACTTATTTT 


CAACAATTTA 


780 


GCAGAGTATT 


TTTATAAGTG 


TGATATAATA 


GAAGTATAAT 


TTGTTCTGAT 


AGTTTATTTT 


840 


ATGGAGAAGT 


AG ATTTTT AG 


AATGCGGAGG 


GTTCAATATG 


GTTGAGTTTA 


TAAAGTCTAA 


900 


GAAAGAAATG 


AGTGAGGAGG 


ATATTAAAGC 


AAATTTCATC 


ACTCCTGCTA 


TTCTATCCAA 


960 
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AGGATGGAAA 


AATGGTGAGC 


ATATCGCTTA 


CGAAGAATAC 


TTCACTCATC 


GTCGAATTGA 


1020 


AGTTAGAGGA 


GATAAGGCTC 


GTCGTAAAGA 


AGGAAAAAAA 


T CAG ACT ATT 


CACTGTATTA 


1080 


CCAATTTGGA 


ACTCGAATTG 


CAATTGTTGA 


GGC AAAGGAT 


AAT AAAC AC A 


GCGTTCG AG C 


1140 


AGGATTACAA 


CAAGCTATTG 


AATATGGAGA 


GATTTTAGAT 


GTTCCATTTG 


TTTATTCTTC 


1200 


GAATGGTGAT 


GGCTTTATTG 


AACACGACCC 


TATCACGAGA 


GAAGAACGTG 


AGCTGGAGTT 


1260 


AGACGAATTC 


CCTACTCGTG 


AAGAATTATT 


TTCTCGTATG 


ACGAAGGAAA 


AAGGATTGAC 


1320 


GTACGAAATT 


ACAGAAGCT A 


TCTCAACTCC 


ATACTATACA 


GACGCCTTCT 


CAATGAAAAC 


1380 


GCCACGCTAT 


TATCAGCAAA 


TAGCTATCAA 


CCGTACTATT 


GAAACAGTTG 


CCAGAGGACA 


1440 


AAAACGAGTA 


ATGTTTGTGA 


TGGCAACAGG 


AACGGGGAAA 


ACGTTCATGG 


CTTTTCAAAT 


1500 


TATTCATCGC 


CTTCGAAAAG 


CTGGTTTGGC 


TAAACGAGTT 


TTATTCTTAG 


CAGATAGAAA 


1560 


CATCTTAGTA 


GACCAAACGA 


TGGCTGAAGA 


CTTTAGGCCA 


TTCG AAAAG G 


TAATGACGAA 


1620 


AATTACACCA 


AAACTTTTGA 


CTGCTCCTGA 


AAAATTAAAT 


TCTTTTGAAA 


TTTATCTAGG 


1680 


GCTTTATCAG 


CAACTAACTG 


GTGAAGATGG 


AACTGAAACA 


CATTATCAAA 


AATTTGACAA 


1740 


AGACTTCTTT 


GATTTAATCG 


TAATTGATGA 


AGCGCAC CGT 


GGTTCAGCTA 


AGGAAAACAG 


1800 


TAACTGGCGT 


AAGGTAATTG 


ATTATTTCAG 


TTCTGCGACA 


CAG ATTGGG A 


TGACCGCTAC 


1860 


TCTTAAAGAA 


ACCAAGAATG 


CTTCCAATAC 


GGAATACTTT 


GGTGAGCCAA 


TCTATACTTA 


1920 


TAGTTTAAAA 


CAGGGAATCG 


AGG ATGGTTT 


TTTGGCTCCA 


TATCGTGTTA 


TGAGGGTTAA 


1980 


TTTAGATGTG 


GATGTGGATG 


GTTATCGTCC 


AGAAACTGGA 


AAAGTTGATG 


CTAACGGACA 


2040 


ATTAATAGAA 


GATAGGTACT 


ACGGCAGGAA 


AGATTTTGAT 


AAAACCATTG 


TCATTGATGA 


2100 


TAGAACGCAA 


AGAGTTGCCA 


AGTTTGTTTC 


TGATTATATG 


AAGCAAAACA 


ATGCACGATT 


2160 


TGATAAAACA 


ATTGTTTTTT 


GTGTTGATAT 


TGACCATGCC 


GAGCGAATGC 


GTGCTGCACT 


2220 


TGTAAAAGAG 


AATCTAGACT 


TAGTCCAAGA 


AGACTATCGT 


TATGTCATGC 


AAGTAACTGG 


2280 


TGACAACGCT 


GAAGGAAAAG 


CTCAACTGGA 


TAACTTTATG 


GATGTCAATT 


CTAATTTTC C 


2340 


CGCTATTGTA 


ACAACGTCTA 


AATTATTAAC 


GACAGGAGTT 


AATGCTAAAA 


CATGTCGTTT 


^400 


GATTGTTTTA 


GACTCTAATA 


TCCAATCCAT 


GACTGAATTT 


AAACAAATTA 


TTGGTCGTGG 


2460 


CACACGTCTT 


TATCCTCAAA 


AGGGGAAAGA 


ATTTTTTACG 


ATTATTGATT 


TTCGAAATGT 


2520 


TACCAATTTG 


TTTGCTGACC 


CTGATTTTGA 


TGGTGATCCA 


GTGAAGGTGC 


TAGAAACACG 


2580 


TGCGAAAACA 


GTCAGTGGTT 


CTACGCCCGG 


TTTCGTAGAT 


GAGGAAGGTG 


ACCCAGTAGA 


2640 


AAAATATATC 


GTTACAGACA 


AGCAGGTTAC 


CATTCTTAAT 


TCTACTGTTC 


AAGTATTGGA 


2700 


TGAAAACGGG 


AAACTGATTA 


CCGAAAGCCT 


GACCGACTAC 


ACTCGAAAGA 


ATATCTTAGG 


2760 
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TAGCTACGCC ACTTTGAACG ATTTTATCAC AGTTTGGCAT ACGGCAGATA AGAAGAAGCT 
TAT CTTAG AC GAACTTTATA AAAAAGGAGT TTATCTAGAT GCTATTCGAG AGTCGGAGGG 
AATATCAGAA CAAGAAATCG ATGATTTTGA TTTACTCCTA AAACTTGCCT ATGGTCAAAA 
AGAATTAACC AAAACGGAAC GTATCAATAA ACTCAAACAA AGCGGATATT TATATAAATA 
TAGTGAGGAA GCGCGTGCTG TTTTGGAAAT TTTACTGAAC AAATACATGG ATAAACCTAT 
TGGAGAACTC GAAAGCATTG AAACATTAAA ACTTCCAGAA TTTCAGATAT ATGGTGGAAC 
CTTCAAAATC ATC AAT ACT T ATTTTGGAGA TAAAAAACGA TATTTACAAG CAATTAAAGA 
ATTGGAGCAA GAGCTATTTA CAGTAGCTTA ATGAAAGGAA AGTATGTCAA TTACATCATT 
TGTAAAAAGA ATTCAAGATA TCACTCGAAA CGATGCTGGT GTTAATGCTG ATGCTCAACG 
TATTGAGCAA ATGTCTTGGT TATTATTCTT AAAAATTTAT GATAGCCGTG AAATGGTTTG 
GGAATTAGAA GAAGACGACT ATGAGTCAAT TATCCCAGAG GAATTAAAAT GGCGAAATTG 
GCCTCATGCT CAAAATGGGG AACGGGTATT GACAGGCGAT GAATTACTTG ATTTTCTCAA 
TAACAAGTTA TTCAAAGAGT TGAAAGAGCT TGAAATAACT TCAAATATGC CT ATT CG AAA 3 540 

AACGATTGTT AAATC AG CTT TTCAACATGC GAACAACTAT ATGAAAAATG GCGTCTTGTT 3 600 

ACGCCAAGTC ATCAATGTTA TTGATGAAGT TGATTTCAAT AGCCCTGAAG ATCGTCATTC 3660 
GTTTAATGAT ATTTACGAAA AAATTCTTAA AGATATTCAA AATGCTGGGA ACTCAGGAGA 372 0 

ATTTTATACG CCACGTGCAG CGACTGATTT TATTGCCGAA GTTCTTGACC CAAAACTTGG 
AGAATCAATG GCAGACCTTG CTTGCGG AA C AGGAGGCTTC TTGACTTCGA CTCTGAACCG 
TTTAAGTAGT CAACGTAAAA CTAGTGAAGA TACCAAAAAA TATAATACAG CTGTTTTTGG 
TATTGAAAAG AAAGCATTTC CTCATCTTTT AGCAGTTACA AATCTGTTTC TTCACGAAAT 
TGATGACCCT AAAATTGTTC ATGGAAATAC TTTGGAGAAA AATGTTCGTG AATATACGGA 4 020 

TGATGAAAAA TTTGACATTA TTATGATGAA TCCACCTTTT GGAGGGTCAG AATTAGAAAC 4080 
AATAAAAAAT AACTTTCC AG CAGAATTACG GAGTTCTGAA ACAGCTGATT TATTTATGGC 
TGTCATTATG TATCGTTTGA AAGAAAATG G TCGTGTTGGA GTTATTTTAC CTGATGGTTT 
TCTATTTGGT GAAGGTGTAA AAACTCGCTT GAAACAAAAA CTGGTAGATG AGTTCAACTT 
G CAT ACGATT ATTAGGTTGC CTCATAGTGT CTTTGCACCG TATACAGGAA TCCATACGAA 
CATTCTTTTC TTTGATAAAA CAAAGAAAAC AGAAGAAACT TGGTTTTATC GTTTAGATAT 
CCCAGATGGT TATAAAAATT TCTCGAAAAC TAAGCCGATG AAGTCAGAAC ACTTCAATCC 
TGTTCGTGAC TGGTGGGAAA ATCGTGAAGA GATTCTGGAA GGTAAGTTCT ACAAATCTAA 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 



3780 
3840 
3900 
3960 



4140 
4200 
4 2 60 
4320 
4380 
4440 
4500 
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ATCATTTACA CCTAGTGAAT TGCCTGAGTT GAATTATAAT TTAGACCAGT CTCACTTTCC 4 5 60 

AAAAGAGGAA GAGGAAATCT TAAATCCCTT TGAGTTGATT CAGAATTATC AAGCGGAAAG 4 620 

AGCAACTTTA AATCATAAGA TTGATAATGT ATT AG CTG AT ATTTTGCAGT TGTTGGAGGA 4 680 

CAAATAATGA CACCAGAACA ACTTAAAGCA AGTATTCTCC AAAGAGCGAT GGAAGGGAAA 4740 

TTAGTGCCGC AAAATCCCAA TGAC GAACCT GCAAGTGAAT TATTAAAGAG AATTAAAGCT 4800 

GAAAAAGAAA AACTTATCAG TGAAGGAAAA ATCAAACGAG ATAAAAAGGA AACTGAGATA 4860 

TTTCGTGGTG ATGATGGGAA ACATTATGGG AAGTTTCCTG ATGGAAGCAC TCAAGAAATT 4920 

GATGTTCCTT ATGATATTCC TGATACTTGG GAGTGGGTGA GGTTTTCTAC ATTGGTTGAA 4 980 

ATTCTCAGAG GTGGCTCTCC ACGACCAATC AAaGATTATC TTACTTCTGA AGTAGATGGA 504 0 

ATAAATTGGA TAAAAATAGG TGATACTGAA AAGGGTGAAA AGTATATAAA TAATGTTAAA 5100 

GAAAAAATCA AAAAATCAGG GCTTAACAAA ACTAGATTTG TAAAAAAAGG TACATTTTTG 5160 

TTAACTAATT CTATGAGTTT TGGTAGACCT TATATTTTGA ATGTT GATGG TGCAATACAC 522 0 

GATGGATGGT TGGCTATTTC GAACTATGAA AACTCATTAA ATAAAGATTA CCTATTCTAT 52 80 

ATTCTTTCAT CAAATGTAGT TTATTCTCAA TTTCTATCTC TAATTAGTGG AGCTGTTGTG 534 0 

AAAAACTTGA ATAGTGATAA AGTTGCTTCT ATTCTTATCC CTCTCCCCCC ACTATCCGAA 54 00 

CAACAACGAA TAGTAGAAGC AATCGAATCA GCTTTAGAAA AAGTAGATGA ATATGCTGAA 54 60 

AGTTATAATA GACTAGAACA GCTAGATAAA GAATTT CC AG ATAAACTAAA AAAATCTATT 5520 

CTTCAATATG CTATGCAAGG AAAATTAGTT GAACAAGACC CAAATGATGA ATCAGTCGAA 5580 

GTTTTACTTG AAAAAATACG AGCAGAAAAA CAAAAACTCT TTGAAGAAGG C AAG ATT AAA 5640 

AAGAAAGATT TGGACATTTC TATTGTTTCC CAAGGAGATG ATAACTCTTA TTATGGGAAT 5700 

ATACCTATGA ATTGGGTTGT TATAAAAATA AAAGATATTT TTTCAATAAA TACAGGTCTT 57 60 

TCTTACAAGA AGGGCGATTT AAGCATTAAT AATAAAGGTG TTAGAATTAT ACGTGGTGGT 582 0 

AATATTAAGC CTTTAGAATT TTCTCTGTTG GATAATGATT ACTACATTGA TACACAATTC 5880 

ATCTCC T CTG AGCAAGTTTA TTTAAAACAT AATCAGCTAA TAACACCTGT ATCAACCTCT 5 940 

TT AG AACAT A TTGGAAAGTT TGCAAGAATC GATAAAGACT ATGATGGTGT TGTGGCTGGT 6000 

GGATTTATTT TCCAATTAAC AC CATTCGAA AGTTCAGAGA TTATTTCAAA ATTTCTATTA 6060 

TTTAACTTGT CCTCTCCGTT ATTTTATAAA CAATTGAAAG CAATAACTAA ACTATCAGGT 6120 

CAAGCTTTAT ATAATATTCC TAAAACTACA CTGAGCGAGC TATTAATTCC GTTAGCTCCT 6180 

TTTGAGGAAC AGGAACTTAT TACTCAAAAA GTTGAGAAAC TTTTTGAAAA AGTAAATCAA 6240 

CTTTGAAAAT GATTCTTTTC ATCTCTTCAT GATTAGAAAT AGGGATTAAT AATTCGGAGA 6300 
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TACTGGTACT ATTTAATCTT TTCCCT TTG A TAGCATCTTT TGAATCACCT AAAGTAGAGA 
TAAGTGGCAA AAATATCATT AAGTAATCTC TGATAATATT TTCTTTATTA GCATAGGCGA 
ATATCGATAT AATG G CTTCA TTATGAGTGG CAGGAATATC CAATATGGCA ACTTTTCCAA 
TAGATAATTT AAAACTCATT AATAAAGTTC CTTTAGGTGA AATGTCTATT TTCTTTGATT 
TTAATGCTAA TTTAGAAATA GATTCTCTCG CAT TAG TTAC ATAACCAGAT ATAGGCATAT 
CTGATATAGA TACCCAAGGT ATTTCAGTTC CCCAAAAAGT AGCTTCACTG CGTGGAGGAG 
TTTTTCCTAT TCTGAAGTTA ACT AGGCT AG CAAATTTAAT ATATCTCCAT GCTTCTGCGA 
TTTCATATAT AGGATAAGAG GTTGTTTCGT CTTTGTTCCC ATAATAAGAG CCATAATCAC 
AAAAATAGCA GGTAGTCAGT TTGACCACCT GTTATTTTTT ACCAATTAAC AATTTTATCT 
ACAATATTTT GTTGTTCAGT AGCTGTTTTC CTTAGATAAA TTCGAGTAGT TTCTATACTT 
TCGTGTCCCA TCAAATCTCC AAGCAAGGCA ATATCATTAT ACTTCGCTAA AAAATTCTTA 
GCAAATAAAT GCCTAAAAGA ATGAGGGTAA ATTACGTTAG GATTCATTTT GTATTTATCA 
GCATAATTTT TTAACTGTTG AGCAACTCCT CTTGCTGTAA TTGGTTCGTT AAATTTATTC 
AAAAATAAAT AACCACTTCG GCGATTTTCT GATTCTAACC AACTAAGACA ACTATTTCTT 
AATTTTTTAG GAATGTACAG TCTACGAATT TTACCACCTT TTGAGTAAAT GTCAAAATAA 
CCGATTTCTA CATGCTCTAC TTTTAGTTTA ATAAGTTCAC TTACACGAGC CCCAGTTGCA 
CCTAAAAACC AAACGACAAA ATGCCATTTT AAAATACCAT CTTTTTTCAA ACTACGTTTA 
AGAAAAAGGT AATCAGCATG G CT AATG AC A TCTTCTAAAA ACGGTTTTTG CTGTACTTTG 
ACAAATTTTA ATTTCAAATC ATCATGACCA ATAAAAGCCA GATATTTATT TACTCCTTGT 
AGTCGCAAAT TGACAGTTTT AGGTTTAAAA TTGTCTAATA AATATCCTTT GT ATT C AAA T 
AAATCTTCCA TTTTGAGTTC GTAATTCTCC AAGAAAAATC GAACACCATA AAGGTACGAA 
CGCACAGTAT TTTCAGCTAA ACCAGCTTTC TTCAAATGTA ATTC AAAAT C TTTCAACGTA 
AAACTCCTAT CTTATGTTTG ATAGAAATTC CACCGCACGT AAAACTATTA TACTAAATTA 7680 
GTGCGTCAAT ATGGGCGAAA AATTGTTCGA TTTTATCAAC GATTCTGGAT TGTTCAGGAA 7740 
GGGGTGGGAG GGGGATTAAA TATTCTTTTA TAGTTTTCGT TAATAATTCT TTTTGTTTTG 
TACTACCCGA CGCTTTTTCT TCAATAACTG ACTGAACAAT AGGAGAGGAA AGAAAATTAT 
AGATGAAATG GCAATTAATA ACCCCCGATA AGACTCTTAT AACTGTAACA TGGCTATCTG 
CAACAGCCCA GCCATAAGGA TTTTTATTTT . CATGGTAAAT AGCTAATCGT CCTAAC GT AC 
CTAGACCTGT TGAATTCCAC ATTAAATCAC CATCTCTTAG TAATCTTTCT TTCTGGTAAC 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 



7800 
7860 
7920 
7980 
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TATGAACTGT 


TTCGGGATCA 


, ATAAATCTTG 


CTAAGTCAAT 


AGAAAAGCCA 


GACCATTGAT 


8100 


TACATTTCTG 


AGCAATCACA 


GGGTATATAG 


GAATATTTGA 


ATATTTTGGA 


GACTTCCCTC 


8160 


TTTGAATGTA 


GGAGGTTATA 


TCGTTTAACC 


TCACCCATTC 


CCAACTTTCT 


GGTATTTCAC 


8220 


AAGGTACTTC 


CTCATAATAA 


G AGTT AT CAT 


CTCCTTGGGA 


AACAATAGAA 


ATGTCCAAAT 


8260 


CTTTCTTTTT 


AATCTTGCCT 


TCTTCAAAGA 


GTTTTTGTTT 


TTCTGCTCGT 


ATTTTTTCAA 


8340 


GTAAAACTTC 


GACTGATTCA 


TCATTTGGGT 


CTTGTTCAAC 


TAATTTTCCT 


TGCATAGCAT 


8400 


ATTGAAGAAT 


AGATTTTTTT 


AGTTTATCTG 


GAAATTCTTT 


ATCTAGCTGT 


TCTAGTCTAT 


8460 


TATAACTTTC 


AG CAT ATTCA 


TCTACTTTTT 


CTAAAGCTGA 


TTCGATTGCT 


TCTACTATTC 


8520 


GTTGTTGTTC 


GGATAGTGGG 


GG GAG AG C AA 


TTAATAATAG 


ATTAAAATTA 


TAATCATTGA 


8580 


TTGCAGGATA 


ACTTGTTCCA 


GTAGATTTAT 


TATTAACACG 


ATTGATAAAA 


TTATCTGATA 


8640 


ATAAATAATA 


TTTCAAATAT 


GTTTCGTTAA 


GTAAAGTATC 


CAAAACAATA 


AATGCTGTAC 


8700 


TAGCTATCAA 


ATACTCTTTA 


AGTTCTCTAA 


CTACAGCAAT 


ATTTTTTAGA 


TATGGTCTAA 


8760 


CTGTTGAAAA 


TAAGACACTA 


TTCTGCGAAA 


CTAATTTTCT 


AGCACGGGAA 


GGCGCTTGTT 


8820 


CAGGTGAAAG 


ATATTGTAGA 


TTTTTGTAGT 


TGATTATGTT 


CTTTTTTCTA 


TCAATACTAG 


8880 


ACGTATCTAT 


ATACCTAAAG 


GATTTCTCTG 


GCTTATTTTG 


CC CAAAATT C 


CAATAAATTG 


8940 


ATTTTATCCT 


CACCCACTCC 


CAAGTATCAG 


GAATATCATA 


AGGAACATCA 


ATTTCTTGAG 


9000 


TGCTTCCATC 


AGCAAACTTC 


CCATAATGTT 


TCTTATGTGC 


TTCAAGTATA 


TAAAAAGGCG 


9060 


TAAAAATACG 


CCTATAGATA 


ATGGGGTTGA 


AATAGGTTTA 


TTGTTGATGA 


GATTGTAGAT 


9120 


AATTCAATTT 


TTTACTTCCA 


ATCGAATATT 


CAAATCCTCC 


ACCTTTTCTG 


CCTGTAATTG 


9180 


TTCATCATAA 


AATTCAATAT 


CTTCAGGATT 


TTCCCCTTGG 


CAACCTCGGC 


AGAAATATTC 


9240 


TTCCGCTCGA 


TCAGG ATTCA 


AAAATCGACA 


AGCACAAACA 


AAACAGTCGC 


CATCATCATT 


9300 


TATTGAGATA 


ATATAGTAGA 


TTGAAATAAG 


ATGTAAACAA 


ATCGATTAGG 


AAAGTTAAAT 


9360 


T AGTTTCT AG 


AAATTTTTAG 


CAGATGTAGT 


GTACTATTCT 


AGTCTCAATT 


TACTATGGCT 


9420 


TCAAATATAT 


CTTTCGAAAA 


AATATTTACA 


GATGTGTAAT 


TTTGAAGCTT 


GCAAAAGTTA 


9480 


GTAAACTTGT 


AGATTTCGAT 


TTGAAGTAAC 


TTGTTTTCTT 


GCCCGATATT 


GTTTTTGAAA 


9540 


TTGAATTTTT 


CCATAGTGAC 


TCCTTAATTT 


TCTTCTACAC 


GTCTGATGAT 


AAATCTAATT 


9600 


CGCAAAAGAG 


TCAAGAGGAT 


TTTTCGAAAA 


ATAAATAGCG 


ACCGAAATCG 


CTATTTTAAG 


9660 


GGTTATAGGT 


ATTTGATGGC 


TTAGACTGCT 


GTGTGACTGT 


TTACCCACAG 


GC AATCTTTC 


9720 


TTCTATATTA 


GTATTAGTAA 


AGGTCTAAAT 


AATTATCAAT 


TTCCCATTGT 


GAAACGAAGG 


9780 


TTGCATAACT 


TGCCCATTCG 


ATTCGTTTGG 


CTTCAAGGAA 


GCTAGTATAG 


ATGTGATCTC 


9840 
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CGAGAGCAGC TTTAACCACT TCATCTTCTG TCAAAGCTTT CAAAGCGTTG TGAAGAGTTG 9900 
ATGGAAGGTC TGTAATACCA GCTTCCTTGC GCTCTTCTGC TGTCATGATG TAGATATTTT 9960 

CTTCGATAGG AGCTGGTGCT TCGATTTTAT TTTCAATACC ATACAAACCA ACTTCCAAAA 10020 

G AAC AG C CAT AGCAACGTAA GGGTTCGCCA TTGGATCCAC TGAACGCAAC TCAAGACGAG 10080 

TTCCCATACC ACGTGAAGCA GGTACGCGCA CAAGTGGCGA ACGGTTACGA CCAGCCCAAG 10140 

CAATGTAAAC AGGCGCTTCA TAACCTGGAA CCAAACGTTT GTATGAGTTA ACTGTTGGGT 10200 

TCATGATGGC AGTATAGTTG TAAGCATGCT TGATCAAACC GCCTAGGAAA TGGTAAG CTG 10260 

TTTCTGACAA CTGCATTCCT TTTGGAT CAT TTGGATCAAA GAAGGCGTTA TTTCCTTCTG 103 20 

CATCAAACAA GGACATATTA CAGTGCATAC CTGATCCAGC AATACCAAAT TTTGGCTTC G 10380 

CCATAAATGT TGCGTAAAGT CCGTGTTTGC GAGCAATGGT TTTAACAACA AGCTTAAAGA 10440 

TTTGAATCTT ATCACAAGCA CGGAGAACTT CATCGTACTT AAAGTCAATC TCATGCTGTC 10500 

CAACCGCAAC CTCGTGGTGA CTCGCTTCTA CTTCAAATCC CATTTTGGTC AAGACATTCA 10560 

CAATCTCACG ACGTGTGTTG TCCGCAAGGT CAGTAGGTGC CAAGTCAAAG TAGCCACCCT 10620 

TGTCATTCAC TTCAAGTGTT GGGTCCCCAT TTTCATCCAA CTTAAATAGG AAGAATTCTG 10680 

GCTCTGGACC AAGGTTGAAG GATTTGAATC CAACTTCTTC CATGTGACGA AGAGCTCGTT 10740 

TCAAATTACC ACGAGGGTCA CCCGCAAATG GTTCACCTTC TGTTGTATAG ACATCACAGA 10800 

TCAGACCTGC AACACTTCCA TTTTCATCTC CCCAAGGGAA GACTGTCCAT GTATCCAAGT 10860 

CCGGGTACAA GTACATATCC G ACT CATTG A TACGTACAAA ACCTTCAATA GAAGATCCAT 10920 

CAAACATAAC CTTGTTCGAC AAGACCTTAT CTAACTGTTC ATCTGTAGCA GG AATTTCG A 10980 

CGTTTTTCAT GGTTCCCAAA ATATCTGAGA ACATAAGACG AATAAAGGTA ACATTTTTTT 11040 

CCTTGACTTC ACGACGAATA TCTGCAGCTG TGATTGGCAT AAGTTTTCTC CTTAATCTAT 11100 

GACTACTTGC GGTTGCCTAA CCGCGACCAA AAGGTGACTG TACTGAAGCA AAACGCCCCT 11160 

GTTGGAGGAG TTCATTGTGA AGTGCACGAC GTACTTCAGT CTGACTAACC GCTTTCTTGG 11220 

ATTTCGCTTC ACGTTCAGCA TATTTTTTCT TAATGGCAGC GAT ATT AT AA CCTTCAGAGA 11280 

TATAATCTTT GATTTCAAGC AGACGATCCA TCTCATTCAA GGAATACATG CGACGATTTC 113 40 

CTTCGTTTCG ATCGGGCTTG ATCAACTCTT GATCTTCATA ATAACGAATC TGACGCGCCG 114 00 

ATAGATCGGT CAACTTCATA ACACTGCCGA TAGGAAAAAC AGCCATATTT CGGCGAAATT 114 60 

CTTTTTCCTT CATTTACAAT TTC C TT C TTT CTGTCTATTA TAGTCTAAAA AAAGACAAAC 11520 

GTCAATTGAT AATGTTATAA AATGTAACAT TATTTTTCTT TTTTCTCTAA AAAGAGACGA 11580 
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ATACGATCAA TATCGTAATT TACGATAATT GCGACAAAAA CTCCCATAAA CGTTTCTAAT 11640 

ACACGCACAA ACACGTACAA AATTGTCTCA CCACTTGGAA TTGATAGGGT AATGATTAAC 11700 

ATAGCTGCTA CACCACCAAT AACCCCTGCT TTGTT ATT C A TGGCTACATT TGTCATAATG 11760 

GTTAACATGG TGCAGATTGG AACAACTACC AAGGTCACCC AAAAGGCTTC GTGGAAAAAG 11820 

GTATTTAATA AGAAGAAGAC CAAGGCATAG AGTCCACCGA TACTATTTCC TAGAATACGC 11880 

GAAGTCCCAA AATGAACACT CTCATCAAAA CTCTCCCTCA GGCTAAAAAC GGCTGTCAAA 11940 

GCACCAATTT GAAGACCTTT CCAGCCAAAA AAGCCAAAAA TCAAGAGAAC TAGAAAAACA 12 000 

GCAATACCTG TTTTAAAGGT TCGC AT AC C A AGTTTGAACT GGGATTTATC GAATTTATAT 12060 

TTTTTAAAAT AACTCATAAT CTCAACTTTC TATTTCCATT TTATCATAAA TCGGTGATTT 12120 

TTATGAGTAA TAGTTGAGAG GAAGCGTTTT TATTTTAAGC AAAAGAAAAG AGGAACTTTC 12180 

ATCCCTCTCT TCTTTGATTT ATTTATAAAA TCTTATTTTT CTGTCAAGGC TGCAAGTCCT 12240 

GGAAGAACCT TACCTTCAAG AAGTTCCATT GATGCTCCAC CACCCGTACT AATCCATGAG 12 300 

AACTTGTCTG CACGGCCAAG GTTAATCGCT GCGGCAGCTG AGTCACCACC ACCGATGATT 12 360 

GATTTAACTC CTGGTTGTTT CACGATAGCG TCCATCACAC CGATTGTACC AGCTTGGAAA 12 4 20 

TCTGGGTTTT CAAATACACC CATAGGTCCG TTCCATACGA CTGTTTTGGC ACCAG TC AAA 12480 

GCTTCGTCAA ATTTGGCGAT AGATTTTGGA CCGATGTCAA GACCAAGGAA GCCTTCAGAA 12 540 

ACTGCTTCAC CTTCAGTGTC ACGCACTTCA GTGTAACCAG CAAATGCGTT AGCTTCTTTT 12 600 

GAGTCAACTG GCAAGATCAA TTTACCATTT GCTTTTTCAA GAAGACCTTT CGCAACATCC 12660 

AATTTGTCTT CTTCTACAAG TGAGTTACCG ATTTCGATAC CTTGTGCTTT GTAGAATGTG 12720 

TAAGTCATCC CACCACCGAT AAGGACGTTA T C AG CTTTTT CAAGCAAGTT TTCGATAACA 12780 

CCGATCTTGT CTGAAACTTT TGAACCACCA AGGATAGCCA CGAATGGACG TTCTGGAGTT 12 840 

TCAACTGCTT CTTGGATGTA GGCAATTTCG TTTTCAAGAA GGAAACCAGC AACTGCTTTT 12 900 

TCAACGTTTG CTGAGATACC AACGTTAGAT GCGTGTGCAC GGTGAGCTGT ACCGAATGCA 12960 

TCGTTTACGA AGATACCATC TCCAAGTGAT GCCCAGTATT TACCAAGTTC AGGATCGTTT 13020 

TTAGATTCTT TCTTGCCGTC AACATCTTCG TAACGAGTGT TTTCAACCAA GAGAACTTGT 13 080 

C CAT CTTC AA GAGCGTTGAT TGCCGCTTCT AATTCAGCAC CACGAGTGAC ACCTGGGAAA 13140 

ACAACATCTT GACCAAGTTT TGCTGCCAAG TCAGCTGCTA CAGGAGCAAG TGATTTACCA 13200 

GCTTTATCAG C TT CTTCn T CACACGTCCA AGGTGAGAGA AAAGAATTGC ACGTCCACCT 13260 

TGTT CGATGA TGTACTTAAT AGTTGGAAGA GCTGCTGTGA TACGGTTATC GTTAGTGATT 13320 

ACGC CATCTT TCAATGGTAC GTTGAAGTCA ACACGAACGA GGACTT TT TT ACCTTTCAAG 13 380 
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TCAACGTCTT TAACAGTAAG TTTTGCCATG TTACAAAAAC TCCGG 
(2) INFORMATION FOR SEQ ID NO: 152: 

<i> SEQUENCE CHARACTERISTICS: 

{A) LENGTH; 905 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
GATTTATCCT ACCGGnGAAT TTCCGGAGGG GTTCTAGCAG CAATCTTAGG AATCTATGAA 60 

CGAATGATTG GCTTTCTGGC CCATCCCTTT AAAGACTTTA AAGAAAATGT TTTGTACTTT 120 

ATTCCAGTTG CCATCGGTAT GCTTCTGGGA ATCGGCTTAT TTTCCTACCC GATTGAATAC 180 

CTGCTTGAAA ATTATCAGGT TTTTGTATTA TGGAGCTTTG CGGGAGCTAT TATCGGTACA 24 0 

GTTCCTAGCC TCCTCAAAGA ATCAACTCGA GAATCTGACC GAG ACAAG AT TGATTTAGCT 300 

TGGTTATGGA C AACCTTT AT CATTTCTGGA TTAGGACTCT ATGCCTTAAA TTTTGTCGTT 3 60 

GGAACCTTAA GCGCCAGCTT TCTTAACTTC GTCCTAGCAG GCGCACTATT GGCCCTTGGC 4 20 

GTCTTGGTTC CTGGCCTCAG CCCATCAAAT TTACTTTTGA TTTTGGGACT CTATGCTCCT 4 80 

ATGTTGACTG CTTTTAAAAC TTTTGATTTC TTGGGAACCT TCTTTCC GAT TGGAATTGGT 540 

GCAGGTGCAA CTCTCATCGT TTTTTCAAAA TTGATAGATT ATGCCTTAAA CAACTACCAC 600 

TCACGCGTCT ATCATTTCAT CATCGGTATC GTCCTATCAA GTACCCTTTT GATCTTAATT 660 

C C AAATGC AG GAAACGCTGA AAGTATCCAA TACACAGCAC TTTCACTTGT CGGTTATGTC 720 

ATCATCGCCT TCTTCTTTGC GCTGGGAATC TGG CTTGGTA TTTGGATGAG TCAATTGGAG 780 

GATAAATATA AATAATGGCA AAAAAAGTTA AAATCAAAAA AACATTGGTG GAACAAATCC 84 0 

TATCTAAAGC AGCTATCCCT CATCAGGGGA TTCAAATCAA TGCCCTAGAA GGAGAGCTTC 900 
CTCAA 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4278 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



905 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
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CTTGAATTAA ATAAAAAACG TCATGCGACT 


' AAG C ATTTT A 


. CTGATAAGCT 


TGTTGATCCC 


60 




GTACGGCTAT 


' CGAAATTGCA 


, ACCTTAGCGC 


CAAGCGCCCA 


CAAC AG CC AG 


120 




' TTGTGGTGGT 


ACGTGAGAAA 


. AATGCTGAAC 


TGGCAAAGTT 


AG CTT ATGGT 


180 


4\.CAAl 1 i 1\J 


ft It ft >^ M ***** ft «**J*-1 

AACAGGTATC 


ATCAGCGCCT 


GTAACCATTG 


CCTTGTTTAC 


AGATACGGAC 


240 


1 1 AtaLLAAAC 


G TGCTCGT AA 


GATTGCCCGT 


GTTGGTGGTG 


CTAATAACTT 


TTCTGAAGAG 


300 


UVAL I 1T-AAT 


ATT TT ATG AA 


AAATCTGCCA 


GCTGAGTTTG 


CCCGTTACAG 


TGAGCAACAA 


360 


<j 1 LAbL U ACT 


ACCTAGCTCT 


CAATGCAGGT 


TTGGTTGCCA 


TGAACTTGGT 


TCTTGCATTG 


420 




GAATTGGTTC 


TAACATTATT 


CTTGGTTTTG 


ACAAATCAAA 


AGTTAATGAA 


480 


tjTTTuiGAAA 


TCG AAG AC CG 


TTTCCGCCCA 


GAACTCTTGA 


TCACAGTGGG 


TTATACAGAC 


540 


LrAAAAAl 1 GG 


AACCAAGCTA 


CCGCTTGCCA 


GTAGATGAAA 


TCATCGAGAA 


AAGATAGAAA 


600 




TGACAGCAAT 


TGATTTTACA 


GCAGAAGTAG 


AAAAACGCAA 


AGAAGACCTC 


660 


TTGGCTGACT 


TGTTTAGCCT 


TTTGGAAATC 


AATTCAGAAC 


GTGATGACAG 


CAAGGCTGAT 


720 


W_v. LAGCATC 


CATTTGGGCC 


TGGTCCAGTA 


AAAGCCTTGG 


AGAAATTCCT 


TGAAATCGCA 


780 


oACCGCGATG 


GCTACCCAAC 


TAAGAATGTT 


GATAACTATG 


CAGGACATTT 


TGAGTTTGGT 


840 


UA TCjCjAGAAG 


AAGTTCTCGG 


AATCTTTGCC 


CATATGGATG 


TGGTGCCTGC 


TGGTAGCGGT 


900 


iVKJoACACAG 


ACCCTTACAC 


ACCAACTATC 


AAAGATGGTC 


GCCTTTATGC 


GCGCGCGGCT 


960 


TCGGACGATA 


AGGGTCCTAC 


AACAGCTTGT 


TACTATGGTT 


TGAAAATCAT 


CAAAGAATTG 


1020 


GOTCI I CCAA 


CTTCTAAGAA 


AGTTCGCTTC 


ATCGTTGGAA 


CAGACGAAGA 


ATCAGGCTGG 


1080 


*jC AJL»AC A 1 


A CT ACT ACT T 


TGAGCACGTA 


GGACTTGCC A 


AACCAGATTT 


CGGTTTCTCA 


1140 


C C ALi ATG CTG 


AATTTCCAAT 


CATCAATGGT 


GAAAAAGGAA 


ATATCACGGA 


ATACCTCCAC 


1200 


I 1 TGCAGGAG 


AAAATACAGG 


TGTTGCCCGT 


CTTCACAGCT 


TTACAGGTGG 


TTTACGTGAA 


1260 


AATATGGTAC 


CAGAATCAGC 


AACAGCAGTC 


GTTTCAGGTG 


ACTTGGCTGA 


CTTGCAAGCT 


1320 


AAACTAGATG 


CCTTTGTTGC 


AGAACACAAA 


CT TAG AGGAG 


AACTCCAAGA 


AGAAGCTGGC 


1380 


AAATACAAGG 


TGACGATCAT 


TGGTAAATCA 


GCCCACGGTG 


CTATGCCTGC 


TTCAGGTGTC 


:.440 


AATGGCGCAA 


CTTACCTTGC 


CCTCTTCCTC 


AGCCAGTTTG 


GCTTTGCTGG 


TCCAGCCAAA 


1500 


GACTACCTTG 


ACATCGCAGG 


TAAAATTCTC 


TTGAACGATC 


ATGAGGGTGA 


AAATCTTAAG 


1560 


ATTGCTCATG 


TGGATGAAAA 


GATGGGTGCT 


CTTTCTATGA 


ATGCCGGCGT 


CTTCCACTTC 


1620 


GATGAAACAA 


GTGCTGATAA 


TACCATTGCC 


CTCAACATCC 


G CT ATCCAAA 


AGGAACAAGT 


1680 


CCAGAACAAA 


TCAAGTCAAT 


CCTTGAAAAC 


TTGCCAGTTG 


TTTCTGTTAG 


CCTGTCTGAA 


1740 


CACGGTCACA 


CGCCTCACTA 


TGTGCCAATG 


GAAGATCCAC 


TTGTGCAAAC 


CTTGTTGAAT 


1800 
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ATCTATGAAA AACAAACTGG CTTTAAAGGT CATGAACAAG TCATCGGTGG TGGAACCTTT I860 
GGTCGCTTGC TAGAACGCGG AGTTGCCTAC GGTGCTATGT TCCCAGACTC GATTGATACC 192 0 

ATGCACCAAG CCAATGAATT TATCGCCTTG GATGATCTTT TCCGAGCAGC AGCAATTTAT 1980 
GCCGAAGCTA TTTACGAATT GATCAAATAA AACGATAGAA GTCTGAGATC TTATGCTTGG 204 0 

ACTTCTTTTT GGAGGGAAAG TAGATGTCTC AAATCGAAAG AATCAAACAG GCTATCATGG 2100 

CGGATTCGCA GAATGCCAGC TATACAGAGC GTGGCATTGA GCCTCTCTTT GCAGCGCCAA 2160 

AAACTGCTCG CATCAATATC ATCGGTCAGG CTCCGGGACT TAAAACTCAA GAAGCAGGCC 2 220 

TTTACTGGAA AG AT AAAAGT GGTGACCGCT TGCGGGACTG GCTAGGTCTG GATGAAGATA 2 2 80 

CCTTTTACAA TTCAGGTTAT TTTGCTGTTT TGCCTATGGA TTTCTACTTT CCAGGACATG 2 3 40 

GCAAGTCGGG TGATCTTCCG CCTCGTACAG GTTTTGCAGA AAAATGGCAT CCGCAGGTCT 24 00 

TACAGGAATT GCCTGATATT CAGTTAACCC TCTTGATTGG GCAATATGCC C AAGC CT ACT 2 4 60 

ATTTACAGGA GAAAATCAGT GGGAAGGTAA CGGAGAGGGT GAAACACTAT AAAGACTATC 2520 

TGCCAGCCTA TTTTCCGCTA GTTCACCCAT CACCACGAAA TCAAATCTGG ATGGCCAAAA 2 580 

AT CCTTGG TT TGAGGCAGAA GTAGTGCCAG ATTTGAAAAA AAGAATTAAA AC CATTTTAT 2 640 

AGTCAATGAA AATCAAAGAG CAAACTAGGA AGCTAGTCGT AGGCTGCTCA AAGTACAGCT 2700 

TTGAAGTTGC AGATAAAACT GACGAAGTCG GTAACATACG CACGGTAAGG CGACGCTGAC 27 60 

GTGGTTTGAA GAGATTTTCG AAGAGTATTA GAAGAAAAAG AATGAAAGAA ATAGCCTTTG 2 820 

ACGCATTTTA CCAGCTTTAC CAAAACGACC AGCTTTCTTT AGTGGATGTG AGAGAAGTGG 2 8 80 

ATGAGTTTGC AGCTCTTCAT TTAGAAGGTG CCCACAACCT ACCGCTTAGT CAATTGGCTG 2 940 

ATAGTTATGA TTAATTGGAC AAAGATCGCT TGCATTATAT TATTTGCAAA TCTGGAATGA 3000 

GATCGGCGCG TGCTTGC CAA TTCCTATTAG AACAAGGTTA TAATGTTATC AATGTCCAGG 3 060 

GTGGCATGTT AGCCTTTGAA GAACTTTAAA ATTTTGCATT TCTCCTACTT GGTGTGGACT 3120 

GGGTAGGAGA GTTTTATTTT TAGATAATTC TTATTTTTAA GAAAATTGAA AACATTTAAT 3180 

ATTTGCCTCG TGATGCTTTT TTCAGACTCC TAATCGTGGT ATACTAGGTC AGTATTTTAT 3240 

AAATATGAAG GAGATTTTTA TGGCTAAAAA AGGTACCCTA ACAGGTTTGC TCCTGTTTGG 3300 

AATATTTTTT GGTGCGGGGA ACTTGATTTT TCCGCCTTCT CTAGGTGCTC TATCTGGAGA 3360 

ACATTTTCTT CCTGCCATCG CAGGTTTTGT CTTTTCAGGC GTTGGTATCG CCGTCTTGAC 3420 

CCTTATTATT GGAACGCTAA AT CCTAAAGG ATATATCTAC GAGATTTCAA CGAAGATAGC 3 480 

GCCTTGGTTT GCGACTCTTT ACCTCTCAGT TCTTTACTTG TCAATCGGTC CATTCTTTGC 3 540 
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TACCCCACGT ACTGCTACAA CAGCTTACGA AGTAGGGATT AGCCCCCTTT TGTCGGATGC 3 600 

AAATAAAGGA CTTGGCTTGA TTGTATTTAC GGTTCTGTAT TTTGCGGCAG CCTATTTGAT 3 660 

TTCGCTTAAT CCATCAAAAA TCTTAGACCG CATTGGACGT ATTTTAACGC CAGTCTTTGC 3720 

AATTTTGATT GTTATCTTGG TCGTTCTGGG AGCT ATC AAA TATGGTGGAA CAAGTCCTCA 3 7 BO 

AGCTGCTTCA CTCCTTATCA ACCTTCTGCC TTTGGTACAG GTTTCCTAGA AGGTTACAAT 3 840 

ACCTTGGACG CCCTTGCCTC AGTGGCCTTT AGCGTAATCG CAGTTCAAAC CTTGAAACAA 3 900 

CTTGGATTTT CAAGTAAGAA AGAATACATT TCAACTATTT GGGTTGTTGG TATCGTTGTT 3 960 

GCCCTTGCCT TCAGCGCTCT TTACATCGGT TTAGGTTTTC TTGGAAATCA TTTCCCAGTA 4020 

CCAGCTGAAG CGATGAAGGG TGGAACACCA GGTGTTTACA TCTTGTCACA AG CC ACT CAA 4080 

GAAATCTTTG GCTCAACAGC TCAACTCTTC CTTGCAGCTA TGGTTACCGT AACCTGCTTC 414 0 

ACAACGACTG TTGGTTTGAT TGTGTCAACA G CTGAGTTCT TTAATG AGC G CTTCCCACAA 4 200 

ATCAGCTACA AGGTTTATGC GACAGCCTTT ACCTTGATTG GATTTGCTAT TGCCAATTTG 42 60 

GGTCTTGATG CGATTATC 42 7 8 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ACCCGATCAA ATGACAAAAG CTAACTTTGG TGTCGTAGGT ATGGCCGTAA TGGGTCGTAA 60 

CCTTGCCCTT AATATTGAAT CTCGTGGTTA CACAGTTGCT ATCTACAACC GTAGTAAAGA 120 

AAAAACGGAA GATGTGATTG CTTGCCATCC TGAAAAGAAC TTTGTACCAA GCTATGACGT 180 

TGAAAGTTTT GTAAACTCAA TCGAAAAACC TCGTCGTATC ATGCTGATGG TTCAAGCTGG 240 

AC CTGGT AC A GATGCTACTA TCCAAGCCCT TCTTCCACAC CTTGACAAGG GTGATATCTT 3 00 

GATTGACGGA GGAAATACTT TCTACAAAGA TACCATCCGT CGTAATGAAG AATTGGCAAA 360 

CTCTGGTATC AACTTTATCG GTACTGGGGT TTCTGGTGGT GAAAAAGGTG CCCTTGAAGG 4 20 

TCCTTCTATC ATGCCTGGTG GACAAAAAGA AGCCTACGAA TTGGTTGCGG ATGTTCTTGA 4 80 

AGAAATCTCA GCTAAAGCAC CAGAAGATGG CAAACCATGT GTGACTTACA TCGGTCCTGA 54 0 

TGGAGCTGGT CACTATGTGA AAATGGTTCA CAATGGTATT GAGTACGGTG ATATGCAATT 600 

GATCGCAGAA AGCTATGACT TGATGCAACA CTTGCTAGGC CTTTCTGCAG AAGATATGGC 660 
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TGAAATCTTT 


ACTGAGTGGA 


ACAAGGGTCA 


ATT AG AC AG C 


TACTTGATTG 


AAATCACAGC 


720 


TCATATCTTG 


AGCCGTAAAG 


ACGATGAAGG 


CCAAGATGCA 


CCAATCGTAG 


ACTACATCCT 


780 


TGATGCTGCA 


GGTAACAAGG 


GAACTGGTAA 


ATGGACTAGC 


CAATCATCTC 


TTGACCTTGG 


840 


TCTACCATTG 


TCACTGATTA 


CTGAGTCAGT 


GTTTGCACGC 


TACATTTCAA 


CTTACAAAGA 


900 


AGAACGTGTA 


CATGCTAGCA 


AGGTGCTTCC 


AAAAC CAGCT 


GCCTTCAACT 


TTGAAGGAGA 


960 


CAAGGCTGAA 


TTGATTGAAA 


AGATCCGTCA 


AGCCCTTTAC 


TTCTCAAAAA 


TCATTTCATA 


1020 


CGCACAAGGA 


TTTGCTCAAT 


TGCGTGTAGC 


CTCTAAAGAA 


AACAACTGGA 


ACTTGCCATT 


1080 


TGCAGATATC 


GCATCTATCT 


GGCGTGATGG 


CTGTATCATC 


CGTTCTCGTT 


TCTTGCAAAA 


1140 


GATTACAGAT 


GCTTACAACC 


GCGATGCAGA 


TCTTGCCAAC 


CTTCTTTTGG 


ACGAGTACTT 


1200 


CTTGGATGTT 


ACTGCTAAGT 


ACCAACAAGC 


AGTACGTGAT 


ATCGT AG CTC 


TTGCGGTTCA 


1260 


AGCAGGTGTG 


CCAGTGCCAA 


CTTTCTCAGC 


AG CT ATT ACT 


TACTTTGATA 


GCTACCGTTC 


1320 


AGCTGACCTT 


CCAGCTAACT 


TGATCCAAGC 


ACAACGTGAC 


TACTTTGGTG 


CTCACACTTA 


1380 


CCAACGTAAA 


GACAAAGAAG 


GAACCTTCCA 


CTACTCTTGG 


TATGACGAAA 


AATAAGTAGG 


1440 


TCAGCCATGG 


GGAAACGGAT 


TTTATTACTT 


GAGAAAGAAC 


CAAATCTACC 


T C ATTTTTT A 


1500 


AGTTTGGAAC 


TC CAGAAAG A 


GCAGTATCGG 


GTTGATCTGG 


TAGAGGAGGG 


GCAAAAAGCC 


1560 


CTCTCCATGG 


CTCTTCAGAC 


AGACTATGAT 


TTGATGTTAT 


TGAACGTTAA 


TCTGGGAGAT 


1620 


ATGATGGCTC 


AGGATTTTGC 


AGAAAAATTG 


AGCCGAACTA 


AACCTGCCTC 


AGTCATCATG 


1680 


ATTTTAGATC 


ATTGGGAAGA 


CTTGCAAGAA 


GAGCTGGAAG 


TTGTTCAGCG 


TTTTGCAGTT 


1740 


TCATACATCT 


ATAAGCCAGT 


CCTTATCGAA 


AATCTGGTAG 


CGCGTATTTC 


GGCGATCTTC 


1800 


CGAGGTCGGG 


ACTTCATTGA 


TCAACACTGC 


AGTCTGATGA 


AAGTTCCAAG 


GACCTACCGC 


1860 


AATCT T AGG A 


TAGATGTTGA 


ACATCACACG 


GTTTATCGTG 


GTGAAGAGAT 


GATTGCTCTG 


1920 


ACACGCCGTG 


AGTATGACCT 


TTTGGCGACA 


CGG 






1953 


(2) INFORMATION FOR SEQ ID NO: 155: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6474 base pairs 
<B> TYPE: nucleic acid 
(C) STRAND EDNESS : double 
(DJ TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
CCGGCAGTAC ACGAGCTTGG GGAACAGCCA CTGGAACGAT G AGGTGTG AG CTCAAAATAT 



60 
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CCTCCAGTTA 


TGTTTTTCCT 


AATAGTATAC 


CGGAAGAGTG 


AAAGGATTTT 


ATAATGGACC 


120 


GGTTACAAAG 


AACCTACTTT 


CTATTAAACA 


GTATACTATG 


AAAATGTGAA 


AATTTAACAT 


180 


TTTTTTGTAC 


AAATTTTATA 


AATTATTGCC 


TTTTTAATAT 


CAATAGTTAA 


TCTCTTATCC 


240 


AGATCCCCCT 


TGTGTAAACT 


TTATCTTTAT 


AAGCTTCAAG 


GCCCCTATCC 


CATCTATTTG 


300 


CAACAATTAG 


ATCACTTTGT 


TTTGTAAATA 


GTTCAAAATT 


CTTTTCAATA 


ATTACGTTAT 


360 


CTATACTAAC 


GTTTAAATTT 


GGTTCATATA 


CTAAAATTTT 


TATACCGACA 


ATCAATAGTT 


420 


CATTAATTAT 


ACTTAAAATA 


GCTGAC TCTT 


TGTAATTATC 


TG AATT AT AT 


TTCATCCCCA 


4S0 


ATTTATATAT 


TCCTACTATC 


TTTGGCTTTC 


GTTCCAATAT 


TTCTTTAACT 


ATGAACTGTT 


540 


TTCTATTTGT 


GTTTGAAATA 


TCAATCGCTT 


CTATCACTGG 


GGCATTTATT 


TCTATAAATT 


600 


CTTTTTTTAA 


TTGTTTAGTA 


TCTTTGGGAA 


GACAATATCC 


TCCAAATCCA 


AAAGAAGGAT 


660 


TATTATAAAA 


ATTTCCAATT 


CTTGG AT CT A 


AACAAACACC 


TTTTATTACA 


ACTTCAGCAT 


720 


TTAAGCTTCT 


CCTCTCAGCA 


AAAGAAT CT A 


GTTCATTAAA 


AAAGCAACAC 


GGAGAGCTAA 


780 


GAATGTGTTA 


GAAAAAAGCT 


TAATTGCTTC 


TGCTTCAGTA 


GGAGAAACTA 


ACATAACATT 


840 


TTTAATATTG 


GCAGTACTAT 


GAGTACTAAT 


CGAAAGGAAC 


AACTCTGCAA 


TTTTTCTTCC 


900 


TTCAACTGTC 


TCATCTCCAA 


CAACTATGCG 


ACTTGGATAT 


AAATTATCAT 


ATATAGAACA 


960 


ACCTTCTCTC 


AAAAATTCAG 


GGACAAAAAT 


GATATTTTTT 


CTATCAAACA 


GCCTTTTTAA 


1020 


TTTGTTTGAA 


AAGCCGATCG 


GAACTGTTGA 


CTTTAAAATA 


ATCTTTCCAT 


TAGGTTTTAC 


1080 


CCTCAGAATC 


TTCGATACCG 


TTTGTTCGAT 


TTCATATGTA 


TTAAAACTAC 


CAATTTTCTC 


1140 


ATCATAATCT 


GTCGGAAGCG 


CAATAATATA 


ATAATCAATA 


TTATTTTTAA 


TTTCAGAAAA 


1200 


TGTATCAAAA 


AAAGTAATAT 


TTAAGTTATT 


CTCGCAAAAA 


AACTTCATAA 


GCTCTTCATT 


1260 


TTTAGATGGA 


AGAATGCCCT 


TTTTTAAATT 


ATTTATTTTT 


ACAGAATCTA 


TATCATATGC 


1320 


AACAACTTTA 


TATTTAGATG 


CAAATAGTAA 


CGCGTAGGCC 


AGCCCAACAT 


GCCCCAAACC 


1380 


AATT ACT GCT 


ATATTCATAA 


AACTACTTCC 


TTATTTCTTA 


ATCCAAAATC 


TAATAGAAVA 


1440 


AGCTGCCCCA 


TTCCTTAAAT 


ACAACTCTTT 


AATATTGTTT 


AAAAGTTTTT 


CAACTGATTT 


1500 


CCAGATTATC 


AAAATCTGAG 


ATTTATAGCA 


CAATATTGAT 


GATATTCTAT 


CAATATAATT 


1560 


TTTTTCATCA 


AGTTCCTCTT 


GATACATTTT 


TAATTCTTTA 


GTTTTTCCCA 


TATAACTAAC 


1620 


CATACTACTA 


TCACTTACAT 


ATGGGAAGTC 


CTCATAATAT 


ATTACTTTAT 


AACGCATAAA 


1680 


TTCAAGCGCC 


CTTCCAATAC 


TATTCACAAA 


AACATGAGCA 


ACATGGTCAC 


CAAGTGAAAG 


1740 


CGGACAATAT 


ACGACACATT 


TGTCGTCTAA 


ATGCATTAAC . 


AGCTCTTTTA 


TGATATCATT 


1800 


CTTTAATGTG 


TCCTCATTTT 


TTAATTCACT 


ATAGATATGA 


CGGTATAGAA 


AATTGCCATT 


1860 
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TCTATCTTTC 


CTATAGAGAC 


ATTCAT AG T A 


CGATAAGTGT 


CTAAAATCAC 


ATTGTACACC 


1920 


TTCACAACCT 


AACCTGTCTT 


CTTTCTTCCT 


TTCTTCAATC 


GGATATTTCC 


CAAGGTTACA 


1980 


CAACTTATGA 


AATTGCTTAG 


CAGAGGGCTG 


TAGCTGTTCG 


CTCAAAGGGT 


AACCAGAAAA 


2040 


TATAGTAATA 


ACAAGTACAA 


TTTCTCCTTC 


TGAAGTTAAT 


TTTGAAATAT 


AATCACCACA 


2100 


GGAAAAAATT 


GCGTCATCTA 


AATGTGGAGA 


TAAAAAGATA 


TACTTAGTAT. 


TGTTACTCAT 


2160 


AACCATTCCC 


TCTACAATTT 


ATCTAAAAAC 


TCACTAAGTG 


TCTGATTAAA 


TTCCACATCA 


2220 


TCAAAAAAAT 


TCACCTTATT 


CTTAATAATG 


AATATTTCGT 


TAAATAAACA 


TATATATAAA 


2280 


TATTTCAATA 


TCCTTTCAAT 


ATCATCCTCT 


AAATTCTCCT 


CAATATTTTC 


TATCAGCCCA 


2340 


TTTACAATCT 


TATTAAAAAA 


GATAAGCTCT 


TTATCTCTAA 


AATTAAATAT 


TTTCATACAA 


2400 


CTGTTGTATC 


GAAAAATATA 


TAAAATAATT 


TTTACTAATG 


TTTGAATATT 


TAAACAACTA 


2460 


AATAAATGAG 


TTGTACCCGG 


GACACTATTT 


ATGTTATCAA 


GAACACTATC 


TTGAAACCTC 


2520 


AACTC AC AGT 


TCTTTTTGTG 


AAATTCTTTT 


TTATCGTTTA 


GATCTGATAT 


TTTTTTAGAC 


2S80 


ATTTCAACAA 


TCTCAGACAT 


TTTATATGGA 


TATCTAGGAT 


GAATGC CAAA 


ACT ATGC AAA 


2640 


ATGAACTGCA 


CCCCAAAAGT 


T AGACAG AAT 


AAATCTAACT 


TTTGGGGTGC 


AGTTCATAAG 


2700 


ATTGGGATAT 


TTTTTTTTAG 


CTAGAACTAG 


TAGAAATATA 


TAGTCAAATA 


ACAGATACCT 


2760 


TAAGGGTTTC 


TCATCTACAT 


AAAAAAATGA 


TACTTTTTTC 


T CTTC AGTAA 


TTACCTCATA 


2820 


AGCTTCACAA 


TAGAATCTCA 


TGTTTCCCTC 


CCCTATATTC 


TTAAATAAAA 


TCCTTTGGAA 


2880 


ATTGATATAT 


CTTAGTAAAA 


TATTGTTTAA 


GTTCCGGATG 


CGGAGCATGG 


GTAACAATAA 


2940 


TGACAGTCAA 


ATCCTCTCTA 


TCTAATATCT 


TACGTTCAAT 


CGCTAACGAA 


GTTCTCCTAT 


3000 


CGATAGCAGA 


AGTTCCCTCG 


TCAATTAATA 


CTATTTTCTT 


ATTTCTAATT 


AGCCCTCTAG 


3060 


CTAAAGTAAT 


TTTTTGTTTC 


TGCCCTCCTG 


ACAGTAATCT 


CCCATCATCA 


CC AAC AT AAT 


3120 


AATCTAAAAT 


GTTATTAGGA 


AAATCTTTTA 


CACTCAAACC 


AACTTGCTCT 


AAAGACTGTA 


3180 


GTATTTCTTC 


ATCAGTATAA 


TTTTCTTCCA 


ATAAAATATT 


ATCTCTAATC 


GTACCTTCAA 


3240 


ACAAATAAGC 


TTTTTGATCT 


ACATATAGAA 


CATTCGAAAC 


CATATTTAAA 


TAGGAGGTTT 


3300 


TTTTTATATC 


ATCCCCGCAG 


AATCGCAATT 


CTCCACTATA 


ATCTCTCAAA 


AAGCCATTCA 


3360 


ATAATTTTAA 


TAATGTAGAT 


TTCCCGCTTC 


CACTTTCACC 


TAAAATTAAA 


TACTTTTCAT 


3420 


TACGTTGAAA 


ACAAAAATTT 


AAGTTTTTTA 


ATATTTCTTT 


ATCTCCATAC 


TTATAGCAAA 


3480 


TATTTTTTGC 


TTCATATAAC 


GGAAAATCTC 


TATTCACCTC 


ATTTGGTTCG 


ATATCATTCA 


3540 


TTTTATTTGA 


CTCAATTGGA 


TTAATTGAAT 


ACAATTTTAA 


AAAAATAGGC 


TTCGTACCAA 


3600 
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TAATAGAGGA 


TAATTGACCT 


CCTAATTCAC 


CTAGCGCTGT 


AAAAATAACA 


CCTGTTAGTG 


3660 


CTCCTATTGC 


TTCAATAGTA 


CCAATTTTCA 


CTATTCCTTT 


TATTGCAAGA 


TAGCCTGTTA 


3720 


AAAAAACGAG 


AGAT ATCTG A 


AAAAAAATAT 


TGAGAAAGAA 


GCTAATAGCG 


CCTGCTAACG 


3780 


TTTCTACAGT 


TGTCTTTCTT 


TGTATAACCA 


TCTTTAATAA 


AATTCCTGCT 


TCTTTAATTT 


3840 


TCTTAGGCAA 


TACATATAAA 


AGATTCAAGG 


ACGCTAACAC 


ATCAAATCCA 


TTCAATATAG 


3900 


TCTCACTAGA 


TTTTAAAAAA 


GCTTCATTTT 


GGTTAGTTAA 


ATTTAGACTA 


ACTTCTCGCA 


3960 


TTTTCGATGC 


AAAGATTTTT 


GGTACAAGTA 


GCATAATCAT 


TAATGAAAAC 


AAGGTGGCTA 


4020 


CAGTCAATGA 


CCAATGATAG 


TGATTAAGAG 


TCACAACTGC 


AAAT AT AGT A 


CCAGAAATTC 


4080 


CTTTTATTAC 


TAAAAAAAGT 


TGTTTAAACG 


CCTGATCATT 


TAAAGTCTGA 


ACATCATTAT 


4140 


TTAGCCACGA 


AAGATATGTT 


CCTGATGATT 


TACT ATG AAA 


TTCTTGATAG 


GTAGAGTTAG 


4200 


AGATGTCTGT 


GGCAACTCTA 


TTTCGAATCT 


CTAGATTAAA 


CTCTTGGATC 


ACTTCAACCT 


4260 


GATAATTTTT 


CACTACCCAG 


TCAAGGAATA 


TTATCCCACA 


CCAGACAATC 


ATTTGGTAGA 


4320 


TTGACAATTT 


CAAAAACCGC 


TCTAAATTCA 


TCGCAATTAA 


TTCATTCAAC 


ACCAGAGCAT 


4380 


TAATAGTTGC 


TGCATAAATT 


AGCAATAATT 


GACCAGCAAC 


AATAAATATC 


GTTAATAAAC 


4440 


TAAATTTTTT 


TATATTTGAT 


TTTATAATAG 


TATACACAAT 


AGTTTCTCAC 


TTTCTAAATT 


4500 


TTAATTGAAC 


ATAGTTTTCA 


TATATACAAT 


AGAAAAAACC 


AAAATGATAT 


AATAACATAT 


4560 


ATTTCAAAAA 


AGAAATTCGT 


TAAAAATTTT 


TTCTTCTCTT 


GCCTTCTTGA 


TTACTTTTAA 


4620 


AGCCTTGCAT 


TTGTCTCCTA 


TTAATAGTAA 


CCGCTTTATG 


TTTAAAGAAT 


AATATTTCTT 


4680 


TGTAACCAAT 


ATTCTCTCGT 


TGAAACTCAA 


TAAATTAAAA 


TATTTCCTAC 


AGTAATTATA 


4740 


ATATTCTTCA 


TCTGCATTAA 


TTGTTTTTTG 


TGTCACTCCA 


GTGATACCGT 


TTTCTTTACT 


4800 


GTGAGCGTAG 


TAATTCACCA 


AG AATTCT C G 


CACTATATCA 


ATTTGGTATC 


CTTGAACAAG 


4860 


TAGTTTTAAT 


AAAACAACAC 


CGTCCTGATG 


TGAATCTATT 


TTCTCAAAAC 


CATTAATTAA 


4920 


TTCTAGCACC 


TCTTTTTTAC 


ACAACCAAAA 


TGACGTACCT 


GCTATATTGT 


GAACCATTTG 


4980 


AACAAACAAG 


GGATTTCCAA 


CAAAATCGGT 


CTTCTCCTCT 


TCTCGTGTAC 


CATTTGGATA 


5040 


AATTATTATT 


CCATAACTAC 


AAACTAAAGC 


TAAATTCTTC 


ATTCTACTCT 


TTTTAAAACA 


5100 


AGCCATCAAC 


TTTAAAATTC 


GATCTGGCAT 


ATATTCATCA 


TCATCGTCTA 


AAAATGATAT 


5160 


ATACTTACCT 


CTAGAATTTT 


TGATACCTAT 


GTTTCTGGCA 


TTAGTTGCAC 


CTAAATCTTC 


5220 


ATTACTTAAA 


ATTAACTTAA 


TTCTATGATT 


GGTATAGCCA 


AATTGATGGA 


TAATTTTATT 


5280 


TCTTAAATTT 


ACATTACTAT 


AATTATCATC 


AATAATTATA 


ACTTCGATAT 


TTTTATAACT 


5340 


TTGATGTAAA 


CAACTTTTCA 


CAGCTCTAAT 


CAGAGATTCA 


TACCTATTAT 


GTGTTGGTAT 


5400 
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TATAATACTT ACTAATTCTT GATCTATATT CCTATCCATG ACTACTCTTC TCTAATAATT 54 60 

CATCATATAC TCTCATGGTT TCTACAAACA TTTTTTGCAC AGAAAAATGT TTTCTTATTT 5520 

TTGATTTACT ATTCTCACCT ATATATTTCA AATACTCAGA ATCATTGAGT AAAAAATTAG 5580 

CACAAGCACA CACTCCCTCA ACATCTTCCT TCTCAAATAA AAATCCATCA ACCCTATGTT 5640 

CAATAATTTC ACTTAACCCG CCAACATTAC TACCTAAAAC CGGAGTTCCT TGTGACATTG 5700 

ACTCTAAAAC ACACATAGGT ATTCCTTCTG TATCAGAAGG AAT AT ACAAT AAATC CGAT A 5760 

TTTGGTAAAC TATAGTAGCT GGATAGATTT CACCAAGTAA CCTGAAATTA TCTCTACATT 5820 

TCAAATGGCA AATTTTTTCT TTCAAAGCAG CCCACATACT ACCATTTCCA GCCATAATAA 5880 

AAATCACATC TTCTCTGACT AAAAATAATT TTTCTGCAAA TTCAAGGAAT CTATCCGGCC 5940 

TTTTTTCTGG ATCCAACCTT CCAACATAAC AAATGATTTT TTGTTATTTG GAATACAAAA 6000 

T TCTTTTTT A AAGTCTTGAA CACCTACTAC ATCTAAATCC CTATTTGATA CATTAATTCC 6060 

GTTATTTATT GCAACTATCT TCTTATTTTT TATTATACTC TCCAATCTTT TTTTTCATAG 612 0 

TTT C AG AT AC ACAAATAAAA GCATCTCCCA TAGAATATGT CCAAAAATCA AAATAAGTCA 6180 

AGAATTTCTT TTTTAAGTTA TATTCAACCC ATCCATGGCA TGTTATCACT GTCTTAACCT 624 0 

TTCCAAATCC ATTCTTGTCA AGTTTTTTTA ACATATATAA AAAATAATTA GTTGAGTACC 6300 

CATGACAGTG TATAAGTTGG ATTTTTAATA ATTTTAAAAT ATTTTTAACG TGTAAGGCAG 6360 

TTTCAAAATT ATTTGAACAT TGAGTACAAT CAACATAGGC AATATCTAAA TTTTTATAAT 6420 

CATCAATAAC CTTTGAATCT CTAGATACAA TT AT C AAAAT AGGGAATAGA GACA 6474. 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4792 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

TATTTAACGA TTTTTTTCAT GTCATTTCCT CCAAAATAGA ATACCTTATA ATCTTAACAG 60 

AAAAAGAGCA TTTACGCCAT TATATGATAT CTATCTCTGT GATAAGTTTT TTTTATGGGT 120 

AATTTAAAAG ACCAAACGCA AGATGGCAAT CAAGACCACT CCAAAGAGAA CTGTTCCGAC 180 

TAGATTGCGG TAGCGAAAGG CTACCCAAGC TGTTGGAAAG ACGGCTAAGA AGTCCAGTCA 240 

TTTGATTTGA GGAAGACTGC CAACCTTACC TGTCACTACG CTTGAAAGAA T C AGGGC AAA 300 
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GATAATGGAA 


ACAGGCAAAA 


ACTTCAAAAA 


ACGCTCAACA 


ATCGCAGGCA 


GGCCCTTATA 


360 


CTTGACCAAG 


ATGAAGGGAA 


TCATACGGGG 


AATCCAAGTC 


ACCAAGCCAG 


AGAAAATAAC 


420 


TGCTAATAAA 


AGATACTTAC 


TGACCATCTA 


AAACCACCCC 


CATGCTACAA 


CCAAGTAGCG 


480 


TCGCAAACAG 


AACAGCTAGT 


GACTGAGACA 


TCACTGTCAA 


GAG CAAAAAG 


AAGGACACCG 


S40 


CAACAACTGC 


TAGGATAATG 


AGCAGATTGC 


GGACAGGAAT 


CCCTCTTTGC 


ATAATCTGAA 


600 


ATTGCGAAGC 


AAAATACCAA 


TAAACATCCC 


AACCAGGGCA 


AAATCCAAGC 


CAAAGATTTC 


660 


TCCATTTGCT 


AGCAGGCCAC 


CCAGAGCCGT 


TCCGACTACT 


GTCCCCACAA 


ACCAAGCCAC 


720 


ATAGCTGTTA 


AG ATTGTTTC 


CGTGCATCCA 


CATAGGATTT 


ACCTTGTCTG 


TATGGGCCAA 


780 


TTCACCCATC 


AAAACGCCAT 


AGGTCTCATC 


TGTCAAGATA 


CT AG AC AT AC 


CGATATTGTA 


840 


CCAAAGACTG 


GTATGACGGA 


AATAAGTCGA 


TGCGTGTAAA 


CTCAACAAAA 


AGAGACGCAA 


900 


GTTGATTAGA 


AAAACCGTCA 


TAGCAATAGC 


TGCCACAGGA 


GCTTGAACCA 


CAATCAGTGC 


960 


CAACATGGCA 


AACTGGGCAC 


TCCCACCATA 


AACAAAGAGA 


CTCATCAAGC 


CCATCTCAAC 


1020 


AGGTGTCACA 


TAGGGCGCAC 


C GAT AATTCC 


ACAGGCCAGG 


CCGATACTGA 


CATAGCCAAG 


1080 


AG CCGTTGGC 


ATGGCTGCCT 


GCGCCCCCTC 


CTAAAATCCT 


TTTTCTTTCA 


TCTTTCTCCT 


1140 


CATATTGTCT 


TAATAATACT 


CAATGAAAAT 


CAAAGAGCAA 


ACTAGGAAAC 


TAGCCGCAGG 


1200 


TTGCTCAAAA 


CACTGTTTTG 


AGCTTGCAGA 


TAGAACTGAT 


GAAGTCAGCT 


CAAAACACTG 


1260 


TTTTGAGGTT 


GTGGATAGAA 


CTGACGAAGT 


CAGCTCAAAA 


CACCGTTTTG 


AGGTTGTGGA 


1320 


TAGAACTGAC 


GAAGTCAGTA 


ACCATACCTA 


CGGCAAAGTG 


AAGCTGACGT 


GGTTTGAAGA 


1380 


GAGTTTCGAA 


GAGTACAAGT 


AGGCTGAAAA 


GAATCCAACC 


ACAGCATGGA 


CTATTATATA 


1440 


GCAGATTGAA 


ATAAGATGAG 


AACAAATCGA 


TTGGGAAAGT 


AAAATTAATT 


TCTATAAATG 


1500 


TTTTAGCAAT 


TGTTTCGTAC 


TATTTTAGAT 


TCAGTCTATT 


ATAACACATT 


CAGAAAAGAG 


1560 


AAAAAAGTCT 


GTTGATTTTG 


ACCATCATAA 


AAAGACTGGC 


AATCCAGTCT 


CAAACATATA 


1620 


TTATAGAAAT 


TCTCCACTAA 


AT ACTTT C AC 


GAATATTCAG 


AAGCATAACA 


AACGCAACTA 


1680 


GAAGAAATAG 


CAATAAAACA 


AAGCTAACTG 


CCAGAGTTCC 


AAAGCTAGTA 


GCAATGGTTA 


1740 


CCAAAGCTAT 


TGTAAATAAG 


CTAGGTAAAA 


CAACCGTAAT 


GGCACCGATA 


GAGGATTGAA 


1800 


CTGCTCCCAT 


TGACTCCTCA 


GGTATTTGTT 


TAAAAACGAG 


TTCTTG C AAT 


CTAGGAGAGA 


1860 


GAACACCTGC 


GAAAAAGGCA 


TCCAAGGTAC 


TAAAGATGAG 


AATCCAGTCA 


AAACGAACTG 


1920 


TGGCAAATCC 


TACTAGAAGA 


AGCAACTGGA 


TGACAAGTGA 


GGCATAGAGA 


GCTGTTTTTA 


1980 


TGGAAATGGT 


ATGTTGCAGA 


TAGCCACTTA 


CAAGGCTTCC 


GACAATCAGG 


GCTGATAATT 


2040 


CTAGTGTGGC 


TAACAAGGCA 


AGAGATTGAC 


CAGTTTGTAA 


ATTCAAAAAG 


GGCTGGTTCC 


2100 
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TTAAAAATAG AGTGCAAATA GGAACCGTAA CATTTATCAC TGCTTGACTA GTAGAGATAA 2160 

TAAACAAAAC CAAGAGCACC TTATTCATAT TCCATATCAA TTTCGATGAT TGGAGCAAAT 2 2 20 

GCTGGCAAAA GGATTTTACA GAGAGTCCTT CTTGATAGCT AATCGTTTTT TCTACTTTCA 22 80 

AGAGGTCAGT TTTTATGAAG AGGATACCTA AAAATGCGAT TAAAAAGGTA AGAGCGTTCA 2 3 40 

GTAAGGAAAT AAACTGGATG GATAGAATGC CTAGTAAGAC TCCTCCTAGG ATATTACTGA- 2400 

TTGTTTTCAC TAAACTAACA GTTGACTGTT TAAAGCCAAT AGCTTCTGCC AGATGGTCTT 2460 

GCCCAATAAT TCTAATGAAA ATCGGAGTGA GCATGGCGCC TGAAAAATAA CTCAATGTGT 2 520 

CAGACAAGAG GTTAATCAGA CAAATAAATG CTACTAGCAA CAAGGAGAAA GACTGCCCTG 2580 

AAAGTGATAA AGACACTATA GAGTAAAGCA AAAATTTTGC AAAACTAATG ACTGTGTATT 2 64 0 

TCAAGACACG ATGATGTTGA AAATCCGCCA AAACTCCCAG AAAGATTTGT AGAACTTGGG 2700 

GCAGGGTTTC TGAAATCGTG ATGAGTAAAA TCGCCAAAGG GGCAAAAGAT GCATCTGCCA 27 60 

C ATAATT C AG GAAGGCCAGA TAAAAAATCG TATCCCCAAG CGTTGAAATC CACTGGTTGA 2820 

TAGTTAATTG CCTAAAATCT CTATTTTGAA GAAATACTTT CATCACAACT CCTTCTTAAG 2880 

TTCAAATGGG AATCTTTCCC CAAGGATAGA CC GCG AT ACT ACTAACAACC AAAATTACAG 294 0 

TAACATCAAA AGCTGACCAA TGCCATTGTA GACTATATGC AGTCCAATAG GCCAATAAAT 3000 

TGACTTTGTC ATTCTAAATA AGACTGCAAA TATAAGACCT CCACCCATAT AGAAGACAAA 3060 

GTCTGTCAAG ACCCAACCGT GATTACTAAT GTGCGAGACC CCAAATAAAA CAGCGGAACC 3120 

AAGTACATCT AGCCCCCATT TCTTTCCTTT TTCCAGAGCA GTCATCACTA ATCCACGATA 3180 

AATCATGTCT TCAAAAATGG GACCTGCAAT CACAGGATAA AAAAAATACA TCAAAAATGC 32 40 

TGTAGCCCCT GTAAAAGTCG GAGCAGCATG TTGATAAGAA ATTTCATTTC GAGTAGGTGG 3 3 00 

GAAAAGAAAA AAGGTAACGA AATTCCAAAC AACAAAAGCA AG C AG AGCT A GGAAGGAATA 33 60 

GAAAAGATAG GATCCTTTAA ACTTTCTACT ATTGATTTTC TGCCATTTCC CCGACCAAAT 3 4 20 

CATAGCAATA AGAGCAAATA AAACCACAAG AAAATTCAAC ATCATATCCG ACAGATAATA 34 80 

GGCAAAGTCA GATAGCCCAG TAACAAGGTC GCTGCGTAAA ACT AG AAC AC TGAACTTCTG 3540 

GTCAGCAATA ACTAGTAGAA AAACTATAAT AAAGTAGCGG TGTGAGATTA TCTTTTTCAT 3 600 

ATATCACCTT TCTAATATCC AAATACCAAT AAAGTAACAA TGAGTAAGAA ACTATTCCAT 3 6 60 

GAAGCATGCA GAGCTATAGC CCAATAGATG GATCGGGTGT AGCGAAACAT CATACAAAAT 3720 

AT CAAGCCC A TTCCAAAATA CTTTATGAAA TCTGTCGTTA TCCAACCATA CTGCAAAACA 3780 

TGCATAGCGC CAAATATGGC AGCGGAAACA AGAAC AT CAA GATAGTATCT CTTAACTTTA 3840 
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GATAAACTTC TCATCAAAAG ACCACGACAA AC AAC CTCTT CTGATACAGG TGCGATAATA 3900 

CTAGTATAAA GTATTCGCGT AACAAAATAG CTAATTCCTG TTAAATTGGT GGCTACTTCT 3960 

ACGACTGTAC TTCCATTCTG GGTACGAGGA AAGATATAGG TTGTTAGATT TGCCCACACG 4020 

AACAATAAGA AAAAAGAAAG AAGGAAAACA CCCAGGTAAG ACCAACGAAA CTGGAAACGA 4080 

CCACACTCTT TCCAATGTTC ACTTTTGACA AAAGCAATTG TAGCTATAGT TCCCAGAATA 4140 

AGTACCAATA AAACTTGGAA CACATAGTAC AT ATT ATCAG ACAAAGCAAC CATAAAATCT 4200 

AAGTCTG AT G TGACATTAAA AATGAGGTAA TAAGTCAAAA TCAACAAGCC AGTTGCTAGG 4260 

TGAAATTTCA CTTCTTTCAT TTTCTTCATC CTATTATCTC CTATAAGAGC CTATCTTCTA 4320 

CGGCGGCCAA ACAATCCATC TGCTAAATCT ATAGTCCAAT CAAAAGCTCC ACGATTAGGA 4 3 80 

CTCATCCCTT GATTGCCCCA ACCAGGGTAA ATTCCTGGGA CGCCCCAACC AGATATACCA 4440 

CTTCTTCCAC CACCTCCCAT AGAATTTACG AGGTTGCCTC CTCTAACATC TTGCAACTCA 4 500 

GCTTCTGTCA ATTCCATTGT TTCTGCAAAT TGTAAATTTA ACATCTTTTA CACTCCTTCA 4560 

ATT ATCTTC A TTTGTAAACC ACTTCTGCGA CCTAGGATTT GCTTCAAGTG CTTTACAAGT 4 620 

ACAGTATAAC ACGAACATTG GCTTATTTTA G AAAATCG C A TATTTGATAT TTTTTCTTAT 4680 

AG AAATTT CA GATTTGCGAT TTTGGTGAAT TTGATTACTT CTCTGGTATA AT AAAGTT AC 47 40 

TACTAATGAG GAGTGGAGAA ATATGAAGAA ACAAATTTTA ACATTATTGA AA 4792 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

CCGTTCTCGG CGACGGCCAT CTGATGAAGC TATTTATGAG GGAAACTGGC AAGCTGGAGA 60 

GTCAGAGTAT CTAGTCTTTC ACCGATTGCT GTGGCAGCAG ATGTGCAGGG AAAAGGAGTT 120 

GCTCAAACCT TCTTAGAGGG CTTGATTGAA GGTTTTGATT ATCTTGATTT TCGCT CAG AT 180 

ACGCATGCTG AAAACAAGGT TATGCAACAT ATTTTTGAAA AACTTGGTTT TAAACAAGTC 240 

GGTAAGATGC CAGTAGATGG CGAACGCTTG GCCTATCAAG AATTAAAGAA ATAATGCAAA 300 

AGAAGTATGT AAAAATCCTC TACTCCTCAC CAATTGGTAT TCTATCACTT GTAGCTGATG 360 

ACCATTATTT GTATGGAATT TGGGTTCAGG AGCAGAAGCA TTTTGAG AG G GGACTAGGAG 420 

ATGAAACGAT AGAAGAAGTT GTTAGTCATC CTATTTTAGA CCCAGTTATT GCTTGCTTAG 480 
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ATGATTACTT TAAAGGCAAG CCTCAGGATT TATCCAACTT GCTCTTGGCG CCAATCGGAA 54 0 

CGAATTTTGA AAAGAGAGTT TGGGACTATT TACAGGGCAT TCCTTATGGT CAGACAGTGA 600 

CCTATGGACA AATTGCTCAA GACCTGCAAG TGGCTTCTGC TCAAGCAATT GGTGGAGCAG 660 

TGGGACGCAA TCCTTGGTCT ATCCTAGTAC CTTGTCATCG TGTGTTGGGA GCAGCCAACC 720 

GTCTGACAGG TTATGCTGCA GGAGTGGAAA AG AAAGCT T G GCTCTTGGAG CATGAAGGAG 780 

TAGATTTTAA AGATAGAAGC AATAGAAGGA GAAGCACATG TTAGAATTTA TCGAATACCC 840 

CAAATGTTCA ACTTGTAAAA AAGCAAAACA AGAATTAAAT CAATTAGGTG TGGACTATAA 9 00 

AGCCGTCCAT ATCGTGGAAG AAACACCTAG CCAAGAAGT C ATTTTGAATT GGCTAGAAAC 960 

CTCAGGATTT GAATTGAAGC AATTTTTCAA CACCAGTGGT AT C AAAT AC C GTGAATTAGG 1020 

GCTAAAAGAT AAGGTAGGAA GTTTGTCAAA CCAAGAAGCG GCTGAGTTGC TAGCAAGTGA 1080 

CGGTATGTTG TTAAAACGGC CCATTTTAGT AGAAAATGGA ACTGTTAAGC AAATCGGTTA 1140 

TCGAAAATCT TATGAGGAAC TGGGACTGAA ATAGTTTTTA TCTATCTCTT T GAT AG AT AA 1200 

AATATATAAC TTCCCTGTTT CAAAGTATGA TAAACTAGTA GGTAGACAAA GTCTGTATCT 12 60 

GACCGTAGCA AATAATTTCA TTGACGGCAG AAGCATGGTA GCATGAATCA TTATCAGAAG 1320 

AGGATGTTTT TATGAATGTT ACAACGATTT TAGCATCAGA TTGGTACCAA AACTTGATGC 1380 

AATTGATTCC GGATGGCAAG CTGTTTAGCC TACGTTCGGT CTTTGATGGA ATCCCTAGAA 144 0 

TTGTCCAACA ACTTCCAACA ACAATTATGT TGACAATTGG TGGTGCCCTT TTTGGCTTGG 1500 

TTTTGGCGCT TCTTTTTGCC ATTGTGAAGA TCAATCGTGT CAAGATTTTA TATCCCTTGC 1560 

AGGCCTTCTT TGTTAGTTTC TTAAAAGGGA CACcGATTTT GGTGCAACTC ATGTTGACCT 1620 

ACTACGGAAT CCCTTTGGCT TTGAAAGCCC TCAATCAGCA ATGGGGAACT GGTCTCAATA 1680 

TCAATGCGAT TCCAGCTGCA GCTTTTGCGA TTGTCGCCTT TGCCTTTAAT GAGGCAGCTT 174 0 

ATGCTAGTGA AACCATTCGT GCAGCCATTC TCTCAGTTAA TCC TGGTG AG ATTGAGGCGG 1800 

CACGCAGTCT GGGTATGACC CGAGCGCAAG TTTATCGACG AGTGATTATT CCTAATGCAG 18 60 

CGGTGCTAGC TACTCCAACC TTGATTAATT CCCTCATCGG TTTGACCAAG GGAACATCTC 1920 

TAGCTTTTAG TGCGGGTGTT GTGGAAGTCT TTGCCCAAGC TCAGATTCTA GGTGGAGCTG 19 80 

ATTATCGCTA TOTTGAACGC TTCATCTCCG TTGCCCTTGT TTATTGGGTA GTCAATATCG 2 04 0 

GAATTGAAAG CCTCGGTCGT TTCATCGAGA GAAAAATGGC TATTTCTGCA CCTGATACAG 2100 

TGCAACAGAT GTGAAAGGAG ACCTTCGTTA ATGATTAAGA TTTCGAATTT AAGCAA 215 6 
(2) INFORMATION FOR SEQ ID NO: 158: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


158: 






GTATCTCTAC 


AC AT GTOTT 


rvii\.UAi ill 


GTTGTCCTCC 


AATTTAATTC 


CTTATATGCT 


60 


TTGTCTGCAT 


TTGCATAACA 


AtTTTCrAAr^ 


TCTCCTGAAC 


GTCTTGGAAC 


TATTTTATAA 


120 


GGAAT AGGG A 


TCTTATTAAr 


t\\~ I I 1 UAAft 1 


GTATTTACAA 


GTTGTAATAC 


ACTAGTGCCT 


180 


TCTCCCGAGC 


CT AfinTT ATA 




TCTGTTTTTT 


CAGATACTTT 


TTCTAAAGCT 


240 


TTTATATGTC 


V. 1 f\ i. 1 \J^_ 1 AA 


*\ 1 V- A rVl_ i /\v,M 


TGGATATAAT 


CACGCACACC 


AGTACCATCA 


300 


AGCGTATCAT 




nAArArAfTT 


AGCTCTGATA 


GCTTACCTAC 


CGCTACTTGT 


360 


GCAATATAAG 


G CATC AAGTT 


ul 1 AVAlfin 1 1 


CCTGAGGGAT 


CTTCCCCAAT 


CAAACCAGAC 


420 


TCATGAGCAC 


CAATTGGATT 


HAAATAAr^A 


AGCAACGCAA 


TACTCCATTC 


TGAATCTGCC 


480 


ACATGAACAT 


CTTTTAAAAT 


i i vjv. i v^rtywjv^ 


ATCACTTTCG 


TATACCCATA 


AGGATTTGTC 


540 


GCACTTGTTT 






GACTGATTGT 


TAATTCCATA 


TACAGTCGCA 


600 


CTTGAAGAAA 


AGACAATCPT* 


TTTA Af*lV 
111 t\x\\^J\ 1 In 


AATTCTGACA 


TCACTTCAAC 


AAGTGCCAAT 


660 


GTACTCATAA 


TATTATTTTT 




ACAGGCTTTT 


GCACGGATTC 


TCCGACAGCT 


720 


TTATAACCTG 


CAAAATGAAT 


TfiPAfifATVA 


ATCGATTCTT 


GTTCAAATAC 


CTTTCTCAAT 


780 


GCTTGTTTAT 


CACAAACATC 


TAATTCGTAA 


AACACGGGAC 


GTATTCCTGT 


AATTGCTTCA 


840 


ATACGGTCTA 


G CACCAAG AT 


GCTAGAGTTC 


GAAAGGTTGT 


CGACAATGAT 


AACTTCCTTT 


900 


CCTAAATTTA 


GTAATTCTAC 


TACGGTATGG 


CTACCAATAT 


AACCAGCTCC 


GCCTGTTACC 


960 


AATATTGCCA 


TCTGGGTTTC 


CTCCTAATTA 


ATTCCAACCG 


ACTTAACAAA 


TCTCATAAAC 


1020 


GCTTCATGCC 


CAGACGGTGT 


ATTCTTATAA 


ACTCCTGCAT 


CTTCCAGAAC 


TCTCGCAAAC 


1080 


ACTTGTCCTG 


CTTCGTGTTG 


AACTACGCTA 


TTAACCTCTT 


CTTTATTAAT 


GCGAGGATAT 


1140 


TTTTCTTTCA 


ATTGGTCGGC 


CCATTCTAAA 


TGATAATCCG 


CAATTGCATT 


ATCCTCTCCT 


1200 


AAAAGATATT 


TTCCAACTTC 


TTCTAACTCT 


GGTTTCAAAC 


GAGGTGGTAA 


TATCGCAAGT 


1260 


CCCATCACTT 


CGATTAACCC 


GATATTTTCC 


TTTTTAATAT 


GTTGTACATC 


TTGATGAGGA 


1320 


TGGAAAACAC 


CATCTGGGTA 


TTGTTCAGTA 


GTATGATTAT 


CTCTTAGAAC 


AATATCTAAT 


1380 


TCGTATCTCC 


CGTCCACTTT 


ACGAGCAATA 


GGACTCACCG 


TATGGTGTGG 


GACATCTTCA 


1440 


GTCATAGCAA 


TGATGTCTAC 


TTCTAAATCT 


GAATATTCTC 


TCCACTT ATT 


TAGAATTTTA 


1500 
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GTAGCTAAAT 


CTAACAAGCG 


ATTTTTATTT 


TCACTTTGTA 


AC CTAATT AC 


TGACATTGGC 


1560 


CATTTTACAA 


TACCAGCATT 


AACATCCTCA 


AAGTCTTTAA 


AACAAAATTC 


ACTCTCAAAT 


1620 


TTTGCTTTTT 


CCATTGGGAA 


AATATGTTTC 


CCTCCCTGGT 


AGTGGTTATG 


ACTAAGAATG 


1680 


GAGCCTCCTG 


AGATAGGAAG 


ATCAGAATTT 


GAACCAGCAA 


AATATCCTGG 


CAAAATATCA 


1740 


ACAATCTCCA 


ATAATTCTTC 


AAATGTTTTA 


GAGGTAATAG 


CCATTGGTAC 


ATGTTGACTA 


1800 


TTCAAAAATA 


TCGCATGCTC 


ATTAAAGTAT 


GAGTAGGGAG 


AATACTGGAA 


TCCCCATACT 


1860 


TCGTCACCAA 


GTTTCAACCG 


AATAATTCTA 


TGATTCGAAC 


GTGCTGGATA 


ATTTATTCGC 


1920 


CCCTGATATC 


CTTCATTTTC 


CATACATAGT 


AAACATTTGG 


GATAATTAGT 


TGCTTTTACT 


1980 


AATTTTTCAG 


CAGCAATTCT 


TTTTGGATCT 


TTTTCGGGTT 


TTGACAAATT 


TATCGTAATC 


2040 


TCTAGCTCTC 


CGTATTTAGT 


TGATGCTCGA 


AACTCAATAT 


TCTTAGCAAT 


AGCAGAAGTT 


2100 


TTAATATAAT 


CACTATCTTT 


ACTTAACTTA 


TAAAACTCTT 


CAACTGCTTC 


TTGAGGTGAT 


2160 


ATATCATATG 


AACTCCAAAA 


AATATCATTT 


AATCGACTAG 


GTAAAGGAAC 


TATGAAATTC 


2220 


ATTAACTCTG 


CTCCTAAACA 


TTCCTTTTCC 


TCGATTAAAT 


CTTTAATTTT 


ACCGTTTTTT 


2280 


AAGGCGATTT 


CCACTAAGTA 


ATCTTTTATT 


TGTTTCAGGT 


CATTTTCATC 


GGAAATGCGA 


2340 


TCAATTCCCT 


CCTCACCTAT 


TAACGCTAGT 


ACTCTATTTT 


TCACATATAT 


TTTGTCAATT 


2400 


TCATTATACA 


TTCCGTATTC 


AATTACTCTA 


TCAACAAAAT 


TATCAATAAT 


TGTTTTCATA 


2460 


TATTTTTCTT 


TCTAATTTAT 


GTTCCCATAT 


TTTCTATACA 


TTATCCATTT 


ATAAATTGCT 


2520 


TGCGTAGTAT 


GAGCAATTTT 


ATCAAGGTGA 


TGAATAATAT 


CTAAAGCACT 


AATTACTTCA 


2580 


GAAACGTTCC 


CATCATCTTC 


AAATATGTAA 


TTCATTATTT 


TCTTTTCCAT 


ATTTATACTA 


2640 


AGCTCTT CT A 


TCTCATTCTG 


TTTTT GTAT A 


ACAACCATAT 


CTAAACATCC 


AGATTGTTCC 


2700 


TCTCTATAAC 


AAGATATAGC 


CCTATTCATA 


TGCAGTCCGA 


TAACTTCATG 


AAGTATTTTT 


2760 


ATTTTTGAAA 


TAATTTTCTT 


CAAAATTTCA 


TTATTTTGAA 


bAAT CT GT AG 


ATTTTTTAAA 


2 820 


ATTTCAACAA 


TTCTATCCCC 


AATACGTTCA 


ATGTCAGTTG 


ATATTTTTAT 


TACACTAATA 


2880 


ATTCTTCTTA 


AGTCATATGA 


AACAGGATGT 


TGTAAACAAA 


TT AACT CAT A 


TCCrTTTTTA 


2940 


TCAATATTTA 


GAACTGACTC 


ATTTATGATT 


AAATCTT CTT 


TAATCAATTC 


TACTCGTTCT 


3000 


TCATTTGATA 


AATATTCAAA 


T AACTTCT C A 


TATTTATCAA 


GCACAGATAC 


CCAAATGGTC 


3060 


TCTAAATTAT 


TTGATAATTC 


TATAATTTCA 


TTTTCTAAAT 


ATAACCTTAA 


CATTTAGGTA 


3120 


CCTCTTCTTA 


ACAAAGTTCG 










3140 


(2) INFORMATION FOR SEQ ID NO : 159: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9048 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


159: 






CCGGATGATT 


1 »- iuVi L Unlj 


AlMiuwiuAn 


AGTGACTTCC 


TCAGCAATCG 


CGCGTAGAGT 


60 


AGG ATTCCC T 






ATCAATTAAG 


TGAGCAGCTT 


TTGTAATACG 


120 


TTCTATTGCA 


*J**V— A A i 1 1L1 


LlLUl 1A1A1 


TATGTTTAGT 


GCAGTTAGCT 


ACTGCCAAAG 


180 


CCCAAGTGGT 




TAAGCv_ACTG 


TGGATTAGTT 


CATTTTCTTT 


CATTACCTCT 


240 


ACATGATATC 


AC A A A ATT! & 


AAusAni 1\iAA 


AGCATTATGG 


CATTTAGGAT 


TTATAGAAAA 


300 


TAG A T AGG AA 


flTTfAATTr a 

<>J 4 IV-AAl 11. A 


AI luluAAAb 


AAATACTTAT 


CTGTGATATA 


ATAAAAAGAA 


360 


AAGGCTTGCA 


T A Art a a irra 


lj<j<jAUnA LuA 


AGATACAAAG 


AAGACAAAAT 


CGAAATCAGG 


420 


GTGGTTTAGC 


* 1 «. 1 *-\J I 1 11 




TGGTAAACTT 


TTTAGGAGTT 


ATCGCAAGTG 


480 


GAGCAATAAG 


V»vjn 1 1 1 1 L»Lj 


Lyn X AV» i\~ I L 


GCTAGCAGTT 


GGTTTATCAA 


TGGGCTTGGT 


540 


CTTGTTGTTT 




a a nr* a a ivw 


TTGACTAGTC 


AAAAACGACA 


TACTATTTCT 


600 


CAAGAGGGGA 


fTAArtfA a a a 


1 A_ 1U_Ao*jAL» 


TAGGAAGAGG 


AAAAAACTGC 


CAGAATTATG 


660 


GCCCACGGGG 


i. 1 I ILIA 


LLALvjA iVKiA 


CTTTTCTTTT 


CAGCTAAAAA 


AGAAGACGGT 


720 


ACCTATGACT 


T*TV* ATY" a a a a 


TTTTGAGTAT 


GTGACTCCTT 


GGCTCAAGCA 


AGGGGACTAA 


780 


G CAGCAG ATT 


TAGCTATTGG 


TGATTTTGAA 


GGAACCATTA 


ATAAGGATCA 


TTATTTAGCG 


840 


GGTTATCTTC 


TCTTTAATGC 


TCCTGTTGAA 


GTTATGGATG 


CTATTAAGGA 


GGCAGGTTAT 


900 


CATGTGCTGG 


ATTTAGCTCA 


T AAT CAT ATT 


TTGGATTCGC 


AAATTGAGGG 


AGTTATTTCA 


960 


ACGGCCGATA 


TTATTGAGAA 


AGCTGGAATC 


ACTCCAATCG 


GAGTTTATAC 


GCACGAACCA 


1020 


CGTGATCAGG 


CTCCGCTGGT 


CATTAAGGAA 


GTGAATGGTA 


TCAAGGTTCC 


ATTGTTAGCC 


1080 


TATTCCTATG 


GTTTCAATGG 


AATTGAGCAG 


TATATTTCTC 


AGGAAGACTA 


TAATCGTTAT 


1140 


CTTTCAGATT 


TAAACGAAGA 


TAAGATGAAG 


GTTGAAATTG 


AACGGGCAGA 


GAAGGAAGCA 


1200 


GATATCACCA 


TTATCATCCT 


TCAGATGGGT 


GTTGAGTATC 


GATTGGAACC 


AACTGAAGAA 


1260 


CAAAAAGCTC 


TTTATCACAA 


GATGATCGAT 


TTGGGAGCGG 


ATATTATCTT 


TGGAGGGCAT 


1320 


CCTCACGTTG 


TTGAACCATC 


TGAAACGGTT 


GAAAAAGATG 


GAGATAAGAA 


ACTCATTATC 


1380 


TATTAAATGG 


GGAACTTCAT 


TTCCAATCAA 


CGAATTGAAT 


CT A TGGG AG A 


TGAAGAGAAT 


1440 


GCTAAGTGGA 


CTGAACGTGG 


TGTTCTCATG 


GATGTCACCA 


TCAAGAAGAA 


GGATGGAAAA 


1500 
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ACAACTATCG 


GAACAGCTAA 


AGCTCATCCT 


ACTTGCGTCA 


ATCGAACACC 


AAAGGGAACC 


1560 


TTTTCACCAG 


AAGGATATCC 


CTTGTATCAT 


TACCAAACTT 


ATATTTTGGA 


AGATTTTATA 


1620 


GAGGATGGCA 


GTCATCGTGA 


CCAGTTAGAT 


GAAGCGACTA 


AGGAACGAAT 


TGATACAGCC 


1680 


TATAAAGAAA 


TGAATGAACA 


TGTGGGATTG 


AAGTGGTATT 


AGCTTGAATC 


CAGAGGAAAG 


1740 


TAAATGATGA 


TTAAGG T AAT 


TGCGACAGAT 


ATGGATGGGA 


CCTTGCTGGA 


TGCTAGAGGT 


1800 


CAGCTTGATC 


TCCCACGATT 


GGAAAAGATT 


TTAGATCAGT 


TGGATCAAAG 


GGGCATTCGT 


1860 


TTTGTCATTG 


CGACGGGCAA 


TGAAATTCAC 


CGCATGAGAC 


AACTACTGAG 


TCCCTTGGTG 


1920 


GATCGAGTGG 


TTCTGGTTGT 


TGCTAATGGC 


GCTCGTATTT 


TTGAAAACAA 


TGAATTGATT 


1980 


CAGGCTCAGA 


CATGGGATGA 


CGC CATTGTC 


AACAAGGCTT 


T G ACT C ATTT 


CAAGGGTCGA 


2040 


GCGTGTCAGG 


ACCAGTTTGT 


TGTAACGGGG 


ATGAAGGGTG 


ATTTTGTCAA 


GGAAGGTACG 


2100 


ATTTTTACAG 


ATCTTGAAAG 


TTTTATGACT 


CCAGAAATGA 


TTGAAAAATT 


CTACCAACGG 


2160 


ATGCAATTTG 


TGGATGAATT 


AACATCTGAC 


CTCTTTGGTG 


GTGTGCTCAA 


GATGAGCATG 


2220 


GTTGTTGGTG 


AGCAACGTTT 


GAGTTCGGTT 


TTGGAAGAAA 


TCAATGCTCT 


CTTTGATGGC 


2280 


CGTGTCCGAG 


CTGTATCCAG 


TGGCTATGGT 


TGCATTGATA 


TCCTCCAAGC 


TGGGATTCAT 


2340 


AAAGCATGGG 


GCTTGGAGGA 


ATTACTCAAG 


CGCTGGGACT 


TGAAATCCCA 


AGAAATCATG 


2400 


GCTTTTGGTG 


ATAGTGAAAA 


TGATGTTGAA 


ATGCTTGAAA 


TGGCTGGAAT 


TGCCTATGCG 


2460 


ATGGAAAATG 


CTGATGAGAA 


AG CCAAAGCT 


GTGGCGACTG 


CTCTAGCACC 


AGCCAACAGC 


2520 


CAAGGAGGAG 


TTT AT CAAGT 


CTTGGAAAAC 


TGGTTAGAAA 


AAGGAGAATC 


AAGTGGCAGT 


2580 


ACAGTTATTA 


GAAAATTGGC 


TCCTAAAGGA 


ACAAGAAAAA 


ATTCAAACTA 


AGTATCGTCA 


2640 


CCTAAATCAC 


ATTTCTGTTG 


TAGAACCAAA 


CATTCTTTTT 


ATTGGGGATT 


CCATTGTCGA 


2700 


GTATTATCCT 


CTACAGGAGC 


T ATTTGG G AC 


TTCAAAGACG 


ATTGTCAATC 


GAGGAATTCG 


2760 


TGGCTATCAG 


ACAGGACTGT 


TACTAGAGAA 


CCTTGATGCT 






£. O £> \J 


AGATAAAATT 


TTTCTTCTGA 


TTGGGACAAA 


TGATATCGGA 


AAGGATQTTC 


CTGTGAATGA 


2880 


GGCTCTCAAT 


AATCTCGAAG 


CTATCATTCA 


ATCCGTTGCT 


CGCGATTATC 


CATTGACAGA 


2940 


GATTAAATTG 


CTTTCCATTT 


TGCCTGTCAA 


TGAGAGAGAG 


G AGTACC AG C 


AGGCAGTCTA 


3000 


TATCCGCTCG 


AATGAAAAAA 


TTCAGAACTG 


GAATCAAGCC 


TATCAAGAGC 


TTGCATCTGC 


3060 


CTATATGCAG 


GTGGAATTTG 


TGCCAGTATT 


TGATTGTTTG 


ACAGACCAAG 


CAGGCCAACT 


3120 


CAAAAAAGAA 


TATACAACTG 


ATGGACTGCA 


CCTCAGTATT 


GCTGGTTATC 


AGGCTTTGTC 


3180 


AAAATCCTTG 


AAAGACTATC 


TTTACTAAAT 


AGCTAAATAA 


TGTTAAATTT 


GAGCATAATA 


3240 
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TCTTGTAAAA AATTCTAAAA TCCTTTAAAA TAAAAAGTGA CGGAGGAATT TATGAATGTA 3 300 

AATCAGATTG TACGGATTAT TCCTACTTTA AAAGCTAATA ATAGAAAATT AAATGAAACA 3 3 60 

TTTTATATTG AAACCCTTGG AATGAAGGCC TTGTTAGAAG AATCGG CCTT TCTGTCACTA 3 4 20 

GGTGACCAAA CGGGTCTTGA AAAGCTGGTT TTAGAAGAAG CTCCCAGTAT GCGTACTCGT 34 80 

AAGGTAGAGG GAAGAAAAAA ACTAGCTAGA TTGATTGTCA AOGTG G AAAA TCCCTTAGAA 3 54 0 

ATTGAAGGAA TCTTATCTAA AACAGATTCG ATTCATCGAT TATATAAAGG TCAAAATGGC 3 600 

TACGCTTTTG AAATTTTCTC ACCAGAAGAT GATTTGATTT TGATTCATGC GGAAGATGAC 3 660 

ATAGCAAGTC TAGTAGAAGT AGGAGAAAAG CCTGAATTTC AAACAGATTT GGCATCAATT 3 720 

TCTTTAACTA AATTTG AG AT TTCTATGGAA TTACATCTCC CAACTGATAT CGAAAGTTTC 3 7 80 

TTGGAATCAT CTGAAATTGG GG CATC CCTT GATTTTATTC CAGCTCAGGG GCAGGATTTG 384 0 

ACTGTGGACA AT AC GGTT AC CTGGGACTTA TCTATCCTCA AGTTCTTGGT CAATGAATTA 3900 

GACATAGCAA GTCTTCGCCA GAAGTTTGAG TCTACTGAAT ATTTT ATTC C TAAGTCTGAA 39 60 

AAATTCTTCC TTGGTAAAGA TAGAAATAAT GTTGAATTGT GGTTTGAAGA ACT ATG AAG T 4 020 

GGACCAAGAT TATTAAAAAA ATAGAAGAAC AAATCGAGGC AGGGATTTAT CCCGGAGCCT 4 080 

CTTTTGCGTA TTTTAAGGAC AATCAATGGA CAGAGTTCTA TTTAGGCCAG AGTGACCCAC 4140 

AGCATGGCTT GCAGACTGAG GCAGGACTAG TTTATGACCT AGCTAGTGTC AGCAAGGTTG 4 200 

TTGGGGTTGG CACAGTTTGT ACCTTCTTGT GGGAAATAGG T C AATT AG AT ATTGATAGAC 4 2 60 

TGGTAATAGA TTTTTTACCT GAGAGTGATT ATCCAGACAT CACTATTCGC C AG CTCTTGA 4 320 

CTCATGCAAC AGACCTTGAT CCTTTTATTC CTAATCGTGA TCTTTTAACA GCCCCTGAAT 4 3 80 

TAAAGGAAGC GATGTTTCAT CTCAACAGAC GAAGTCAGCC AGCCTTTCTT TATTCGGATG 444 0 

TCCATTTTTT GCTGTTGGGC TTTATTTTGG AAAGAATTTT TAATCAAGAT TTGGATGTGA 4 500 

TTTTAAAGGA TCAAGTCTGG AAACCTTGGG GAATGACGGA AACTAAGTTT GGGCCAGTTG 4 560 

ACCTTGCTGT TCCAACAGTT AGAGGTGTAG AGGCAGGCAT AGTGCATGAT CCCAAGGC TC 4 62 0 

GTCTCCTGGG TAGACATGCT GGGAGTGCTG GTTT ATTTT C GACTATAAAG GATTTACAAA 4 680 

TCTTTTTAGA ACACTATTTA GCAGATGATT TT GC AAG AG A CTTAAATCAA AATTTTTCTC 4740 

CTTTGGATGA CAAGGAACGT T CTTT AG CAT GGAATTTGGA AGGAGATTGG CTAGACCATA 4 80 0 

CGGGCTATAC AGGTACCTTT ATCATGTGGA ATCGTCAGAA GCAAGAAGCC ACT ATTTT CC 4 860 

TATCGAATCG TACCTATGAA AAGGACGAGA GAGCTCAATG GATATTAGAC C GCAATC AAG 4 920 

TGATGAACTT GATTCGCAAA GAAGAGTAAG GAGAGACATG TCAAATAGTT TAAAAGGGAC 4 980 

TTTACTAACA GTTGTGGCTG GTATTGCTTG GGGGTTGTCA GGAACGAGTG GCCAATACCT 5040 
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AATGGCACAC GCAATTTCGG CTCTGGTCTT GACTAACTTG CGTCTTTTAA TCCCTTCTGG 5100 

AATTCTCATG CTCTTGGCTT ATGCTACTGC AAAGGATAAA ATACTGGTCT TTTTAAAGGA 5160 

TAGAAAGAGT TTGCTGTCTC TTCTTATTTT TGCTCTGATT GGTCTTTTTC TCAACCAATT 52 20 

CGCCTATCTG TCTGCTATTC AGGAGACCAA TGCGGGAACA GCGACGGTGC TTCAGTATGT 52 80 

TTCTCCTGTC GGAATTTTAA TTTATAGCTG TATCAAGGAT AGGGTGGCAC CGACACTGGG 53 40 

AGAGATAGTT TCCATCATAT TCGCCATCGG AGGAACCTTC CTGATCGCAA CACATGGGCA 5400 

GTTGG AC C AG TTATCCATGA CACCTGCTGG TCTGTTCTGG GGTCTCTTTT CTGCCTTGAC 54 60 

TTATGCTCTG TATATCATTT TACCCATAGC CTTGATTAAA AAGTGGGGGA GCAGCTTGGT 552 0 

CATTGGTGTG GGAATGGTCA TAGCAGGTTT GGTCG CCCTT CCTTTTACAG GGGTTCTACA 5580 

GGCCGATATC CCGACTAGTC TTGATTTTCT CCTTGCGTTT GCAGGC ATT A TCCTT AT CGG 564 0 

GACTGTCTTT GCCTATACAG CTTTCCTTAA AGGAGCCAGT CTGATAGGAC CGGTCAAGTC 57 00 

AAGCTTG TTG GCTTCAATTG AGCCAATATC GGCGATTTTC TTTGCCTTCT TAATAATGAA 57 60 

TGAACAATTT TATCCCATTG ATTTTCTTGG TATGGCAATG ATATTGTTTG CTGTAACTTT 582 0 

GATTTCTTTG AAAGATTTAT TCTTAGAAAA ATAAAAAAGA CTCTTTGTCC GTGACAGAGA 5880 

GTTTTTGCGT GGTAATCTAA TTATTTTCAA GATAAAATTC AAAGCGTTCG CCTACATATT 594 0 

GACTTTTTAC GTATTCAAAA GCAGTACCAT CTTCTAGGTA GGAAACCTGG GTCAATCCAA 6000 

GAATAGCATG TCCTTTTTCA ACTTCCAAAT AGTGGGCAAT CTTTTCTTTA GCAAGGCGAG 6060 
CATACATGGT CTGTTGAGAT TTGCCGATAC GATAGCCATG TTTTTGCAAG GTTTGGAAGA . 612 0 

AATGACTGGT GATTTCTTCT TTTTTAAAGT CCTTAATGAA TTTTTCAGGA A T AG AAGCAA 6180 

CTTCATAAAC TAGGGGAACT T GGTCGGC AT AGCGGACCCG CTCCATTCGG ATAATATTGT 6240 

CCGTTGGAAA AATTCCTAGC TTGGCAACTT CTTGCTCATT GGGAATGGTT TTTTTGTAGG 6 300 

AAATGAGCTG GCTAGAGGGA ACTTTACCTT GGGATTTGAC AATTTC AG T A AAACTGGTTG 6 3 60 

TCCCTCGCAT CTTTTCTTGT ACTCGAGTAC TGGAAACAAA GGTGCCGCTT CCTACACGGC 6420 

G CTCTAAG AC GCCTTCTTCG ACTAATAGAG ATACGGCTTG GCGGAG GGTC ATGCGACTGA 64 80 

CCGCAAACTG CTCAGCTAAA TCTCTTTCAC TGGGAAGCCT CTCACCAATA GCCCAACGGT 6540 

ACTCGTCAAT ATC C T T TTTT ATCTGATCAT GGATTTTTAT ATAAGCAGGT AGCATATTTT 6600 

TCACTTCATT TCTATCTTTT CTCTATTGTA CCCCAATAAA CTAGAAAAAG TCAAACTTCG 66 60 

CCTTGTTTAG TTGGTAATTC GCCCTTATTT GTGATAGAAT ATTGAGAAAA GATATTTCTT 6720 

TTGAGAAAGG AAAAAGATGA GCAACATTTC AACTGATTTG CAAGATGTAG AAAAAATCAT 67 80 
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CGTATTGCAC 


TATGGTAGCC 


AGTACAACCA 


GCTGATTTCA 


CGCCGTATCC 


GTGAGATTGG 


6840 


tgtittitca 


GAACTAAAAA 


GCCATAAAAT 


TTCAGCTGCT 


GAAGTTCCTG 


AAGTCAATCC 


6900 


TGTAGGAATT 


ATTCTATCAG 


GTGGTCCAAA 


TTCTGTATAT 


GAAGATGGTT 


CATTTGATAT 


6960 


TGACCCAGAA 


ATCTTCGAAC 


TCGGAATTCC 


AATTTTGGGA 


ATCTGTT AT G 


GTATGCAGTT 


7020 


ATTGACCCAT 


AAACTTGGAG 


GAAAAGTTGT 


TCCTGCAGGT 


GATGCTGGAA 


ATCGTCAATA 


7080 


CGGTCAATCA 


ACCCTAACTC 


ACACACCATC 


AGCGCTTTTT 


GAATCAACAC 


CTGATGAACA 


7140 


GACTGTTTTG 


ATGAGCCATG 


GTGATGCGGT 


TACTGAGATT 


CCTGCTGACT 


TTGTTCGTAC 


7200 


AGGTACATCA 


GCTGACTGCC 


CATACGCAGC 


CATCGAAAAC 


C C AG AT AAAC 


ACATTTACGG 


7260 


TATCCAATTC 


CACCCAGAAG 


TTCGTCATTC 


TGTATACGGA 


AATGATATCC 


TTCGTAACTT 


7320 


TGCCCTTAAC 


ATTTGTAAGG 


CTAAAGGTGA 


CTGGTCAATG 


GATAATTTCA 


TTGACATGCA 


7380 


GATCAAAAAA 


ATTCGTGAAA 


CCGTCGGTGA 


TAAACGTGTC 


CTTCTTCGTC 


TATCAGGTGG 


7440 


TGTTGACTCA 


TCTGTCGTTG 


GGGTTCTTCT 


CCAAAAAGCG 


ATTGGCGATC 


AATTGATCTG 


7500 


TATCTTCGTA 


GACCACGGTC 


TTCTTCGTAA 


AGGCGAAGCT 


GATCAAGTTA 


TGGACATGCT 


7560 


CGGTGGTAAG 


TTTGGTTTGA 


ATATCGTCAA 


AGCAGACGCT 


GCTAAACGTT 


TCCTTGACAA 


7620 


ACTTGCTGGC 


GTTTCTGACC 


CTGAACAAAA 


ACGTAAAATC 


ATCGG T AACG 


AGTTTGTCTA 


7680 


TGTATTCGAT 


GACGAAGCAA 


GCAAGCTCAA 


AGATGTGAAA 


TTCCTTGCTC 


AAGGTACTTT 


7740 


ATATACAGAT 


GTTATCGAGT 


CTGGTACGGA 


TACAGCTCAA 


ACTATCAAGT 


CACACCACAA 


7800 


CGTGGtGGTC 


TTCCAGAAGA 


TATGCAGTTT 


GAATTGATTG 


AACCACTCAA 


TACTCTTTAC 


7860 


AAGGATGAAG 


TTCGTGCTCT 


TGGTACAGAG 


CTTGGTATGC 


CAGACCATAT 


CGTATGGCGC 


7920 


CAACCATTCC 


CAGGACCAGG 


ACTTGCTATC 


CGTGTCATGG 


GTGAAATCAC 


TGAAGAGAAA 


7980 


CTTGAAACCG 


TTCGTGAATC 


AGACGCTATT 


CTTCGTGAAG 


AAATCGCTAA 


AGCTGGACTT 


6040 


GACCGCGATA 


TTTGGCAATA 


CTTCACTGTT 


AACACAGGCG 


TTCGTTCAGT 


CGGTGTTATG 


8100 


GGTGACGGTC 


GTACGTATGA 


CTACACGATT 


GCAATCCGTG 


CTATCACTTC 


TATCGATGGT 


8160 


ATGACTGCTG 


ATTTTGCCAA 


AATTCCATGG 


GAAGTACTTC 


AAAAAATCTC 


AGTACGTATC 


8220 


GTAAATGAAG 


TGGATCATGT 


TAACCGTATC 


GTCTACGATA 


TTACAAGTAA 


ACCACCTGCA 


8280 


ACAGTTGAGT 


GGGAATAATC 


GCAAAAAAAT 


TAAAAGCTTT 


GTAAAATCAA 


CGGTTACAGA 


8340 


GGATTAAAAA 


CTGTAACTGG 


GATTAAAACG 


GGAACATTTG 


CTAAAAAGAA 


TAAATTGAAT 


8400 


AATAGTTCCA 


AGTGGTTTAC 


ATTTGGACAA 


AAAATTAGAC 


CGTAGTTTTC 


AAGCTGCGGT 


8460 


CTTTTGATAT 


ATATAATGAG 


AATTAATGGC 


TCTTTGTCAA 


CTGTAGTGGG 


TTGAAGTCAG 


8520 


CTAAGCTCGA 


GAAAGGACAA 


ATTTTGTCCT 


TTCTTTTTTG 


ATATTCAGAG 


CGATAAAAAT 


8580 



WO 98/18931 



PCT/US97/19588 



1037 

CCGTTTTTTG AAGTTTTCAA AGTTCCGAAA ACCAAAGGCA TTCCCCTTGA TAAGTTTGAT 8 640 

GAG ATT ATT G GTCCCTTCCA ATTTGGCGTT AGAATAGTGT AGTTGAAGGG CGTTGACGAT 8700 

TTTCTCTTTG TCCTTTAGAA ACCTTTTAAA GACAGTCTGA AAAAGAGGAT GAACCTGCTT 8760 

TAGATTGTCC TCAATCAGTC CGAAAAATTT CTCCGGTTCC TTATTCTGAA AGTGAAACAG 882 0 

CAAGAGTTGA TAGAGCTGAT AGTGATGTTT CAAGTCTTGT GAATAGCTCA AAAGCTTGTT 8880 

TAAAATCTCT TTATTGGTTA AATGCATACG AAAAGTAGGG CGATAAAAAT GTTTATCGCT 8940 

GAGTTTACGA CTATCCTGTT GTATGAGCTT CCAGTAGCGC TTG AT AGCC T TGTATTCATG 9000 

AGACTTTCGA TCCAATTGAT TCATGATTTG AACACGCACA CGACTCGG 904 8 
(2) INFORMATION FOR SEQ ID NO: 160: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

GTACCTTTAT TGATGAATGG ACTGTTTAAA TCAGTAGCAC GCCAACCAGA TATGCTTTCT 60 

GAGTTTCGTA GTTTGATGTT TTTAGGTGTT GCCTTTATTG AAGGAACTTT CTTTGTAACT 120 

CTTGTCTTCT CATTTATTAT CAAATAAATA CATGGAACGA GAAGAAAAGG GAGGATTTTA 180 

GATGGAAGAA AGTATTAATC CAATCATCTC TATTGGTCCT GTTATCTTCA ATCTGACTAT 2 40 

GTTAGCCATG ACTTTGTTGA TTGTGGGAGT TATTTTTGTC TTTATTTATT GGGCAAGCCG 300 

CAATATGACC TTGAAACCCA AAGGAAAGCA AAATGTACTT GAGTATGTCT ATGACTTTGT 3 60 

TATTGGATTT ACAGAACCTA ACATTGGTTC GCGCTACATG AAAGATTACT CACTCTTTTT 420 

CCTTTGTTTA TTCCTTTTCA TGGTGATTGC CAATAACCTT GGCTTAATGA CAAAGCTTCA 480 

AACGATCGAT GGGACTAACT GGTGGAGTTC GCCAACCGCT AATTTACAGT ATGACTTAAC 54 0 

CTTATCTTTT CTTGTCATTT TGTTGACACA TATAGAAACC GTTCGTCGTC GTGGATTTAA 600 

AAAAAGTATA AAATCTTTTA TGAGTCCTGT TTTTGTCATA CCGATGAATA TCTTGGAAGA 660 

ATTTACAAAC TTCTTATCTT TGGCTTTGCG GATTTTTGGG AATATCTTTG CAGGAGAGGT 720 

CATGACGAGT TTGTTACTTC TTCTTTCCCA C CAAGCT ATT TATTGGTATC CAGTAGCCTT 780 

TGGAGCTAAT TTGGCTTGGA CTGCATTTTC TGTCTTTATT TCCTGCATCC AAGCTTATGT 840 

TTTTACTCTT TTGACATCTG TGTATTTAGG GAATAAGATT AATATTGAAG AGGAATAGAA 900 
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AGGAGTAACT 


GATGCACGTA 


ACAGTAGGTG 


AATTAATTGG 


TAATTTTATT 


TTAATCACTG 


960 


GCTCTTTTAT 


TC TTTTGCT A 


GTCTTGATTA 


AAAAATTTGC 


ATGGTCTAAT 


ATTACAGGCA 


1020 


TTTTCGAAGA 


AAGAGCTGAA 


AAAATTGCTT 


CAGATATTGA 


CAGAGCTGAA 


GAAGCCCGTC 


1080 


AAAAAGCAGA 


AGTATTGGCT 


CAAAAACGCG 


AAGATGAATT 


GGCTGGTAGC 


CGTAAAGAAG 


1140 


CTAAGACAAT 


CATTGAAAAT 


GCAAAGGAAA 


CAGCTGAGCA 


AAGTAAGGCT 


AATATCTTAG 


1200 


CAGATGCTAA 


AC T AG AAGC A 


GGACACTTAA 


AAGAAAAAGC 


CAATCAAGAA 


ATTGCTCAAA 


1260 


ATAAAGTAGA 


AGCTTTACAG 


AGTGTTAAGG 


GTGAGGTCGC 


AGATTTGACC 


ATCAGCTTAG 


1320 


CTGGTAAAAT 


CAT CTCAC AA 


AACCTTGACA 


GTCATGCCCA 


TAAAGCACTC 


ATTGATCAGT 


1380 


AT AT CG ATCA 


GCTAGGAGAA 


GCTTAATGGA 


CAAGAAAACA 


GTAAAGGTAA 


TTGAAAAATA 


1440 


CAGCATGCCT 


TTTGTCCAAT 


TGGTACTTGA 


AAAAGGAGAA 


GAAGACCGTA 


TCTTTTCAGA 


1500 


CTTGACTCAA 


ATCAAGCAAG 


TTGTTGAAAA 


AACAGGTCTG 


CCTTCTTTTT 


TAAAACAAGT 


1560 


GGCAGTAGAC 


GAGTCGGATA 


AGGAAAAAAC 


AATTGCTTTT 


TTCCAAGATT 


CTGTGTCGCC 


1620 


TTTATTACAA 


AACTTTATCC 


AGGTTCTGGC 


CTACAATCAC 


AGAGCAAATC 


TTTTTTATGA 


1680 


TGTGCTTGTA 


GATTGCTTGA 


ACCGACTTGA 


AAAAGAAACA 


AATCGATTTG 


AAGTGACGAT 


1740 


TACGTCTGCT 


CATCCTCTAA 


CTGATGAACA 


GAAGACTCGT 


TTGCTCCCTT 


TGATTGAGAA 


1800 


AAAAATGTCT 


CTGAAAGTAA 


GGAGTGTAAA 


AGAACAAATC 


GATGAAAGTC 


TCATTGGTGG 


1860 


TTTTGTCATT 


TTTGCCAATC 


ACAAGACAAT 


TGATGTGAGT 


ATTAAACAAC 


AACTTAAAGT 


1920 


TGTTAAAGAA 


AATTTGAAAT 


AGAAAGTGGT 


GTTCTTTTGG 


CAATTAACGC 


ACAAG AAAT C 


1980 


AGCGCTTTAA 


TTAAGCAACA 


AATTGAAAAT 


TTCAAACCCA 


ATTTTGATGT 


GACTGAAACA 


2040 


GGTGTTGTAA 


CCTATATCGG 


GGACGGTATC 


GCGCGTGCTC 


ACGGCCTTGA 


AAATGTCATG 


2100 


AGTGGAGAGT 


TGTTGAATTT 


TGAAAACGGC 


TCTTATGGTA 


TGGCTCAAAA 


CTTGG AGT C A 


2160 


ACAGACGTTG 


GTATTATCAT 


CCTAGGTGAC 


TTTACAGATA 


TCCGTGAAGG 


CGATACAATC 


2220 


CGCCGTACAG 


GGAAAATCAT 


GGAAGTCCCT 


GTAGGTGAAA 


GTCTGATTGG 


TCGTGTTG'i'G 


2280 


GATCCGCTTG 


GTCGTC C AGT 


TGACGGTCTT 


GGAGAAATCC 


ACACTGATAA 


AACTCGTCCA 


2340 


GTAGAAGCAC 


CAGCTCCTGG 


TGTTATGCAA 


CGTAAGTCTG 


TTTCAGAACC 


ATTGCAAACT 


2400 


GGTTTGAAAG 


CTATTGACGC 


CCTTGTACCG 


ATTGGTCGTG 


GTCAACGTGA 


GTTG ATT AT C 


2460 


GGTGACCGTC 


AGACAGGGAA 


AACAACCATT 


GCGATTGATA 


CAATCTTGAA 


CCAAAAAGAT 


2520 


CAAGATATGA 


TCTGTATCTA 


CGTCGCGATT 


GGACAAAAAG 


AATCAACAGT 


TCGTACGCAA 


2580 


GTAGAAACAC 


TTCGTCAGTA 


CGGTGCCTTG 


GACTACACAA 


TCGTTGTGAC 


AGCCTCTGCT 


2640 


TCACAACCAT 


CTCCATTGCT 


CTTCCTAGCT 


CCTTATGCTG 


GGGTTGCTAT 


GGCGG AAGAA 


2700 



WO 98/18931 



PCT/US97/19588 



1039 



TTTATGTATC 


AAGG TAAGC A 


TGTTTTGATT 


GTATACGATG 


ATCTATCAAA 


ACAAGCGGTA 


2760 


GCTTATCGTG 


AACTGTCGCT 


CTTGCTTCGT 


CGTCCTCCAG 


GTCGTGAAGC 


CTTCCCAGGG 


2820 


GATGTTTTCT 


ATCTCCACAG 


CCGTTTGCTT 


GAGCGCTCAG 


CTAAAGTTTC 


TGATGAACTT 


2880 


GGTGGTGGAT 


CAATTACAGC 


C CT AC CATTT 


ATCGAGACAC 


AAGCAGGAGA 


TATCTCAGCC 


2940 


TATATCGCAA 


CCAACGTGAT 


TTCTATCACT 


GATGGACAAA 


TCTTCCTTGG 


CGATGGCCTC - 


3000 


TTCAATGCAG 


GTATTCGTCC 


AGCCATCGAT 


GCGGGTTCAT 


CTGTATCTCG 


TGTAGGTGGT 


3060 


TCTG C AC AAA 


TCAAAGC CAT 


GAAGAAGGTT 


G CTGGT AC AC 


TTCGTATCGA 


CCTTGCTTCA 


3120 


TACCGTGAGT 


TGGAAGCCTT 


TACTAAGTTT 


GGTTCTGACT 


TGGACGCAGC 


AACACAGGCT 


3180 


AAGTTGAACC 


GTGGACGTCG 


TACCGTTGAG 


GTCTTGAAAC 


AACCTGTTCA 


C AAACC ATT A 


3240 


CCTGTTGAGA 


AACAAGTAAC 


CATTCTTTAT 


GCTTTGACAC 


ATGGTTTCTT 


GGATACTGTT 


3300 


CCAGTAGATG 


ATATTGTTCG 


TTTCGAGGAA 


GAGTTCCATG 


CCTTCTTTGA 


TGCTCAACAT 


3360 


CCAGAGATTT 


TGGAAACCAT 


TCGTGATACA 


AAAGACTTGC 


CAGAAGAAGC 


AGTCTTGGAT 


3420 


GCTGCGATTA 


CAGAGTTTCT 


CAATCAATCT 


AGCTTCCAAT 


AAGAATAGAG 


GTGTCAGATG 


3480 


GCAGTATCTC 


TAAATGATAT 


TAAAACAAAA 


ATCCCCTCAA 


CAAAAAATAC 


GAGTCAAATC 


3540 


ACTAATGCCA 


TGCAAATGGT 


ATCGGCTGCT 


AAGCTAGGTC 


GTTCTGAAGA 


AGCTGCTCGC 


3600 


AACTTCCAAG 


TTTACGCTCA 


GAAAGTGCGT 


AAACTTTTGA 


CAGATATCCT 


TCATGGTAAT 


3660 


GGAGCTGGTG 


CTTCAACTAA 


TCCGATGTTG 


ATTAGCCGTT 


CTGTGAAGAA 


GACAGGCTAT 


3720 


ATCGTTATCA 


CTTCAGACCG 


CGG TTTGGTT 


GGAGGTTATA 


ATTCCTCTAT 


TTTGAAAGCT 


3780 


GTTATGGAGT 


TGAAAGAAGA 


ATACCACCCA 


GACGGTAAAG 


GTTTTGAAAT 


GATCTGTATC 


3840 


GGTGGGATGG 


GAGCTGATTT 


CTTTAAGGCT 


CGCGGTATTC 


AACCACTTTA 


TGAATTACGT 


3900 


GGCTTGTCAG 


ACCAACCTAG 


CTTTGATCAA 


GTTCGTAAGA 


TTATTTCAAA 


AACTGTTGAA 


3960 


ATGTACCAAA 






i n i. i i v.* v- i 






*» \) z u 


CTAACCAGTC 


AAATGCGTGT 


GG AAC AAATG 


CTTCCGATTG 


TTGACTTGGA 


TCCAAATGAA 


4080 


GCGGATGAAG 


AGTACAGCTT 


GACTTTTGAA 


TTGGAAACCA 


GCCGAGAAGA 


AATTCTGGAG 


4140 


CAGTTGTTGC 


CTCAGTTTGC 


AGAAAGTATG 


ATTTACGGTG 


CCATTATCGA 


TGCCAAGACA 


4200 


GCT GAGAATG 


CTGCGGGCAT 


GACAGCCATG 


CAAACAGCGA 


CAGATAATGC 


T AAGAAAGT C 


4260 


ATCAATGATT 


TGACAATTCA 


GT AT AAC C GT 


GCCAGACAGG 


CGGCGATTAC 


ACAAGAAATT 


4320 


ACAGAAATCG 


TAGCAGGTGC 


TAGTGCCTTA 


GAATAGGCTC 


TAGTCCAGCT 


CGTATGAAAA 


4380 


TGAACTTAGG 


ACCTAGTTGA 


GCTAGGAACC 


GACAGTATCT 


TATATAGAAT 


AGGAGAAGGA 


4440 
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GATGAGTTCA 


. CGTAAAATTG 


CTCAGGTTAT 


CGGTCCCGTT 


GTAGACCTTT 


TGTTTGC AG C 


4500 


AGGGGAAAAA 


CTTCCTGAGA 


TTAACAATGC 


ACTTGTCCTC 


TACAAAAATG 


ACGAAAGAAA 


4560 


AACAAAAATC 


GTCCTTGAAG 


TAGCCTTGGA 


GTTAGGAGAT 


GGTATGGTTC 


GTACTATCGC 


4620 


CATGGAATCA 


ACAGATGGGT 


TGACTCGTGG 


AATGGAAGTA 


TTGGACACAG 


GTCGTCCAAT 


4660 


CTCTGTACCA 


GTAGGTAAAG 


AAACTTTGGG 


ACCTGTCTTC 


AACGTTTTGG 


GAGATACCAT 


4740 


TGACTTGGAA 


GCTCCTTTTA 


CAGAAGACGC 


AGAGCGTCAG 


CCAATTCATA 


AAAAAGCTCC 


4800 


AACTTTTGAT 


GAGTTGTCTA 


CCTCTTCTGA 


AATCCTTGAA 


ACAGGGATCA 


AGGTTATTGA 


4860 


CCTTCTTGCC 


CCTTACCTTA 


AAGGTGGTAA 


AGTTGGACTT 


TTCGGTGG TG 


CCGGACTTCG 


4920 


TAAAACTGTC 


TTAATCCAAG 


AATTGATTCA 


CAACATTGCC 


CAAGAGCACG 


GTGGTATTTC 


4960 


AGTATTTGCT 


GGTGTTGGGG 


AACGTACTCG 


TGAGGGGAAT 


GACCTTTACT 


GGGAAATGAA 


5040 


AGAATCAGGC 


GTTATCGAGA 


AAACAGCCAT 


GGTCTTTGGT 


CAGATGAATG 


AGCCACCAGG 


5100 


AGCACGTATG 


CGTGTTGCCC 


TTACTGGTTT 


GACAATCGCT 


GAATACTTCC 


GTGATGTGGA 


5160 


AGGCCAAGAC 


GTGCTTCTCT 


TTATCGATAA 


TATCTTCCGT 


TTCACTCAGG 


CTGGTTCAGA 


5220 


AGTATCTGCC 


CTTTTGGGTC 


GTATGCCATC 


AG CCGTTGGT 


TACCAACCAA 


CACTTGCTAC 


5280 


GGAAATGGGT 


CAATTGCAAG 


AACGTATCAC 


AT CAACC AAG 


AAGGGTTCTG 


TAACCTCTAT 


5340 


CCAGGCTATC 


TATGTGCCAG 


CGGATGACTA 


TACTGACCCA 


GCGCCAGCAA 


CAGCCTTCGC 


5400 


TCACTTGGAT 


TCAACAACAA 


ACTTGGAACG 


TAAGTTGGTA 


CAATTGGGTA 


TCTACCCAGC 


5460 


C GTTGACCC A 


CTTCCTTCAA 


GCTCACGTGC 


CTTGGCACCT 


GAAATCGTTG 


GAGAAGAGCA 


5520 


CTATGCAGTT 


GCTGCTGAAG 


TAAAACGTGT 


CCTTCAACGT 


TACCATGAAT 


TGCAAGATAT 


5580 


CATTGCTATC 


CTTGGTATGG 


ATGAGCTTTC 


TGATGAAGAA 


AAGACCTTGG 


TTCCTCGCGC 


5640 


CCGTCGTATC 


CAGTTCTTCT 


TGTCACAAAA 


CTTCAACGTT 


GCGGAACAAT 


TTACTGGTCA 


5700 


GCCAGGTTCT 


TATGTTCCAG 


TTGCTGAAAC 


TGTACGTGGC 


TTTAAGGAAA 


TCCTTGATGG 


5760 


TAAATACGAC 


CACTTGCCAG 


AAGATGCCTT 


CCGTGGTGTA 


GGTTCTATCG 


AAGATGTGAT 


5820 


TGCAAAAGCT 


GAAAAAATGG 


GATTTTAAGA 


GGTGATCTAT 


GGCTCAGTTA 


ACTGT CC AG A 


5880 


TCGTGACACC 


AGATGGTCTC 


GTCTATGATC 


ACCATGCCAG 


CTATGTATCG 


GTTCGAACTC 


5940 


TGGATGGTGA 


GATGGGGATC 


TTGCCACGAC 


ATGAAAATAT 


GATTGCGGTT 


TTAGCAGTTG 


6000 


ATGAAGTAAA 


GGTAAAACGT 


ATCG ATGAT A 


AAGATCACGT 


GAACTGGATT 


GCAGTAAACG 


6060 


GTGGCGTTAT 


TGAAATTGCC 


AATGATATGA 


TCACAATCGT 


CGCTGACTCT 


GCAGAACGTG 


6120 


CTCGTGATAT 


CGATATCAGT 


CGTGCAGAAC 


GTGCCAAACT 


TCGTGCAGAA 


CGTGCAATTG 


6180 


AAGAAG C AC A 


AGACAAACAT 


TTGATTGACC 


AAGAACGTCG 


TGCTAAGATT 


GCTTTGCAAC 


6240 
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GTGCTATTAA 


CCGTATTAAT 


GTCGGAAATA 


. GACTATAAGA 


AAAAATGAAC 


TTGAAAATAC 


6300 


CAAGTTCATT 


TTTTATGGTG 


TTTTAAGGAG 


CAAAACGGAT 


GCAGACTGCT 


TCGGGAACAT 


6360 


GGAAGTCGTT 


GGAGAGTTCT 


GCTAGACGAC 


CATTGTCACA 


ATTACCTTTA 


AAGACAGTTG 


6420 


CATTGTCAGA 


GTCTTCATGG 


ACAACAATGA 


GAAATTTTTG 


GTCGGGTGTC 


AAATCAAAAT 


6480 


CACGTGGAGT 


CTGACCATGC 


GTTGGAACGA 


TTTCTAATAA 


CTCTAAGCTA 


CCGTCCGCAA 


6540 


GGATGGTATA 


TACTGCGATA 


GAATCATGCC 


CACGGTTAGA 


AGCGTAGAGG 


TATTTACCGT 


6600 


CTTTAGAGAG 


ATGAATAGCA 


GCGGTTCCAT 


TAAAGCCTTC 


GTAAGCTTCC 


GGTAAAGTTG 


6660 


AAATGACCTG 


CATACGTTCA 


AATTCGCCAA 


CGCCATCGTA 


GATTAAAACT 


TCGATAGTAC 


6720 


TATTGAGTTC 


ACAAATGAGA 


TAAGCGATTT 


TATAGTGGTT 


ATGGAAAATG 


ATATGGCCTG 


6780 


AGCCTGCTCC 


TGGCTTGCTG 


TGATAGGTAT 


AGAGCTTAGA 


TAATTTTCCT 


TCTTGATCGA 


6840 


GGTCATAGGT 


GATGACTTGG 


T C AGTTCCC A 


AGTCGCAGGT 


CACTAGATAG 


TGGTCAGGTG 


6900 


TTAAATCTGT 


ATAGTGAACA 


TGGGGGGAAG 


CTTGATTTTC 


ATGTGGACCT 


TGGCCACTCT 


6960 


GTTG ATC CAT 


ATCACTAAGT 


AGAAGACTAC 


CATCTTCCTG 


GCGTTTATAA 


ACAAGGACTT 


7020 


GTCCCTTGTG 


ATAGTTAGCT 


GCGTAAACCA 


AATCACGCTT 


TTCATCGACA 


GCAACATAAC 


7080 


AGTGGGGAGC 


TCCTTCTTCA 


ACAACATGAT 


TTAACACAGT 


CCCGTCAGTT 


TGATAGGCTG 


7140 


CAATTCCCCC 


CTTATCGTCT 


TGGCTACCAA 


CAGTGTATAA 


ATGTTGGTGC 


TGGTCAAAGC 


7200 


CAAGGTAGGT 


TGGACTTGGC 


TCAGCTGCAA 


AAAGTTCTAG 


ATTTGAAAGC 


TGACCACTTT 


7260 


CTGTATCAAA 


GTCTGCCTTG 


TAAATCCCTT 


GAGAAGTACG 


ACGTGTATAA 


GTTCCAAAAT 


7320 


AAACAGTTTC 


TTTCATTACT 


ATACCTCTGT 


GTAAAGATAA 


GACTATTATA 


TCACAAAAAC 


7380 


AAGTAAATTA 


AAGATATCCA 


ATT AG AT GTA 


AGCACTTTAA 


AAAAG AG TT A 


TTTTGTTTCA 


7440 


AAAATGGTAT 


AATGAGAGAA 


CAATAGAAAG 


GAAGT ATTT A 


TGGAGCAAAA 


AGAGAAACAT 


7500 


TTTAGCCTAT 


CTTGGTTTTT 


CAAGTGGTTT 


TTAGATAACA 


AGGCAATTAC 


GGTATTTTTA 


7560 


GTAACCTTAT 


TATTGGGACT 


GAATCTTTTT 


ATTTTAAGTA 


AGATTAGTTT 


TCTATTTTCA 


7620 


CCTGTTTTAG 


ACTTTTTAGC 


AGTTGTGATG 


TTGCCAGTCA 


TTTTGTCTGG 


TTTGTTATAT 


7680 


TATTTGTTGA 


ATCCTATTGT 


TGATTGGATG 


GAGAAGCATA 


AGGTTAATCG 


TGTTATAGCT 


7740 


AT CACT ATTG 


TCTTTGTTAT 


CATCGCTCTC 


TTTATCATTT 


GGGGCTTGGC 


AGTCGCCATT 


7800 


CCAAATCTGC 


AACGTCAGGT 


TTTGACCTTT 


GCAAGAAACG 


TTCCTGTTTA 


CTTAGAAGAT 


7860 


ATAGATAGGA 


TTGTTAATGG 


ATTGGTAGCC 


CAGCACCTGC 


CAGATGATTT 


CAGACCTCAA 


7920 


TTAGAGCAAG 


TTTTGACCAA 


TTTTTCTAGC 


CAGG CT AC AG 


TTTTGGCAAG 


TAAGGTTTCA 


7980 
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TCTCAGGCAG 


TCAACTGGGT 


GAGTGCCTTT 


ATTAGCGGGG 


CTTCTCAAGT 


GATTGTTGCC 


8040 


TTGATTATCG 


TTCCTTTCAT 


GCTCTTTTAT 


CTCTTGCGTG 


ATGGGAAAGG 


CTTGCGTAAC 


8100 


TATTTGACCC 


AATTCATTCC 


AAGAAAATTG 


AAGGAACCTG 


TTGGACAAGT 


TTTATCAGAT 


8160 


GTGAATCAAC 


AGTTGTCCAA 


CTATGTTCGA 


GGGCAAGTGA 


CAGTGGCTAT 


TATTGT AGC A 


8220 


GTAATGTTTA 


TCATCTTCTT 


CAAGATTATT 


GGTCTACGCT 


ATGCGGTTAC 


GCTGGGGGTT 


8280 


ACTGCTGGTA 


TTTTAAATCT 


GGTCCCTTAT 


CTTCGTAGCT 


TTCTAGCCAT 


GCTTCCTGCT 


8340 


CTAGTATTGG 


GTTTGATTGC 


TGGTCCAGTC 


ATGCTTTTGA 


AAGTAGTGAT 


TGTCTTTATC 


8400 


GTAGAACAAA 


CTATTGAAGG 


CCGTTTTGTC 


TCTCCATTGA 


TTTTGGGAAG 


TCAATTAAAC 


8460 


ATCCACCCTA 


TTAATGTTCT 


CTTTGTTTTG 


TTAACTTCAG 


GATCTATGTT 


TGGTATCTGG 


8520 


GGAGTTTTAC 


TTGGTATTCC 


GGTTTATGCC 


TCTGCTAAGG 


TTCTCATTTC 


AGCCATTTTC 


8580 


GAATGGTATA 


AGGTAGTCAG 


TGGTCTATAT 


GAATTAGAGG 


GTGAGGAAGT 


CAAGAGTGAA 


8640 


CAATAGTCAA 


CAGATGTTAC 


AGGCTTTGGA 


GGAGCAAGAT 


TTAACTAAGG 


CTGAGCATTA 


8700 


TTTCGC C AAA 


GCTTTAGAAA 


ATGATTCAAG 


TGATCTTCTG 


TATGAATTGG 


CAACTTATCT 


8760 


T GAAGGG ATT 


GGTTTCTATC 


CTCAGGCCAA 


GGAAATTTAC 


CTGAAAATTG 


TAGAGGATTT 


8820 


TCCAGAGGTT 


CATCTTAATC 


TAGCTGCAAT 


TGCTAGCGAG 


GATGGTCAAA 


TAGAAGAAGC 


8880 


CTTTACCTAT 


CTTGAGGAAA 


TCCAAGCTGA 


CAGTGACTGG 


TATGTCTCGT 


CTTTGGCTCT 


8940 


GAAGGCAGAC 


CTTTACCAGC 


TGGAAGGTTT 


GACAGATGTG 


GCACGTGAGA 


AATTATTGGA 


9000 


GGCCTTGACC 


TACTCAGAGG 


ATTCTCTCTT 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


9060 


GTTGGAAAAT 


TACCAAGCGG 


CTATTCAAGC 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


9120 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


AATTGCCTTT 


GCCTATGCTC 


AGTTAGGGAA 


9180 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


9240 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


9300 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


CTTTGAAGGC 


TATGAGTATG 


GGTAC AGT C A 


9360 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


9420 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


9480 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


9540 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


9600 


AGAATTGCAG 


AGTGAGGAGC 


CAGAAAATCT 


TTTGACCAAG 


TGGATGATTG 


CTCCTTCTTA 


9660 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


TGAGTATTAT 


CAAGAGTTGA 


CAGGAGATTT 


9720 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


TATCTATCTC 


TTGCGTGAAT 


TGGGACATTT 


9780 
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TGAAGAAGCA AAAGTCCATG CTCACACTTA CTTAAAACTC GTTCCAGATG ATCTGCAAAT 984 0 

GCAAGAACTG TTTGAGAGAT TGTAAGAATG TTTAACCCAA ATCATTCATA CCTCTCTCAA 9900 

CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT TTCTTCCTCC TCATGAGGTC 9960 

AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT AGATTTCCTC AAAAGGGCAG 10020 

ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC TGTTCTTT AA TGCATCATTA 10080 

ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG ATTCAGGTTG ACTTTTCTAA 1014 0 

TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA GAGACTAGAC AATTTGAGGA 10200 

GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG AAGAAAAAGA TGGCGGAAGC 10260 

GTTTGATTGT TAAAGTTTGC AAGTCACCTC CAGCTAGATG TTTGAGAAAA AGATAGAGAT 10320 

TGTAGGCGAT ACAGCTCATC ATCATACGAA TTCGTTTTTG AT T AAGGTTG AACTATCCGT 103 80 

TTTATCGCCA AAAAATCGG 10399 



(2) INFORMATION FOR SEQ ID NO : 161: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 9409 base pairs 
( 8 ) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
CD] TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GATAAGATTA AGTTAGAAAA GAAAGAACTA GG AC AT AT CT ACCAGATTCA GGTTTTTAAT 60 

AGCT ATGGG C AGGAAGAAAT CTATCGTGTG ATTTTGATGG AGACCAATAT TAGTTCGGTT 120 

TCAACCAATA TCAAGTATGC TGCTGTCTTG ATTAATACCA GTCAGTTGGA ACAGGCTAGT 180 

CAAAAGCATG AG CAATTG AT TGTGGTCGTG ATGGCTAGTT TCTGGATTTT GTCTTTACTT 2 40 

GCCAGTCTCT ATCTAGCTAG GGTCAGTGTT AGGCCCCTGC TTGAGAGTAT GCAGAAGCAA 3 00 

CAGT CTTTTG TGGAAAATGC CAGTCATGAG TTACGAACTC CACTCGCAGT TTTGCAAAAT 360 

CGCTTAGAGA CCCTTTTTCG TAAGCCAGAA GCTACCATTA TGGATGTGAG CGAAAGCATT 4 20 

G CAT CGAGTT TGGAAGAAGT CCGAAATATG CGTTTTTTAA CGACAAGCTT GCTGAACTTA 480 

GCTCGGAGAG ATGATGGGAT TAAGCCGGAG CTTGCAGAAG TTCCAACTAG CTTTTTTAAT 54 0 

ACAACTTTCA CAAACTACGA GATGATTGCT TCGGAAAATA ATCGTGTCTT CCGTTTTGAA 600 

AATCGTATCC ATCGAACAAT TGTCACAGAT CAGCTTCTTC TGAAACAACT GATGACCATT 660 

CTTTTCGATA ATGCCGTCAA GTATACTGAG GAGGATGGTG AAATTGATTT TCTTATCTCG 720 
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GCGACCGATC 


GCAATCTTTA 


TTTACTTGTT 


TCTGATAATG 


GAATCGGTAT 


TTCGACAGAA 


780 


GATAAAAAGA 


AAATTTTTGA 


CCGTTTTTAT 


CGACTACACA 


AGGCTAGAAC 


CCGGCAAAAA 


840 


GGTGGTTTTG 


GTTTAGGATT 


ATCCCTAGCC 


AAGCAAATTG 


TAGATGCTCT 


AAAAGGAACT 


900 


GTTACTGTCA 


AAGATAATAA 


ACCCAAGGGA 


ACAATCTTTG 


AAGTGAAGAT 


TGCCATTCAG 


960 


ACACCATCTA 


AAAAGAAAAA 


ATAAAAATAT 


CGCTCCAATT 


GGGGCGATAT 


TTTGGATTTA 


1020 


TCTTCTACGT 


TTTCGTTTGA 


TAATAGACCG 


TTGAACTTTT 


AAAACAAGTA 


AGCTGAATCC 


1080 


GATTGCTGCG 


GCAAAGGCAA 


GACCAGTTGA 


TAATTTTAAT 


GCTAAAAAGA 


TAAAACTAAA 


1140 


GAT AGCAAT A 


CAGATACAAA 


AAACAGCGAT 


ATTAATAAAA 


AATAGGATTT 


CCTTGAGATT 


1200 


GGCATCAGAT 


TGCGCTTCAG 


GTGTATAAGC 


TTGGTAATGA 


GGAAGCTGCT 


GGTTTAATTC 


1260 


TTCTTGATAG 


TCTACCTCAT 


AGGATTGTAA 


TTTTCTTACG 


GCCATGATTC 


TCTCCTTAAC 


1320 


AGTACATACC 


TATTTTATCA 


TTTTTTCGGC 


AGAGAATTAT 


TACAGAAAGG 


TTACAAAAAG 


1380 


AATAAAGTCC 


CTTTTCATTT 


TCAAAGCATG 


GCTGATTTTG 


GAGAAATGTG 


GTATAATTTT 


1440 


TCTTATGGAA 


AAGATTGTCA 


TTACAGCAAC 


TGCTGAAAGT 


ATTGAACAAG 


TTGAACAACT 


1500 


ACTCGAAGCT 


GGCGTAGACC 


GTATCTATGT 


C GGTG AG AAA 


GATTTTGGTC 


TTCGTCTGCC 


1560 


AACGACCTTT 


AGTTATGACC 


AATTACGTGA 


AATCGCTAAG 


TTGGTTCATG 


ATGCTGGTAA 


1620 


GGAATTGATC 


GTTGCGGTCA 


ATGCTCTCAT 


GCACCAAGAT 


ATGATGGACC 


GTATCAAGCC 


1660 


TTTCTTAAAC 


TTCTTGGAAG 


AAATCAAGAC 


AGACTATATT 


ACGATTGGGG 


ATGCAGGCGT 


1740 


CTTTTACGTA 


CTTAACCGCG 


ATGGTTATTC 


ATTTAAGACC 


ATCTACGATG 


CTTCAACCAT 


1800 


GGTAACTAGC 


AGTCGT CAGA 


TTAACTTCTG 


GGGACAAAAG 


GCTGGCGCAT 


CTGAGGCTGT 


1860 


TTTGGCGCGT 


GAAATTCCAT 


CAGCTGAACT 


TTTCAAAATG 


CCAGAGATTT 


TGGAAATTCC 


1920 


TGCTGAAGTT 


TTGGTTTACG 


GTCCTAGCGT 


CATCCATCAT 


TCTAAACGTC 


CACTCTTGCA 


1980 


AAACTACTAT 


AACTTTACAC 


ATATCGATGA 


TGAAAAGACG 


CATAAACGTG 


ACCTCTTCTT 


2040 


GGCTGAGCCA 


AGTGATCCAG 


AGAGCCACTA 


TTCCATTTTT 


GAAGATAATC 


ATGGGACCCA 


2100 


TATCTTTGCC 


AACAATGACC 


TTGATTTGAT 


GATCAAATTA 


ACAGAATTGG 


TGGAGCATGG 


2160 


CTTTACTCGC 


TGGAAACTAG 


AAGGGCTCTA 


CACTCCTGGT 


CAGAACTTTG 


TTGAGATTGC 


2220 


AAAACTCTTT 


ATCCAAGCGC 


GTAGCTTGAT 


TCAAGAGGGC 


AACTTTAGTC 


ATG CTCAAGC 


2280 


CTTCTTGCTG 


GATGAAGAAG 


TTCGTAAACT 


TCACCCTAAA 


AACCGTTTCC 


TTGATACAGG 


2340 


ATTTTATGAC 


TACGATCCTG 


A CAT GGTT AG 


ATAAAATACA 


TGATTCGTTG 


AGAGAAGGAA 


2400 


GATGCAAACA 


TTTCTTCTCT 


CAATTTTTCG 


TATTTCTTCA 


CTATTTTACA 


AAAATCAGCA 


2460 


GGCTAGAATG 


CTCTATTCGA 


TGGGATTTTT 


AAGAAAAGTA 


GTGTTCTTGA 


GTTTGAAAAT 


2520 
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TATCCTATCT 


TTGCAGGTGC 


CAAATGGCCC 


TTTTTTT GGT 


ATAATTTTTT 


ATAATGAAAA 


2580 


CGATTGGTAA 


TCGCTATGTT 


GTGGTGGATT 


TAGAGGCAAC 


TAGCACAGGT 


AGTAAGGCTA 


2640 


AAATTATCCA 


AGTGGGAATT 


GTCGTGATTG 


AGGACGGAGA 


AATCGTCGAT 


CACTATACGA 


2700 


CGGATGTCAA 


TCCACATGAA 


CCCTTGGATG 


CTCATATCAA 


AGAACTGACA 


GGATTGACAG 


2760 


ACCAACGTCT 


GGCG CAAGC A 


CCTGATTTTT 


CGCAAGTTGC 


CAGAAAAATA 


TTTGACTTGG 


2820 


TGGAGGATGG 


GATTTTTGTA 


GCCCATAATG 


TTCAGTTTGA 


TGCTAATCTC 


TTGGCGGAAA 


2880 


ATTTATTTTT 


TGAAGGCTAT 


GAGCTAAGAA 


ACCCTCGTGT 


TGATACGCTC 


GAATTGCCCC 


2940 


AGGTCTTTTT 


CCCTGAACTG 


GAAAAATATA 


GCTTGCCGAT 


TTTGTGTCCA 


GAATTAGGAA 


3000 


TTCCTCTTAA 


ACACGCACAC 


ACAGCCCTTT 


CAGATGCCCA 


AGCTACAGCA 


GAATTACTTC 


3060 


TTTTTTT A CG 


GAAAAAGATG 


ACCCAGCTTC 


CTAAAGGTCT 


CTTGGAACGC 


TTGCTGGAAA 


3120 


TGGCTGACGC 


TCTCCTATAT 


GAGTCCTACC 


TGGTT ATTG A 


GGAAACTTAT 


CGCAACCAAT 


3180 


CTATCCTGAG 


TTCTCCAGAC 


TTGGTCCAAG 


TTCAAGGTCT 


ATATTTTAAG 


AAAACGGAAG 


3240 


CTTCTCTGGA 


GCCACGAAAA 


CTATCTCAAG 


ACTTTTCTAA 


AAATATTTCT 


CTGTTGAACC 


3300 


TTGAAGTGAG 


GGAGGAACAA 


GAAAGTTTTG 


CTAAAGAGGT 


TGGCTTGCTA 


TTGAAAGATG 


3360 


AACCTGTCTC 


TCTGATTCAA 


GCGCCGACAG 


GGATTGGGAA 


AACCTATGGC 


TATCTCTTAC 


3420 


CCGCTTTATC 


TCAATCCAAA 


GAGCGACAAA 


TTGTTCTTAG 


TGTTCCGACA 


AAGATTCTTC 


3480 


AAAATCAAAT 


CATGGAAGAA 


GAAGGTAAAC 


GCCTCAAGGA 


AGTGTTC C AT 


ACAGATATTC 


3540 


ATAGCTTAAA 


GGGACCACAA 


AATTATCTGA 


AGTTGGATGC 


CTTTTATCAT 


TCCTTGCAGG 


3600 • 


AAAATGATGA 


AAATCGCTTA 


TTTAGACGCT 


TTAAAATGCA 


AGTCTTGGTC 


TGGCTTACTG 


3660 


AGACAGAGAC 


AGGAGATTTG 


GATGAAATCG 


GGCAACTCTA 


CCGTTACCAA 


CATTTTCTAG 


3720 


CAGACCTTCG 


TCATGATGGG 


AATTTATCAT 


CCCAGAGCTT 


ATTTGTGACG 


GAAGATTTTT 


3780 


GGAAACGTAG 


TCAAGAAAGG 


GCAGAGACTT 


GCAAGCTTTT 


AGTGACTAAT 


CATGCCTATC 


3840 


TCGTAACCAG 


ACTTGAAGAT 


AATCCTGAAT 


TTGTCAGTGA 


CCGTTT ACTG 


ATTATTGATG 


3900 


AAGTCCAAAA 


GATTTTGTTA 


GCTCTAGAAA 


ATCTGCTTCA 


AGAGACCTAC 


GATATACAAT 


3960 


CTATTATCGA 


TTTAATTGAT 


AAGGCTTTAG 


TAGGAGAAGA 


AAACAGGGTT 


CAACAACGGA 


4020 


TACTAGAAAG 


TATTCGCTTT 


GAGTGTCTCT 


ACTTGATAGA 


ACAATTTCAG 


TCTGGCAAAT 


4080 


CTAGGAAAAA 


TATCTTAGAT 


TCTCTGGACA 


ATCTCCATCA 


GTATTTTTCA 


GAATTGGAAG 


4140 


TAGAAGACTT 


TGATGAGCTG 


GTTCGCTATT 


TTACAGCTGA 


AGGTGATTAC 


TGGCTTGAAG 


4200 


TAACTGAAAC 


GAG TCAAAAG 


AAAATTCAGA 


TTTCTTCT AC 


AAAATCAGGC 


CGTACTCTTC 


4260 
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TGTCCTCTTT 


' ACTTCCTGAG 


; AGTTGCCAAG 


1 TCTTGGGAGT 


ATCGGCTACT 


CTTGAGATTA 


4320 


GTCAGAGGGT 


1 TTCTTTGGCA 


. GACCTTTTAG 


i GCTATCCTGA 


AGCTAAATTT 


GTCAAGATTG 


4380 


AATCTCGGGG 


AAAACAGGAA 


CAAGAAGTGG 


TCATCGTCAA 


AGATTTCCCT 


CTGGTAACAG 


4440 


AAACCTCCTT 


AGAAGTCTAT 


GCCAGAGAGG 


T AGCTG CTTT 


ACT AG TGG AA 


ATTCAAGCTT 


4500 


TCCAGCAACC 


GATTTTGGTT 


CTCTTTACCG 


CTAAAGACAT 


GCTTCTAGCA 


GTATCGGATT 


4560 


TACTTACAGT 


TAGCCACTTG 


GCCCAGTATA 


AAAATGGGGA 


TGTTCATCAG 


CTAAAGAAAC 


4620 


GCTTTGAAAA 


AGGTGAACAA 


CAAATCTTGC 


TTGGTGCAGC 


AAGTTTCTGG 


GAGGGAGTTG 


4680 


ATTTTTCAAG 


CCATCCTTCT 


GTGATTCAAG 


TTGTACCGAG 


GCTTCCTTTC 


CAAAATCCTC 


4740 


AAGAACCCTT 


GACGAAAAAG 


ATTAATCAAG 


AACTGAATCA 


AGAAGGGAAA 


AATGCCTTTT 


4800 


ATGATTATCA 


ATTGCCAATG 


GCCATTATTC 


GTTTAAAACA 


GGCTTTGGGA 


AGAAGTATGA 


4860 


GACGTGAATA 


CCAACGTTCC 


TTAACTCTTA 


TTTTGGATAG 


GAGAATCGTC 


GGAAAACGAT 


4920 


ACGGCAAACA 


AATAGTAGCA 


TCTCTAGCAG 


AAGAAGCGAC 


TGTTAAAACC 


ATCTCTCGAT 


4980 


CCGAAGTTGA 


CGAGGCTATT 


GATAGATTTT 


TTAATGAGCT 


TTGATAAATA 


GTATTGTATG 


5040 


AAAGTATAAG 


GTTAGTATAT 


ATGAAACGTT 


CTCTCGACTC 


AAG AGTCG AT 


TACAGTTTGC 


5100 


TCTTGCCAGT 


ATTTTTTCTA 


CTGGTCATCG 


GTGTGGTGGC 


TATCTATATA 


GCCGTTAGTC 


5160 


ATGATTATCC 


CAATAATATT 


CTGCCCATTT 


TAGGGCAGCA 


GGTCGCCTGG 


ATTGCCTTGG 


5220 


GGCTTGTGAT 


TGGTTTTGTG 


GTCATGCTCT 


TTAATACAGA 


ATTTCTTTGG 


AACGTGACCC 


5280 


C CTTTCT AT A 


TATTTTAGGC 


TTGGGACTTA 


TGATCTTGCC 


GATTGTATTT 


TATAATCCAA 


5340 


GCTTAGTTGC 


ATCAACGGGT 


GCCAAAAACT 


GGGTATCAAT 


AAATGGAATT 


ACCCTATTCC 


5400 


AACCGTCAGA 


ATTTATGAAG 


ATATCCTATA 


TCCTCATGTT 


GGCTCGTGTC 


ATTGTCCAAT 


5460 


TTACAAAGAA 


ACATAAGGAA 


TGGAGACGCA 


CGGTTCCGCT 


GGACTTTTTG 


TTAATTTTCT 


5520 


GGATGATTCT 


CTTTACCATT 


CCAGTCCTAG 


TTCTTTTAGC 


ACTTCAAAGT 


GACTTGGGGA 


5580 


CGGCTTTGGT 


TTTTGTAGCC 


ATTTTCTCAG 


GAATCGTTTT 


ATTATCAGGG 


GTTTCTTGGA 


5640 


AAATT AT T AT 


CCCAGTATTT 


GTGACTGCTG 


TAACAGGAGT 


TGCTGGTTTC 


TTAGCTATCT 


'.-.700 


TT ATT AG C AA 


GGACGGACGA 


GCTTTTCTTC 


ACCAGATTGG 


AATGCCGACC 


TACCAAATTA 


5760 


ATCGGATTTT 


GGCTTGGCTC 


AATCCCTTTG 


AGTTTGCCCA 


AACAACGACT 


T AC CAGCAGG 


5820 


CTCAAGGGCA 


GATTGCCATT 


GGGAGTGGTG 


GCTTATTTGG 


TCAGGGATTT 


AATGCTTCGA 


5880 


ATCTGCTTAT 


CCCAGTTCGA 


GAGTCAGATA 


TGATTTTTAC 


GGTTATTGCA 


GAAGATTTTG 


5940 


GCTTTATTGG 


CTCTGTCCTG 


GTTATTGCCC 


TCTATCTCAT 


GTTGATTTAC 


CGTATGTTGA 


6000 


AGATTACTCT 


TAAATCAAAT 


AACCAGTTCT 


ACACTTATAT 


TTCCACAGGT 


TTGATTATGA 


6060 
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TGTTGCTCTT 


CCACATCTTT 


GAGAATATCG 


GTGCTGTGAC 


TGGACTACTT 


CCTTTGACGG 


6120 


GCATTCCCTT 


CCCTTTCATT 


TCGCAAGGGG 


GATCAGCTAT 


TATCAGTAAT 


CTGATTGGTG 


6180 


TTGGTTTCCT 


TTTATCGATG 


AGTTACCAGA 


CTAATCTAGC 


TGAAGAAAAG 


AGCGG AAAAG 


6240 


TCCCATTCAA 


ACGGAAAAAG 


GTTGTATTAA 


AACAAATTAA 


AT AAGG AG AA 


AATCATGGTA 


6300 


AAAGTAGCAG 


TTATATTAGC 


TCAGGGCTTT 


GAAGAAATTG 


AAGCCTTGAC 


AGTTGTAGAT 


6360 


GTCTTGCGTC 


GAGCCAATAT 


CACATGTGAT 


ATGGTTGGTT 


TTGAAGAGCA 


AGTAACGGGT 


6420 


TCGCATGCAA 


TCCAAGTAAG 


AGCAGATCAT 


GTCTTTGATG 


GAGATTTATC 


AGACTATGAT 


6480 


ATGATTGTTC 


TTCCTGGAGG 


TATGCCTGGT 


TCTGCACATT 


TACGTGATAA 


TCAGACCTTG 


6540 


ATTCAAGAAT 


TGCAAAGCTT 


CGAGCAAGAA 


GGGAAGAAAC 


TAG C AG C CAT 


TTGTGCGGCA 


6600 


CCAATTGCCC 


TCAATCAAGC 


AGAGATATTG 


AAAAATAAGC 


GATACACTTG 


TTATGACGGC 


6660 


GTTCAAGAGC 


AAATCCTTGA 


TGGTCACTAC 


G TC AAGG AAA 


CAGTAGTGGT 


AGATGGTCAG 


6720 


TTGACAACCA 


GTCGGGGTCC 


TTCAACAGCC 


CTTGCCTTTG 


CCTACGAGTT 


GGTGGAGCAA 


6780 


CTAGGAGGGG 


ACGCAGAGAG 


TTTACGAACA 


GGAATGCTCT 


ATCGAGATGT 


CTTTGGTAAA 


6840 


AAT C ACT AAA 


ACGGGAGTTA 


TTCTCTCGTT 


TTTTATGTGG 


AAAACTCAGG 


GAAATCATCG 


6900 


CTTTTTTCAT 


AAAAAAATGC 


TATAATGAAG 


GGTATGAAAT 


ATCACGATTA 


CATCTGGGAT 


6960 


TTAGGTGGAA 


CTTTACTGGA 


TAATTATGAA 


ACTTC AAC AG 


CTGCATTTGT 


TGAAACATTG 


7020 


GCACTGTATG 


GTATCACACA 


AGACCATGAC 


AGTGTCTATC 


AAGCTTTAAA 


GGTTTCTACT 


7080 


CCTTTTGCGA 


TTGAGACATT 


CGCTCCCAAT 


TTAG AG AATT 


TTTTAGAAAA 


GTACAAGGAA 


7140 


AATGAAGCCA 


GAGAGCTTGA 


ACACCCGATT 


TTATTTGAAG 


GAGTTTCTGA 


CCTATTGGAA 


7200 


GACATTTCAA 


ATCAAGGTGG 


CCGTCATTTT 


TTGGTCTCTC 


ATCGAAATGA 


TCAGGTTTTG 


7260 


GAAATTTTAG 


AAAAAACCTC 


TATAGCAGCT 


TATTTTACAG 


AAGTGGTGAC 


TTCTAGCTCA 


7320 


GGCTTTAAGA 


GAAAGCCAAA 


TCCCGAATCC 


ATGCTTTATT 


TAAGAGAAAA 


GTATCAGATT 


7380 


AGCTCTGGTC 


TTGTCATTGG 


TGATCGGCCG 


ATTGATATCG 


AAGCAGGTCA 


AGCTGCAGGA 


7440 


CTTGATACCC 


ACTTGTTTAC 


CAGTATCGTG 


AATTTAAGAC 


AAGTATTAGA 


CATATAAGAA 


7500 


AAAGGAATAA 


GATGACAGAA 


GAAATCAAAA 


ATCTGCAGGC 


ACAGGATTAT 


GATGCCAGTC 


7560 


AAATTCAAGT 


TTTAGAGGGC 


TTAGAGGCTG 


TTCGTATGCG 


TCCAGGGATG 


TACATTGGAT 


7620 


CAACCTCAAA 


AGAAGGTCTT 


CACCATCTAG 


TCTGGGAAAT 


TGTTGATAAC 


TCAATTGACG 


7680 


AGGCCTTGGC 


AGGATTTGCC 


AGCCATATTC 


AAGTTTTTAT 


TGAGCCAGAT 


GATTCGATTA 


7740 


CTGTTGTGGA 


TGATGGGCGT 


GGTATCCCAG 


TCGATATTCA 


GGAAAAAACA 


GGCCCTCCTG 


7800 
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CTGTTCAGAC 


CGTCTTTACA 


GTCCTTCACG 


CTGGAGGAAA 


GTTCCGCGGT 


GGTGGATACA 


7860 


ACGTTTCAGC 


TGGTCTTCAC 


GGGGTCGGGT 


CGTCAGTAGT 


TAATGCCCTT 


TCCACTCAAT 


7920 


TAGACGTTCA 


TGTTCACAAA 


AATGGTAAGA 


TTCATTACCA 


AGAATACCGT 


CGTGGTCATG 


7980 


TTGTCGCAGA 


TCTTGAAATA 


GTTGGAGATA 


CGGATAAAAC 


AGGAACAACT 


GTTCACTTCA 


8040 


CACCGGACCC 


AAAAATCTTC 


ACTGAAACAA 


CAATCTTTGA 


TTTTGATAAA 


TTAAATAAAC 


8100 


GGATTCAAGA 


GTTGGCCTTT 


CTAAATCGCG 


GTCTTCAAAT 


TTCAATTACA 


CATAAGCGCC 


8160 


AAGGTTTGGA 


ACAAACCAAG 


CATTATCATT 


ATGAAGGTGG 


GATTGCTAGT 


TACGTTGAAT 


8220 


ATATCAACGA 


GAACAAGGAT 


GTAATCTTTG 


ATACACCAAT 


CTATACAGAC 


GGTGAGATGG 


8280 


ATGATATCAC 


AGTTGAGGTA 


GC CAT GCAGT 


ACACAACTGG 


TTACCATGAA 


AATGTCATGA 


8340 


GTTTCGCCAA 


TAATATTCAT 


ACCCATGAAG 


GTGGAACACA 


TGAACAAGGT 


TTCCGTACAG 


8400 


CCTTGACACG 


TGTTATCAAC 


GATTATGCTC 


GTAAAAATAA 


GTTACTGAAA 


GACAATGAAG 


8460 


ATAATTTAAC 


AGGGGAAGAT 


GTTCGCGAAG 


GCTTAACTGC 


AGTTATCTCA 


GTTAAACACC 


8520 


CAAATCCACA 


GTTTGAAGGA 


CAAACCAAGA 


CCAAATTGGG 


AAATAGCGAA 


GTGG TCAAG A 


8580 


TT AC CAATCG 


CCTCTTCAGT 


GAAGCTTTCT 


CCGATTTCCT 


CATGGAAAAT 


CCACAGATTG 


8640 


CCAAACGTAT 


CGTAGAAAAA 


GGAATTTTGG 


CTGCCAAGGC 


TCGTGTGGCT 


GCCAAGCGTG 


8700 


CGCGTGAAGT 


CACACGTAAA 


AAATCTGGTT 


TGGAAATTTC 


CAACCTTCCA 


GGGAAACTAG 


8760 


CAGACTGTTC 


TTCTAATAAC 


CCTGCTGAAA 


CAGAACTCTT 


CATCGTCGAA 


GGAGACTCAG 


8820 


CTGGTGGATC 


AGCCAAATCT 


GGTCGTAACC 


GTGAGTTTCA 


GGCTATCCTT 


CCAATTCGCG 


8880 


GTAAGATTTT 


GAACGTTGAA 


AAAGCAAGTA 


TGGATAAGAT 


TCTAGCCAAC 


GAAGAAATTC 


8940 


GTAGTCTTTT 


CACAGCCATG 


GGAACAGGAT 


TTGGCGCAGA 


ATTTGATGTT 


TCGAAAGCCC 


9000 


GTTACCAAAA 


ACTCGTTTTG 


ATGACCGATG 


CCGATGTCGA 


TGGAGCCCAC 


ATTCGTACCC 


9060 


TTCTTTTAAC 


CTTGATTTAT 


CGTTATATGA 


AAC CAATCCT 


AGAAGCTGGT 


TATGTTTATA 


9120 


TTGCCCAACC 


ACCAATCTAT 


GGTGTCAAGG 


TTGGAAGCGA 


GATTAAAGAA 


TATATCCAGC 


9180 


CGGGTGCAGA 


TCAAGAAATC 


AAACTCCAAG 


AAGCTTTAGC 


CCGTTATAGT 


GAAGGTCGTA 


9240 


CCAAACCGAC 


TATTCAGCGT 


TATAAGGGGC 


TAGGTGAAAT 


GGACGATCAT 


CAGCTGTGGG 


9300 


AAACAACCAT 


GGATCCCGAA 


CATCGCTTGA 


TGGCTAGAGT 


TTCTGTAGAT 


GATGTGCAGA 


9360 


AGCAGATAAA 


ATCTTTGATA 


TGTTGATGGG 


GATCGAGTTG 


TCCTCGTCG 




9409 


(2) INFORMATION FOR SEQ ID NO: 162: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6415 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


162 : 






CCTGGGAAAG 


TCTTGAAAAT 


TATGATAGAA 


TGGTGGAAGG 


AAAAATTCAG 


GAGAGTAGTA 


60 


GTGACTCAAA 


ATGTTGAAAG 


TCTTCTCGTA 


TCCATTGTAA 


TCAGTGCATA 


CAATGAAGAA 


120 


AAATATCTGC 


CTGGTCTAAT 


TGAAGACTTA 


AAAAATCAAA 


CCTATCCTAA 


AGAGGATATT 


180 


GAAATTCTAT 


TTATAAATGC 


TATGTCCACA 


GATGGGACCA 


CAGCTATCAT 


TCAGCAATTT 


240 


ATAAAGGAAG 


ATACAGAGTT 


TAACTCAATT 


AG ATT GT AT A 


ACAATCCTAA 


GAAAAATCAA 


300 


GCTAGTGGTT 


TTAACCTGGG 


AGTTAAACAT 


TCTGTAGGGG 


ACCTTATTTT 


AAAAATTG A T 


360 


GCTCATTCAA 


AAGTTACTGA 


GACTTTTGTA 


ATGAACAATG 


TGGC T ATT AT 


TCAACAAGGT 


420 


GAATTTGTCT 


GTGGGGGGCC 


TAGACCGACG 


ATTGTCGAAG 


GAAAAGGAAA 


ATGGGCAGAG 


480 


ACCTTGCATC 


TTGTTGAGGA 


AAATATGTTT 


GGCAGTAGCA 


TTGCCAATTA 


TCGAAATAGT 


540 


TCTGAGGATA 


GATATGTTTC 


TTCTATTTTT 


CATGGAATGT 


ATAAACGAGA 


GGTTTTCCAG 


600 


AAGGTTGGTT 


T ACT AAATG A 


GCAACTTGGC 


CGAACTGAAG 


ATAATGATAT 


TCATTATAGA 


660 


ATTCGAGAAT 


ATGGTTATAA 


AATCCGCTAT 


AGCCCAAGTA 


TTCTATCTTA 


TCAGTATATT 


720 


CGAC CAACAT 


TCAAGAAAAT 


GCTGCATCAA 


AAGTATTCAA 


ATGGTTTGTG 


GATTGGCTTG 


780 


ACAAGTCATG 


TTCAGTTTAA 


GTG TTT ATC A 


TTATTTCACT 


ATGTTCCTTG 


TTTATTTGTT 


840 


TTGAGTCTTG 


TGTTTAGTCT 


AGCATTGTTA 


CCGATCACAT 


TCGTATTCAT 


AACTTTACTA 


900 


TTAGGTGCCT 


ATTTTCTACT 


TTTGTCATTA 


CTCACTTTGC 


TGACTTTATT 


AAAACATAAA 


960 


AATGGATTTC 


TAATTGTGAT 


GCCCTTTATT 


TTATTTTCCA 


TTCACTTTCC 


TTATGGCCTT 


1020 


GGGACGATTG 


TAGGTTTAAT 


TAGAGGATTT 


AAATGGAAGA 


AGGAGTACAA 


GAGAACAATA 


ioao 


ATTTATTTGG 


ATAAAATAAG 


CCAAATAAAT 


CAAAATATGC 


TATAATAACA 


AT AT AGT AAA 


1140 


ACTCTTTTAA 


GGAGGAGTAG 


ATTTCT ATGA 


ATAAAAAACT 


AACAGATTAT 


GTGATTGATC 


1200 


TGGTGGAAAT 


TTTAAATAAA 


CAACAAAAGC 


AGGTTTTCTG 


GGGAATATTT 


GATATTTTCA 


1260 


GTATGGTGGT 


TTCCATCATT 


GTATCTTATA 


TTTTATTTTA 


TGGGCTGATT 


AATCCAGCAC 


1320 


CTGTTGACTA 


CATTATCTAT 


ACGAGTTTGG 


CCTTCCTGTT 


CT AT CAATTG 


ATGATTGGTT 


1380 


TTTGGGGGTT 


GAACGCGAGC 


ATTAGTCGTT 


ACAGCAAGAT 


TACGGATTTC 


ATGAAAATCT 


1440 


TTTTTGGTGT 


GACTGCTAGC 


AGTGTCTTGT 


CATATAGTAT 


CTGTTATGCC 


TTCTTGCCAC 


1500 


TCTTCTCCAT 


CCGTTTCATC 


ATTCTCTTTA 


TCTTGTTGAG 


TACCTTCTTG 


ATTTTATTGC 


1560 
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CACGGATTAC TTGGCAGTTA ATCTACTCCA GACGCAAAAA AGGTAGTGGT GATGGAGAAC 1620 

ACCGTCGGAC CTTCTTGATT GGTGCCGGTG ATGGTGGGGC TCTTTTTATG GATAGTTACC 1680 

AACATCCAAC CAGTGAATTA GAACTGGTCG GTATTTTGGA TAAGGATTCT AAGAAAAAGG 1740 

GTCAAAAACT TGGTGGTATT CCTGTTTTGG GCTCTTATGA CAATCTGCCT GAATTAGCCA 1800 

AACGCCATCA AATCGAGCGT GTCATCGTTG CGATTCCGTC GCTGGATCCG TCAGAATATG 1860 

AGCGTATCTT CCAGATGTCT AATAAGCTGG GTGTCAAATG TTACAAGATG CCTAAGGTTG 1920 

AAACTGTTGT TCAGGGCCTT CACCAAGCAG GTACTGGCTT CCAAAAAATT GATATTACGG 1980 

ACCTTTTGGG TCGTCAGGAA ATCCGTCTTG ACGAATCGCG TCTGGGTGCA GAACTG AC AG 2040 

GTAAGACCAT CTTAGTCACA GGAGCTGGAG GTTCAATCGG TTCTGAAATC TGTCGTCAAG 2100 

TTAGTCGCTT CAATCCTGAA CGCATTGTCT TGCTCCGTCA TGGGGAAAAC TCAATCTACC 2160 

TTCTTTATCA TGAATTGATT CGTAAGTTCC AAGGGATTGA TTATGTACCT CTCATTGCCG 22 20 

ACATTCAAGA CTATGATCGT TTGTTGCAAG TCTTTG AG C A GTACAAACCT GCT AT TGTTT 2280 

ATCATGCGGC AGCCCACAAG CATGTTCCTA TGATGGAGCG CAATCCAAAA GAAGCCTTCA 2 340 

AAAACAATAT CCGTGGAACT TACAATGTTG CTAAGGCTGT TGATGAAGCT AAAGTGTCTA 2400 

AGATGGTTAT GATTTCGACA GATAAGGCAG TCAATCCACC AAATGTTATG GGAGCAACCA 24 60 

AGCGCGTGGC GGAGTTGATT GTCACTGGCT TTAACCAACG TAGCCAATCA ACCTACTGTG 2520 

CAGTTCGTTT TGGGAATGTT CTTGGTAGCC GTGGTAGTGT CATTCCAGTC TTTGAACGTC 2580 

AGATTGCTGA AGGTGGGCCT GTAACGGTGA CAGACTTCCG TATGACCCGT TACTTTATGA 2640 

CC ATT CC AG A AGCTAGCCGT CTGGTTATCC ATGCTGGTGC TTATGCCAAA GATGGGGAAG 2700 

TCTTTATCCT TGATATGGGC AAACCAGTCA AGATTT AT G A CTTGGCCAAG AAGATGGTGC 2760 

TTCTAAGTGG CCACACTGAA AGTGAAATTC CAATCGTTGA AGTTGGAATC CGCCCAGGTG 2820 

AAAAACTCTA CGAAGAACTC TTGGTATCAA CCGAACTCGT TGATAATCAA GTTATGGATA 2880 

AG ATTTT CGT TGGTAAGGTT AATGTCATGC CTTTAGAATC CATCAATCAA AAGATTGGAG 2940 

AGTTCCGCAC TCTCAGTGGA GATGAGTTGA AGCAAGCTAT TATCGCCTTT GCTAATCAAA 3000 

CAACCCACAT TGAATAAAAA AGAAAAACGC ATAGTATCAA GTTACACAAC CTTGGTAATA 3060 

TGCGTTTTAT TATGTAGAGA CTTATACTCT TCGAAAATCT CTTCAAACCA CGTCAACGTC 3120 

GC CTTGCCGT ATATGGTTAC TGACTtCGTC AGTTCTATCC ACAACCTCAA AACAGTGTTT 3180 

TGAGytGACT TCGTCAGTTC TATCCACAAC CTCAAAAC AG TGTTTTGAGc TGACtTCGTC 3240 

AGTTCTATCC ACAACCTCAA AACAGTGTTT TGAGCTGAcT TCGTCAGTTC CATCCACAAC 3 300 

CTTAAAACAG TGTTTTGAGy TGACnTTCGT CAGTTCCATC TACAACCTTA AAACAGTGTT 3 360 
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TTGAGCTGCC 


CGCAGCTAGT 


TTCCTAGTTT 


GCTCTTTGAT 


TTTCAT TG AG 


TATTACTTCA 


3420 


TTTTCTTCTG 


AAATGGAATT 


GTTACCCAGT 


CTATGCTATT 


GAAAATACGC 


CAAAACTTCT 


3480 


AAGGGTTTGT 


GAGCGATATA 


ATCAGGTTGA 


TAGTTTAGTA 


GATCTGCTTG 


CTCTCCAAAT 


3540 


CCCCAAGTGA 


TGCCCAATTT 


CTGAATACCT 


GTTTCTCGAG 


CTCCCACCAT 


ATCAAACTTG 


3600 


GTATCTCCGA 


TGATGATGGC 


TTGTTCTGGT 


GCTAGTTGAT 


GTGTCTGCAA 


GGCTTGGTGA 


3660 


ATGACATCTG 


CCTTATGGGG 


TGCTTCAGGG 


CTAGAACCAT 


AAATGCCATC 


AAAGAAATGA 


3720 


TGGATTTCCA 


AGTTTTTTGC 


CATGTCTTGA 


GCAGTAGATG 


TATCCTTTGT 


CGTGGTGATG 


3780 


TAGAGTGGAT 


AACTGCTCGA 


TAACTCCTCA 


AGCAAGTCTA 


TAATCTGAGG 


AAAGAGTTGA 


3840 


GCTTCATAGA 


TGCCITTTGC 


CTTATAGTAA 


GAACGATATA 


TCTGCACGGC 


TTCAGAAATT 


3900 


TGGTCTTTGG 


ACAGGCAGGT 


CGCAAAACTA 


CTTTCGAGAG 


GTGGTCCCAT 


AAAAC C AC G A 


3960 


ATAGTTTTGG 


CATCAGGGCT 


AGGCACCCCC 


AGCTCTTTAA 


AGGTATAGGT 


AAACCCATTG 


4020 


TGAATCCCGA 


TAGAACTATC 


AACGAGGGTT 


CCATCCAAAT 


CGAAAAAAAT 


CGCTGTGATA 


4080 


GAGGTCATGG 


TTTCTCCTAT 


TTGATAAGCT 


TATTCTCCGA 


AAATTTCTTT 


TTGGAGGCGA 


4140 


CGACCAGTAG 


GGGTGGTAGC 


GAGTCCACCT 


TCAGCTGTTT 


CACGAAAGGC 


AGTTGGCATG 


4200 


CTTGCTCCTA 


CTTGGTACAT 


GGCATCGATC 


ACTTCATCCA 


CAGGGATTTT 


AGATTCGATA 


4260 


CCTGCCAAGG 


CCATGTCTGC 


T GCG ATGAAA 


GCAAAGCTAG 


CTCCCATCGC 


ATTACGTTTG 


4320 


ACACAGGGAA 


CTTCGACCAA 


ACCTGCAACA 


GGGTCACAGA 


TGAGGCCTAG 


CATATTTTTA 


4380 


ATGACAAAGG 


CAATAGCTTG 


ACTGGCCTGA 


TAAGCTGTTC 


CACCTGCAGC 


CAGAGTCAAG 


4440 


GCGGCAGCAC 


TCATAGCAGA 


GGCTGAACCA 


ACTTCAGCTT 


GACACCCACC 


CTCAGCACCT 


4500 


GAGATGGAGG 


CATTGTTTGC 


GATGACTAGT 


CCAAAGGCAC 


CAGCAGCAAA 


GAGGAAATCC 


4560 


AATTGTTGCT 


CGTGGCTGAG 


GTCTAATTTT 


TCAAT AGC AG 


CAGTGAGAAC 


GGATGGCAGA 


4620 


CAGCCAGCAC 


TTCCAGCGGT 


TGGAGTGGCA 


CAGACCAAGC 


CCATTTTCGC 


ATTGTGTT C A 


4680 


TTGACTGCGA 


TGGCATTTCG 


GGCAGCCGAG 


AGAATCGTAT 


AATCTGACAG 


AGTTTTTCCC 


4740 


TTTTCGATGT 


AGTGATCCAA 


TTTGCCAGCA 


TCTCCACCTG 


TCAGGCCACT 


ACGAGATTTA 


4800 


TTTTCATTGA 


GGCCAAGTTG 


GACAGAGGCT 


TTCATAACTT 


CCAGATTGCG 


TTCCATGAGA 


4860 


AGGAAGACTT 


CTTCACGTTC 


GCGACCGGTC 


AATTCAAACT 


CTGTTGTAAT 


CATGAGTTCT 


4920 


GCGACATTTC 


CTTGAAAGTC 


CAGATCTGCT 


TGCTCGACCA 


ATTCTTTGAT 


AG AAT AAAAC 


4980 


ATGCTTCCTC 


CTATTTAAAG 


AAATTGACAT 


TGTGGAGATG 


AGGGATTTTT 


CGAATTTCTT 


5040 


CGATAGCCTC 


ATCACAGTTG 


CGACTGTCAA 


CTTCGATAAT 


CATAATGGCT 


TTTTCACCAG 


5100 
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CTTTTTCACG AGTGACATTC 


ATCTGGGCGA 


TATTGATACC 


ATAGCGGGAA 


AGCGCCTCTG 


5160 


TAACAAGGGC AATCATACCT 


GGAATATCTT 


GATGAACGAT 


GATGATAGTC 


GGTGTATTCA 


5220 


TATTGAGAGA GACGGCAAAA 


CCATTGAGTT 


CGGTTACCTG 


AATATTTCCT 


C C ACCG AT AG 


5280 


AAATACCAGT CACGCTGATG 


GTCTTGTGGG 


CATTTTTAAC 


AGTAATTTTA 


GTGGTGTTAG 


5340 


GGTGAGGGGC ATTGCTGTCT 


TTCTGAATGG 


TCCAGACAAT 


CTTGATACCA 


CGCTTGTGGG 


5400 


CAATTTCCAG A CT ATTTGG A 


ATTTCAGGAT 


CATCTGTATC 


CATTCCTAAA 


ATACCTGCAA 


5460 


C AAGGGCT AG GTCTGTTCCG 


TGACCACGAT 


AGGTCTTGGC 


AAATGAGTTA 


AAAAGTTGGA 


5520 


ATTCAACTTC TGTCGGAGTA 


TCATCAAAAA 


TGGAAGAGAC 


AATCTTCCCA 


ATACGAACAG 


5580 


CACCAGCGGT ATGGCTACTA 


GATGGGCCAA 


T CAT AACTGG 


TCCGATGATA 


TCAAAGACAG 


5640 


ATTGAAAACG AAGTGATTTC 


ATCAGTTTCC 


CCTTATAAAA 


ATTCTTATCT 


CTATTATATC 


5700 


AAAGAATGAG GGGCTTGGCT 


TTAATTGTGG 


ATGAAAACCT 


TTCTAATACC 


TCAAATAGCA 


5760 


TAAAAATAGT ATCTTTTATG 


ACAAAAAACA 


CCTTATTTAG 


GGAAATAAAA 


AATAATTTTG 


5820 


TAATATTTCT ACATAAAAGT 


GTCAAGAAAC 


GGTAATATTT 


AAAGGGTATG 


ATAGAACTAT 


5880 


AGAAAGAAGG AGAATTTTCG 


AATATGAAAT 


CAATAACTAA 


AAAGATTAAA 


GCAACTCTTG 


5940 


CAGGAGTAGC TGCCTTGTTT 


GCAGTATTTG 


CTCCATCATT 
# 


TGTATCTGCT 


CAAGAATCAT 


6000 


CAACTTACAC TGTTAAAGAA 


GGTGATACAC 


TTTCAGAAAT 


CGCTGAAACT 


CACAACACAA 


6060 


CAGTTGAAAA ATTGGCAGAA 


AACAACCACA 


TTGATAACAT 


TCATTTGATT 


TATGTTGATC 


6120 


AAGAGTT GGT TATCGATGGC 


CCTGTAGCGC 


CTGTTGCAAC 


ACCAGCGCCA 


GCTACTTATG 


6180 


CGGCACCAGC CGCTCAAGAT 


GAAACTGTTT 


CAGCTCCAGT 


AGCAGAAACT 


CCAGTAGTAA 


6240 


GTGAAACAGT TGTTTCAACT 


GTAAGCGGAT 


CTGAAGCAGA 


AGCCAAAGAA 


TGGATCGCTC 


6300 


AAAAAGAATC AGGTGGTAGT 


ATACAGCTAC 


AAATGGACGT 


TATATCGGAC 


GTTACCAATT 


6360 


AACAGATTCA TACCTGAACG 


GTGACTACTC 


AGCTGAAAAC 


CAAGAACGGG 


TACCG 


6415 


(2) INFORMATION FOR SEQ ID NO: 163: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8494 base pairs 
(B> TYPE: nucleic acid 
( C > STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACCCCTTTC GAATTTTGGC AAAAATTCGG TAAGGCTTTG ATGCTAGTTA TCGCGGTTAT 
GCCGGCTGCT GGTTTGATGA TTTCAATCGG T AAGTCT AT C GTGATGATTA ACCCAACCTT 



60 



120 
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TGCACCACTT GTCATCACAG CTGGAATTCT TGAGCAAATC GGTTGGGGGG TTATCGGTAA 180 
CCTTCACATT TTGTTTGCCC TAGCCATTGG AGGAAGCTGG GCTAAAGAAC GTGCTGGTGG 240 
TGCTTTCGCC GCTGGT CTTG CCTTCATCTT GATTAACCGT ATCACTCGTA CAATCTTTGG 3 00 

TGTATCAGGC G AT ATGTTG A AAAATCCAGA TGCTATGGTA ACTACTTTCT TTGGTGGTTC 3 60 

AATCAAAGTT CCTGATTACT TTATCAGTGT TCTTGAAGCT CCAGCCTTGA ACATGGGGGT 4 20 

ATTCGTACGG ATTATCTCAG GTTTTGTAGG GGCAACTGCT TACAACAAAT ACTACAACTT 4 80 

CCGTAAACTT CCTGATGCAC TTTCATTCTT CAACGGGAAA CCTTTCGTAC CATTTGTACT 54 0 

TATTCTTCGT TCAGCAATCG CTGCAATTCT ACTTGCTGCT TTCTGGCCAG TAGTTCAAAC 600 
AGGTATCAAT AACTTCGGTA TCTGGATTGC CAACTCACAA GAAACTGCTC CAATTCTTGC 660 
ACCATTCTTG TATGGTACTT TGGAACGTTT GCTCTTGCCA TTTGGTCTTC AC C AC ATGTT 720 
G ACT ATCC C A ATGAACTACA CAGCTCTTGG TGGTACTTAT GACATTTTAA CTGGTGCAGC 780 
TAAAGGTACT CAAGTATTCG GTCAAGACCC ACTATGGCTT GCATGGGTAA CAGACCTTGT 84 0 

AAACCTTAAA GGTACTGATG CTAGTCAATA TCAACACTTC TTAGATACAG TACATCCAGC 900 
T CGTTTC AAA GTTGGACAAA TGATCGGTTC ATTCGGTATC TTGATGGGTG TGATTGTTGC 9 60 

TATCTACCGT AATGTTGATG CTGACAAGAA ACATAAATAC AAAGGTATGA TGATTGCAAC 1020 

AGCTCTTGCA ACATTCTTGA CAGGGGTTAC TGAACCAATC GAATACATGT TCATGTTCAT 1080 

CGCAACACCT ATGTATCTTG TTTACTCACT TGTTCAAGGT GCTGCCTTCG CTATGGCTGA 1140 

CGTCGTAAAC CTACGTATGC ACTCATTCGG TTCAATCCAC TTCTTGACTC GTACACCTAT 1200 

TGCAATCAGT GCTGGT ATTG GTATGGATAT CGTTAACTTC GTTTGGGTAA CTGTTCTCTT 12 60 

TGCTGTAATC ATGTACTTTA TCGCAAACTT CATGATTCAA AAATTCAACT ACGCAACTCC 13 20 

AGGGCGCAAC GGAAACTACG AAACTGCTGA AGGTTCAGAA GAAACCAGCA GCGAAGTGAA 13 80 

AGTTGCAGCA GGCTCTCAAG CTGTAAACAT TATCAACCTT CTTGGTGGAC GTGT AAA CAT 14 4 0 

CGTTGATGTT GATGCATGTA TGACTCGTCT TCGTGTAACT GTTAAAGATG CAGATAAAGT 1500 

AGGAAATGCA GACCAATGGA AAGCAGAAGG AGCTATGGGT CTTGTCATGA AAGGACAAGG 1560 

GGTTCAAGCT ATCTACGGTC CAAAAGCTGA C ATTTTG AAA TCTGATATC C AAGATATCCT 1620 

TGATTCAGGT GAAATCATTC CTGAAACTCT TCCAAGCCAA ATGACTGAAG CACAACAAAA 1680 

CACTGTTCAc TTCAAAGATC TTACTGAGGA AGTTTACTCA GTAGCAGACG GTCAAGTTGT 1740 

TGCTTTGGAA CAAGTAAAGG ATCCAGTATT TGCTCAAAAA ATGATGGGTG ATGGATTTGC 1800 
AGTAGAACCT GCAAATGGAA ACATTGTATC TCCAGTTTCA GGTACTGTGT CAAGCATCTT . 1860 
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CCCAACAAAA CATGCTTTTG GTATTGTGAC GGAAGCAGGT CTTGAAGTAT TGGTTCACAT 1920 

TGGTTTGGAC ACAGTAAGTC TTGAAGGTAA ACCATTTACA GTTCATGTTG CTGAAGGACA 1980 

AAAAGTTGCA GCAGGAGATC TCCTTGTCAC AGCTGACTTG GATGCTATCC GTGCAGCAGG 2040 

ACGTGAAACT TCAACAGTAG TTGTCTTCAC AAATGGTGAT GCAATTAAAT CAGTTAAGTT 2100 

AGAAAAAACA GGTTCTCTTG CAGCTAAAAC AGCAGTTGCT AAAGTAGAAT TGTAATATAC 2160 

TTGAGGTTGG AAGCTGTATT CCAACCTCTT ATTTTGGGAG AAAAGAATGA AATTTTTAAC 2220 

ACTCAATACT CACAGTTGGA TGGAGAAAGA AGCAGAGGAA AAATTCCAGA TTTTGCTTGA 2280 

AGAT ATTCT T GAAAAGGACT ATGATTTGAT TTGTTTTCAA GAAATCAATC AGGAGATGAC 2340 

CTCGTCAGAG GTGGAGGTTA ATGACCTTTA TCAAGCTTTG CCAGCAGCTG AGCCTATTCA 2400 

CCAAGAC CAT TATGTTAGAC TCTTGGTTGA AAAGTTGTCT GAGCAAGGGA AAAATTACTA 2460 

CTGGACCTGG GCCTATAACC ATATCGGCTA TAACCGCTAC CACGAAGGTG TGGCTATCTT 2 520 

GTCTAAAACA CCTATTGAAG CCAGAGAAAT TTTGGTTTCA GATGTGGATG ATCCAACAGA 2580 

CTATCATACT CGCCGTGTTG CCCTAGCTGA AACTGTAGTC GATGGCAAGG AGCTAGCAGT 264 0 

TGCCAGTGTT CATCTCTCTT GGTGGGATAA AGGTTTCCAA GAAGAATGGG CACGATTTGA 2700 

GGCTGTCTTG AAAAAATTGA ACAAG CCACT TTTACTAGCT GGAGATTTCA ACAATCCGGC 2760 

TGGACAGGAA GGTTACCAAG CTATTTTAGC TAGTCCATTA GGCTT ACAAG ACGCATTTGA 2820 

AGTTGCTCAA GAGAAAAGTG GTAGCTATAC TGTTCCGCCT GAAATTGATG GCTGGAAAGG 2880 

GAACACTGAA CCCCTTCGAA TCGATTATGT CTTTACTACC AAAGAGTTAG CGGTGGAAAA 2940 

TTTACATGTC GTATTTGATG GTAACAAGAG TCCACAAGTG AGTGATCACT ATGGCTTGAA 3000 

TGCTATATTA AACTGGAAAT AATAACTGAA AAG AGGTTGG AACTATAAAA TTCCAGCCTT 3060 

TTCTTACTAG AG AAG CT ACT GGAAATAGCC TAAATAAGTG AG ACT ACTGT AATGGAATAA 3120 

AATATGGTAT AATTGATAAG GT AGAT AG AA TCGAGGATGT TATGTCATTT ACGAAATTTC 3180 

AATTTAAAAA CT AT ATT AG A GAAGCCTTGA AGGAGTTAAA ATTTACAACT CCAACAGAGG 324 0 

TGCAAGACAA GTTGATTCCT ATTGTTTTGG CAGGTCGTGA CCTAGTAGGA GAATCAAAAA 3300 

CAGGTTCAGG TAAGACTCAT ACTTTCTTGT TACCGATTTT CCAGCAATTA GATGAAGCTA 3360 

GCGATAGTGT ACAAGCAGTG ATTACTGCAC CGAGTCGTGA GTTGGCTACT CAAATTTACC 3420 

AAGTAGCGCG TCAGATTTCA GCTCACTCAG ATGTCGAAGT TCGTGTGGTT AATTATGTGG 34 80 

GTGGTACGGA TAAGGCTCGC CAGATTGAGA AATTGGCAAG CAATCAGCCT CAT ATTGTT A 3540 

TTGGAACACC AGGC CGTATC TACGACTTGG TTAAATCTGG TGATTTAGCT ATTCATAAAG 3600 

CCAAGACATT TGTTGTTGAT GAAGCAGATA TGACCTTGGA TATGGGATTC TTGGAAACTG 3660 
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TTGATAAGAT TGCTGGCAGT CTTCCAAAAG ACTTGCAATT CATGGTCTTC TCAGCGACTA 3720 

TCCCACAAAA ACTGCAACCA TTCTTGAAAA AATACTTATC AAATCCTGTT ATGGAGAAAA 3 7 80 

TTAAGACCAA AACGGTTATT TCTGACACCA TTGATAATTG GTTGATTTCG ACC AAGGG AC 3 84 0 

ATG AT AAGAA TGCTCAAATT TACCAGTTGA CTCAGTTGAT GCAGCCGTAT TTGGCAATGA 3900 

TTTTTGTTAA CACTAAAACG CGTGCTGATG AATTGCATTC ATATCTGACT GCTCAAGGCT 3 9 60 

TGAAGGTTGC AAAAATC C AT GGCGATATTG CCCCTCGTGA ACGCAAGCGA ATCATGAATC 402 0 

AGGTGCAAAA TCTGGATTTT GAGTATATTG TCGCAACAGA TTTGGCAGCG CGTGGGATTG 4080 

ACATTGAAGG TGTCAGCCAT GTCATCAATG ATGCCATTCC GCAAGACTTA TCTTTTTTTG 4140 

TTCATCGTGT TGGTCGTACT GGACGAAATG GCCTACCAGG T AC AG CT ATT ACCCTTTATC 4200 

AGCCAAGTGA TGACTCGGAT ATCCGTGAGT TGGAGAAATT GGGAATCAAG TTTAGTCCTA 4 2 60 

AGATGGTCAA AGACGGGGAA TTTCAAGATA CCTATGACCG TGATCGTCGT GCCAACCGTG 432 0 

AGAAAAAACA AGAT AAA CTT GATATCGAAA TGATTGGTTT GGTTAAAAAG AAAAAGAAAA 43 8 0 

AAGTCAAACC GGGTTATAAG AAGAAAATTC AATGGGCGGT TGATGAAAAG CGCCGTAAAA 444 0 

CCAAGCGTGC TGAAAATCGC GCTCGCGGTC GTGCAGAGCG TAAAGCTAAA CGCCAAACAT 4 500 

TTTAATAGAA ATTGTTGGAG TATTGAGCTC CAACTTTTTT ATTTATGAGA ACGAACTATC 4560 

TAAACCGAAA CACTACATTA AAGACTGCAA ATTGCGATTA AAAATGGTAT AATGATAAAG 4620 

TTATATAGTC CCGATAAGAT GGTAGGTATT TATTACGAAG AGTTTTCCTA TCAGTACTTT 4680 

GTAACTCTAT AACAATATTT TTTAAGGGGG GACATTTTTA TGTCAGAGCG TAAATTATTC 4 740 

ACGTCTGAAT CTGTATCTGA GCCGCATCCG GATAAGATTG CAGACCAAAT TTCAGATGCG 4800 

ATTTTGGATG CTATTTTAGC AAAGGATCCA GAGGCGCACG TTGCTGC TG A AACAGCTGTA 4 8 60 

TATACTGGTT CTGTCCACGT TTTTGGTGAA ATTTCTACAA ATGCCTATGT GGATATTAAC 4 920 

CGTGTGGTTC GTGATACCAT TGCAGAGATT GGTTATACCA ATACAGAATA TGGATTTTCT 4 980 

GCTGAGACGG TGGGAGTACA CCCATCTTTG GTGGAACAAT CTCCTGACAT CGCTCAAGGT 5040 

GTTAACGAAG CCTTGGACGT TCGTGGAAAT GCTGATCAAG ATCCACTGGA CTTGATTGGA 5100 

GCAGCTGACC AAGGG CT CAT GTTTGGATTT GC AGT AGATG AAAC AGAAG A GCTTATGCCA 5160 

TTGCCAATTG CACTCAGTCA TAAATTGGTT CGTCGTCTGG CAGAACTTCG TAAGTCTGGA 5220 

GAAATTAGCT ATCTCCGTCC AGATGCAAAA TCACAAGTTA CAGTTGAGTA CGATGAAAAT 5280 

GACCGTCCGG TACGTGTAGA TACAGTCGTT ATTTCTACTC AGCATG ATCC AGAGGCCACT .5340 

AATGAACAAA TCCATCAAGA TGTGATTGAC AAGGTCATCA AAGAAGTTAT TCCATCTTCT 5400 



WO 98/18931 



PCT/US97/19588 



1056 



TATCTTGATG 


ATAAGACAAA 


ATTCTTTATC 


AATCCGACAG 


CTCGTTTTGT 


AATCGGTGGT 


5460 


CCTCAAGGGG 


ACTCAGGTTT 


GACTGGTCGT 


AACATTATTG 


TAGATACTTA 


TGGTGG CT AC 


5520 


TCTCGTCATG 


GTGGTGGTGC 


CTTCTCTGGT 


AAAGATGCGA 


CTAAGGTGGA 


TCGTTCAGCC 


5580 


TCTTATGCGG 


CTCGCTATAT 


TGCCAAGAAT 


ATCGTTGCAG 


CAGACCTTGC 


TAAGAAGGCA 


5640 


GAAGTGCAGT 


TGGCCTATGC 


TATCGGTGTT 


GCGCAACCTG 


TTTCTGTTCG 


TATCGATACT 


5700 


TTCGGTACAG 


GAACAGTAGC 


TGAAAGTCAA 


CTTGAAAAAG 


CGGCTCCTCA 


AATCTTTGAC 


5760 


CTTCGCCCTG 


CAGGGATTAT 


CCAAATGCTG 


GACCTCAAGC 


GT CCAATTT A 


CCGTCAAACA 


5820 


TCGGCTTACG 


GTCACATGGG 


ACGTACAGAT 


ATTGATCTTC 


CATGGGAACG 


TTTGGATAAG 


5880 


GTAGATGCTT 


TGAAAGAAGC 


AGTAAAATAA 


GATTTTAAGA 


GGGGAACGTC 


CTCTCTTTTT 


5940 


TATAGTTTTT 


AACTATACTG 


GGATACTGTT 


CTGAAAATCC 


ATTTTGCGAA 


AGTAGAGATT 


6000 


TACATGTATA 


GTAGATTGAA 


ACTAGAATAG 


TACACCTCAA 


CTTCTAAAAC 


ATTGTTAGCA 


6060 


AT C AATTTG A 


CTGTCCTGAT 


CGATTTCTCC 


TGTTCTTGTT 


TCATTTTACT 


ATATTTCTTT 


6120 


AAAAATGATA 


AAGGTTAAGA 


TTTCTCCTCG 


TAATAGATAA 


TCTTGGGGAT 


ATTTCAATCC 


6180 


AAAGTTTTAT 


TCGTTATCAC 


TTGACTATTG 


C AAGGTTTT C 


TAGAGCAACA 


GAGTCATGGA 


6240 


ATGGACTCAT 


GGTTGAGATT 


TCTCCTTGTT 


GCTTGGACTT 


CATTCAAAAG 


TCTGTTACCC 


6300 


aagccttgtt 


CAAACTTCTA 


ATACACTAGC 


T GTTTCCAT A 


GCATGACTTC 


TGTACTAGAC 


6360 


TTTCTTTTCC 


GAATAAATAG 


ATAGAACCAC 


AGAATCTAGT 


AAACCTAGAA 


TTAAAATTAT 


6420 


GGTATAATAT 


TAGCAATAAA 


AGAAATCTGG 


AGGATTAGAA 


TCATGGTATC 


AACGAAAACA 


6480 


CAAATTGCTG 


GTTTTGAGTT 


TGACAATTGC 


TTGATGAATG 


CAGCAGGTGT 


GGCTTGTATG 


6540 


ACGATAGAGG 


AGTTAGAAGA 


GGTCAAAAAC 


TCAGCGGCAG 


GAACCTTTGT 


TACTAAGACA 


6600 


GCGACCTTGG 


ACTTCCGTCA 


GGGGAATCCT 


GAGCCACGCT 


ACCAAGATGT 


TCCACTTGGT 


6660 


TCCATCAACT 


CTATGGGCTT 


GCCAAATAAT 


GGCTTAGACT 


ATTATTTGGA 


TTATCTTTTA 


6720 


GATTTGCAGG 


AAAAAGAGTC 


GAACCGAACT 


TTCTTCTTAT 


CTCTGGTCGG 


CATGTCTCCA 


6780 


GAGGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


CAAGAGAGTG 


ATTTTCGTGG 


TCTGACTGAG 


6840 


CTAAATCTTT 


CCTGTCCAAA 


TGTTCCAGGT 


AAAC CTCAGA 


TTGCCTATGA 


TTTTGAGACA 


6900 


ACAGACCGGA 


TTTTGGCAGA 


AGTGTTTGCT 


TACTTCACCA 


AACCTCTTGG 


AATTAAATTG 


6960 


CCACCTTATT 


TTGATATTGT 


TCACTTTGAC 


CAAGCGGCAG 


CTATTTTCAA 


CAAATATCCG 


7020 


CTCAAGTTTG 


TCAACTGCGT 


TAACTCTATC 


GGAAACGGCC 


TCTATATAGA 


AGACGAATCT 


7080 


GTCGTTATTC 


GGCCTAAGAA 


TGGTTTTGGT 


GGAATTGGTG 


GAGAATACAT 


CAAACCGACT 


7140 


GCTTTAGCCA 


ATGTTCACGC 


CTTTTATCAA 


CGTTTAAATC 


CTCAAATCCA 


AATTATCGGA 


7200 
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ACAGGTGGCG TTCTGACTGG TCGAGATGCC TTTGAACACA TCCTCTGTCG AGCAAGTATG 7 260 

GTGCAGGTCG GAACGACCCT TCACAAAGAA GGCGTCAGTG CTTTTGACCG CATTACCAAT 73 2 0 

GAACTGAAAG CAATCATGGT GGAAAAAGGC TACGAGAGCT TAGAAGATTT CCGTGGGAAA 7 3 80 

TTGCGCTATA TTGACTAAAT TAAATCGAAA AATCTGAAGA AAGGAGAGAC GATGCTAGCC 7440 

ATTGAAGAAA GTCAGAAGTT GACTTTATCA AATTTACCGA GCCTGAGCCT ATTTACAGGG 7 500 

ACAGATCAGG GTCAGTTTGA AGTGATGAAG AGTCAAATGT TGAAACAGAT TGGGTATGAT 7 560 

TCTGCTGACC TCAACTTTGC CTACTTTGAT ATGAAAGAAG TAGTTTACAA GGATGTGGAA 7620 

CTG GAGTTGG TCAGCCTTCC TTTCTTTGCG GATGAGAAAA TCGTGATATT AGATTATTTT 7 680 

ATGGATATCA CGACTGCTAA GAAACGCTTT TTGACAGATG ATGACCTTAA GTCATTTGAG 774 0 

GAATACCTTG ACAATCCTTC TCCAACAACC AAGTTGATAA TCTTTGCAGA AGGAAAGCTG 7 800 

GATAGCAAAA GACGGTTAGT CAAATTACTT AAGCGTGATG CCAAGGCCTT CGATGCAGTA 7860 

GAAGTAAAAG AACAAGAATT GCGCCAGTAC TTCCAAAAGT GGAGTCAGAA ACAAGGTCTG 7920 

CAGTTTACCA ATCATTCTTT TGAAAATCTC CTCATCAAGT CGGGGTTTCA ATTTAGCGAA 7980 

ATCCAGAAAA ATCTTCTCTT TTTACAGTCC TATAAGGCGA ATTCTGTTAT TGAGGAAGAG 8040 

GATATTGTTA ACGCAATTCC CAAGACTTGC AGGACAATAT TTTTGATTTA ACTCAGTTTA 8100 

TTCTGACTAA AAAGATGGAT CAGGCGCGCG ATTTGGTGAG AGACTTG AC C TTGCAAGGGG 8160 

AAGATGAAAT CAAACTGATT GCAGTCATGC TGGGACAATT TCGGACTTTT ACTCAGGT G A 8220 

AGATTTTGGC GGAGTCTGGC CAAACAGAAT CGCAGATTGC AAGTAGTTTA GGTAGTTATC 8280 

TGGGACGTAA CCCAAATCCT TATCAAATCA AGTTTGCATT AAGAGATTCG AG AGGACTT T 8340 

CTTTGAGCTT TTTGAAGCAA GCTATTTCCT ATTTG ATT G A G AC AG ACT AT CAGATTAAGA 84 00 

CAGGTCTTTA TGAAAAAGGT TTCCTTTTTG AAAAGGCACT CTTACAGATT GCTAGTCAGG 84 60 

TCAATTGACA TTTGTTGAAA CTACTAACCC GCGG 84 94 

(2) INFORMATION FOR SEQ ID NO; 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9707 base pairs 
<B> TYPE : nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
CCGGTCAGTT CGTTCAGTAC AAGGAATCAT AATGAACGAT CAATCAGAAA AAAAG ACT AG 



60 
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AAAGAAGACT GTATGGATAA TCGACCAATT GGTTTTTTGG ATTCGGGTGT CGGGGGCTTG 120 

ACCGTTGTGC GCGAGCTCAT GCGCCAGCTT CCCC ATGAAG AAA TCGT CT A TATTGGAGAT 1B0 

TCGGC GCGGG CGCCCTATGG CCCCCGTCCT GCTGAGCAAA TTCGTGAATA TACTTGGCAG 240 

CTGGTCAACT TTCTCTTGAC CAAGGATGTC AAAATGATTG TCATTGCTTG TAACACTGCG 300 

ACTGCGGTCG TCTGGGAAGA AATCAAGGCT CAACTAGATA TTCCTGTCTT GGGTGTAATT 360 

TTGCCAGGAG CTTCGGCAGC CATCAAGTCC AGTCAAGGTG GGAAAATCGG AGTGATTGGA 420 

ACGCCCATGA CGGTACAATC AGACATATAC CGTCAGAAAA TCCATGATCT GGATCCCGAC 480 

TTACAGGTGG AGAGCTTGGC CTGTCCCAAG TTTGCTCCCT TGGTTGAGTC AGGTGCCCTG 540 

TCAACCAGTG TTACCAAGAA GGTGGTCTAT GAAACCCTGC GTCC CTTGGT TGGAAAGGTG 600 

GATAGCCTGA TTTTGGGCTG TACTCATTAT CCACTCCTTC GCCCTATTAT CCAAAATGTG 660 

ATGGGGCCAA AGGTTCAGCT CATCGATAGT GGGGCAGAGT GCGTACGGGA TATCTCAGTC 7 20 

TTACTCAATT ATTTTGAAAT CAATCGTGGT CGCGATGCTG GACCACTCCA TCACC G TTTT 780 

TACACAACAG CCAGTAGCCA AAGTTTTGCA CAAATTGGTG AAGAATGGCT GGAAAAAGAG 84 0 

ATTCATGTGG AGCATGTAGA ATTATGACAA ATAAAATTTA TGAATATAAG GATGACCAGG 900 

ACTGGTATGT TGGGTCTTAT AGTATTTTTG GTGGCGTTAA CAGTTTGAGC GACTATAAGA 960 

CAGATTTTCC TCTGTTTGAA TTCTCCAAAA TATTTGGAGA TGAAGAGTAT GGTTTCCCGC 1020 

TTTCAGTTAC TGTTTTACGC TATGGTTCTA TCTACCGTTT GTTCTCCTTT GTGGTAGACA 1080 

TGCTTAATCA AGAAATGGGA CGAAACTTCG AAGTTATTCA ACGTCATGGG GCCCTGCTCT 1140 

TGGTTGAAAA TGGGCAACTC TTGTATGTAG AATTGCCTAA AGAAGGGGTC AATGTTCATG 1200 

ATTTCTTTGA GACAAGCAAG GTCAGAGAAA CCTTGTTGAT TGCGACTCGT AACGAAGGTA 1260 

AAACCAAGGA ATTCCGAGCT ATCTTTGATA AGTTAGGCTA CGATGTGGAA AATCTTAATG 1320 

ACTACCCTGA CCTGCCTGAA GTAGCAGAAA CAGGTATGAC CTTTGAAGAA AATGCCCGCC 1380 

TTAAGGCAGA AACCATTTCT CAATTAACGG GCAAGATGGT TTTGGCAGAT GATTCTGGTC 144 0 

TCAAAGTCGA TGTCCTTGGT GGCTTACCAG GCGTCTGGTC AGCTCGTTTC GCAGGTGTGG 1500 

GAGCAACTGA CCGTGAAAAT AATGCCAAAC TCTTGCACGA ATTGGCCATG GTCTTTGAAC 1560 

TCAAGGACCG CTCGGCTCAG TTCCACACAA CCCTAGTCGT AGCCAGCCCA AATAAGGAAA 1620 

GTTTAGTTGT TGAAGCAGAC TGGTCAGGTT ATATTAACTT TGAACCTAAG GGTGAAAATG 1680 

GCTTTGGCTA TGATCCCCTC TTCCTTGTAG GAGAAACAGG TGAGTCATCA GCTGAATTAA 1740 

CCCTGGAAGA AAAAAATAGT CAATCTCACC GTGCCTTAGC CGTTAAGAAA CTTTTGGAGG 1800 

TATTTCCATC ATGGCAAAGC AAACCATCAT TGTAATGAGC GATTCCCATG GCGATAGCTT 1860 
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GATTGTGGAA GAAGTCCCTG ATCGCTATGT GGGCAAAGTC GATGCTGTTT TTCATAACGG 1920 

CGATTCTGAA CTACGTCCGG ATTCTCCACT TTGGGAGGGC ATCCGCGTTG TTAAAGGGAA 1980 

CATGGACTTC TACGCCGGCT ACCCAGAACG TCTGGTGACT GAGCTTGGTT CGACCAAGAT 204 0 

TATCCAAACT CATG GTC ACT TGTTTGACAT CAATTTCAAC TTTCAAAAGT TGGACTACTG 2100 

GGCTCAGGAG GAAGAGGCCG CTATCTGCCT CTATGGTCAC TTGCATGTGC CAAGTGCTTG 2160 

GTTGGAAGGC AAGATCCTCT TTCTAAATCC AGGTTCTATC AGTCAACCAC GAGCTACCAT 22 20 

CAGAGAATGT CTCTATGCTC GTGTGGAGAT TGATGATAGT TACTTCAAAG TGGACTTTTT 2280 

CACACGAGAT CACGAGCTGT ATCCAGGTTT GTCCAAGGAG TTTAGCCGAT GATTGCCAAG 2 34 0 

GAGTTTGAGA CTTTCTTGTT GGGGCAGGAG GAAACTTTTT TGACCCCTGC TAAAAATCTA 2400 

GCTGTGTTGA TTGATACCCA CAATGCGGAT CATGCGACCC TCTTGCTCAG TCAGATGACC 24 60 

TATACCCGTG TTCCCGTTGT GACAGATGAA AAACAGTTTG TTGGGACGAT TGGACTCAGA 2520 

GATATTATGG CTTAT CAG AT GGAGCATGAC TTGAGCCAAG AAATCATGGC GGATACGGAT 2 580 

ATCGTTCATA TGACAAAAAC GGACGTAGCG GTTGTTTCGC CTGATTTCAC CATTACGGAG 2 640 

GTCTTGCACA AGCTAGTAGA TGAGTCCTTC TTACCGGTTG TGGATGCAGA GGGTATTTTC 2700 

CAAGGGATTA TTACGCGCAA GTCCATCCTC AAGGCCGTTA ATGCCCTCTT GCATGACTTT 2760 

AGTAAGGAAT ATGAGATTCG ATGCCAATGA GAGACAGGAT TTCAGCCTTT TTAGAGGAAA 2 820 

AGCAGGGCTT GT CTGTCAAT TCCAAGCAGT CCTATAAGTA TGATTTGGAG CAATTTTTAG 2 880 

ACATGGTAGG TGAGCGGATT TCTGAGACCA GTCTCAAGAT TTACCAAGCC CAGCTAGCCA 2940 

ATCTAAAAAT CAGCGCCCAG AAGCGAAAGA TTTCGGCCTG TAACCAATTT C T AT ACTTTC 3000 

TCTATCAAAA AGGAGAGGTG GACAGCTTTT ACCGCTTGGA ATTAGCCAAA CAAGCTGAAA 3060 

AGAAGACGGA AAAGCCAGAG ATTCTATACC TAGACTCTTT TTGGCAGGAA AGCGACCATC 3120 

CAGAGGGCCG CTTGCTAGCG CTCTTAATCC TAGAAATGGG GCTCTTGCCC AGTGACATTT 3180 

TAGCCATCAA GGTTGCGGAC ATCAATCTGG ATTTTCAGGT GTTGCGAATC AGCAAGGCTT 32 40 

CCCAACAGAG GATTGTCACC ATTCCCACGG CCTTGCTTTC AGAATTGGAA CCCTTG ATGG 33 00 

GGCAGACCTA TCTTTTTGAA AGAGGAGAGA AACCCTATTC TCGTCAGTGG GCCTTTCGTC 3 3 60 

AGTTAGAATC TTTTGTCAAG GAGAAAGGTT TTCCATCCTT ATCAGCTCAA GTCTTACGTG 3420 

AACAGTTTAT TCTAAGACAA ATAGAAAACA AGGTCGATTT GTACGAAATT GCAAAAAAAT 34 80 

TAGGATTAAA AACAGTCCTG ACCTTAGAAA AATATAGATA ATGG AT ATT A AATTAAAAGA 3540 

TTTTGAAGGA CCCCTGGACT TGCTCTTGCA TCTGGTTTCT AAGTACCAGA TGGATATCTA 3 600 
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CGATGTGCCC 


ATTACGGAAG 


TCATCGAACA 


GTATCTAGCC 


TATGTCTCAA 


CCCTGCAGCC 


3660 


CATGCGTCTG 


GAAGTGACGG 


GTGAGTACAT 


GGTCATGGCT 


AGTCAGCTCA 


TGCTGATTAA 


3720 


GAGTCGTAAA 


CTCCTTCCGA 


AGGTAGCAGA 


AGTGACAGAC 


TTGGGGGATG 


ACCTGGAGCA 


3780 


GGACCTCCTC 


TCTCAAATCG 


AAGAATATCG 


CAAGTTCAAG 


CTCTTGGGTG 


AGCACTTGGA 


3840 


AGCCAAGCAC 


caagaacggg 


CCCAGTATTA 


TTCCAAAGCG 


CCGACAGAGT 


TGATTTACGA 


3900 


AGATGCGGAG 


CTTGTGCATG 


ACAAGACGAC 


CATTGACCTC 


TTTTTGACTT 


TTTCAAATAT 


3960 


CCTAGCCAAG 


aaaaaagagg 


AGTTTGCACA 


AAATCACACG 


ACGATCTTGC 


GGGATGAGTA 


4020 


TAAGATTGAG 


GACATGATGA 


TTATCGTGAA 


AGAGTCCTTG 


ATTGGACGAG 


ATCAATTGCG 


4080 


CTTGCAGGAT 


ttgttcaagg 


AAGCCCAGAA 


TGTCCAAGAG 


GTCATCACCC 


TCTTTTTGGC 


4140 


AACCCTAGAG 


TTAATCAAAA 


CCCAGGAGTT 


GATCCTCGTG 


CAAGAGGAGA 


GTTTTGGAGA 


4200 


TATCTATCTC 


ATGGAAAAGA 


AGGAAGAAAG 


TCAAGTGCCT 


CAAAGCTAGA 


CTTGATAGAG 


4260 


AGGAAAGATG 


AGTACTTTAG 


CAAAAATAGA 


AGCGCT CTTG 


TTTGTAGCGG 


GTGAAGATGG 


4320 


GATTCGGGTC 


CGCCAGTTAG 


CTGAACTCCT 


CTCTCTGCCA 


CCGACAGGCA 


TCCAGCAAAG 


4380 


TTTAGGAAAA 


TTAGCCCAGA 


AGTATGAAAA 


GGACCCAGAT 


TCCAGTTTGG 


CTTTGATTGA 


4440 


GACAAGTGGT 


GCTTATAGAT 


TGGTGACCAA 


GCCTCAATTT 


GCAGAGATTT 


TGAAGGAATA 


4500 


CTCTAAGGCG 


CCTATCAACC 


AGAGCTTGTC 


TCGGGCTGCC 


CTTGAGACCT 


TGTCCATTAT 


4560 


TGCCTACAAA 


CAGCCGATTA 


CGCGGATAGA 


AATTGATGCC 


ATCCGTGGAG 


TTAACTCGAG 


4620 


TGGAGCCTTG 


GCAAAGTTGC 


AGGCTTTTGA 


CCTGATAAAG 


GAAGACGGGA 


AAAAGGAAGT 


4680 


ATTGGGGCGC 


CCCAACCTCT 


ATGTGACTAC 


GGATTATTTC 


CTAGATTACA 


TGGGGATAAA 


4740 


CCATTTAGAA 


GAATTACCAG 


TGATTGATGA 


GCTTGAGATT 


CAAGCCCAAG 


AAAGCCAATT 


4800 


ATTTGGTGAA 


AGGATAGAAG 


AAGATGAGAA 


TCAATAAGTA 


TATTGCCCAC 


GCAGGTGTGG 


4860 


CCAGTAGGAG 


AAAAGCAGAA 


GAGCTGATTA 


AGCAAGGCTT 


GGTGACGGTT 


AACGGCCAAG 


4920 


TGGTGCGTGA 


ACTAGCAACC 


ACTATCAAGT 


CAGGCGACAA 


GGTCGAAGTT 


GAAGGTCAAC 


4980 


CTATCTACAA 


CGAAGAAAAG 


GTCTACTATC 


TGCTTAACAA 


ACCACGCGGT 


GTGATTTCCA 


5040 


GTGTGACAGA 


TGATAAGGGT 


CGCAAGACGG 


TTGTCGACCT 


CTTGCCCAAT 


GTCAAAGAGC 


5100 


GTATTTACCC 


TGTGGGTCGT 


TTGGACTGGG 


ATACATCAGG 


TGTCTTGATT 


TTGACCAATG 


5160 


ATGGGGACTT 


TACAGACGAG 


ATGATTCACC 


CTCGTAATGA 


GATTGACAAG 


GTTTATGTCG 


5220 


CGCGTGTTAA 


AGCTGTGGCC 


AATAAGGACA 


ATCTCCGCCC 


CTTGACCCGT 


GGTCTTGAGA 


5280 


TTGATGGTAA 


GAAAACCAAG 


CCAGCTGTTT 


ATGAAATTCT 


CAAAGTGGAC 


CCAGTCAAAA 


5340 


ATCGCTCTGT 


GGTGCAGTTG 


ACCATCCATG 


AAGGGCGTAA 


CCATCAGGTT 


AAAAAGATGT 


5400 
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TTGAACCTCT TGGTCTCCAA GTAGATAAGT TGTCTCGGAC TCGTTTCGGA CACCTAGACT 54 60 

TGACAGGACT CCGTCCAGGA GAATCCCGTC GTCTTAATAA AAAAGAAATC AGCCAACTAC 5520 
ACACCATGGC TGTAACTAAG AAATAATGAA ACGAATTTTA ATAGCGCCTG TGCGCTTTTA 5 5 80 

CCAACGTTTT ATCTCACCAG TCTTTCCACC CTCTTGTCGC TTTGAGCTGA CTTGTTCCAA 5640 
CTACATGATT CAGGCTATTG AAAAACATGG GTTTAAGGGG GTATTGATGG GCTTGGCTCG 5700 
GATTTTACGT TGTCATCCCT GGTCGAAAAC AGGTAAGGAC CCCGTTCCAG ACCGCTTTTC 5760 
CCTTAAACGA AATCAAGAAG GGGAATGAGG TGGGGTAAAT AGATTTCAAA ATGATAAAAA 5820 
CGCATCCTAT CAGGTTTGAG TGAACTTGAT AGGATGCG TT TTAGAATCTC AAAATTTTAT 5880 

ACTCTTCGAA AATCTCTTCA AACCGCGTCA GCTTTCATCT GCAACCTCAA AACAGTGTTT 5 940 

TGAGCAACCT GCGGC T AGTT TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTAAATTGA 6000 

GTTTGAAGTG GCTTATTTCA AAGCTTTTTG TATGTCTTCA AT CATG AGTT TTGTTGATTC 6060 

AAGTCCGCCT CCGCTTAGAT ACCAGAGGTC TGGTGTTAGT TGGATAATCT TACCATTTTT 6120 

AGCAGCAGGT GTTTCAGCGA TAAGGGCATT TTCTAGGACA CCGTCGTTGC TAGAGTTGTC 6180 

CCCACCGATG GCAAGGGTAC GGTTGATGAC AAAGAGGATG TCAG GGTTG A TTTCTTTGAC 62 40 

ACTTTCAAAG CTGACTTCTT GTCCGTGGCG TGAGTCTTCA AATTTTGTAT CAGTTGGTTT 6300 

GAATTTCAAG GTTTGGTACA AGAAAGAGAA ACGAGATTTG GCACCAAAGG CTGCCATTTT 6360 

TCCTTCATTA AGGAGGATCG CAAGGGCTTT TTTGTCAGAG CTTTCATTTT TAGTAGCGAC 6420 

TTCTTGGATG CTCTTGTCTA GCTTGGTCAA TTCTTC CTTG GCTTTCTGTG TACCAGTTTC 64 80 

GCCGAAGGCA CTTGCTAAGG ATTCGATATT AGCCTTGGTA GAAGTCCAGT AGTCGTCCTT 654 0 

GCTTGCTTGG AAGAGAACGG TTGGGGCGAT TTCTTTGAAT TTGTCTACGA ATTTTTGTGT 6 600 

ACGTGGCGAA GCGATAATCA AATCAGGCTC AAGGGCGGCG ATAGCTTCTA AATCAGGTTC 6660 

TTTCATAGAA CCAACATTTT TGACAGTTCC CACTAGGTCT TTTAGATAAG TCGGAACAGT 6720 

TTTTGTAGGC ATTCCGACGA TATTTTTTTC AAATCCTAAA GCGCGAATAG TATCCGCAGC 67 80 

GCCGAGGTCA AAGGTCACAA TCTTTTCAGG AACTTTGGAA AGTTTG AC CT CGTCCAGTGA 6840 

ACTTTTAATG GTTACCTCTG TTGGAGCAGA GCTACTGGTC TCTGTCTGAC TAGTGCTTGA 6900 

GTTTGTACTA CATGCACCAA GTAGGAGCAA GAAGCTGGCC ACTAGGGCAG TGAAATAAAG 6960 

TTTAAGGGAT GTTTTCATAA TTTCTCCTTT TTAAAATGTG ATAACGATTT AGGGAGTCTC 7020 

TTAATCTTAT TGACTAAGAG ACTGAAGGTT CTCTAACTTG AGCTTTTATG TTACTAGCTA 7080 

TAGATACAGA TCTTTTTGTC ATTGATATCA GCTAGCGTGA TGGGAATCTC ATAAAGTTGA 7140 
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CTGAGCAGGT CAGCCTGCAT GATTTGATCG GTTCTTCCCT TGCTAAAGAC CTGGCCGTCC 7200 

TTGAAGGCGA CAATTTCATC TGCATACTGA CTGGCCATGT TGATATCGTG GAGGACGATG 7260 

ATAATGGTCT TGCCGAGTTC CTCCACCAGT CGTCGAAGAA TCTGCATCAT GCTGACGCTT 7 320 

TGCTTGATAT CGAGATTGTT GAGTGGTTCG TCCAGCAAGA TAAAGTCCGT ATCCTGGGCC 7 380 

AGTACCATAG CGATAAAGAC GCGCTGGAGT TGCCCCCCTG AC AGG CT ATT GATGTAGCGG 7440 

TCTTTTAAGT TGGTCAGTTC TAAATAGTTC AGAGTTTCTC GG ATTTTTTC CCAGTCTTCT 7500 

GATCTAAGTC GACCTCGGCT GTAGGGAAAA CGTCCAAAAC TGACCAGTTC TTCAACAGTC 7560 

AATTTGGCTT GGTAATTGAT TTTCTGTTTT AGG ATGGTT A GTTCTTGGGC CAGTTCTTGC 7620 

GAATTCCAGC TCTCGATTTC ACGTCCTTTG ATACTGAGAA CTCCCTGATC TTTCTTGGTT 7 680 

AGCCTGCTCA TGATGGAGAG GAGAGTCGAT TTTCCAGCAC CATTTGGACC AATAAAGGCT 77 4 0 

GTCAGTTTTT GAGGACTGAC TTCAAGCGAA ATGCCTTGCA AAATATCCTG TTTTTGAATG 7 800 

GATTTGTCAA TGTTTTCCAG TTTCACTGAC GAGACCTCCT ATATAGTAAG ATAAAGAATA 7860 

AGAAGCCACC CACACTCTCA ATGATCATAC TGATACGAAT TTCCAGTGCA AAGACTCGTT 7 920 

CAATCAAGGC TTGCCCCAAG GTTAACCTAA TAAATCCAAC CAGAATGGCC ACTATAAAGA 7 980 

GTAACTTGTG CTGATAGTCT TTGACAATCA GGTAGGTGAG GTTGGCCAGT ATAAAGCCGA 8040 

AGAAGGCCAT AGGTCCTACC AAGGCAGTGG CCGTTGAGGT CAAAAGCACG ATTCCCCAGA 8100 

GGAGCTCTTT CTGTTCTTTT TCAACATCGA GTCCCAATAT CTGAGCCGTT TCTCTTTGCA 8160 

GGTGCAAGAC ATCTAGAACG ACTGCTTTTC GAAAGAAAAA GATTGTCAAA GCGAGGATGA 8220 

TCAGAGAACC GATGGCTAGG ATGGAAGTGT TGAGATGTTG AAAGGAGGCA AAAAGACTAT 8280 

TTTGCAGTTT ATCGTATTCG TTTGGATCCA TTAGGACTTG AAGGAAGGTG CTGATATTTC 8340 

G AAAGAG ACT TCTGAGCGCT AGACAGAT C A GCAGGACGAA GACCAGGTCT TGCTTCATCA 8400 

GTGTCTTCAA GTAACCTTGT AAGGCGAGAA AGAAGAGGGA CTGGACAAGA AGTAAGACTA 84 60 

GGAATTCTAA GATAGGGGAT TTGCCAAGTT GAAGAAACTT GCTTTCAAAA ACCAGTAG1 A 8520 

GGGTTTGTAG TAGGACGTAG AAGGATTCAA TTCCCAAAAT ACTAGGCGTC AGGAAGCGAT 3580 

TTTCCGTCAG GGTTTGAAAA CTAATGGTCG AAATCCCAGT CGCGATGGCT ACCAAGAGAT 8640 

AAACGATGAT CTTTTGGGAA CGCAACTTCC AAGCAAAGGC TGACAAGTGA GTGATGGGCC 8700 

AAAAGTAGAG AAGACAAGCT CCGATGGCAA GAATAATGAG AATCCAGAAG AGCTTGGTAT 8760 

GTTTGCTTTT AGTCTGCATC TTTTCGTCCC CCTCTCCAGA GAAGTAGGAT AAAGACGAGA 8820 

CTACCGATGA TTCCTAGCAA GAGACTGACA GACAACTCAT AGGGCCTAAT CAGAACTCGG 8880 

GATAGGATAT CGCAAGCCAG AACTAGATTG GCACCAACCA GTGCGACCAT GAGTTTGGTT 894 0 
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TGACTTAGAT TATCTCCATA GCGCTTGCGA ACAAGATTGG GAACGATAAC TCCGAGAAAT 9000 

GGTAGGCCAC CCACGGTAAT CATGGTGACG CTTGTCGTTA GCGCCACCAG AAAGAGGGCC 9060 

AGTTTTTCAA GTAGGGAGTA GGAAATCCCC AAACTCTCGC TGGTTTCTTT CCCTAGATTC 9120 

ATGATGGTGA AGGTTTGGGA TAATTTCCAA ACGGTTATCA GGATGATGAG GCCTAAGAAG 9180 

AGCCACTCAT ACTGATGGGT CTGAATCATG GAGAAGGAGC CCTGGGTCCA GGCAGTCATA 924 0 

CTCTGAACCA GATTGAAACG ATAGGCGATA ACTTCT GTG A CTGAGCCGAT AATCCCGCTA 9 300 

TAGATGATCC CAATCAGAGG CAACATCCAC CTTTCCTTTA CAGTAAAAAT GGTCATAAAG 9 3 60 

GCTAGGAAGA AGAGGGTGAA TACGATGGAT GAAACAAAAG CGAAGAGCAT CTTGTGGGTC 9420 

ACACTAGCCG ATGGAAAGAC AAAAAGGCTC AGCACCATTC CCAGTTTGGC GGCTTCAGTC 94 60 

GTTCCAACTG TACTCGGTGC AGCAAACTGA TTTTGGGTAA TAGTCTGCAT GAGAAGGCCT 9 540 

GCCATACTCA TACTAGAGGC AGTCAGGAGA ATACTGATAG TTCTTGGGAG ACGGGACTCT 9 600 

TGAAAGAGGA GCCAGGTCTG CTGGTCGAAA TCAAATAGCT TTCCCCATGA AAAATCACTG 9660 

GTCCCAATGC TAATAGAGAG AAAG ACT AGG AGTAGAACTA AGCCAGG 9707 

(2) INFORMATION FOR SEQ ID NO; 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5910 base pairs 
(S) TYPE: nucleic acid 
(C) STRANDEONESS : double 
(DJ TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

CCGCAATTAT GCTTGAAAAG GAGTATACTT ATAAGTAACG CAAACGTTTG CGTCTGAAAA 60 

ATACGCAACG TTCCATTATT TTAACACACG AGGTGC T ATT ATGAAAAAAC GTCAAAGTGG 120 

TGTGTTGATG CACATCTCTT CTCTTCCAGG AGCTTACGGA ATCGGATCAT TTGGTCAAAG 180 

TGCTTACGAC TTCGTTGATT TCTTGGTCCG TACAAAACAA CGTTACTGGC AAATCCTTCC 2 40 

ATTAGGAGCA ACTAGTTACG GGGATTCTCC TTACCAATCT TTCTCAGCCT TCGCAGGAAA 3 00 

CACTCATTTT ATCGATTTAG ATATCTTGGT GGAGCAAGGT TTGTTGGAAG CAAGTGACCT 3 60 

TGAAGGAGTT GACTTTGGTA GCCATGCGTC TGAAGTTGAC TATGCTAAAA TCTACTATGC 420 

ACCTCGTCCT CTTTTAGAAA AAGCGGTGAA ACGTTTCTTT GAAGTCGGAG ATGTTAAAGA 4 80 

TTTTGAGAAA TTTGCTCAAG ACAACCAATC ATGGCTT GAG CTCTTTGCTG AGTATATGGC 540 

TATCAAAGAG TATTTTGACA ATCTTGCTTG GACTGAATGG CCAGATGCAG ATGCTCGTGC 600 
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TCGTAAAGCT TCAGCACTTG AAAGCTATCG TGAGCAATTG GCAGACAAGT TGGTTTACCA 6 60 

CCGTGTGACT CAATACTTCT TCTTCCAACA ATGGTTGAAA TTGAAAGCTT ACGCTAACGA 7 20 

CAACCACATC GAAATCGTTG GGGACATGCC AATCTACGTA GCGGAAGATT CAAGTGATAT 7 80 

GTGGGCAAAT CCACATCTCT TCAAAAC AG A TGTCAATGGT AAGGCTACTT GTATCGCAGG 840 

ATGCCCACCA GAT G AGTTTT CTGTAACTGG TCAGCTTTGG GGTAATCCAA TCTATGACTG 900 

GGAAGCAATG GACAAAGACG GCTACAAATG GTGGATTGAA CGCTTGCGTG AAAGCTTCAA 960 

AATCTACGAT ATCGTTCGTA TCGACCACTT CCGTGGCTTC GAATCTTACT GGGAAATCCC 1020 

TGCTGGTTCC GATACAGCAG CACCTGGTGA GTGGGTGAAA GGTCCAGGTT ACAAGCTTTT 1080 

TGCAGCCGTT AAGGAAGAAC TTGGTGAGCT AAACATCATC GCAGAAGACC TTGGCTTCAT 114 0 

GACAGATGAA GTGATCGAAT TGCGTGAACG TACTGGCTTC CCAGGAATGA AGATTCTTCA 1200 

ATTTGCCTTC AACCCAGAAG ACGAAAGCAT TGAT AG CCC A CACTTGGCAC CTGCTAACTC 12 60 

AGTTATGTAC ACAGGAACAC ACGATAACAA TACGGTTCTT GGTTGGTACC GTAATGAGAT 13 20 

TGATGATGCG ACTCGTGAGT ACATGGCTCG TTACACGAAC CGTAAAGAAT ACGAAACAGT 13 80 

GGTACACGCT ATGCTTCGTA CAGTATTTTC ATCAGTTAGC TTTATGGCAA TTGCAACTAT 1440 

GCAAGATTTA CTAGAATTGG ATGAGGC AGC TCGTATGAAC TTCCCATCTA CCCTTGGTGG 1500 

AAACTGGTCT TGGCGTATGA CTGAAGATCA ATTGACACCA GCTGTCGAGG AAGGTTTGCT 1560 

TGACTTGACA ACAATTTATC GCCGAATTAA TGAAAATTTG GTAGATTTAA AGAAATAAGA 1620 

CAATAATCAG GAGACAACTA AACATGTTAT CACTACAAGA ATTTGTACAA AATCGTTACA 1680 

ATAAAACCAT TGCAGAATGT AGCAATGAAG AGCTTTACCT TGCTCTTCTT AACTACAGCA 1740 

AGCTTGCAAG CAGCCAAAAA CCAGTCAACA CTGGTAAGAA AAAAGTTTAC TACATCTCAG 1800 

CTGAGTTCTT GATTGGTAAA CTCTTGTCAA ACAACTTGAT TAACCTTGGT CTTTACGACG 1860 

ATGTTAAAAA AGAACTTGCA GCTGCAGGTA AAGACTTGAT CGAAGTTGAA GAAGTTGAAT 1920 

TGGAACCATC TCTTGGTAAT GGTG G TTTGG GACGTTTGGC TGCCTGCTTT ATCGACTCAA 1980 

TTGCTACTCT TGGTTTGAAT GGTGACGGTG TTGGTCTTAA CTACCACTTT GGTCTTTTCC 2040 

AACAAGTTCT TAAAAACAAC CAACAAGAAA CAATTCCAAA TGCATGGTTG AC AG AG CAAA 2100 

ACTGGTTGGT TCGCTCAAGC CGTAGCTACC AAGTACCATT TGCAGACTTT ACTTTGACAT 2160 

CAACTCTTTA CGATATTGAT GTTACTGGTT ATGAAACAGC GACTAAAAAC CGCTTGCGTT 2220 

TGTTTGACTT GGATTCAGTT GATTCTTCTA TTATTAAAGA TGGTATCAAC TTTGACAAGA 2280 

CAGATATCGC TCGCAACTTA ACTCTCTTCC TTTAC CCAG A TGATAGTGAC CGTCAAGGTG 2340 

AATTGCTCCG TATCTTCCAA CAATACTTCA TGGTTTCAAA CGGTGCGCAA TTGATCATCG 2400 



WO 98/18931 



PCT/US97/I9588 



1065 

ACGAAGCAAT CGAAAAAGGA AGCAACTTGC ATGACCTTGC TGACTACGCA GTTGTCCAAA 24 60 

TCAACGATAC TCACCCATCA ATGGTGATTC CTGAATTGAT TCGTCTTTTG ACTGCACGTG 2520 

GTATCGATCT TGACGAAGCA ATCTCAATTG TTCGTAGCAT GACTGCCTAC ACT AACCAC A 2 580 

CAATCCTTGC TGAAGCGCTT GAAAAATGGC CTCT T GAATT CTTGCAAGAA GTGGTTCCTC 2 640 

ACTTGGTACC AATCATCGAA GAATTGGACC GTCGTGTGAA GGCAGAGTAC AAAGATCCAG 2700 

CTGTTCAAAT CATCGATGAG AGCGGACGTG TTCACATGGC TCACATGGAT ATCCACTACC 2760 

GATACAGTGT TAACGGGGTT GCAGCACTCC ATACTGAAAT CTTGAAAAAT TCTGAGTTGA 2 820 

AAGCCTT CT A CGACCTTTAC CCAGAAAAGT TCAACAACAA AACAAACGGT ATCACTTTCC 2 880 

GTCGTTGGCT TATGCATGCT AACCCAAGAT TGTCTCACTA CTTGGATGAG ATTCTTGGAG 2 94 0 

ATGGTTGGCA CCATGAAGCA GATGAGCTTG AAAAACTTTT GTCTTATGAA GACAAAGCAG 3 000 

TTGTCAAAGA AAAATTGGAA AGCATCAAGG CTCACAACAA ACGTAAATTG GCTCCTCACT 3 060 

TGAAAGAACA CCAAGGTGTG GAAATCAATC CAAATTCTAT CTTTGATATC CAAATCAAAC 3120 

GTCTTCACGA GTACAAACGC CAACAAATGA ACGCTTTGTA CGTGATCCAC AAAT AC CTTG 3180 

ACATCAAAGC TGGTAACATC CCTGCTCGTC CAATCACAAT CTTCTTTGGT GGTAAAGCAG 3 24 0 

CTCCAGCCTA CACAATCGCT CAAGACATTA TCCATTTAAT CCTTTGCATG TCAGAAGTTA 3 300 

TTGCTAACGA TCCAGCAGTA GCTCCACACT TGCAAGTAGT TATGGTTGAA AACTACAACG 3360 

TTACTGCAGC AAGTTTCCTT ATCCCAGCAT GTGATATCTC AGAACAAATC TCACTTGCTT 3420 

CTAAAGAAGC T T CAGGT ACT GGTAACATGA AATTCATCTT GAACGGAGCT TTGACACTTG 3480 

GTACTATGGA CGGTGCTAAC GTGGAAATCG CTGAGTTGGT TGGAGAAGAA AACATCTACA 3 540 ~ 

TCTTCGGTGA AGATTCAGAA ACTGTTATCG ACCTTTACGC AAAAGC AG CT TACAAATCAA 3 600 

GCGAATTCTA CGCTCGTGAA GCTATCAAAC CATTGGTTGA CTTCATCGTT AGTGATGCAG 3660 

TTCTTGCAGC TGGAAACAAA GAGCGCTTGG AACGTTTTTA CAATGAATTG ATCAACAAAG 3720 

ACTGG TT CAT GACTCTTCTT GATTTGGAAG ACT AC AT C AA AGTCAAAGAG CAAATGCTTG 3780 

CTGACTACGA AGACCGTGAC GCATGGTTGG ATAAAGTCAT CGTTAACATT TCTAAAGCAG 3 840 

GATTCTTCTC ATCTGACCGT ACAATCGCTC AGTATAACGA AGACATCTGG CACTTGAACT 3900 

AATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTTAT CTGCAACCTC AAAGCAGTGC 3960 

TTTGAGCAAC TGCGGCTAGC TTCCTAGTTT GCTCTTTGAT TTTCATTGAG TATAAGATAC 4020 

AAATTTATAC TAATACATTT TGTAAAAAAG CGAGTTTCGA TTGAAATTCG CTTTTTTAAT 4080 

GATGTAGATT TGGGTCAATC TTGTCTAAAA ATAGGGAAAT CCTAGATACA GTGAAGGCTT 4140 
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TAAATGCTGG TTTTTACTGT CCTCAGCCTT ATATTTTTTC GTAGTTGGTT ACCTCATATC 4 200 

TATTATATTC GCTTACATAA AGTATTATAA TATAATTGTA GGAAAGAAGG TGTTTTTATG 4260 
ATATACACAC TTAAATTGGT CTTGTTTATT ACCTTTCTTG TAATAAGCTT GTTACCTGAT 4320 
AAG A TTTTTG GAAAAAATAA AAAAATTTGG AAAATAGTTT TTGCAATATT GACGGCAGTG 4380 
GCAGCATTGT CATTTATGTA CTAAGTTATT TTAAGAATGT AGGGAAATAA ACCCTACATT 4 4 40 

CTTTTTAGTT TTTTCTGTTT TCTAAATTCT ATTTATCCAA G CGATTC AAC ATTTCTTGCT 4 500 

TCTTCG CTTC AAGTTCTGCA CGCTTTTCTT CGATTTCGGC ATGTTTTTTC TCGACTTCAC 4560 
AACAACTTGC ACCATTGCTA AATTCTTTTC GCCATCAGGA GATAGGGTGA GTCGACATGT 4 62 0 

CTATTACTCA CCCAAAGCAG TCCTACAAAG CAGGAATTTT CTCTTACTTT TTTGGAAATA 4 680 

GTAACGTTTA TACAGCTTTG ACACTTCCTA TCAAAGCGCC AAACACACTC CGAGGCCTTT 474 0 

ACAGAAAGCA GAAAAGGAAT GATCTGGTAT AAG AT C ATT C CTTTTCyCTC TTTTTCTTTA 4800 

AGTAATTATA TACAATGTAC GACGAAGTCG TCATTGCAAT GCTGATCCAC CACCTAAAGG 4 860 

GAACTTTAAA CAACATTGAT AAGATAAAGA ATATAAACAA CGAAAATACG TTATACCCAA 4 920 

TTAATTTTAT TGTATATCTC ATGATTAAAA GTTAATCCTT CCGTTGTTAG GAATGGCATC 4 980 

ATTTTTATCC CATAATTGTG CTAAATAAGT CCCCGGTGAT AATAAATTCA TAGCGAATTC 504 0 

TAAAGCAACA TCATTTACAA ACCAACTACC TAGATATCTA GAAATTGCTG AACGAATAGC 5100 

ACTTTTTGCT GCATGTTTTC CTTTTACTTT AATTAGATTT GCAAGGCCTG CAGTAGTTCC 5160 

TCCTAATGCT AAAGCTATTG CAGTATCTAA TAGAGCACCC ATTTGATTAA CTCTAATACC 5220 

TTGCCAAACT GCTCTAAATG GAGAGTATGT AGGTGGGATT GTATAATCGC CTTGTAATTG 5280 

T CGGTT AATT ACTTCTTTGA TCCATTGTTG TGAGACGTCT GGATGAAAAG ATTGGATTTC 5340 

GTTTGCAAGT GTATTGATTT GTTCTTCTGT TAGAGAAGTG ACAGGTTGAA G TTCC AT ATT 5400 

TGTTTCAATT TGTGATACTT GTTCAGAAGC GTATACAGCT GAAACACTTG GAATCGCTGA 54 60 

T AC AATT AAC ACAATTGACG TCAAAAAAAC CGAAATAAAT TTCATTAATT TGTTCATGAG 5520 

CTTTTCTCCT TTTTATTTGC ATCTGCTTAC ATTTT ATCAT ATACTGTTAT TATAGTCAAA 5580 

AAAATATGCT ATTATGTTAA AAAAATATTT TTCAAAATAT AAATGGACGG ATTTATTTTG 5640 

GATTTTATTT GTTATTTTGA CCTGCCTCTA TATTGGTAAC CATGATTTGT TTACTCTCAA 5700 

TCATCAAGAA TTCTCTTTTC CTGGTAGCGT TTGGGGTCTG GTACTGGCCT TATATCACTT 5760 

ACTATTCATT GATAAGTTTG TT AT AT CG AA TCGAAAATAA AGATTAGAGC TATGCTTGAC 5820 

TGTGTACTTT TAGGATTTAT TTTGGAGGAA GATTTTGTCT CTATTATTTA TT ATTTT AAA 5880 

TTTATTTATT TTGTATAAGA TCTATTCTTT 501 a 
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(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5406 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
GGCATAGCGA CTCATTTTTT CAACTGTCCA GGCTGGATAC CAGACTAATT TAACCTCAGT 
ATCCGTTACT TCTGGAACCT CTATCATAGC ATCATAAATC TGGTCTGTCA AAAGGTCTGC 
TAAGGGACAA CCCATAGTTG T CAAAGT CAT GTCAAT CTCT GTTT GCCCTG TGTCACCGTC 
AAAACGAATC TCATAGATCA AACCAAGATT GACAATATCG ATTCCCAACT CAGGGTCGAT 
CACTTCTTCC AAGGCTGTTA AAATCCGTGT TTTGATGTTT TCAATTTGCT CTTCTGTATA 
AGC CAT ATTT TCCTCACTCT TAGTCTTCAA TAAAATCACG AAGCGGTTTG CTACGACTTG 
GTTGGCGTAG TTTTCTCAAA GCCTTTGCTT CAATCTGACG G AT ACGCT C A CGAGTTACGT 
TAAAGACTTT CCCCACATCT TCAACTGTGC GCATTTTTCC ATCATCTAGT CCAAAACGTA 
GACGCAGAAC ATTTTCTTCA CGGTCTGTAA GAGTATCTAA GATTTCATCC AATTGCTCAC 
GCAAGACGAT ACGAGTCGTA TAATCCACTG GATTTTCAAT CACTTCATCT TCCATAAAGT 
CTCCAAGGTG GCTATCGTCC TCTTCACCGA TAGGAGTTTC AAGAGATACT GGTTCTTGGG 
CAATCTTCAA G ATTTCACG A ACCTTATCAG GTGTCATATC CATTCGTTCA GCAATCTGTT 
CTGGTGTCGG ATCTTGCCCC AATTCTTGAA GGAGATTCCG CTGTTCACGA ACCAATTTAT 780 
TGATAGTTTC AACCATGTGA ACTGGGATAC GGATGGTACG AGCTTGCTCC GC AATAGCAC 840 
GAGTGATAGC CTGACGAATC CACCAAGTTG CATAAGTTGA AAACTTGAAC CCTTTAGAAT 
AGTCAAACTT GTCAACCGCC TTCATCAAGC CCATATTTCC TTCTTGAATC AAGTCAAGGA 
ACTGCATACC ACGACCGACA TAGCGTTTGG CAATGGAAAC AACCAAACGA AGATTGGCTT 
CCGCAAGACG TTGTTTGGCT TCGATATCAC CAGCTTCAAC AGCCAGTGCC AACTCTTTCT 
CCTCTTCATT GGTCAAGAGA GGAACGACCC CTATTTCTTT CAAGTACATA CGGACAGGGT 
CATTGACCTT AGCAGAAGTT GACCCAATCA AGTCCTCATC GCTGAGTTCT GGTTCTTCTT 
CATTGCTGAG AACACGCGCA CTTGGATTTC CTTCGTTATC TGTGATAGAA ATGCCTGCAT 
CCTGAATCCG TTGCAAGAGA TCTTCAATCC CATCAGCGTC CAAGGTAAAA GGAATAACCA 
GACTTGCATT G ATTT CAT CA TCTGTTGCTG TCCCTTTTTG CTTATGATTA CGGATAAATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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CTGCTACCTG TACGTCAAAT GTTGTTACTT CTTTTTGTTT TGTTGCCATT ATTACTCCAT 14 40 

TCTTCTCTTT TGGGAAATTA AACGTTCCAA TTCTTCTAGG GCTGTATCTG TATCTCCTAC 1500 

ATGGCTAGCT TCCTGCACCT TCTTTTTGAT TCTCATATTG TCCTGATTCA AGAGAGCCTT 1560 

GTTTCGAGTC ATCTCTACTT CACTAAGTTC CTGCGGCGAT ATCTCAGCAG GCAAATCCTG 1620 

AGCTAAAACT TGGTACCAAG CTCTTTCAAC TTCCTCTGTC TGCTCTGCTA AAACTTCTGG 1680 

AGGAAGATTT CCATACTGGC CAAGCAAGTC ATATAAGACC TGAAATTCAG GTGTAGCAAA 174 0 

TGCAAAGTCT TCTCGCAAAC GGTAATCGTT CAAAACAAGA GGGGATTCCA TCATCCGATA 1800 

GAGTAGATGG GCTTCTGCCC TCATAATAGC CGATAACTGC TTGGTGACAG GCATGGTGAT 1860 

TGGCGTCGGT CTGGAAATTC CTTCCATGCG ATTCTGCCTT TGCACCTGAC GACTCTCATT 1920 

AACAATCTGC TCAATCTGGG TATAATCAAA GGACGCCAGA CTGTCAGCTA AAATATGAAT 1980 

ATAGCTGTTT TGAGCAGCGA TGGACTTTTC TTGAACAATC AAGGGACCTA TTTTTTCAAG 2040 

AAACT C AATC TGAGCCTGCA GATTTTCACT GTTTTCAGGT TTGTACTGAT GAATGTAGAA 2100 

CTCAATCGGA CTAATACGAG TTTTCGTTAA TAGATAGGCC AAGTCTTCTG GACCATTTTT 2160 

TTGTAGATAC TCATCAGGAT CCAAGTTATC AGGCATGCTG ACGATTTGCA CAGGCATATC 2220 

ACCAATTTCA TCCAATGCTT TCAATGTCGC GGCTTGCCCA GCCTTATCTC CATCGTAAAC 22 80 

AAGAACCAAT TTCTTGGTTA ACCTTTTCAG ATGCTCAACA TGCTCTCGAC TCAAGGCTGT 2340 

TCCCATCGAC GCCACAGCAT TTTCGATTCC AGCCCGATAG GCTGCAATAA CATCCATGAA 2400 

TCCTTCCATC AGGTAAATCT CACTAGCTTT TCCAGAAGAT CTTTTTGCCC TATCCATATG 24 60 

ATATAATTCG TAACTTTTGT TAAAAATTGC AGTCGATCGG CTGTTTTTAT ACTTAGAAGT 2 520 

TTGTGAATCC GTTTTTTGCC AGATACGACC TGAGAAGCCA ATGACCTTTC CTTGGTCATT 2 580 

TGTCAGGGGA AACATAATGC GATTGTGAAA GGTGTCTACA AATTGATTGG CATCCGAGAG 2640 

ATAAAACAGG CCTGAATCCA GTAAATCCTC TTCACGATAC TGATCAGACA AACGTTGATA 2700 

GAGATA G T T T CGTTCTGGAG GTGCTAAACC AATCCAAAAA TGTTTAAGCA CTTCATCTGT 27 60 

CAACCCCCGC TGATAAAGGT AATTTCTGGC CTCTTCGCCC ATAGTCGTTG TCATGAGAAT 2820 

AGCATGGTAA AATTTGGCTG CATCTTCGTG CAT ATCAT AA AGAGCTTGGT GAGGTGAGGC 2880 

TGACTTCTGC TCACTATAAA GCGGTTTTTC AACCTCAATT CCAACACGCT GACCTAAGAT 2940 

TTGGACTGCT TCTATAAAGG GAACCCCTTG GTACTCCTCG ATGAACTTAA AGACATCACC 3000 

TGAGCGACCA CAACCGAAAC AGTGATAAAA CTGCTTGTCC TCTACAACAT TGAAAGATGG 3060 

TGTTTTTTCA CCATGAAAAG GACAGAGCCC TAGATAGTTC CGTCCTGCCT TTTGTAAAGA 3120 

AATCACATCT CCTATGACTT CCACAATGTT GGCATTGTTT TTGATTTCTT CAATGACTTG 3180 
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TTTGTCAACC 


ATACACAATA 


CCTCCATGTT 


ATCATAGTTT 


ACTTTATATA 


GTATACTTTA 


3240 


TTTCAGAAAA 


AAAGTAAACC 


ATTTCACTCA 


TTTTCCCTAC 


TTTATTCAAA 


GAGTTGATAA 


3300 


TAATCAGAGA 


TTTTCATTTT 


TGCTTTTTCT 


TCTTGGTTTA 


AATCTTGGAT 


AATTCGTCCT 


3360 


TCTTTCATGA 


CAATCAAGCG 


ATTGCCGTAT 


TTGAGAGCAT 


CTTCCATATG 


ATGAGTAATC 


3420 


ATAAGGGCTG 


TTAGCTGATC 


TTTCTTAACA 


AATTCATCTC 


T C AATTCC AT 


CAAAGCAACA 


3480 


CTAGTCTTTG 


GATCCAGGGC 


ACCAGTATGC 


TCATCTAACA 


GGAGTAATTC 


AGGTCGCTTC 


3540 


AAGGTTGCCA 


TCAAGAGACT 


CAAAGCCTGT 


CTTTGTCCAC 


CTGATAAGAA 


CTCAATCGGT 


3600 


GTATTCAAGT 


GTTTCTCAAG 


ACCATTTCCT 


ACTTTTTCAA 


TGGTTGCCTG 


AAATTCATCC 


3660 


TTATAGCTAG 


TCAAGCGTCG 


TGGTAACAAT 


CCACGCTTTT 


CACCACGAAA 


CTTGGCGATT 


3720 


AAAAGATTTT 


CAGCGACCGT 


CATACGGGGA 


GCTGTCCCCA 


TCTTTGGATC 


TTGGAAGACA 


3780 


CGAGACAGGT 


ACTTGGCACG 


CTTCTCGGCT 


GAAAACTTAG 


TGAGATCTTC 


AC CT AAAAT A 


3840 


CGGATAGTTC 


CACTAGTTAG 


TGATAAGGTC 


CCTG CT AT AG 


TGTTAAAGAG 


AGTTGATTTT 


3900 


CCAGCACCAT 


TTCCGCCCAA 


AATCGTGATA 


AACTCCCGTT 


CAAAAATTTC 


TAAGGAAACA 


3960 


TCATTTAAAA 


TAATCTTTTC 


TTCATCAAAG 


CCATTTTTAA 


CGATTTTGCT 


TGCATTTTTT 


4020 


AATTCTACAA 


TTGCTGTCAT 


TTGCTTAACT 


TGGCTCCTTT 


CAAGATTGTT 


TGCTTAAATG 


4080 


TTGGAATCAT 


GAGGCAGACT 


GCTAAAATCA 


AGGCACTGTA 


TAAACGAAGG 


TAACTTGTAT 


4140 


TAAAGCCAAG 


TGCGATAACT 


GCCCACACTA 


AAAATTGATA 


AGCGATAGAA 


CCTACAACGA 


4200 


TAGTAACCAA 


ACGCTCTGCC 


AAGCTCAAAC 


TCTTGAAAAT 


AACTTCTCCA 


ATAATCAAAC 


4260 


TTGCAAGCCC 


CACAACGATA 


ACCCCGATCC 


CTCGAGACAC 


ATCGGCATAA 


CCTTCTTGCT 


4320 


GAGCAATGAG 


GGCACCTGCA 


AGGGCAATCA 


CACCATTTGA 


TAAGACCAAG 


CCCATGAGCT 


4380 


CCATGCGTCC 


AGTATGAATC 


CCGAAACTTC 


TAGCCATATC 


AGGATTATCC 


CCTGTAGCAA 


4440 


TATAGGCTTG 


TCCGAGTTTA 


GTGTCCAAGA 










CAAAGATGAG 


ACCTGTCAAG 


AGTTGATTCA 


AATCCGAATC 


AAAAGGCAAA 


ACATCCTGAA 


4560 


TTTGCTTGGT 


TCCAAGCAGG 


CCTAAATTCG 


CACCTCCCAT 


AATCAAGAGC 


ATGATTGAGT 


4620 


GACAAGAAGT 


CATCACCAAA 


ATCCCTGAGA 


GCAAGGTTGG 


GATCTTCCCT 


TTTGTATAAA 


4680 


GAAGGCCTGC 


TGCCATT CCA 


GCCAAACAAC 


CTGCTCCTAC 


AGCAACAAGT 


GTCGCTAAAA 


4740 


ATGGGTT CAC 


GCCTTTGGTT 


ATCAAAGTGA 


CAGCAAC AG C 


TCCCCCAAGA 


GGGAAGGAAC 


4800 


CTTCTGTCGT 


CATATCTGGA 


AAGTTTAAAA 


TCCTAAATGT 


CATAAAGATT 


CCCAGACCTA 


4860 


GAATAGCCCA 


GACAAATCCT 


TGAGAAATAA 


TGGAAACAAT 


CATATTTTAT 


TTAATCCTTT 


4920 
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CT ATATT CAT CTTTTTAAAA AATGGGAAGA GTCTCCTCCT CCCTACCTTA TTTATTCGAT 4 980 

GACTTGTCCT GCTTCTTTGA GAACAGACTC AGGAATAGTA ATACCTAGTT CTTGTGCTAT 5040 

TTTTTTATTG ATGACTGACT TACCAGTTGA AAAGACATTG ACTGGGGTAT CGGCTGGTTT 5100 

TGCACCTTTC AAGACTTGCA CAATCATTTT ACCTGTTGCC ACACCAAGGT CATGTTGGTC 5160 

AATTACAACT GATGCCAAAC CACCTACTTC T AC CAT AGCT GTCGCACTGG GATAAATTGG 52 20 

TTTCTTAGAA CTTTGATTGC TAGAGACAAC CGTTGGAAAT CCTGATGCAA TGGTGTTATC S2 80 

AATTGGAACC CAAATAGCAT CTACCTTGCT AGTCATAACA GTGACAGTTG AGGCAATTTC 5340 

ATTTGTTGAA GGAACTGCAA ATGTTTCCAC TGTCAGACCT GCCTTTTCAG CATAAGCCTT 5400 

AAATTC 5406 

(2) INFORMATION FOR SEQ ID NO: 167: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9711 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



CAGCTTGCTC 


TTACTATTAT 


AGCAGATGTT 


ATAGCTGGAA 


TTATCTTGTA 


TTTCGTCTGC 


60 


AAATGGCTAG 


ATGGTAAGAA 


GTAGACCGAA 


TGACTAGCCT 


AT AAACAC C C 


GTTAAATCGC 


120 


TAAGATACGT 


CAAAAAAGCC 


CTTAACTATG 


GCACTAGTTA 


GGGGCTTTGG 


TGTTCTAATG 


180 


AACCTTATAC 


ACTAACTACA 


TTCTAGCATA 


TAAGCCCAGA 


TATTTCAAGA 


GTTTTATTTA 


240 


TTGTTTAAAG 


TTCTGAAAGG 


TCTATAATGA 


AGTTAGCCAT 


CTAGTATCAA 


AAAACCGACT 


300 


AGCTCTTATG 


AACTAGTCGA 


TTTCTCATCA 


ATGCGCCAAC 


ATTTCTTGGG 


CGATTTCTTG 


3 60 


GCCAGATAGG 


TTATCTGGGT 


AGTAGGTTGG 


CCAGTTGTCC 


ATTTCTTCAA 


AGAGGGCTTC 


420 


TTGGCTTGTG 


CCTCCAAAGA 


AGATATGGAA 


ATGTTCTGCC 


TTAACTGGGG 


CAACATTC* G 


480 


GTCACTAAAC 


TGAACATACT 


TGAATTGTCC 


AGCGTCAGCA 


TCTGTGGCTT 


CAAAGAGGAA 


540 


ACGCACGCCA 


CGATTGCCTT 


TCTTGTAAGT 


CAAAATTTTC 


TT AC C G A CAT 


ACTTGTAAGT 


600 


GTATTTCTTG 


CTTTGTCCAC 


CTTGAACAAA 


TTC CAT AGTA 


TTATCAGTAA 


TGTTAATCTT 


660 


AGTCACATCT 


GTATGATAGC 


CTTTTGTATA 


GTAACCCTTG 


TACTCAGCCT 


GGGTCATCTT 


720 


ACCAGTCAAC 


TTAGCCTTGT 


AGTCAAAGAC 


TTGGTCAAAC 


GTGCCGTCTT 


CAAGGAAAGG 


780 


ATAAACTGAT 


TGCCAGTTAC 


CTGCATAGTC 


ACTCAAGGTG 


CGGTCCTTGA 


CAGCTGCATC 


840 


CTCGAAGTAA 


CCATTTTGGA 


CTGTCTTGGT 


ATCCTCTGCC 


TTTTCAGGTT 


CAATTGCTGC 


900 
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GCCTTCTTGG TCTGTTGTTT GTTTCAAAGC CTTGAGGTTT TTCTCCATCA CGGAAATGTA 9 60 

GTTTTCTCCA GCCTTGGTGT CCTCTTCTGT CAGACTTTCT AAAGGATTGA GGACATCAGT 1020 

TTTGACACCT GCTTCTTTTG AAAGTGTGTT AGCAAGGGCT TGTGAGCCAT TTCTTCAAAA 1080 

TAGATATAGG CGATTTTATT TTTCTTGACA TACTCTGTCA ATTCTGCCAA GCGAGCAGCT 114 0 

GATGGCTCTG CATCTGGAGA AAGTCCTGAG ATTGCGACTT GTTTGAGTCC ATACTCCAAG 1200 

GCAAGATAGT TAAAGCCTGC GTGTTGAGTC ACAAAGCTCT TTTGTTTTGC TTGAGACAAA 1260 

CCTTCTGCCT AAGCCTTATC CAAGGCTTGC AATTTTTCGA TATAGGCAGC TGCATTCTTC 13 20 

TCAAAGGTCT CTTTTTTATC AGGATAATCT GCTGACAAGC TGTCGCGGAT GTGCTCTACT 1380 

AGTTTAATGG CACGAACTGG TGATAACCAA ACATGGGGGT CAAACTCATG GTCATGACCT 14 40 

TCTTCTCCAT GGTCATGGTC TCCCTCTTCT TCCTCGCCAC CTCGCAAGAG C AAC AT AT CG 1500 

CCTGTCGCCT TGATGGTTTT CACTTTTTTC TTATCCAAGG TATCTAGCAA TTTAGGTACC 1560 

CATGTTTCCA TGTTTTCATT TTC AT AAACG AAGGTATCTG CATCTTGGAT TTTGGCAACT 1620 

GCCTTGGCAG ATGGTTCGTA TTCATGAGGT TCTGTCCCAG CACCGATTAG GAG TTCT AC A 1680 

TTAGCCGTAT CTCCTGCGAC TTGCTTGGTA AATTC AT AG A CAGGGTAAAA GGTTGTCACG 174 0 

ATATTGAGTT TACCATCTGC CTGTTTTTGA TTGGAACAAG CCACTAAAAA CAAGGCACAT 1800 

AGACTGGCTA GTAATAAGCT AATTTTTTTC ACGTTCGTCT CCTATTTGAT AAAACGTCTT 1860 

ACTAAACTGA TTAGTATAAA GACAGTTACA AAAATAATGG TAATACTTGC ACTTGCAGGT 1920 

GTTTCTGCAT AGTAGGAAAT GTAAAGTCCT GCTACCATTC CCAAAAAGCC AATCGCACTG 1980 

GCAAGCAGCA TAACCGATTT AAAGTTTTTC CCCAGACGCA GGGCAATACT AGCTGGCAAG 2 040 

ACCATAATGG TCGATACCAG AAGAGCTCCT GCTGCAGGAA TCATAAGGGC AATAGCCACC 2100 

CCTGTCACCA TCTTAAAAAG AATGGACATG GTACGAACTG GCAAGCCATC CACAAAGGCC 2160 

GTATCTTCGT CAAAAGTTAA GATATACATA GGACGAAGAA AGAGAAAGGT CAAAATCAAA 2 220 

ACAACCGCCG CAATGACAAA GAGG GAAATG ACCTGTTCTT CACTGATAGT CACGATCGAA 2 280 

C CAAAG AG AT ATTGGTCCAA ACTCATTGAA CTCGAGCTTT TACCCTTGCT CATGACAATC 234 0 

AGAGAAACAG CCAGACCTGT TGACATGAGG ATAGCTGTCC C G ATTTCC AT AAAGCTCTTG 2 400 

TAAACCGTAC GGAGATACTC CAGAAAGACC GCCGCAATCA AGACAATGGC AATAGTAGAA 2460 

ACAGTTGGAG AAATCCCCAA AACCAGACCA AAGGCTACAC CTGAAAGTGA GACGTGGCTA 2520 

AGGGTATCAC TCATCAAACT CTGACGACGC AAGATGAGGA AGGTT.CCCAA TACCGGTGAG 2580 

AAAAGACTCA TAGCAATAAC CGCCAAAAAG GCGCGTTGTA TAAAGTCGTA AGATAATAAA 2 640 
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CTAAGCATGG 


CCCACCTCCT 


GGCCATTCTC 


ATGAACATTG 


AAACAACCCC 


ATGGCGAGTC 


2700 


TTGGTTACGG 


ACTAGATGAA 


TATTGCGATC 


CGCATAATCC 


TTAACTTCTT 


CAGGGTCATG 


2760 


GGTAATCATC 


AAAACAGCCT 


TGCCATGATG 


ATGGGCG CTG 


TGGTGCATGA 


GTTCGTAAAA 


2820 


TTCATTTTTA 


CTTCCTGCAT 


CCATCCCCGT 


TGTCGGCTCG 


TCTAGGATAA 


ACACATCAGG 


2880 


GTCAGAAGCA 


AACATACGCG 


CAATTACCGC 


TCGCTGCTTT 


TGTCCCCCAG 


ATAGAGACCC 


2940 


CAAGCGTTTG 


TCTCGATGTT 


CCCACATGCC 


AACTGAGTCC 


AGACTAGCCT 


TGATATGCTC 


3000 


CTCATCATGA 


GCATTCAAAC 


GACGGAACCA 


GCCTTTTCTC 


GGATAGCGAC 


CCGACTTGAC 


3060 


AAATTCATAG 


ACCGTACTTG 


GAAAACCAGC 


ATTAAAACTG 


GCAATTTGTT 


GAGGAAGATA 


3120 


GGCTATTCTC 


AATTTCTTAC 


CTTGCGTATT 


TGTCTTTGAA 


ATAGCCACCT 


TTCCAATGCG 


3180 


TGGTTGCAGA 


ATTCCAAGAC 


TAGCCTTGAT 


GAGCGTCGTC 


TTAGCCCCTC 


CATTTTCCCC 


3240 


AGTCAAGGTA 


ACAAATT CCC 


CACTATCAAC 


ACAATAATTG 


ATATGTTCAA 


GAACAGGCTC 


3300 


CTTATCATAA 


TAGAAGGACA 


AATCCTCTAC 


CGTAATATAT 


CTCATTATTT 


GATTTCTCCT 


3360 


ACTAAAGCAG 


TCAAAAACCG 


CTGAATCACT 


TTTTGTTCAT 


TTGGAGTAAA 


CTGAGTCGCC 


3420 


ACTTGTTCAT 


AGGTTAAAAG 


TGTATGCTCA 


TGGTGATGGT 


GGTGCTCCTC 


AG CG ATTGG A 


3480 


CGAGCCAAGT 


CAGTCAACTG 


ATAAAAAATC 


ACACGCGCAT 


CTTTAGAATC 


TTTAGATGTT 


3540 


TCCAACATCC CTTCCTTGAC CAAAGACTTA ATGGCCTTGG 


TAACTGCCGC 


CTGACTGACA 


3600 


TTGAGACGAC 


GGGCCAATTC 


TGAATTTGTT 


AAAGATTCCT 


CTGACAAGAG 


CATAAGGATA 


3660 


TGCTCCTGAG 


TATTGGTCAG 


GGCCACCTCG 


CTAGTGCAAT 


GACCTATTAG 


GATTTCATGC 


3720 


TGATTTTCCG 


CCTGCAAAAT 


CACCTCATTC 


AAAAAAGCAT 


TGATATCCTT 


TGCTAGCTGT 


3780 


CTCATATCTG 


ACTCCTTTCC 


TTTTAGACTT 


CTCTTTTTTA 


AGAGAAAAAT 


ACTATTCTTT 


3840 


GACATTTTGT 


TTACCAGTTA 


ATTATATCAC 


AAGCAAAAAA 


AGAGTCAAGA 


AAAAACGTGA 


3900 


AAACTAGTTT 


CATTCTTGAA 


CTCTTCTATA 


TTATATTATC 


TATTGAAATT 


CTTTGAC ATC 


3960 


TCCATCATAA 


GTCGCCCAAT 


CTTTGCTGAA 


AAAGCGCTCA 


TTCAGATGGT 


AAGTCGGAGC 


4020 


TGGTGTGGGA 


TTGGATAGGA 


AAGGATCAAC 


TGCCTTGTCA 


AAAGCCAACC 


AACCCAACCA 


4080 


ACCAAGGTGA 


ATGGTGTCCT 


TCATAAAGAA 


AGGCTCCCCG 


CCGTCCTTAG 


AAAAATCTGC 


4140 


TATATTGGTA 


AAACCTTGAC 


TTTCTAACTG 


GTAGCGAATC 


TTCTGCACCG 


TTTGTTGGTA 


4200 


CATATCCTCT 


CGTAGACCAG 


CATAGTTCAT 


CCATTTTTTA 


TTAACAGGTG 


GAATGATAAA 


4260 


AATCGGGTTT 


ACCTTAGATT 


TAGAAAACTG 


TGTTAAAACC 


AACTCCAAGT 


CATTATACTC 


4320 


TGGCGACTTG 


AGATAGGTAA 


AGCTTTTCTG 


AGAATCCTTT 


AATTTCTTCA 


AATCCTTCTT 


4380 


GATCTGCTCA 


TTATAGAAAT 


AATTTTCCAT 


TCCCATCTCA 


TTATTGGAAG 


TATrrrTTTC 


4440 
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AGCATCTGCT TTGACAACAT CTTCTATTGC CTGATAAGAA AACTGGTCTG GCAAGATTTT 4 500 

TAAATACTTA GCTACATGCT TATCGTAGTT AACATAGCCT CT AAC CG AAA ACTGACCAAA 4 560 

AAAGGAAGCT TGGCGTTCAT TAAAACGAGC CAATAATTCA ATCATTTCAT TGTCTGCTGT 4 620 

CGACAATTCT TCTTTACTTG CCAACTTCTG AACCAGGTCC TTC AT AG CT A CGTTTGGGAA 4 680 

CTGTTGCAGT AAGCGAGTCG CTGCATATTG ACTAG CCTG A TCCCCAGATT GATGTTTCAG 4 740 

AAAACTAGTC AACTGGTCTC CATTAAAATA CTGCTGGAAG GCTGCTGGAT CATACCCATT 4 800 

TTTACTGAAC CACTGAGGTG AGATAACATA CACAACTTGT TTATTCTCCA GCTGTGGTAA 4860 

CATCTGTTGC AT TCC AAAAT ATTGGTTAAG CGATGCAGCT CCCCCCTGTC CTAAAAGATA 4 92 0 

AGG ACGG TAG GAACGATTGT ATTT CTCAGC TAATACCGCA GGATGAGCAC CGTCAAAACG 4 9 80 

AAGCCATTCA CTAGAGCCAA AGAAGGGAAC AAAACGCACA TTTGGATCAG ATAGTGCTCT 504 0 

GACTTTTTGA CTTCGCTCCT TAAAACTATC GAT AGT ACTA GCCACTGCTG AACGCTTTTC 5100 

ACCTCCTAGA TTATGATGCA TCTCAGTAGG ATAAAAGAAA ATGAGCAGAA AAACCAACAA 5160 

ACCAGCGATC AAGACCGGTC CGAAGATCAT CCATAAGCGT TTAAGCATTT TGTAGCTCCA 5 2 20 

CAATACCAGC TATGATTTTA TTAGCTGTAT TCCAGTCGTC ACGACCAAAC TCTGTTACAG 5280 

GGACACGAAT GTCAAAACGG TTCTCAATCT CCACAATCAA CTCAACCGTT CCCATACTAT 5340 

CCAAGACACC TGCATCAAAA AG AT CTT CAT CCATCATGTC AGAAACATCT TCCATAAACA 5400 

ACT CATC AAT AATTTCAATA ACTTCTGATT TGATATCCAT ATTTTATTTC CTTTTATTTT 54 60 

TT AAAC CAT A GATTATTCAA GAATCCAGAA AAGATTAAGA ATGACAACAT GACAACATGG 5520 

AAAGTGACAA CCATGCCAAG CAACTGAATC CAGCGATTCT CAGGTAGGGC AGCCTTCCCT 5580 

GCTTTTTTCC GTTCCTTATT GAGCGTTTTT TTCTTGCGAA CCCAGGCATC ATTGATGACC 564 0 

AAGCCTAGTC CATGAAAGAG TCCATAGGCG ATATAGTACC AGGTCACACC ATGCCAAAAT 5700 

CCCATAATCA GCATATTTAC AATGTAGGCC ATGCTTGAGG TTACATTACG ATTTTTAAAG 5760 

ACTTTCTTTC TGGTTAACAC CATCACCATT CGCATAAAGA CAAAGTCACG GAACCAGAAG 5820 

GACAGACTCA TATGCCAGCG ATTCCAAAAC TCCTTTAAAT CCCTTGATAA AAAGGGCTTG 5880 

TTAAAGTTGA TAGGGCTACG GATTCCCATC AAGTTTGAGA TGGCCAAAGC AAACATAGAA 5940 

TAACCTGCAA AGTCAAAGAA GAGTTCCAGA CCAAAAGTAT ACATAACTGC CAAGG CAT AG 6000 

AGATTAAAGA AGCCACCTGA CTGCAAGGCT AAATTCTTCA GAGGAGGTAG TAACGTCTCT 6060 

CCTAAAACAT GAGCTAGGAT AAACTTATAC AAAAAGCCCC ACATGATATA CCGGACAGAT 6120 

TCATC CAGC A TATCCATCAA CTCATCTCGC TCAGGAATAG CCTGATAATT TTCATTAAAT 6180 
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CGCTTAAACC GATCGATTGC ACCACTCGAG AAAGTTGG C A TGAAGAGAAG GAAACGCAGG 624 0 

AATTCC C AG A GGGTAAAATC CTTAATCACT CCATCTCTCA GCTCG ATG AC AATTCCAACC 6300 

GAACGAAAGG TCAGGTAAGA AATTCCCAAG AACCCAAGCA AAGACTGCGT TCCATTGATA 6360 

GCTGGTTCCA CCTTGACAAA GATAATCGGA AGTAGGGACA CAAAACTAAC TAAGTAGAAG 64 20 

ACCCACTTGC CATCCTTGCT TTTTCGATAA T GCTTGT AG A AAAGCAGGAG CAATATTTCC 6480 

CAGCAAAGGT AAATACCCAA GGCAGCTAGT TGATTGGTCT TTCCACCCAC CAACATGGTG 6540 

ACAATAAAGA AGAGACTTAC C AAC A CTTCA TACCAGGCAA AGCGTTTCTT GAAAAAGAGA 6600 

CCTATAAAGA TGGGCAAGGT TGCAGCAATC ACATAAACAA AATACTGAGG ATTGCCATAT 66 60 

GGCTCTAAAT GAGGAAGCTG TTGAAAAAAC TC CAT C ATCT CTTATTCACC TCCTTAATCA 6720 

ATCCTTTGAT GTCAATCTTT CCATTTGGAG TTAGTGGCAA ACTGTCT CGG TAAAGGAATT 67 80 

TAGATGGCAT CATATAGGAC ATCATGATGT CTGTCAGGTC TTCCTTGATG GCCTTGGTAA 684 0 

TATCGATATC TCGCTCAAAC TGCTCACGAA C ACCGTCTTT TAAGATGACA TAAGCCAATA 6900 

GATTTTGTAC CTTGTGGTCC TTGTTATAGC GCGGTACTGC GACAGCAGAT TCGATAAAGC 6960 

GAGACTTGTT GAGGTTTTGA GAGACATCTT CTAACTCAAT GCGGTAACCG TTAAACTTAA 7020 

TCTGGAAGTC C ATGCGT CCG CCGTAGAGAA GCAAGCCCTC ATCTGTCATG GTTCCCACAT 7080 

CGCCTGTGTG ATAGGCTGGC AGATCTTCAA ACTCAAAGAA GGCTTCTGCT CTTTTTTCAG 7140 

GATTGTTCAT ATAACCTTTT GAAACAGCTG GCCCAGAAAC AATGATTTCT CCCTGCTCAC 7200 

CATTTGGCAG TTTATTTCCT TCCTCGTCAA TGATAAAGGT TGGAGAATCA GCCTTGGTAT 72 60 

AGCCGATTGG TAGGCGTTTG AG AGTCC CT A ACATCTCGTC TGTCACGGCA ACTGCTGACA 7320 

GAGCTACTGT CGCTTCTGTT GGGCCGTAAG CATTGATGAT ACGGGCATTT GGGAAACGCT 73 80 

CGCGCAGTTT TTGAGCTGTT TTGACCGTCA ATTCTTCACC ATCAAAGTAG AAATGCGTGA 74 40 

TTCCAGGCAT TTTCTCACTG TTGAAGTATT CAGACAACAT GGCCATATCT GCAAAGGATG 7500 

GTGTTGATGT CCAGATAGCG ATTGGCAATG AAAAG AT AG C CGCAAAGAGT TGCTTAAACT 7560 

CCTGAGT GAT GACTGAAGGA AGAGTGAAAA GCGTACCACC AAGTGCCAAG GTCGGTGCCC 7620 

AATACATGAC AGACAAGTCA AAAGAATAAG GTGGCTGTGC CAGCATTTGC GGACGACTCG 7680 

GTGTCGCAAA TTCCTTATCC GTAATCATCC AGTTTGTAAA GCTGAGGAGA TTATCATGTG 7740 

AAATCTGCAC TCCCTTAGGC TTACCAGTCG TACCAGAAGT AAAGATAATG TAGTAATTAT 7800 

CATCTCCCTT GACTGGATGC GTGATTTCAT AGTTATTCCC TTGGGCAAAG GCTTCTTGAA 7860 

CCTGAGCTAG ATTTATCATT GGTGTAGAAA CCTGCTCCAA GGGAAAGGCT GAAATGGCAA 7920 

TAATCAAGCT TGGCTCTGCT ACTTCTAAAA TAGCTGAAAC TCGCTCCAAG GCCGAATGGC 7980 



r 

v 
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TATCAATTGG AATGTAGGCA TGACCTGACT TAGTCAGCGC T AC AAAGGT T GCCAACATTT 8040 

CATATTCTTG GCC ACCAAAA ACAACCACAG GAGACTTCTC AGGCAAGCCT AGTTGGTCAA 8100 

TGACTGCAGC CAAACTATCC GAATCAGCCT TTAAATCGCC ATAAGTGTGT TCCTGCCCCA 8160 

AAACATTATA GACAGGATAG CTAGGCTGTG TCTGAGCAAA ATGCTCAATG GTTT CAATC A 8220 

TATCTGCTAT TGGTTTATTT GACACAATAG GG A TTCTCCT TCAAGTTAAA ATTCATTATA 8280 

GATAAAGCTT CCTTGACCCT GACCAAGATA GCTAAAGAAC TAAAGCAGCC CTAGAAAGAT 834 0 

AAGAAAATAC AAGGCTGTCC GACCAAGAAA GAGGTACAAT TCTTTTCTCT GTTT CATC AA 8400 

GAAAAACCAT TCATTTCTGT AATTTTTCGC T AAAATAAG A GTG ATTCTT A CTAGCTTATT 84 60 

TTTCTACCAT TGTACCACTT TATATAGTAT CTTTTCAATT GTTTACCGTA TGTTTCCAAT 8 520 

AGATTTCAGC TTATTTTAAG GATTATACAG TTTTTCTATG TATATTTTCA AATAGAGTGA 8580 

TCCTGCTTCA AAACTCCATT TCACGAGACA ATGAAGTAAA TCTTCCCATA ATAAAACACA 8640 

CAATATCAAG TTTTTTCAAC ACCTGATACT ATGCCCTTTT CTGATTTTTA AAGACTTTTT 8700 

AACCACTCTC TCATTTAAAA TAATCTCGTC TGATATAAAT TAAAATAGCT TCTATCATCA 87 60 

GACAAATGGC TGATAGCCAA AAACTGATGC TAATACCAAA ACTCTCAGTA ATATAGCTCA 8820 

TTAGCAAAAC AAATACTGAA AATGCTAATG TAGAAATCAC TTCAAGAACG GAATAGACAT 8880 

TAACTAAATG ATTTTCCTCT ACTGTTTCCT GAAGAAATAC ACTTTCAGGA ACTTCTTTTA 8940 

GTTGCGATAA CATACCAACT AAAGCTGAAA ATAATAAAAA CATCTGTGCG TTTGGAAAAT 9000 

ATAGAATAGT CAGTGTCACT ATTTCCATAG CTACAAGAGG AAAAAGAATA CTTTCCCCCC 9060 

AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC TCATGAGCCA 9120 

CTTTCTTCCT CCTCATGAGG TCACTTTTAC TTTCTGCTGT TCCAGTATCG TTTTTCCTCG 9180 

CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT TTTCATCTCG 9240 

ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGCTTCATA AGGAACAGGA 9300 

AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC GGAATAGGCA 93 60 

TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT TCCCACCACG 9420 

TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC TCCAGCTAGA 94 80 

TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATC AT ACG AACTTCGTTT 9540 

TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT CTCC TTGATG 9600 

AAATTCTCGG CTTGACCACG TCCACGATAA AGCTGAAACT GGTCTTGGcT gTTCCACTCG 9660 

TCATATTTGT AACGAGAGAA ATAACATCGT AGAACAAGTA TCCTTCTTTT C 9711 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 
{ B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



<xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


168: 






CCCCTTTGTC 


AAAACTGTAA 


AATTAACGAC 


TCAACAATTC 


ATCTTTACAC 


CAATCTCAAT 


60 


GGAAAACAAA 


AACAAATTGA 


CCTCTGTCAA 


AACTGCTATA 


AGATTATCAA 


AACAGATCCT 


120 


AACAATAGCC 


TCTTCAAAGG 


TATGACGGAT 


CTGAACAATC 


GTGACTTCGA 


TCCCTTTGGT 


180 


GATTTCTTCA 


ATGATCTAAA 


CAATTTCAGA 


CCTTCTAGCA 


ATACTCCTCC 


TATTCCCCCA 


240 


ACCCAATCAG 


GTGGAGGTTA 


CGGTGGAAAC 


GGCGGTTATG 


GTTCCCAAAA 


TCGTGGATCT 


300 


GCTCAAACTC 


CGCCACCTAG 


CCAAGAAAAA 


GGCCTGCTGG 


AAGAATTTGG 


TATTAATGTA 


360 


ACTGAAATTG 


CCCGTCGTGG 


AGACATTGAC 


CCCGTTATTG 


GGCGCG A CG A 


TGAGATTATC 


420 


CGTGTCATCG 


AGATTCTCAA 


TCGTAGAACC 


AAGAATAATC 


CTGTCCTTAT 


CGGTGAACCT 


480 


GGTGTCGGAA 


AAACGGCCGT 


TGTCGAAGGT 


CTAGCTCAGA 


AAATTGTCGA 


TGGCGATGTG 


540 


CCACATAAAC 


TCCAAGGTAA 


ACAAGTCATC 


CGTCTGGATG 


TGGTTAGCTT 


AGTTCAAGGA 


600 


ACGGGGATTC 


GAGGACAATT 


TGAAGAACGC 


ATGCAAAAAC 


TCATGGAAGA 


AATTCGCAAA 


660 


CGTGAAGACA 


TCATCCTCTT 


TATCGATGAA 


ATCCATGAAA 


TTGTTGGTGC 


TGGTTCTGCG 


720 


AGTGATGGTA 


ATATGGACGC 


AGGAAATATC 


CTCAAGCCAG 


CCCTTGCTCG 


TGGAGAACTG 


780 


CAACTAGTCG 


GTGCTACTAC 


CCTCAATGAA 


TACCGTATCA 


TTGAAAAGGA 


TGCTGCCCTC 


840 


GAGCGTCGTA 


TGCAGCCTGT 


TAAAGTCGAT 


GAACCAACGG 


TGGACGAAAC 


AATCACTATT 


900 


CTCAAAGGGA 


TTCAAAAGAA 


ATACGAAGAT 


TACCACCACG 


TT CAAT AT AC 


AGATGCTGCG 


960 


ATTGAAGCAG 


CTGCAACTCT 


TTCCAATCGC 


TACATCCAAG 


ATCGCTTCTT 


GCCTGACAAG 


1020 


GCCATTGACC 


TCCTAGATGA 


AGCTGGTTCT 


AAGATGAACT 


TGACCTTGAA 


TTTTGTGGAT 


1080 


CCTAAAGTAA 


TTGATCAGCG 


CTTGATTGAG 


GCTGAAAATC 


TCAAGTCTCA 


AGCTACACGA 


1140 


GAAGAAGATT 


TTGAGAAGGC 


GGCCTACTTC 


CGCGACCAGA 


TTGCCAAGTA 


TAAGGAAATG 


1200 


CAAAAGAAAA 


AGATCACAGA 


CCAGGATACT 


CCTAGCATCA 


GCGAGAAAAC 


TATTGAGCAC 


1260 


ATTATCGAGC 


AGAAAACCAA 


TAT CCCTGTT 


GGTGATTTGA 


AAGAGAAAGA 


ACAATCTCAA 


1320 


CTCATCCATC 


TAGCCGAAGA 


TCTCAAGTCT 


CATGTTATTG 


GTCAAGATGA 


TGCAGTCGAT 


1380 


AAGATTGCCA 


AGGCTATTCG 


CCGTAATCCT 


GTCGGACTTG 


GTACCCCTAA 


CCGCCCAATC 


1440 



♦ 
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GGAAGCTTCC 


TCTTCGTTGG 


GCCAACTGGT 


GTCGGTAAGA 


C AG AACTTT C 


CAAACAACTG 


1500 


GCTATCGAAC 


TTTTTGGTTC 


TGCTGATAGT 


ATGATTCGCT 


TTGATATGAG 


TGAATACATG 


1560 


GAAAAACATA 


GTGTGGCTAA 


GTTGGTCGGC 


GCTCCTCCAG 


GTTATGTTGG 


CTATGATGAG 


1620 


GCTGGTCAAT 


TAACTGAAAA 


AGTTCGCCAC 


AATCCATATT 


CTCTCATCCT 


TCTCGATGAA 


1630 


GTGGAAAAAG 


CTCACCCAGA 


TGTTATGCAC 


ATCTTTCTTC 


AAGTCTTGGA 


CGATGGTCGT 


1740 


TTGACAGACG 


GGCAAGGACG 


CACCGTTAGC 


TTCAAGGATG 


CCATCATTAT 


CATGACCTCA 


1800 


AATGCAGGTA 


CAGGAAAGAC 


CGAAGCTAGC 


GTTGGATTTG 


GTGCTGCTAG 


AGAAGGACGT 


1860 


ACCAATTCTG 


TCCTCGGTGA 


ACTCGGTAAC 


TTCTTTAGCC 


CAGAGTTTAT 


GAACCGTTTT 


1920 


GATGGCATTA 


TCGAATTTAA 


GGCTCTCAGC 


AAGGATAACC 


TCCTTCAGAT 


TGTCG AG CTC 


1980 


ATGCTAGCAG 


ATGTTAACAA 


GCGCCTCTCT 


AGCAACAACA 


TTCGTTTGGA 


TGTAACTGAT 


2040 


AAGGTCAAGG 


AAAAGTTGGT 


TGACCTAGGT 


TATGATCCAA 


AAATGGGAGC 


ACGCCCAcTT 


2100 


CGTCGGACTA 


TTCAAGACTA 


TATTGAGGAC 


ACAATCACTG 


ACTACTACCT 


TGAAAATCCA 


2160 


AGCGAAAAAG 


ATCTCAAAGC 


AGTTATGACT 


AGCAAGGGAA 


ACATTCAGAT 


TAAATCTGCC 


2220 


AAAAAACCTG 


AAGTTAAAAG 


TTCTGAAAAA 


GAAAAATAAA 


TCCTATAAAA 


AAGGAGTAGA 


2280 


AAATGAAATT 


TTTCTGCTTC 


TTTTTTTACT 


AAAATAACTG 


TAATTTCTTG 


ACAGCTTGCC 


2340 


CTTTGTCCAT 


TATGATATAT 


AGTAGACTGA 


ATCTGAAATA 


GTACGAAACA 


ATTGCTAAAA 


2400 


CATTTATAGA 


AATTAATTTT 


ACTTTCCCAA 


TCGATTTGTT 


CTCATCTTAT 


TTCAATCTGC 


2460 


TATAGTCAAT 


TGAAACAAGA 


ACAAGACAAA 


AGAGCCTCAT 


AAAAGGTATT 


GCAACTTGGT 


2520 


AATACCTTTT 


TGAGGTGCTT 


TTTGATATGA 


GCCCATGTTT 


TCTCAATAGG 


ATTGTACTCA 


2580 


GGTGAGTAGG 


GAGGAAGAGG 


TAAAAGTTTA 


TACCCAAACT 


CTTCACACAA 


GAGTTCTAAC 


2640 


TTACCCATTC 


TATGGAATCT 


TGCATTATCC 


ATAATAATAA 


CCGATGGTGT 


GGTTAATGTT 


2700 


GGTAAGAGAA 


ACTTCTGAAA 


CCAAGCTTCA 


AAAAAGTCGC 


TCGTCATCGT 


CTCTTCGTAA 


2760 


GTCATTGGAG 


CGATTAACTC 


ACCATTCATT 


TGTTAGACCT 


GCAACCAAAG 


AAATTCTCTG 


2820 


ATATCTTCTT 


CCAGATACTT 


TGCCTCTTCT 


TAACTGACCT 


TTTAATGAGC 


GACCATATTC 


2880 


TCGATAAAAA 


TAAGTATCGA 


ATCCTGTTTC 


GTCAATCTAA 


ACAGGTGCTA 


GGTGCTTTAA 


2940 


ACTATTAAAA 


TTCTTAAGAA 


ATAAGGCTAC 


TTTTTCTGGG 


TCTTGTTCAT 


AGTAGGTGTA 


3000 


GTTCTTTTTT 


TTTTCGAGTG 


TAGCC 








3025 


(2 ) INFORMATION FOR SEQ ID NO: 169: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4104 base pairs 



WO 98/18931 



PCT/US97/19588 



1079 

<B> TYPE: nucleic acid 
(O STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


169: 






TTTAAGGTTT 


TAAAAAAAGT 


TTTCGAAAGC 


TTTCTTCTTT 


ATTTTTTAAG 


GGAGAGATAA 


60 


CGTTGATATC 


TAAATCGTfU^ 




CAATTTTTCC 


TTTAGATGTG 


TATTGGTGAA 


120 


TATCATAATC 


TAAATCAGTT 

* ****** A ^Aw L 1 




TCTCCAAAAA 


TCCTGAGTCT 


GAGCCGTAGA 


180 


CGGAATCCAA 


ACAGAHGTAA 


ACTTGCCTGT 


ATCAATACTG 


TGTTCTTCCA 


TGAAGTAGAC 


240 


ACCAACGTAG 


ATGCCGATGT 


1111 AuLnL 


CAGTGATGCT 


AGTTTTGCTC 


GAAAGTTTTC 


300 


GACACCTTCG 




ArAWPTTTT 

n\-Al\ilji 1 i 1 


GTCTTCCACG 


TCAAGCCAAT 


AGTAACTAGG 


360 


GCTGTAAGGA 




i i n«/uift«L 


TTCGGCAGCC 


TTTTCCATTT 


CTTGGACACT 


420 


TTTTCCAGCT 


ACATAAGCGT 


AfiarAnriAr 


TGGGACATTC 


CGCTTTTGAA 


GTTCAGTGAT 


480 


ATGACTCTTA 


TAGGCCTTfTT 


/"••I' l 'fTV^f* & 'T w I 1 

\— 1 /\ 1 1 V- 1 1 


GATAAATGAA 


GCATCATTTT 


CTTTTGTCGT 


S40 


TTG AG C AC C A 


CTGTGAAPAr' 


UAaLAA 1 


ACCTGAAATA 


TTTTGTGAGA 


GGGCATCGTA 


600 


GTTGATTTCC 






GAGGTCAATA 


ATCGGTTTGT 


CTAAGTGTTT 


660 


CAAAGCCTGT 




« l v» W Inlrti i 


GGATTTTGTT 


TTAAACGATT 


GGCTGTCATT 


720 


AAGTGGGCGA 


TTGATGATTA 


AAiTrairan* 


CATAATCCCA 


AAAAAACTAA 


ATAAAATAAG 


780 


TGGATGAATT 


A W A X 1 i U 1 *»/\ 


TATrr^aTi a 

1 i M 1 Afl 


TTCTACCCTA 


AAAATCAAAA 


AAAATCAAAA 


840 


AAATGGGTTA 


AGGAAGAGAC 


TTTAflAfir AT 


TTTTTCATTC 


AAGAGTGCGG 


AATGATTTGA 


900 


AATATGGTAT 


AATAAAAGGG 


AATTTCTACA 


GAAAAGAGAA 


GATTATGTCA 


AATTTTGCCA 


960 


TTATTTTAGC 


AGCGGGTAAA 


GGGACTCGCA 


TGAAATCTGA 


TTTGCCAAAA 


GTTTTGCACA 


1020 


AGGTTGCGGG 


TATTTCTATG 


TTGGAACATG 


TTTTCCGTAG 


TGTGGGAGCT 


ATCCAACCTG 


1080 


AAAAGACAGT 


AACAGTTGTA 


GGACACAAGG 


CAGAATTGGT 


TGAGGAGGTC 


TTGGCTGGAC 


1140 


AGACAGAATT 


TGTGACTCAA 


TCTGAACAGT 


TGGGAACTGG 


TCATGCAGTT 


ATGATGACAG 


1200 


AGCCTATCTT 


AGAAGGTTTG 


TCAGGAC AC A 


CCTTGGTCAT 


TGCAGGAGAT 


ACTCCTTTAA 


1260 


TCACTGGTGA 


AAGCTTGAAA 


AACTTGATTG 


ATTTCCATAT 


CAATCATAAA 


AATGTGGCCA 


1320 


CTATCTTGAC 


TGCTGAAACG 


GATAATCCTT 


TTGGTTATGG 


ACGAATTGTT 


CGTAATGACA 


1380 


ATGCTGAGGT 


TCTTCGTATT 


GTTGAGCAGA 


AGGATGCTAC 


AGATTTTGAA 


AAGCAAATCA 


1440 


AGGAAATCAA 


CACTGGAACA 


TACGTCTTTG 


ACAACGAGCG 


TTTGTTTGAG 


GCTTTGAAAA 


1500 


ATATCAATAC 


CAATAACGCT 


CAAGGCGAAT 


ACTATATTAC 


AGACGTCATT 


GGTATTTTCC 


1560 
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GTGAAACTGG TGAAAAAGTT GGCGCTTATA CTTTGAAAGA TTTTGATGAA AGTCTTGGGG 1620 

TAAATGACCG TGTGGCGCTT GCGACAGCTG AGTCAGTTAT GCGTCGTCGC ATCAATCATA 168 0 

AACACATGGT CAACGGTGTT AGCTTTGTCA ATCCAGAAGC AACtfTATATC CATATTGATG 1740 

TTGAGATTGC TTCGGAAGTT CAAATCGAAG CCAATGTTAC CTTGAAAGGG CAAACGAAAA 1800 

TTGGTGCTGA GACTGTTTTG ACAAACGGTA CTTATGTAGT GGACAGCACT ATCGGAGCAG 18 60 

GAGCGGTCAT TACCAATTCT ATGATTGAGG AAAGTAGTGT TGCAGACGGT GTGATAGTCG 1920 

GTCCTTATGC TCACATTCGT CCAAATTCAA GTCTGGGTGC CCAAGTTCAT ATTGGTAACT 19 80 

TTGTTGAGGT GAAAGGATCT TCAATCGGTG AGAATACCAA GGCTGGTCAT TTGACTTATA 204 0 

TCGGAAACTG TGAAGTGGGA AGCAACGTTA ATTTCGGTGC TGGAACTATT AC ACT CAACT 2100 

ATGACGGCAA AAACAAATAC AAGACAGTCA TTGGAAACAA TGTCTTTGTT GGTTCAAATT 2160 

CAACCATTAT TGCACCAGTA GAACTTGGTG ACAATTCCCT CGTTGGTGCT GGTTCAACTA 222 0 

TTACTAAAGA CGTGCCAGCA GATGCTATTG CTATTGGTCG CGGTCGT C AG ATCAATAAAG 2280 

ACGAATATGC AACACGTCTT CCTCATCATC CTAAGAACCA GTAGGAGCCT ATCATGGAGT 2340 

TTGAAGAAAA AACGCTTAGC CGAAAAGAAA TCTATCAAGG ACCAATATTT AAACTGGTCC 2400 

AAGATCAGGT TGAATTACCA GAAGGCAAGG GAACTGCCCA ACGGGATTTG ATTTTCCACA 2460 

ATGGGGCTGT CTGT G TTTTA GCAGTAACGG ATGAACAAAA ACTTATCTTG GTCAAGCAGT 2520 

ACCGCAAAGC TATCGAGGCT GTCTCTTACG AAATTCCAGC CGGAAAATTG GAAGTAGGAG 2580 

AAAACACAGC CCCTGTGGCA GCTGCCCTTC GTGAATTAGA GGAAGAAACA GCCTATACAG 2 64 0 

GG AAA TT AG A ACTCTTGTAC GATTTTTATT CAGCTATTGG CTTTTGTAAT GAGAAGTTAA 2700 

AACTATATTT AGCAAGCGAT TTGACAAAAG TGGAAAATCC GCGTCCGCAG GATGAGGATG 27 60 

AAACCTTGGA AGTCCTTGAA GTGAGCTTAG AAGAAGCGAA AGAATTAATC CAATCAGGTC 2 820 

ATATCTGTGA TGCCAAGACA ATTATGGCTG TTCAGTATTG GGAGTTGCAG AAAAAATAGA 2880 

GGAGGTCAGT ATGGGTAAAT CTTTATTAAC GGATGAAATG ATTGAAAGAG CTAATAGAGG 2940 

CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA TTTTACCAAC 3 000 

CTCTTCTTCC C G T TT TGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC AGGAAACCTT 3060 

GAAGATTCAG GT CGAAC CAT CTATTCATAA AAGCCGTCGT ATTGAAAATA CCAAGAGAAA 3120 

TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT TGATTTTGCT 3180 

TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA GGAATTATTG 324 0 

CTGCTATGCC AGAAGAACTG GCTTATCT GG TCCAGCATTT AG AT AATGC C CAGGAGCAAG 3300 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCATTGCTTC TCATGAAGTC GTTCTTGTAG 3 3 60 

AAAGTGCAAT TGGTAAGGTC ATGTCTGCTA TCAGTGTGGC GATTTTGCCT GATCATTTCC 3 420 

AGGTGGATGC CCTTATTAAT ACGGGTTCAG CTGGGGCACT AGCAGAAGGT ATCGCTGTTG 34 80 

GGGATGTCGT GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 3 54 0 

ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCGA ATCAGACAAA ACCTTTGTTG 3 600 

CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGCT TTGATTGCTA 3660 

CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAAGTCC CATTTCCCAG 3720 

AAGTTTTAGC CGTGGAGATG GAGGGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 3780 

TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AAC ATCTTTT 3840 

TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AG TCTTGTTG ACCTTTTTGA 3900 

AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATCATAAGA TTTAAGTAAA 3 9 60 

GAAAAGCTAG AAAACGTTTC AGAGGATATT ATGAGTATTG AAATGACCGT CAGTGAGATT 4 020 

GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 4080 

GAAGACACAG ATAAAAATGA CAAG 4104 



(2) INFORMATION FOR SEQ ID NO: 170: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8876 base pairs 
tB) TYPE: nucleic acid 
(C) ST HANDEDNESS : double 
CD) TOPOLOGY: linear 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CACGGATAGG CTCGGCTTTC ATCACTCCTC AGGCTGATTT ACTAATAGCA ACTTTCCTCG 60 

ACAAACTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TT ACGCTG AT ACCTTTGCTG 120 

GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGGTAC 180 

AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCCGATCCT TGTGTTGATA 24 0 

GCCTTTCCCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 3 00 

AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 3 60 

GAAAAATCCC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 4 20 

TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 4 80 

GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 540 

AATCTCCAAC TGCCACCACT TGACTAGGGT CTTTGATAAA CTTGCGACTC ATATGGGAAA 600 
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TATGGATGAG 


ACCGTCCTCA 


TGAATTCCGA 


TATCAACAAA 


AGCACCGAAA 


TCAACAACGT 


660 


TACGCACCAC 


TCCTTCTAGC 


TTTTGTCCAA 


CCACTAAGTC 


CTTGATATCT 


AGGACATCTT 


720 


GGCGAaCACA 


GGTGCGTCAA 


AGGAATCACG 


GAAATCTCGA 


CCTGGTTTGA 


GAAGATCTGC 


780 


AATGATATCT 


TTAAGAGTTT 


CTGGACCAAG 


GTCTAACTCT 


TGCGCCATTT 


CCTTGACTGA 


840 


AAGCGACTTG 


AGTTTGCTTT 


GGGCTTCTTC 


GTTTAGCTCT 


TTAATATCTA 


AACGTTTGAA 


900 


GAGTTCCTTA 


ACTG CAGTGT 


AATTCTCTGG 


GTGAACTCCT 


GTATTATCAA 


GGATATTGCT 


960 


ACTTTCAGGG 


ATACGAAGGA 


AACCAGCAGC 


CTGCTCAAAG 


GCCTTGGCTC 


CCAGACGAGG 


1020 


AACTTTCTTG 


ATTTGGGCGC 


GTGAAGTGAT 


TTTTCCTTCT 


TCCTCGCGGT 


ATTTGACAAT 


1080 


ATTTTCAGAG 


ATAGTTTTGT 


TGAGTCCAGC 


TACGTGTGAA 


AG AAGAG CTG 


GGCTAGCTGT 


1140 


ATTGACATTG 


ACACCAACTT 


GGTTAACCAC 


TGTATCGACA 


ACAAAGTCCA 


GACTCTCAGA 


1200 


TAGTTTCTTC 


TCACTGACAT 


CGTGTTGGTA 


TTGACCGACA 


CCAATTGACT 


TAGGATCGAT 


1260 


TTTGACCAAT 


TCCGCAAGAG 


GAT CTTGCAA 


ACGACGGGCG 


ATAGAAATGG 


CAGAGCGTTT 


1320 


TTCAACGGTC 


AAGTCTGGAA 


ACTCCTGACG 


AGCAAGTTCG 


CTGGCAGAAT 


AGACAGAAGC 


1380 


ACCACTTTCA 


TTAACGATAA 


CATAGCTGAC 


TTCAGGGAAA 


TCTTTCAGAA 


CTTCCGCTAC 


1440 


AAAAGCTTCA 


CTTTCACGAC 


TGGCCGTTCC 


ATTTCCAATG 


GCAATAATCT 


CTACACCGTA 


1500 


TTGACCAATT 


AAATCTGCTA 


AATCTTTCTT 


GGCTTCTTCG 


ATTTGACGAG 


CTGATGCTGG 


1560 


TTTAACAGGA 


TAAATAACCT 


GAGTTGTCAG 


CATTTTTCCT 


GTTGC AT CCA 


CGACAGCTAG 


1620 


CTTGGCACCT 


GTACGAAAGG 


CTGGGTCAAA 


TCCAAGAACC 


ACGCGCCCTT 


TCAGTGGAGC 


1680 


AACCAAGAGG 


AGATTGCGCA 


GATTGTCAGA 


AAAAAGTTGG 


ATAGCTCCTT 


CTTCAGCTTT 


1740 


CTC AGTTAAT 


TCTGTCCGAA 


TACGACGCTC 


GATAGCAGGC 


AAGACCTTTT 


TCTTAACGGA 


1800 


TTGCTGAACA 


ACTTCATCAA 


TATAAGCATT 


TTTCACCTTG 


AAACGAGTAG 


CAAAGAAGGC 


1860 


AAGAATACGG 


TCCGTCGCAT 


GTTCAAAACC 


GATCTTCAAG 


ACACCAAGTT 


TCTCCCCACG 


1920 


ATTGAGAGCC 


AAGGTACGAT 


AGCCTTGCAT 


AGTTCCAACT 


GTCTCTGAAA 


AATCATAATA 


1980 


AATCTGAAAA 


ACCTGCTTTT 


CATCAAGACT 


TTCATCCTTG 


GCTTGAGAAG 


TAAGTTTACA 


2040 


GTGTCTCAGC 


ACTTCCTGAT 


AAGTCATAGA 


ACGCAAGGTC 


AC ATCTTCCG 


ATAAGGCTTC 


2100 


GACCAAAATA 


TCAACTGCAC 


CGGTCAAGGC 


TTCCTTGCCA 


GTCGCAAATC 


CTTCACAGAC 


2160 


GAACTTTTCA 


GCTTCTTTCT 


CTAAGTCAAC 


TATATTCTGC 


AAAATCAAGC 


GAGCAAGAGG 


2220 


AAAGAGTCCA 


GCTTCACGGG 


CAATGGTTGC 


CTTGGTACGA 


CGCTTTTCCT 


TATAAGGAAG 


2280 


ATAGAGTTCT 


TCAACGTCTG 


CTAATTTTTC 


GGCAACTAAG 


ATAGCTTCTT 


CCAATTCCTT 


2340 
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GGTCAACTTA CCTTGTTCTT GAATCTTAGC TAAGACAGCT TCCTTACGGT CATTGAGATT 2400 

TGTCAGACTT TTATCCAAAT CAATAATAGC CTT AATCGC C ACCTCATCCA GACTACCAGT 24 60 

CATGTCCTTG CGATAACGCG CGATAAAGGG AATAGTCGCC CCTTCAGCTG TCAAACTTAG 2520 

AACGGTATCA ATTTGCTTTA ACGTCACTCC CAAATCCTGA GAGATTTTTT CATATTTTTT 2580 

ATCCATAAAT CTATTATACC ACAAGCTAAA CGTTTCAAAT TAACTCGTAG AACATTTAAA 2640 

AAATATGTAG GAAATAGATT TATATGCTAC AGCGCAATAA CTTGCACTTA AAGAGCATTG 2700 

CCACCTTTTT TTAACCAAGC CATGAT AT C A AAAGTATTTA ATGGATCAGA CATAATAGCC 27 60 

AGTTCTGGAA GATGTTCCTG ACCTGGAATA ACACATTGAC TTTTCAAATT TTTATATGGA 2820 

CGATTGACTA AAATTAATTT ATTAGAATAA GGAAGATTAT C CAT CTT ATT TAAAATTTCT 2980 

TCACTAGCTG AATCTTTATT ATCAAATTTA AAATAAAGAT TATTCCAATT TATGCGTTTT 294 0 

TTTCTTTTTT CCCACTTAGT TCGTGCTTCT TCAATACTAG AATAATGTAG AAAATGAATA 3000 

TCTATATCTC CTAAGTGCCC CAAAGGATAA ACTTCATGAG TCCAGCTCGG TCAAATAAGT 3060 

TCCTCTTCGA AAACAAGTTC TTGTTCCATA TAATAACGAA AATGCTTTGT AAGTTTATAA 3120 

TAATCATCAG GAAGAATAAA TAAACCAACA AAAGGTGTTC TATATTGAAA ACCAAGCTGT 3180 

TTATAAATTA ATCCTCCAAC ACAATTATTA CTTATAATCG TAAAATCTAA TCTATCAAGC 3 240 

TCAAGAAAAG GGAAAATTCC TTTCTCTGCA GCTATTAACT TATGATAAAC AATATCAGAA 3 300 

TCTAAATATT CACCGTCATT TTTTAACCAA GCACTAAAAT TTGCCAATTC TTGAATATAT 3 360 

TGTTTTTTCG CTCTTTCTAT ATCATAGTTT TCTAAGACGG CGCAATCTTT GATTCTATTT 3420 

T C AT AATTTT CTAATATGAT TTTGTAGGAG TCTTTTAGAG GTTTAGCATC TATAACAGGT 3 480 

TTATAGATAT ATGTCGGGAA ATTAATATAG GTTGCAGTTT TAGAGTGAAT ATAAAGTCTC 3 540 

CAAATAAGCT TGTTTATATC AAATTGATTT ATTTTTCGTA AAAGCTTACT ATTGAATAAT 3 600 

TTTCCAAATA ATGAGCGATA TTGTTTTCTA ATTCGATGAT CTGTATCATC CATCTTTTGT 3 660 

AAAACTTGAA CATTCGTTAA ATTTTCTGTC AACCAATTAT CCCCCCAAAA AGG AT AAA/ \G 3720 

TAAAATACTC CATCAACCAA ATCAGCAAAA TGACCAAGAA CAACATCAGA ATCGGATAAT 3780 

TTTATCGCAT GATACATCTT TTCAAATGTC CAATCAAATA ATGAATCATT TGAAGATAGA 3 840 

AACGTAATAT AATCTCCTGT AATCATATCA GACAACTCAG CAAAAGAATT CTCATCTATA 3 900 

AT CTT AAT AT TAAATGATAG ATTCATCTGT TGGCTAATGG AAGCTATCTC CTCTGTAGAT 3 960 

TGATTTACAA TAATAACTTC TATATCTTTT AATGTTTGTC TCTCCACTAT TGACAAAGAC 4020 

TCTAATAAAC TATTTTTATC TCCTTGATGT AACAAAACAA C ACT AATTGA GTAAGTCAGT 4080 

TTGACTACCT CCCATAATTT TCTGATAATG ATTTTCTTTT TATTTAATTA TAGCACAATT 4140 
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ATCATATATA 


. TCAGGTAATA 


> TCAAGCTATA TTATCTCTTA 


, CCTACTCAAT 


TTGAAATTTT 


4200 


AACTTTTCCC 


TTTTCCGCAA 


. AATAATAGTA 


. TAATAGAGGT 


AGAATCTAGA 


ATCGAGGTAC 


4260 


ACCTATGGCT 


GTCAAATTTA 


CAAAACGAGA 


CGACTTGGAC 


AAGATGTTTG 


AAGAGTTTGC 


4320 


TAAACTCCCT 


GATTTGAAAC 


AAGTTACTTT 


CCCTGATGAC 


AAAGAGAAAA 


AAGTCAAAGC 


4380 


AGAAAAGAAA 


AACTAGATGA 


CTGCTTTTCA 


ACAACTCCCA 


TCTAGTGTAC 


TTCAAACTGG 


4440 


AGCCATTTTT 


CTCTCCATT A 


TCATTGAAGC 


CCTTCCCTTC 


GTTCTGATAG 


GAAGCATTGT 


4500 


CT C AGGGCTG 


ATTGAACTTT 


ATATCACACC 


TGACAAGGTT 


TATCATTTTC 


TCCCTCGAAA 


4560 


TCGTTGGGGG 


AGAATCTTTT 


TTGGGACCTT 


TGTCGG TAT A 


CTTTTCCCTT 


CTTGTGAATG 


4620 


TGGAATCGTC 


CCCATCATCA 


ATCGTTTTCT 


GGAAAAAAAG 


GTTCCAAGTT 


ACACGGCCGT 


4680 


TCCTTTTCTT 


GTGACAGCAC 


CTGTTATCAA 


TCCCATTGTT 


CTTTTTGCGA 


CCTATTCTGC 


4740 


CTTTGGCAAC 


TCCTTCCATG 


TCGCCCTATT 


ACGAGCTCTG 


GGTTCCATTC 


TTGTGGCTGT 


4800 


AATACTAGGA 


ATTTTTCTAG 


GATTTTTCTG 


GC AAGAACCG 


ATTCAGAAAG 


AAAATCGTCT 


4860 


GGCTTGTCAT 


G AGCATG ATT 


TTTCTTACTT 


GAGTTCTGCA 


AAAAAAGTTT 


TTCAAGTCTT 


4920 


TGTGCAGGCC 


ATTGATGAAT 


TTTTTGATAC 


GGGGCGTTAT 


TTGGTATTTG 


GCTGCCTCTT 


4980 


TGCTTCTATA 


ATACAGGTCT 


ACGTT CCGAC 


TCGGATTCTG 


ACCTCTATCA 


GTGCGACCCC 


5040 


TCTTTTTGCC 


ATCCTGCTCT 


TGATGATTTT 


AGCCTTTCTT 


CTTTCGCTCT 


GTAGTGAGGC 


5100 


GGATGCCTTT 


ATAGGTGCTT 


CTCTTCTCTC 


GAGTTTCGGT 


TTGGCACCAG 


TTCTGGCCTT 


5160 


TCTCGTCATT 


GGTCCAATGC 


TGGATATCAA 


AAATATTCTC 


ATGATGAAAA 


ATTACTTGAA 


5220 


AGCACGATTT 


ATCAGTCACT 


TCATAACAAT 


TGTAACTCTT 


GTCGTCTTAG 


TCTATTCTCT 


5280 


CTTGATTGGA 


GTTATCCTAT 


GATTCGATTT 


TTAGTTTTAG 


CTGGCTATTT 


TGAACTGACT 


5340 


ATTTACCTCC 


ATCTGTCGGG 


CAAACTAAAC 


CAGTACATCA 


ACATGCACTA 


TTCCTATCTG 


5400 


GCCTATATCT 


CCATGGTGCT 


TTCTTTTATC 


TTGGCTATCG 


TTCAATTGTA 


TATCTGGATG 


54 60 


AAGCAAGTCA 


AAACCCACAG 


TCATCTGAAC 


AGCCGATTAG 


CCAAGATAAC 


GAGTATTTCT 


5520 


CTTCTGGCTA 


TTCCACTTGT 


CATCGGCTTA 


ACTTTCCCAA 


CTGTTAGCTT 


GGATTCTCAG 


5580 


ACTGTTTCTG 


CTAAAGGTTA 


TCATTTCCCC 


CTATCGGAAG 


GAACGGATCT 


AGCCATTCAG 


5640 


ACAAGCGAAG 


GGACGACAAG 


CCAATATTTG 


AAACCAGATA 


C C AGTTCTT A 


TTTTTCAAAA 


5700 


TCAGCCTATG 


AAAAGGAAAT 


GCGAACGGCG 


GCGGATAAAT 


ACTTATCCCA 


AGATAGTATT 


5760 


CAGATCACTA 


ATGAAAACTA 


TATGGAAGTC 


ATGGAGGCTA 


TCTACGACTA 


TCCAGATGAG 


5820 


TTTGAGGGCA 


AGACAATCCA 


GTTTACAGGC 


TTTGTCTATA 


ACGACCCCAG 


TCATGCCAAT 


5880 
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AGTCAATTTC 


TGTTCCGATT 


CGGCATTATC 


CACTGTATCG 


CAGATTCTGG 


TGTCTATGGA 


5940 


TTGCTGACCA 


AGGGCAATAC 


CCGG C ACT AT 


GAAAACAACA 


CTTGGATAAC 


AGCCAAAGGA 


6000 


AAACTGGTCA 


ATCACTACCA 


TAAAGAACTC 


AAACAAAACC 


TTCCAACCTT 


GGAAATCGAC 


6060 


AGCTTTACCA 


AAGTCGATAA 


ACCAGAAAAT 


CCCTATGTAT 


ATAGAGCTTT 


TTAACAAAAT 


6120 


CAAGATAAAA 


ACGAACAAGT 


TCTCTTCTGA 


ATAACAGAAA 


AAGAGCCTGT 


TCGTTTTTl'G 


6180 


TTATATGAAA 


ATTAGTGACT 


TGTAGATTTT 


CATCTTATAC 


CATTCCCAGC 


AATACAAGTA 


6240 


GCTCATAGAA 


AATAAGCGAG 


CCACTCATTC 


ATTAGACTAG 


CGATTTCTTT 


AGGTGCTTGA 


6300 


GTATAAAGCT 


CATGGCCAAA 


GTTTTCTAAA 


AAAATAGTAT 


CAAAATAGTC 


TGGCAATTCT 


6360 


TTTAGGGCTT 


CCTCTCTCCA 


TGTAGCTTCA 


TTAGGATAGC 


GACGACTAAT 


AAACAAGGTA 


6420 


TCTCCCACTT 


CTCTCTTAAA 


AGCTTGTATT 


TTTCTCCGTA 


GcGGAGTATC 


GCTTCTATAT 


6480 


TTTCATAATT 


TATAGCCAAC 


TCATATCTAT 


TATACTCAAC 


ATTCCAGTGA 


TAAGACTGTC 


6540 


TTACAGCTTT 


CTCCATATTT 


TCTGACCAAT 


GCTTTGCTTC 


AGATTTTTCT 


TTAGAAGTAA 


6600 


GAACATCTAA 


GTCCGAAACA 


ATTTGAGATT 


TGATATAATT 


TTTAGTTTCC 


TCTAACTCTG 


6660 


TATCCAAAGG 


TAAAATCTTA 


TCTAAATCTA 


GATAGCCACC 


ATCCAAAAGA 


ATCAGTTTCT 


6720 


TTACTTCTTC 


AAATTCCGAT 


GCGAAATAAC 


GAGCTAAATC 


TCCTCCAAGA 


GAATGGCCTA 


6780 


TCAGACAGAT 


AGATTCTTCC 


TCTACAATTT 


CATTTTTAAA 


CCATGATTTC 


AATTCTGTTT 


6840 


CATCTCGAAC 


ATGCTTTTCA 


TATGGATTTA 


GAAAATAGAC 


CTGCGAATCT 


AGTTCTTGAA 


6900 


GAAAATCCTT 


GCTATGATAG 


GCATTGCTTC 


CCAAACCGCC 


AAT AAAAT AT 


TTTTTCATTC 


6960 


TCTACTTAAT 


ACTATGCTTA 


TTCATCTTTT 


GTTCAAACAT 


ACTTGTGATA 


ATCTGACGCA 


7020 


ATTCTTCGCG 


TTT TTTTTCT 


GGAATCTCAC 


CACTTGTTTG 


AGCTACAGCG 


TAGAGTTCAG 


7080 


GGTATTCAAT 


TGAAATGCCT 


TTAATCGTAC 


GTGTTGTAGC 


ATGTTTTCTG 


ACAAAAAACG 


7140 


GGATTCGCTT 


AATCAAGTCT 


TGTGGGACTA 


GCGCCAGAAT 


CTTCTCAGTA 


GTTTCTTTGT 


7200 


CACTAATATT 


AGACATTGTA 


AGCCTTTTCT 


TAATCATTTC 


CTGTTCTTTT 


TCTGT AAAAT 


7260 


CTTTTAATTC 


CATTCGATTA 


GTCCTCCTAT 


TTTCTCTAAG 


TTAAATTATG 


TACT AAT AC A 


7320 


GATG AAACT A 


CAAAGAATAA 


ACTTTAAGAA 


ATCTTCTCAC 


TGATAAGATT 


TTAGCATTAG 


7380 


ACTTCCTGCG 


AAACAAAATA 


TGGTATAGTA 


GTTCTATGAA 


TTATGAAGCA 


AGTAAACAAC 


7440 


TAACTGATGC 


ACGATTTAAA 


CGTCTTGTTG 


GTGTTCAGCG 


CACGACTTTT 


GAAGAGATAT 


7500 


TAGCTGTATT 


AAAAACAGCT 


TATCAACTTA 


AACACGCAAA 


AGGTGGACGA 


AAACCTAAAT 


7560 


TAAGCCTAGA 


AGACCTTCTT 


ATGGCCACTC 


TTCAATATGT 


GCGAGAATAC 


CGCACTTATG 


7620 


AAGAAATTGC 


GGCTGATTTT 


GGTATTCACG 


AAAGCAACTT 


AATCCGTCGG 


AGCCAATGGG 


7680 
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TTTAAGTAAC TCTTGTTCAA AGTGGTGTTA CGATTTCAAG AACTCCTCTC AGTTCTGAGG 77 4 0 

ACACGGTAAT GATTGATAGC CATTCCCATC AATATCGTAT CTTTGGACAT AGCCAATAAA 7800 

TGTTTCATTT TTGCGTGGTT TCTGGCTATT AACGATTGAA ATAACCCACC AACTTATCAA 7 860 

AAATAGAAAT AAAAATCCTA AGATTACTGT CATATCATAA C ACT ATT AAA GTTTAACCCA 7 920 

CTTATCATTA TCCATGATAA AAGGCTTAGC CAGTCCCTCG CCTGT AT AA T CCGCATACTT 7 98 0 

GGTG CCC AAA TACTTGTAGC AATCTTCCTT ACTAGCAAAT TTAATCGCTT GGTAGGGCTC 804 0 

TTCGAAAGTC AATTTCTCTA CAAATAAGAA ACCGTCATCA GCAGGTACTA AGACCCCAAC 8100 

GTGGCCTACA AACAGATACT CGCCATCCAA ATTGTCGTGC AAGACTACAG ACAGCATTCG 8160 

AGCTTTTTCA TTGAATTGAA ATTGTGAGAA GAATGCTTCC ATCTTTTCAG CGTGAACCTT 82 20 

GACATCTGTA GTTGACTCAG TTGGAACTCT CGAAAATAGA ATATCAAACT CTTCCTTATC 8280 

TTGTGAATCA AAGACCTTTC CTTTATCAAT CGCATCATTA TCTAGGAAAA GCAACTGGTC 8 340 

ATTCTTTTCA AGCTTTGGAA TGGTG ACT G A ATTTTTCAAA AGACAATAAC TATTGATACG 8400 

GCAGTTGGTC CCAACAAAAT CGCCCTTCTT TTG ATTC C AG AGATGACTGA TTTTCTCAAC 8460 

ATCGTATTCG GTGTGAGTAA AGGAAGTGAA ATCTCCTCAT AAGCCAGTTG AGCCGACAAT 8520 

GGT ATT AT AG TCATTAACGA GATTAAAAAA TGCATCAACA CTATTTGGAT CCAAGTGAGC 8580 

TGATAAGAGA GATTTGACCT CTTCTGTACT TACCTGGTTG TTTAGGTTGG TGTATGAAGC 8640 

TTTCCATGGA ACTTTCGCTG AACTGCTTTG CCTTTGATTC GTCCCCTCAG AAGTAGCATG 8700 

TTGTTGTTGA CAAGCAGCCA AGCCTAAAAA CAAGGCTGAA CAGATTCCTA ATGTGGCTAA 87 60 

TTTTCTTGAT TTCTTCATTT CTTTCTCCTA AATGTCTTGG ATTAAAGTTT CTTTAACTAT 8820 

TGCTTTACAG ATATTGATTA CTTTCTCATT TAATGTGTTC ATCGTCTTTC CTCCGG 887 6 
(2) INFORMATION FOR SEQ ID NO: 171: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CGCAAACTTT CGCGGTCGGA AGGTAGTTTT ATGACACGAT TTGAGATACG AGATGATTTC 60 

TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 120 

CCAGAGGATT GGTATCATTC GCTCTATAAC TTGAAGGCTC TTGGTTTTAA TACGGTAGAG 180 
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ACTTATGTTG 


CTTGGAATTT 


ACACGAGCCT 


TGTGAAGGTG 


AGTTTCATTT 


TGAAGGTGAT 


240 


CTGCATTTAG 


AGAAATTTCT 


CCAAATAGCG 


CAGGATTTGG 


GTCTCTACGC 


AATTGTGCGT 


300 


CCGTCTCCAT 


TTATCTGTGC 


GGAATGGGAA 


TTCGGTGGCT 


TACCAGCTTG 


GCTCTTGACC 


360 


AAGAACATGC 


GAATTCGCTC 


ATCCGACCCA 


GCATATATCG 


AGGCAGTTGG 


TC GOT ACT AT 


420 


GATCAGTTAT 


TGCCAAGACT 


GGTGC CTCGT 


TTGTTGGACA 


ATGGTGGCAA 


TATTCTCATG 


480 


ATGCAGGTTG 


AAAATGACTA 


TGGTTCTTAC 


GGAGAAGATA 


AGGCTTACCT 


GAGAGCGATT 


540 


CGACAGCTAA 


TGGAAGAGTG 


TGGCGTAACC 


TGTCCCCTCT 


TTACATCACA 


TGGTCCATGG 


600 


CGAGCTACTC 


TGAAAGCTGG 


AACCTTAATT 


GAAGAGGACC 


TCTTTGTAAC 


AGGAAACTTT 


660 


GGTTCTAAGG 


CACCTTACAA 


CTTTTCGCAG 


ATGCAGGAAT 


TCTTTGATGA 


A CATGGT AAG 


720 


AAATGGCCAC 


TCATGTGTAT 


GGAGTTCTGG 


GATGGTTGGT 


TCAATCGCTG 


GAAAGAACCG 


780 


ATTATCACAC 


GGGATCCTAA 


GGAATTGGCA 


GATGCAGTTC 


GAGAGGTTTT 


GGAACAAGGC 


840 


TCTATCAATC 


TTTACATGTT 


CCACGGTGGT 


ACAAACTTTG 


GTTTCATGAA 


TGGTTGCTCA 


900 


GCTCGAGGAA 


CTTTGGACCT 


GCCACAAGTT 


ACGTCTTATG 


ATTACGATGC 


CCTTCTGGAT 


960 


GAAGAAGGAA 


ATCCAACTGC 


TAAATATCTT 


GCAGTCAAGA 


AGATGATGGC 


AACACATTTT 


1020 


TCAGAGTATC 


CGCAGTTGGA 


ACCACTCTAC 


AAAGAGAGTA 


TGGAGTTGGA 


TGCTATTCCA 


1080 


CTAGTTGAAA 


AAGTTTCTTT 


GTTTGAAACC 


TTAGATAGCT 


TGTCAAGTCC 


TGTAGAAAGT 


1140 


CTCTATCCTC 


AAAAGATGGA 


GGAGCTGGGA 


CAAAGTTATG 


GCTACCTACT 


TTATCGAACA 


1200 


GAAACAAACT 


GGGATGCAGA 


AGAAGAAAGA 


CTTCGTATCA 


TTGATGGTCG 


AGATAGGGCC 


1260 


CAGCTGTATG 


TCGATGGTCA 


GTGGGTTAAA 


ACTCAATATC 


AGACAGAGAT 


TGGGGAAGAT 


1320 


ATTTTTTATC 


AAGGTAAAAA 


GAAAGGGCTA 


TCTAGGTTAG 


ATATCTTGAT 


AGAAAATATG 


1380 


GGGCGTGTCA 


ACT ATGGG C A 


TAAGTTCTTA 


GCGGATACGC 


AACGTAAGGG 


AATTCGGACA 


1440 


GGGGTCTGTA 


AGGATCTGCA 


TTTCTTACTA 


AACTGGAAAC 


ACTATCCACT 


CCCACTAGAC 


1500 


AATCCTGAGA 


AAATTGATTT 


TTCAAAAGGA 


TGGACTCAAG 


GACAACCAGC 


CTTTTACGC V 


1560 


TATGACTTTA 


CAGTCGAAGA 


GCCAAAAGAT 


ACTT AC CT AG 


ACTTGTCTGA 


GTTTGGTAAG 


1620 


GGGGTT GCCT 


TTGTCAATGG 


GCAGAATCTA 


GGACGTTTTT 


GGAACGTTGG 


CCCAACTCTC 


1680 


TCACTTTATA 


TCCCTCATAG 


CTATCTCAAG 


GAAGGTGCCA 


ACCGCATCAT 


TATCTTTGAA 


1740 


ACAGAAGGTC 


AATATAAAGA 


AGAGATTCAT 


TTAACTCGTA 


AACCTACACT 


AAAACATATA 


1800 


AAGGGGGAAA 


ACTTATGACA 


ATTGTAGGAT 


G CCGTATTG A 


TGGACGTTTG 


ATCCACGGAC 


I860 


AAGTAGCCAA 


TCTTTGGGCT 


GGAAAACTAA 


ATGTTTCACG 


CATTATGGTT 


GTAGACGACG 


1920 


AAGTTGTCAA 


CAACGATATT 


GAAAAGAGTG 


GTTTGAAACT 


TGCGACACCA 


CCAGGTGTGA 


1980 
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AATTGAGTAT TTTGCCAGTT GAGAAAGCTG CAGCCAATAT TCTTGGTGGC AAATACGATA 2040 

GCCAACGTCT CTTTATCGTG GCTCGTAAAC CAGACCGCTT CCTTGGTTTG GTAGAAGCAG 2100 

GTGTACCACT TGAAACCCTT AATGTTGGGA AT ATGTCTC A AACACCAGAA ACTCGTTCTA 2160 

TTACACGTTC TATCAACGTA GTAGACAAGG ATGT GGAAG A CTTCCACAAA CTGGCAGAAA 22 20 

AAGGTGTTAA AC TT ACTGCT CAGATGGTTC CAAATGATCC AATTTCAGAC TTTTTGAGCT 2 280 

TATTAAAATA GGAAAAAAAT TTTTAGGAGG TCATTGTTAT G AT AC AATGG TGGCAAATTT 2 3 40 

TACTTCTCAC TTTGTACTCA GCTTATCAAA TCTGTGATGA GTTGACGATC GTTTC AT CTG 2400 

CAGGTTCCCC TGTATTTGCT GGTTTCATTA CTGGTTTAAT C ATGGG AG AT GTGACTACTG 2 4 60 

GTTTACTTAT CGGTGGTAAC TTGCAACTGT TCGTTCTTGG GGTTGGTACC TTCGGTGGTG 2 520 

CTTCTCGTAT CGACGCAACT TCTGGTGCGG TTCTTGCGAC ACCTTCTCTG TTTCACAAGG 2580 

AATTGATGCA CCGCTTGCCA TTACTACAAT CGCTGTACCA GTAGCAGCTC TCTTGACTTA 2640 

CTTCGACGTT CTTGGTCGTA TGACTACTAC CTTCTTCGCT CACCGTGTGG ATGCTGCAAT 27 00 

CGAACGCTTT GACTATAAAG GTATTGAACG CAACTACTTG CTTGGTGCGA TTCCGTGGGC 27 60 

TCTATCTCGT GCCCTTCCAG TCTTCTTTGC CCTTGCTTTT GGTGGTGCCT TTGTACAATC 2820 

AGTAGTAGAC TTCGTTGAAG CCTACAAATG GGTTGCAGAT GGCTTGACAC TTGCAGGACG 28 80 

TATGCTTCCA GGTCTTGGAT TTGCAATCTT GCTTCGTTAC CTTCCAGTTA AACGTAACCT 294 0 

TCACTACCTT GCTATGGGAT TTGGTTTGAC AGCTATGTTG ACTGTT CTTT AC TC AT ATGT 3000 

AACAGGTCTT GGTGGCGCTG TTGCTGGTAT CGTAGGTACT CTTCCTGCTG AAGTTGCTGA 3060 

AAAAATTGGT TTCGTGAACA ACTTCAAAGG TTTGTCTATG ATTGGTATTT CTATCGTAGG 3120 

TATTTTCCTT GCAGTGCTTC ACTTCAAAAA TAGCCAAAAA GTAGCTGTAG CAGCACCTTC 3180 

TACACCATCA GAAAGTGGGG AAATCGAAGA TGACGAATTC TAATTACAAA CTTACAAAAG 32 4 0 

AAGATTTTAA TCAAATCAAC AAACGTAGCT TGTTTACTTT CCAATTAGGT TGGAACTACG 3 3 00 

AACGTATGCA AGCTTCTGGT TACCTTTACA TGATCTTGCC TCAGTTGCGT AAAATGTATG 33 60 

GTGATGGAAC TCCTGAATTG AAAGAAATGA TGAAAGTTCA TACTCAATTC TTCAATACTT 34 20 

CACCATTCTT CCATACCATT ATCGCTGGTT TTGACCTTGC CATGGAAGAA AAAGATGGTG 34 80 

TAGGTTCAAA AGACGCCGTT AACGGTATCA AGACAGGTTT GATGGGACCA TTCGCTCCTC 354 0 

TTGGGGATAC AATCTTTGGT TCACTTGTAC CTGCTATCAT GGGGTCAGTC GCAGCAACTA 3600 

TGGCTATCGC TGGCCAACCT TGGGGGATCT TCCTTTGGAT TGCAGTTGCA GTAGCGTATG 3 6 60 

ACATCTTCCG TTGGAAACAG TTGGAATTTG CTTACAAAGA AGGGGTTAAC CTT ATCAAC A 37 20 
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ACATGCAAAG TACCTTGACA GCTTTGATTG ACGCTGCATC TGTACTTGGT GTCTTCATGA 37 80 

TGGGTGCTCT TGTAGCAACA GTGATTAACT TTGAAATTTC TTACAACTTG CCAATCGGTG 3840 

AAAAGATGAT TGATTTCCAA GACATCTTGA ACCAAATCTT CCCACGTTTG CTTCCAGCAA 3 900 

TCTTTACTGC CTTTATCTTC TGGTTGCTTG GTAAGAAAGG TATGAACTCT ACTAAAGCTA 3960 

TCGGT ATT AT TATCGTACTT GCTTTGGCTC TTTCTGCCCT TGGTCACTTT GCACTTGGAA 402 0 

TGTAATTCCT TATGACTAAA TCATTAATTT TGGTGAGCCA TGGTCGCTTC TGTGAGGACC 4080 

TTAGAGGTAG CACAGAAATG ATTATGGGCC CACAAGACAA CATTTACACA GT AGCTCTT C 4140 

TTCCAGAAGA TGGCCCAGAA GAATTTACTG CTAAATTTGA AGCTGTTATT GAAGGATTGG 4200 

ATGATTTCCT AGTCTTTGCG GATCTTCTCG GTGGGACACC TTGTAATGTG GTGAGTCGCT 4260 

TGATCATGGA AGGTCGTGAT ATTGACCTTT ACGCAGGGAT GAATCTTCCA ATGGTGATTG 4320 

AATTTATCAA TGCGAGCCTT ACAGGCGCAG ATGCGGACTA CAAGAGCCGT GCTGCAGAAA 4 380 

GCATTGTGAA AGTTAATGAC CTGTTAGCGG GCTTCGATGA TGACGAAGAT GAATAATACT 444 0 

CTTCGAAAAT CTCTTCAAAC TACGTCAACG TCGCCTTGCC GTAGgTATAT GTTACTGACT 4500 

TCGTCAGTCT TATCCGGCAA CCTCAAAACG GTGTTTTGAG CTGACTTCGT CAGTCTTATC 4560 

CGGCAACCTC AAAGCAGTGC TTTGAGCAGC CTGCGGCTAG TTTCCTACAG ATTTTAGTTG 462 0 

GAACTCGATT CAATTCATGT GACAACGTGA AAATCGTTAG AGCATTTTAT ATAGAATATA 4680 

CATGGGAATG TAGCTTACTC CCATTCCCAT ATTT AAT AG A AAAAGAGGAA CTCAATGCTA 4740 

CATTATACAA AAGAAGACTT GCTCGAATTG GGTGCAGAAA TCACTACGCG TGAAATCTAC 4 800 

CAACAGCCTG ATGTATGGAG AGAAGCTTTT GAATTTTATC AAGCAAAACG TGAAGAAATT 4 8 60 

GCAGCCTTCC TACAAGAAAT CG CTGAT AAA CAT G ACT AT A TTAAGGTTAT CTTGACAGGT 4920 

GCTGGGACTT CTGCTTATGT GGGAGATACC TTGCTACCTT ATTTTAAGGA AGTCTATGAC 4980 

GAACGCAAAT GGAATTTCAA TGCTATTGCG ACAACAGATA TCGTTGCCAA TCCAGCAACC 5040 

TATTTGAAAA AAGATGTGGC AACTGTCCTT GTGTCTTTTG CTCGTAGTGG GAATTCGCCT 5100 

GAAAGTTTGG CGACTGTTGA TTTGGCCAAA TCCTTGGTGG ATGAGCTTTA TCAAGTGACG 5160 

ATTACTTGTG CAGCAGATGG TAAATTGGCT CTTCAAGCTC ACGGTGATGA TCGTAATCTC 5220 

TTGCT C TTGC AACCAGCTGT CTCTAATGAT GCTGGATTTG CCATGACTTC TAGCTTTACG 52 80 

TCTATGATGT TGACAACTCT CTTGGTCTTT GATCCTACAG AATTTGCTGT TAAGTCTGAA 5 3 40 

CGTTTTGAAG TTGT AT CT AG TCTTGCCCGT AAAGTTTTAG ACAAGGCAGA AGATGTCAAA 54 00 

GAGCTCGTTG ATTT AG ACTT TAACCGTGTC ATCTATCTAG G CGCTGGTCC TTTCTTTGGA 54 60 

CTTGCTCATG AAGCTCAGCT CAAGATTTTG GAATTAACTG CTGGTCAAGT TGCGACCATG 5520 



WO 98/18931 



PCT7US97/19588 



1089 

TATGAAAGCC CAGTTGGCTT CCGTCACGGT CCAAAATCTC TTATCAACGA CAATACAGTT 5580 

CTTTTGGTCT TTGGTACAAC GACAGACTAC ACTCGTAAGT ACGACTTGGA CTTGGTTCCT 564 0 

GAAGTTGCTG GTGACCAGAT TGCTCGTCCT GTTGTCCTTT TGAGTGATCA AGCTTTTGGT 5700 

CTTGAAAATG TCAAAGAAGT GGCCCTTGGT TGTGGCGGTG TCTTGAATGA TATTTACCGT 5760 

GTCTTCCCTT ACATCGTTTA TGCCCAACTC TTTGCTTTAT TGACTTCACT CAAGGTAGAA - 5820 

AATAAACCAG ATACACCGTC TCCTACAGGT ACAGTAAACC GTGTAGTACA AGGTGTCATA 5880 

ATTCACGAAT ATCAAAAGTA AGACAGTGTT TATGAATTCT TGACAAGAGG ATTTGTAAAT 594 0 

TATCAGATAA ACCATAGATT GTCAGTACGC TTTCTATGGT TTGTTTGCTT GAGAGAAATA 6000 

GTAAAAGGAG AACAGAATGA AAGC AT AC A C AGAGCGTGTA TTTGGAAATG TTGAGGGTGA 6060 

GGATGTCTTG GCCTATCGAT TTGAGACAGA CGGTGGCTAC CAACTTGAGG TTATG ACTT A 6120 

TGGTGCGACT ATCTTGCGCT ATGTCGCACC TGACAAGGCT GGAAATTTTG CCAATGTTAT 6180 

CTTGGGATTT GATGACTTTG ATAGTTATGT AGGCAATAGT CCCAAGCATG GAGCAAGTGT 6240 

AGGTCCTGTA GCGGGTCGTA TTGCAGGTGC GACCTTTGAG CTCAATGGTA AGACCTATGA 6300 

CCTTGAGGTT AATAATGCTA GCAACTGTAA TCACAGTGGT TCAACTGGTT GGGATTCCAG 6360 

CTTGTTTGAA GTTGAAGAAG TAAGCGATCA TGGCTTGACT CTCTACACAG AGCGT AC AG A 6420 

TGGGACAGGA GGGTTCCCTG GAAATCTCAA GATTTGGATC AGTTATCACT TGGAAGAAAC 6480 

TGGTGCCTAT GAAATCAGCT ACAAGGTAAC GACCGATCAG GATACGCTGG TCAATCCAAC 6540 

CAACCACAGC TATTTCAACT TGTCTGGTGA TTTCACGCAG ACGATTGACC GTCATGTCTT 6600 

CCAACTAAAC ACAG AGGGC A TTTACTCAAT CGCTCCTGAC GGTGTTCCTG CCAAAACTCC 6660 

AGAAGCCAAC CGTGATGTGG TCAAACACGT CTACAATCGT ACCTTGTTGA AGGATATCTT 6720 

TGCAGAAGAA GATGAGCAAA TCCAGCTGGC ATCAGGTTTG GATCATCCAT TTGCCCTTCC 67 80 

TGCAGGCCAT GACAATGCTG GATTCCTTTA TGACCAAAAT TCAGGTCCCT TCCTGCTTTT 6840 

CAAGACAGAA GCTCCTTGCT TTGTGGTCTA CACAGCAAAC TTTGTGGATG AAAGTGTCAT 6900 

CATAGGAGGT CAGCCAATGC TACAGCACAA TGGGATTGCT CTTGAAGCGC AAGCTTTACC 6960 

AGATGCCATT CACAGTGACC TTAAAGGCCA AGTCATTCTT AAAGCTGGTC AAACCTTCAC 7020 

CAGTAAGACA CGTTATGAAC TTGTTGTGAA GTAAAAGAGT CATTGCGCCT ACTTTTGGGA 7080 

GCTAGGAATA GGTACGCAGA GACAAATAGT AGGAAAATAT GATATAACTA AGCGTTGAAA 7140 

GCTATCTGTT AATATAATAT TCAAACTACA ATAAGGAGTA AGAAAGAAAC GAAGAAAATT 72 00 

GTATTTGCTA GTGCCTTGGC TTTGACCTTG GCTGGAGCAG TTTTGACAAA TGATGTTTTT 72 60 
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GCGAACGACA CACTTCTGGC AACACAAACT ACTGATGCTA AAAATGAAAA TGTATTGACC 7 320 

TCAGAGGTGC TAAAACCTTC TAGTGGCAAT GTTTTGGTTG GAATCAAAGG AGAATTTGTG 7 380 

GCTCCTCATC AACAATCTAT TTTGGATGCC ATCAATGCTA TCTGTAAAGA AGCGGCTGAC 74 4 0 

GAAGGTTTGG T AG AT AAGT A TGTCCCTATC AAATGATCAA CTGAC CT AG A AAAGGCAGCT 7 500 

TTTGCCAGAG CTACAGAAGC ATCTATAACC ATGGATCATA CCCGTCTTTC TAGCAAAGAT 7S60 

CTTTGGAGTG CCTTTCCAAC TTCTAATAGT ATAATGGGAG AAAATTTGGC ATGGAATCAT 7 620 

GACGGTTTTC TAAAAGCTAT TGAACAATGG CGTGCTGAAA AAGCAGATTA TGTGGAGAAA 7680 

AAAATAGTGG TTCAGACAAC GGGAAATCTG GTCACTATGA GTCGCTAATT AACCCTAAAT 774 0 

TTACACACAT GGGGATGGCA GCTTTTAAAA ATCCTAACAA TCAATACAAA GCTATTACAA 7800 

TTGCTCAAAC TCTAGGTGAT GATGCTTCTT CAGAGGAATT GGCTGGTAGA TATGGTTCTG 7 860 

CTGTTCAGTG TACAGAAGTG ACTGCCTCAA ACCTTTCAAC AGTT AAAACT AAAGCTACGG 7 920 

TTGTAGAAAA ACCACTGAAA GATTTTAGAG CGTCTACCTC TGATCAGTCT GGTTGGGTGG 79 80 

AATCTAATCC T AAATGGT AT TTCTATGAGT CTGGTGATGT GAAGACAGGT TGGGTGAAAA 804 0 

CAGATGGTAA ATGGTACTAT TTGAATGACT TAGGTGTC AT GCAGACTGGA TTTGTAAAAT 8100 

TTTCTGGTAG CTGG T ATT AC TTGAGCAATT CAGGTGCTAT CTTTACAGGC TGGGGAACAG 8160 

ATGGTAGCAG ATGGTTCTAC TTTGACGGCT CAGGAGCTAT GAAGACAGGC TGGTACAAGG 8220 

AAAATGGCAC TTGGTATTAC CTTGACGAAG CAGGTATCAT GAAGACAGGT TGGTTTAAAG 8 280 
TCGGACCACA CTGGTACTAT GCCTACGGTT CAGGAGCTTT GGCTGTGAGC ACAACAACAC 8340 
CAGATGGTTA CCCTGTAAAT GGTAATGGTG AATGGCTAAA CTAGGCTCAG GCCATAGGTA 8400 
AAGCATTCAT CTTACTTAGC AAAAAGAATG AACGATAAGA AAGAGGTTGA TGGCGAACAT 84 60 

TGGCCTCTTT TGATTTATAA AG ATTGG ATT CTTGTCGCCT CAATTTCAGA CTTTTCTATT 8520 
GTAAGCTAAT ATTTTATAGC CCATTAAAAG CATAAGCGGT AAT CT AATTT AAAAAATGCT 8580 
GTAATTAGTC TGAAGTCCAC ACTTACTTGT TGAGATGTTA TCTCTGTTTT TTATCGTTA? 8640 
AATTT ACTGT ATTTTTTATA GTATGCAGAA TATTTTTAAG TATATTTCAA TACAAATTTC .3700 
TATCGATTTA TTGTATAATG ATAAGTAATT GTTGAAAAGT ACT C AG AAAA TTCCATACTA 87 60 

TATTA' l TTTT ATGTTTATAC TTTTATGCTA T AAAAT AT AG ATTGATATAA AGAATATAGA 8820 
AAAAGCGAGG TTAATATGAG CCGAAAAAGC ATTGGTGAGA AACGCCATAG TTTCTCGATG 8880 
AGAAAGTTGT CAGTGGGATT GGTATCAGTT ACTGT ATCTA GTTTCTTTTT GATGAGTCAA 89 40 

GGGATTCAAT CGGTATCGGC CGATAATATG GAAAGTCCAA TTCATTATAA GTATATGACC 9000 

GAGGGTAAAT TG AC AGACGA GG AAAAATCC TTGCTGGTAG AGGCCCTTCC ACAACTGGCT " 9060 
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GAAGAATCAG 


ATGATACTTA 


TTACTTGGTT 


TATAGATCTC 


AACAGTTTTT 


ACCGAATACA 


9120 


GGTTTTAACC 


CAACTGTTGG 


TACTTTCCTT 


TTTACTGC AG 


G ATTG AG CTT 


GTTAGTTTTA 


9180, 


TTGGTTTCTA 


AAAGGGAAAA 


TGGAAAGAAA 


CGACTTGTTC 


ATTTTCTCCT 


GTTG ACTAG C 


9240 


ATGGGAGTTC 


AATTGTTGCC 


GGCCAGTGCT 


TTTGGGTTGA 


CCAGCCAGAT 


TTTATCTGCC 


9300 


TATAATAGTC 


AGCTTTCTAT 


CGGAGTCGGG 


G AACATTT AC 


CAGAGCCTCT 


GAAAATCGAA 


9360 


GGTTATCAAT 


ATATTGGTTA 


TATCAAAACT 


AAGAAACAGG 


ATAATACAGA 


GCTTTCAAGG 


9420 


ACAGTTGATG 


GGAAATACTC 


TGCTCAAAGA 


GATAGTCAAC 


CAAACTCTAC 


AAAAACATCA 


9480 


GATGTAGTTC 


ATTCAG CTG A 


TTTAGAATGG 


AACCAAGGAC 


AGGGGAAGGT 


TAGTTTACAA 


9540 


GGTGAAGCAT 


CAGGGGATGA 


TGGACTTTCA 


GAAAAATCTT 


CTATAGCAGC 


AGACAATCTA 


9600 


TCTTCTAATG 


ATTCATTCGC 


AAGTCAAGTT 


GAGCAGAATC 


CGGATCACAA 


AGGAGAATCT 


9660 


GTAGTTCGAC 


CAACAGTGCC 


AGAACAAGGA 


AATCCTGTGT 


CTGCTACAAC 


GGTGCAGAGT 


9720 


GCGGAAGAGG 


AAGTATTGGC 


GACGACAAAT 


CATCGACCAG 


AGT AT AAACT 


TCCATTGGAA 


9780 


ACCAAAGGCA 


CGCAAGAACC 


CGGTCATGAG 


GGTGAAGCCG 


CAGTCCGTGA 


AGACTTACCA 


9840 


GTCTACACTA 


AGC C ACT AG A 


AACCAAAGGT 


ACACAAGGAC 


CCGGACATGA 


AGGTGAAGCT 


9900 


GCAGTTCGCG 


AGGAAGAACC 


AGCTTACACA 


GAACCGTTAG 


CAACGAAAGG 


CACGCAAGAG 


9960 


CCAGGTCATG 


AGGGCAAAGC 


TACAGTCCGC 


GAAGAGACTC 


TAGAGTACAC 


GGAACCGGTA 


10020 


GCGACAAAAG 


GCACACAAGA 


ACCCGAACAT 


GAGGGCGAAg 


c GGCAGT AG A 


AGAAGAACTT 


10080 


CCGGCTTTAG 


AGGTCACTAC 


ACGAAATAGA 


ACGGAAATCC 


AGAATATTCC 


TTATACAACA 


10140 


GAAGAAATTC 


AGGATCCAAC 


ACTTCTGAAA 


AATCGTCGTA 


AGATTGAACC 


ACAAGGGCAA 


10200 


GCAGGGACAC 


GTACAATTCA 


ATATGAAGAC 


TACATCGTAA 


ATGGTAATGT 


CGTAGAAACT 


10260 


AAAGAAGTGT 


CACGAACTGA 


AGTAGCTCCG 


GTCAACGAAG 


TCGTTAAAGT 


AGGAACACTT 


10320 


GTGAAAGTTA 


AACCTACAGT 


AGAAATTACA 


AACTTAACAA 


AAGTTGAGAA 


CAAAAAATCT 


10380 


ATAACTGTAA 


GTTATAACTT 


AATAGACACT 


ACCTCAGCAT 


ATGTTTCTGC 


AAAAACGCAA 


10440 


GTTTTCCATG 


GAGACAAGCT 


AGTTAAAGAG 


GTGGATATAG 


AAAAT C CTGC 


CAAAGAGCAA 


10500 


GTAATATCAG 


GTTTAGATTA 


CTACACACCG 


TATACAGTTA 


AAACACACCT 


AACTTATAAT 


10560 


TTGGGTGAAA 


ATAATGAGGA 


AAATACTGAA 


ACATCAACTC 


AAGATTTCCA 


ATTAGAGTAT 


10620 


AAGAAAATAG 


AGATTAAAGA 


TATTGATTCA 


GTAGAATTAT 


ACGGTAAAGA 


AAATGATCGT 


10680 


TATCGTAGAT 


ATTTAAGTCT 


AAGTGAAGCG 


CCGACTGATA 


CGGCTAAATA 


CTTTGTAAAA 


10740 


GTGAAATCAG 


ATCGCTTCAA 


AGAAATGTAC 


CTACCTGTAA 


AATCTATTAC 


AGAAAATACG 


10800 
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GATGGAACGT ATAAAGTGAC GGTAGCCGTT GATCAACTTG TCGAAGAAGG TACAGACGCT 10860 

TACAAAGATG ATTACACATT TACTGTAGCT AAATCTAAAG CAGAGCAACC AGGAGTTTAC 10920 

ACATCCTTTA AACAGCTGGT AACAGCCATG CAAAGCAATC TGTCTGGTGT CTATACATTG 10980 

CCTTCAGATA TGACCGCAGA TGAGGTGAGC TTAGGCGATA AGCAGACAAG TTATCTCACA 11040 

GGTGCATTTA CAGGGAGCTT GATCGGTTCT GATGGAACAA AATCGTATGC CATTTATGAT 11100 

TTGAAGAAAC CATTATTTGA TACATTAAAT GGTGCTACAG TTAGAGATTT GGATATTAAA 11160 

ACTGTTTCTG CTGATAGTAA AGAAAATGTC GCAGCGCTGG CGAAGGCAGC GAATAGCGCG 11220 

AATATTAATA ATGTTGCAGT AGAAGGAAAA ATCTCAGGTG CGAAATCTGT TGCGGGATTA 11280 

GTAGCGAGCG CAACAAATAC AGTGATAGAA AACAGCTCGT TTACAGGGAA ACTTATCGCA 11340 

AATCACCAGG ACAGTAATAA AAATGATACT GGAGGAATAG TAGGTAATAT AACAGGAAAT 114 00 

AGTTCGAGAG TTAATAAAGT TAGGGTAGAT GCCTTAATCT CTACTAATGC ACGCAATAAT 114 60 

AACCAAACAG CTGGAGGGAT AGTAGGTAGA TTAGAAAATG GTGCATTGAT ATCTAATTCG 11520 

GTTGCTACTG GAGAAATACG AAATGGTCAA GG AT ATT CT A GAGTCGGAGG AATAGTAGGA 115 80 

TCTACGTGGC AAAACGGTCG AGTAAATAAT GTTGTGAGTA ACGTAGATGT TGGAGATGGT 11640 

TATGTTATCA CCGGTGATCA ATACGCAGCA GCAGATGTGA AAAATGCAAG TACATCAGTT 11700 

GATAATAGAA AAGCAGACAG ATTCGCTACA AAATTATCAA AAGACCAAAT AGACGCGAAA 11760 

GTTGCTGATT ATGGAATCAC AGTAACTCTT GATGATACTG GGCAAGATTT AAAACGTAAT 11820 

CTAAGAGAAG TTGATTATAC AAGACTAAAT AAAGCAGAAG CTGAAAGAAA AGTAGCTTAT 11880 

AGCAACATAG AAAAACTGAT GCCATTCTAC AATAAAGACC TAGTAGTTCA CTATGGTAAC 11940 

AAAGTAGCGA CAACAGATAA ACTTTACACT ACAGAATTGT TAGATGTTGT GCCGATGAAA 12000 

GATGATGAAG TAGTAACGGA TATTAATAAT AAGAAAAATT CAATAAATAA AGTTATGTTA 12060 

CATTTCAAAG ATAATACAGT AGAATACCTA GATGTAACAT TCAAAGAAAA CTTCATAAAC 12120 

AGTCAAGTAA TCGAATACAA TGTTACAGGA AAAGAATATA TATTCACACC AGAAGC AT TT 12180 

GTTTCAGACT ATACAGCGAT AACGAATAAC GTACTAAGCG ACTTGCAAAA TGTAACACTT 12240 

AACTCAGAAG CTACTAAAAA AGTACTAGGA GCAGCGAATG ATGCAGCCTT AGATAACCTA 12 300 

TACTTAGATA G AC AATTTG A AG AAGTT AAA GCTAATATAG CAGAACACCT AAGAAAAGTA 12 3 60 

TTAGCGATGG ATAAATCAAT GAATACTACA GGAGACGGTG TAGTTGAATA CGTAAGTGAG 12420 

AAAATCAAAA ATAACAAAGA AGCATTTATG CTAGGTCT T A CTTATATGAA CCGTTGGTAC 12 4 80 

GATATTAATT ATGGTAAAAT GAATACAAAA GATTTATCTA CGTACAAGTT TGACTTTAAC 12540 

GGAAATAATG AGACTTCAAC GTTGGATACT ATTGTCGCAT TAGGAAATAG TGGACTAGAT 12 600 
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AACCTGAGAG CTTCAAAT AC TGTAGGTTTA TATGCGAATA AACTTGCATC GGTAAAAGGA 12660 

GAAGATTCAG TCTTTGACTT CGTAGAAGCG TATAGAAAAC TGTTCTTACC AAACAAAACA 12720 

AATAACGAGT GGTTTAAAGA AAATACAAAG GC AT AT AT AG TCGAAATGAA GTCTCATATT 12780 

GCAGAAGTAC GAGAAAAACA AGAATCACCA ACAGCCGATA GAAAATATTC ATTAGGAGTT 12 840 

TACGATAGAA TATCAGCACC AAGTTGGGGG CATAAGAGTA TGTTATTACC ACTACTAACT 12 900 

TTACCTGAAG AATCTGTGTA TATTTCATCG AAT ATGTCT A CACTTGCATT C GGTTCGT AT 129 60 

GAAAGATATC GTGATAGTGT GGATGGAGTT ATTCTTTCAG G AG ATGCTTT ACGAACTTAT 13020 

GTAAGAAATA GAGTTGATAT AGCAGCGAAA AGGCATAGAG ACCATTATGA TATTTGGTAC 13080 

AATCTTCTTG ACAGTGCTTC AAAAGAAAAA CTTTTCCGTT C TGTG AT ACT TTATGATGGA 13140 

TTCAATGTAA AAGATGAGAC AGGAAGAACT TATTGGGCAA GGTTAACGGA TAAAAACATC 132 00 

GGCTCTATTA AAGAATTCTT CGGACCTGTT GGGAAATCGT ATGAGTATAA TAGTACTGCA 132 60 

GGAGCGTATG CGyAtGGAAG TTTAACGCAC TTTGTGTTAG ATAGATTATT AGATGCTTAT 13 3 20 

GGAACGTCGG TTTATACTCA TGAAATGGTT CATAATTCTG ATTCTGCAAT CTACTTTGAA 13 3 80 

GGAAATGGTA GACGTGAAGG ATTGGGAGCG GAGTTATACG CACTTGGTTT ACTGCAATCT 13440 

GTAGATAGTG TAAATTCTCA TATTTTAGCT TTAAATACGT TAT AT AAAGC AGAAAAAGAT 13 500 

GATTTGAATA GATTGCATAC ATATAATCCG GTGGAACGTT TCGATTCGGA TGAGGCGCTT 13 560 

CAAAGTTATA TGCATGGATC ATATGATGTA ATGT AT AC AC TTGATGCGAT GGAAGCAAAA 13 620 

GCGATATTAG CTCAAAATAA TGATGTTAAG AAAAAATGGT TTAGAAAAAT AGAAAATTAT 13 6 80 

TACGTTCGTG ATACTAGACA T AATAAAG AT ACACATGCAG GAAATAAAGT CCGTCCATTA 1374 0 

ACAGATGAAG AAGTAGCTAA CTTAACATCG TTAAACTCAT TAATCGACAA CGACATCATA 13 800 

AATAGACGTA GCTATGATGA TAGTAGAGAA TATAAACGAA ATGGCTACTA TACT AT AAGT 13 860 

ATGTTCTCTC CTGTATACGC AGCGCTAAGC AATTCGAAAG GTGCTCCTGG AGATATTATG 13 920 

TTTAGAAAAA TAGCTTATGA ATTACTTGCG GAAAAAGGTT ATC ACAAAGG ATTCCTACCT 13 980 

TATGTTTCTA ATCAGTACGG AGCAGAAG C A TTTGCCAGCG GAAGCAAAAC ATTCTCATCA 14040 

TGGCATGGAA GAGATGTTGC TTTAGTGACA GATGATTTAG TATTTAAGAA AGTATTCAAT 14100 

GGTGAGTACT CATCATGGGC TGATTTCAAA AAAGC AATGT TTAAACAACG TATAGATAAA 14160 

CAAGATAATC TGAAACCAAT AACAATTCAA TACGAATTAG GTAATCCTAA TAGTACAAAA 14 220 

GAAGTAACTA TAACAACGGC TGCACAAATG CAACAATTAA TTAATGAAGC GGCTGCGAAA 14 280 

GATATTACTA ATAT AG ATC G TGCAACGAGT CATACCCCAG CAAGTTGGGT GCATTTATTA 14 340 
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AAACAAAAAA TCTATAATGC ATATCTTCGC ACTACAGATG ACTTTAGAAA TTCTATATAT 14400 

AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT GTAAGGATGA CGACTCAGAT 144 60 

GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT TTGATTGAAT GCAGATTGCA 14 520 

GG AATCATCT TCAACTCATC AACGACC AAT GGTGACAAGG TGGATTTCAA TCCCACAGAA 14 580 

AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT AAATACAAAA CAATCCTAGA 14 640 

AG ATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT CAAGTTCTCT GGCTTGACTT 14 700 

TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 147 36 



(2) INFORMATION FOR SEQ ID NO: 172: 

til SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 11770 base pairs 
(B) TYPE: nucleic acid 
fC) STRAND EDN ESS : double 
(D> TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

ACAGGAAAGC ACGATAGCAA TCTCTTTGGA AGATTTAAAA AATATTCCTC AAAGTTTCGC 60 

TGTTGCTTAC GGTGATACGA AAGTATCTTC GATTCTCTCT GTCTTGCGTG CTAATTTAGT 120 

AAATCATTTG ATTACAGACA AAAATACAAT TTTAAAAGTT TTGGAAGAAG ATGGGGATTT 180 

GACTTTTAGA GAGATTCT AG GTGAGTGAAA AT GAT AG A CT GATTCAGTTT ATCGTTTTTC 24 0 

TTTTTAGTTG ATTGCACATT TGTGCTTATA TAAACAAAAA TACTTTATCT GTTGTTTTTG 300 

GATTGACAAC TTTATTATGT ACTTGTATTC TATAGTTACA AAAGAAAATT TTAAAATTTC 360 

AAATGAAAAA AGCTTTTTAC ATAGTGAAAT GAGGAGGAAT TTATGGAAAT GATTGTTCCA 420 

GATCAAATTA TCATGGGTTT AATTTTATAT GCTGGTGATC CGAAACAACA TATTTATAAA 480 

GCGTTAGATT ACATAAAAAA TGGTACATGT GAACGGTGTG AAGAAGAAAT ACAGTTAGCT 540 

GATGCAGCCT T ATT AGAAGC TCATAATCTA CAAACAAAAT TTTTGGCACA GGAAGCGTCT 600 

GGTACAAAGA CAGAAATTAC AGCTCTCTTT GTTCATTCAC AAGATCATCT CATGACCAGT 660 

ATGACGGAGA TTAATTTAAT CAAAGAAATT ATTAGTTTGA GAAAAGAACT TCATAAAAAA 720 

TAATACTAGA GT ATT AT CAT TGTTATTAAC ATAGAGGAGG AAAACATAAT GGTGAAGATT 780 

GGTTTGTTTT GTGCAGCAGG TTTTTCTACT GGTATGCTTG TAAATAATAT GAAAATTGCA 84 0 

GCGCAATCTA GTGGAGTTGA GGCAGAAATA GAGGCGTTTT CTCAGT CTAA ATT AGCGGAT 900 

TATGCGCCAA ATATAGATGT TGCACTATTG GGTCCACAAG TT GCTT AT AC ATT AGAT AAA 960 

TCAAAAGAAA TTTGTGATAA GTGTGATGTT CCG ATAGCTG TTATTCCCAT GATGGACTAT 1020 
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GGTATGTTAG 


ATGGGAAAAA 


AGTATTAGAT 


TTGGCCCTAT 


CTTTGATTAG 


TGGGTAAGAA 


1080 


AAGGAGATTT 


ATTATGTCAA 


AGATGGATGT 


TCAGAAAATC 


ATTGCACCGA 


TGATGAAGTT 


1140 


TGTGAATATG 


CGTGGCATTA 


TAGCTCTAAA 


AGATGGGATG 


TTAGCAATTT 


TGCCATTGAC 


1200 


AGTAGTTGGT 


AGTTTGTTCT 


TGATTATGGG 


ACAATTGCCG 


TTCGAAGGAT 


TAAATAAGAG 


1260 


CATTGCTAGT 


GTTTTTGGAG 


CTAATTGGAC 


AGAGCCGTTT 


ATGCAAGTAT 


ATTCAGGAAC 


1320 


TTTTGCTATT 


ATGGGTCTAA 


TTTCTTGTTT 


TTCAATTGCC 


TATTCTTATG 


CTAAGAATAG 


1380 


CGGAGTAGAG 


GCTTTACCAG 


CTGGAGTTCT 


ATCTGTATCT 


GCATTCTTTA 


TTTTGCTAAG 


1440 


ATCATCTTAT 


ATCCCTAAAC 


AAGGTGAGGC 


GATTGGGGAC 


GCTATTAGTA 


AAGTTTGGTT 


1500 


TGGAGGCCAA 


GGAATTATCG 


GTGCTATCAT 


TATAGGTTTG 


GTAGTAGGAA 


GTATTTATAC 


1560 


CTTCTTTATA 


AAGAGAAAAA 


TTGTTATTAA 


GATGCCAGAA 


CAAGTTCCAC 


AAGCTATTGC 


1620 


CAAACAGTTT 


GAAGCAATGA 


TTCCAGCATT 


TGTAATTTTC 


TTATCTTCTA 


TGATTGTATA 


1680 


TATTTTAGCG 


AAGTCATTGA 


CTAATGGCGG 


AACATTCATA 


GAAATGATTT 


ATTCTGCTAT 


1740 


TCAAGTTCCG 


TTGCAAGGTT 


TAACTGGATC 


TTTGTATGGT 


GCTATTGGAA 


TTGCATTCTT 


1800 


TATATCATTT 


TTGTGGTGGT 


TTGGTGTTCA 


TGGGCAATCG 


GTAGTAAATG 


GAGTAGTGAC 


1860 


AGCTCTGCTT 


TTATCTAATC 


TTGATGCTAA 


TAAAGCTATG 


TTAGCCTCTG 


CTAATCTATC 


1920 


ATTAGAAAAT 


GGTGCACATA 


TTGTTACTCA 


ACAATTTTTA 


GATTCATTTT 


TAATTCTATC 


1980 


AGGTTCAGGG 


ATTACGTTTG 


GTCTTGTAGT 


TGCCATGCTT 


TTTGCAGCAA 


AATCAAAACA 


2040 


AT AC CAAGC C 


TTAGGAAAAG 


TTGCAGCTTT 


TCCAGCAATA 


TTTAACGTAA 


ATGAGCCAGT 


2100 


TGTATTTGGA 


TTTCCGATTG 


TCATGAATCC 


AGTTATGTTT 


GTACCTTTCA 


TTCTTGTTCC 


2160 


TGTACTTGCA 


GCTGTGATAG 


TATATGGAGC 


TATTGCAACA 


GGTTTCATGC 


AGCCATTCTC 


2220 


AGGGGTAACA 


TTGCCTTGGA 


GTACACCAGC 


TATTTTATCA 


GGATTT TTGG 


TGGGTGGATG 


2280 


GCAAGGAGTT 


ATTACTCAGC 


lljul VjA IA1 1 


AvUun 1 u 1^-1 


f\ X X UU 1 * * 


ATTTTCCATT 


2340 


CTTTAAAGTA 


CAGGATCGTT 


TAGCTTACCA 


AAATGAAATC 


AAACAATCTT 


AGAGGTATTT 


2400 


GTGTGTTACT 


GTTAAACTCA 


CACATTTGTG 


CTAAAAATTA 


GAGAGTTAAA 


ATTTTTCTAG 


2460 


TTAAAAGCTT 


GAAAATTTCT 


ATAAAAATCG 


GTATTATATT 


TTCGAAAGAA 


ATAAAAATAT 


2520 


TTTCGAAAGA 


AAGGTGCTTA 


CGATGGTAAA 


TACAGAAGTA 


GCAAGAACAA 


CAATCAAGAC 


2580 


AGAATATTTT 


GGCAGCCTTA 


CTGAAAGGAT 


GAACAAATAT 


CGAGAAGATG 


TTTTAAATAA 


2640 


AAAACCTTAT 


ATTGATGCTG 


AGAGAGCAGT 


TCTAGCAACA 


CGCGCCTATG 


AACGATACAA 


2700 


GGAACAACCT 


AATGTCCTAA 


AACGTGCATA 


TATGCTGAAA 


GAAATTTTGG 


AAAAT ATG AC 


2760 
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TATCTATATT GAAGAAGAAT CTATGATTGC GGGAAATCAA CCTTCTTCCA ATAAAGATGC 2820 

TCCTATTTTT CCGG AAT ATA CGCTAGAATT TGTTCTCAAT GAGTTGGATC TTTTTGAAAA 2880 

GCGTGATGGA GATGTTTTCT ATATTACAGA AGAAACAAAA GAACAACTTA GAAGTATTGC 2940 

TCCGTTTTGG GAAAATAATA ATTTACGTGC TAGAGCTGGT GCCTTATTAC CTGAAGAAGT 3000 

GTCTGTTTAT ATGGAAACAG GATTCTTCGG TATGGAAGGT AAGKTGAATT CTGGAGATGC 3060 

TCACTTAGCA GTTAACTATC AGAAACTTTT GCAATTTGGT TTAAGAGGTT TTGAAGAGCG 312 0 

GGCTCGTAAA GCAAAAGTAG CTCTAGATTT AACAGATCCA GCAAGTATTG ATAAATATCA 3180 

TTTTTACGAC TCTATATTTA TCGTAATCGA TGCTATTAAA GTATATGCAA AG CGC TTT GT 3240 

TGCTCTTGCT AAAAGTTTAG CCGAAAATGC AAATCCTAAA CGTAAGAAAG AATTACTTGA 3 300 

GATTGCAGAT ATTTGCTCTA GAGTCCCATA TGAACCGGCA ACT ACT TTTG CAGAAGCTAT 3 360 

TCAATCAGTT TGGTTTATTC AATGTATTTT ACAAATTGAA TCTAATGGCC ACTCTCTTTC 342 0 

ATATGGCCGT TTTGATCAAT ATATGTATCC ATATATGAAG GCTGATTTAG AAAGTGGTAA 34 80 

AGAAACAGAA GATAGCATTG TTGAACGTCT GACAAATCTT TGCATTAAGA CAATTACAAT 354 0 

TAATAAGGTT CGCAGTCAAT CACATACATT TTCTTCAGCA GGAAGTCCTT TATATCAAAA 3 600 

TGTTACAATT GGTGGACAGA CTCGAGATAA GAAGGATGCT GTTAACCCAT TATCTTATTT 3 660 

GGTATTAAAA TCAGTTGCAC AAACCCATCT ACCGCAACCT AAT CT AACTG TACGTTACCA 3720 

TGC AG GTTT A GATGCTCGTT TCATGAATGA GTGTATTGAA GTGATGAAAC TTG CT TT TC G 37 80 

TATGCCTGCA TTTAATAATG ATGAGATTAT TATTCCTTCT TTTATTGCAA AAGGAGTATT 3840 

GGAAGATGAT GCTTATGATT ACAGTGCCAT TGGATGTGTT GAAACGGCAG TTCCAGGGAA 3 900 

ATGGGGCTAT CGTTGCACAG GTATGAGTTA TATGAACTTC CCTAAGGTTC TACTTATCAC 39 60 

GATGAATGAT GGAATTGATC CGGCTTCGGG TAAACGGTTT GCACCAAGCT TTGGTCGTTT 4020 

TAAGGATATG AAGAACTTTT CTGAATTAGA AAATGCTTGG GATAAAACAC TAAGATATTT 4080 

GACACGAATG AGTGTTATTG TTGAAAATTC TATTGATTTA TCATTGGAAC GAG AAGTTC C 414 0 

TGATATTCTA TGTTCAGCAT TGACTGATGA TTGTATTGGT CGTGGAAAAC ACCTTAAAGA 42 00 

AGGTGGAGCA GTATATGATT ATATATCAGG ATTGCAAGTT GGAATTGCAA ATTTGTCGGA 42 60 

TTCATTAGCT GCAATTAAAA AATTGGTGTT TGAGGAAGAA CGTATAAGCC CAAGTCAGCT 432 0 

TTGGCATGCA CTGGAAACAG ATTATGCCGG AGAAGAAGGT AAGGTCATTC AAGAAATGTT 43 80 

GATTCATGAT GCACCTAAGT ATGGTAATGA TGATGATTAT CCTGACAAAT TGGTTACTGC 4440 

TGCTTATGAC ATTTATGTTG ATGAAATTGC TAAATATCCT AAT ACACGTT ATGGAAGAGG 4500 

GCCTATTGGA GGAATTCGTT ATTCAGGAAC ATCTTCTATC TCAGCCAACG TAGGGCAGGG 4 560 
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ACGTGGAACA 


TTAGCAACTC 


CAGATGGACG 


CAACGCGGGT 


ACACCGTTAG 


CAGACGGTTG 


4620 


TTCACCATCA 


CATAATATGG 


ATCAACACGG 


CCCTACATCT 


GTTTTAAAAT 


CTGTTTCAAA 


4680 


ATTACCAACA 


GATGAAATCG 


TAGGTGGGGT 


TCTCTTAAAT 


CACAAAGTAA 


ATCCTCAAAC 


4740 


GTTAGCCAAA 


GAAGAAGATA 


AATTAAAACT 


AATTGCTTTG 


TTACGAACAT 


TCTTTAATCG 


4800 


TTTACATGGG 


TACCATATTC 


AATACAATGT 


TGTTTCCAGA 


GAGACGCTGA 


TTGACGCTCA 


4860 


GAAACATCCT 


GAAAAACACA 


GAGACTTAAT 


TGTTCGTGTT 


GCAGGATACT 


CTGCATTCTT 


4920 


CAATGTTCTT 


TCTAAGGCAA 


CCCAAGATGA 


CATTATAGGA 


CGTACTGAGC 


ATACTTTGTA 


4980 


AAATAAAGAG 


GTTCTTTTTA 


TGGAATTTAT 


GCTTGACACA 


TTAAATTTAG 


ATGAGATTAA 


5040 


AAAGTGGTCT 


GAAATTTTGC 


CGCTAGCTGG 


GGTAACTTCA 


AATCCCACTA 


TTGCAAAAAG 


5100 


AGAGGGTTCT 


ATTAATTTTT 


TTGAACGAAT 


CAAAGATGTA 


AGAGAATTGA 


TTGGCTCTAC 


5160 


ACCCTCTATT 


CATGTTCAGG 


TGATTTCTCA 


AGATTTTGAA 


GGCATCTTAA 


AGGATGCTCA 


5220 


TAAAATTCGA 


AGACAAG CAG 


GAGATGATAT 


AT TT AT C AAA 


GTACCTGTTA 


CTCCAGCTGG 


5280 


ATTACGTGCA 


AT AAAGG CGC 


TAAAAAAAGA 


GGGCTACCAT 


ATCACTGCAA 


CAGCTATTTA 


5340 


TACAGTTATT 


CAGGGATTAT 


TAGCTATCGA 


AGCAGGAGCG 


GATTACCTAG 


CTCCATATTA 


5400 


TAATAGAATG 


GAAAATCTGA 


ACATTGATTC 


AAATTCTGTC 


ATTCGTCAAT 


TAGCTCTTGC 


5460 


TATTGATAGA 


CAGAACTCTC 


CTAGTAAGAT 


TTTAGCTGCA 


TCCTTTAAAA 


ATGTAGCACA 


5520 


AGTAAATAAT 


GCTTTAGCTG 


CAGGTGCGCA 


TGCTCTTACA 


GCAGGAGCGG 


ATGTTTTTGA 


5580 


ATCAGCTTTC 


GCCATGCCAT 


CTATCCAAAA 


GGCGGTTGAT 


GATTTTTCTG 


ACGATTGGTT 


5640 


TGTTATTCAA 


AATAGTCGTT 


CCATTTAGAT 


AGAGAGGAAA 


TACATATGAG 


AATTTTTGCT 


5700 


AGTCCTTCTA 


GATATATTCA 


GGGGGAAAAT 


GCCTTGTTTG 


AAAATGCCAA 


ATCAATTTTG 


5760 


GATTTGGGAA 


ATTGCCCTAT 


TCTATTATGC 


GATCAGTTGG 


TTTATGATAT 


TGTTGGAAAA 


5820 


K-\3I^ 111 uAAO 




T AGGT ATGGT 


TTCCATATTG 


TTCTGGCGCT 


ATTTAATGGT 


5880 


GAAGCTTCTG 


ACAATGAAAT 


CAATCGAGTT 


GTTGCCTTGG 


CTGAGAAAGA 


AAATTGTGAT 


5940 


AGTATTATCG 


GTCTTGGTGG 


GGGAAAGACG 


ATTGATAGCG 


CAAAAGCTAT 


TGCAGATTTG 


6000 


ATTGAAAAGC 


CTGTTATTAT 


TGCTCCAACA 


ATTGCATCGA 


CCGACGCACC 


TGTATCTGCT 


6060 


TTATCTGTTA 


TTTATACAGA 


TGAAGGTGCA 


TTTGATCATT 


ATCTATTTTA 


TTCTAAAAAT 


6120 


CCAGATTTAG 


TTTTGGTTGA 


TACAAAAGTT 


ATTTCACAAG 


CCCCTAAGCG 


TTTATTAGCG 


6180 


TCTGGTATTG 


C AG ATGGTTT 


AGCAACTTGG 


GTTGAGGCGC 


GTGCGGTTAT 


GCAGGCAAAT 


6240 


GGAAAAACTA 


TGTTGGGACA 


ACAGCAAACA 


TTGGCTGGAG 


TTGCAATTGC 


GAAGAAATGT 


6300 
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GAAGAAACGC 


TGTTTGCAGA 


TGGTTT AC AG 


GCTATGGCAG 


CTTGTGAAGC 


TAAAGTGGTG 


6360 


AC ACCAGC AT 


TAGAAAATAT 


TGTTGAAGCT 


AATACTTTAT 


TGAGTGGTCT 


AGGTTTTGAA 


6420 


AGTGGAGGAT 


TAGCTGCGGC 


GCATGCAATT 


CATAATGGTT 


TTACTGCATT 


GACAGGTGAC 


6480 


ATTCATCATT 


TAACACATGG 


TGAAAAAGTA 


GCTTATGGAA 


CTTTAGTACA 


ACTATTATTG 


6540 


GAAAATAGAC 


CTAAAGAAGA 


ACTTGATAAG 


TATATTGAGT 


TTTACAAAAA 


AATTGGTATG 


6600 


CCAACAACTC 


TAAAAGAAAT 


GCATTTGGAT 


CAAGTTGGAT 


ATGATGATTT 


AATAAAAGTT 


6660 


GGTAAACAAG 


CAACTATGGA 


GGGTGAGACA 


ATTCATCAGA 


TGCCCTTTAA 


GATTTCGCCT 


6720 


TCAGATGTTG 


CTCAAGCTAT 


TATCGCTGTA 


GATGCCTATG 


TAAATTCAAA 


ATAAACAATA 


6780 


AGGACTACTG 


TTTTCCAAAT 


GGTAGTCTTT 


TATTGATCCC 


TGTATTGAAT 


TCTATAGAAG 


6840 


ATTGAAATAG 


GATGAGAACA 


AATCGATTGG 


GAAAGTAAAA 


TTAATTTCTA 


TAAATGTTTT 


6900 


AGCAATTGTT 


TCGTACTATT 


TCAGATTCAG 


TCTACTATAT 


GTTCTTCATA 


AATCAAAAAG 


6960 


CGACATAGGT 


TGTCGGCTAT 


TTATTGTGAA 


TACATTAATT 


AGCATTCCAG 


TTTTATCTTC 


7020 


GGTCTAAAAT 


AAGTATTTTG 


TGCTATACGA 


GATAAGCTTC 


TTGACTTACT 


CCTTGATTTA 


7080 


CTGCATAACA 


ATGGGATAAA 


AAGTGGGAGA 


TAGAGCAATT 


CATAGTCATC 


AAAATTAATG 


7140 


AGATACAGTA 


TACAGTTTTT 


CCTTTAAACA 


CATTTCAAAT 


TCCCTCAAAA 


ATGGTATAAT 


7200 


AGTAACATCA 


CAAAATTGGA 


GAGAGACCAT 


GAGTTTTTAC 


AATCATAAAG 


AAATTGAGCC 


7260 


TAAGTGGCAG 


GGCTACTGGG 


CAGAACATCA 


TACATTTAAG 


ACAGGAACAG 


ATACATCAAA 


7320 


ACCTAAGTTT 


TATGCGCTTG 


ATATGTTCCC 


TTATCCGTCT 


GGAGCTGGTC 


TGCACGTAGG 


7380 


ACACCCAGAA 


GGTTATACTG 


CAACCGATAT 


CCTCAGTCGT 


TACAAACGTG 


CGCAAGGCTA 


7440 


CAATGTCCTT 


CACCCAATGG 


GTTGGGATGC 


TTTTGGTTTG 


CCTGCAGAGC 


AATACGCTAT 


7500 


GGATACTGGT 


AATGACCCAG 


CAGAATTTAC 


AGCGGAAAAC 


ATTGCCAACT 


TCAAACGTCA 


7560 


AATTAATGCG 


CTTGGATTTT 


CTTATGACTG 


GGATCGTGAA 


GTCAACACAA 


CAGATCCAAA 


762Q 


CT ACT ACAAG 


TGGACTCAAT 


GG ATT TTC AC 


CAAGCTTTAC 


GAAAAAGGCT 


TGGCCTATC-A 


7680 


AGCTGAAGTG 


CCAGTAAACT 


GGGTTGAGGA 


ATTGGGAACT 


GCCATTGCCA 


ATGAAGAAGT 


7740 


GCTTCCTGAC 


GGAACTTCTG 


AGCGTGGAGG 


CTATCCAGTT 


GTCCGCAAAC 


CAATGCGCCA 


7800 


ATGGATGCTC 


AAAATCACGG 


CTTACGCAGA 


GCGCTTGCTC 


AATGACTTAG 


ATGAACTAGA 


7860 


TTGGTCAGAG 


TCTATCAAGG 


ATATGCAACG 


CAACTGGATT 


GGTAAATCAA 


CTGGTGCCAA 


7920 


TGTAACTTTC 


AAAGTAAAAG 


GAACAGACAA GGAATTTACA 


GTCTTTACTA 


CTCGTCCGGA 


7980 


CACACTTTTC 


: GGTGCGACTT 


TCACTGTCTT 


' GGCTCCTGAA CATGAATTAG 


i TAGACGCTAT 


8040 


CACAAGTTCA GAGCAAGCAG 


! AAGCTGTAGC AGACT AT AAA CACCAAGCCA GCCTTAAGTC 


8100 
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TGACTTGGCT CGTACAGACC TTGCTAAACA AAAAACAGGG GTTTGGACTG GTGCTTATGC 8160 

CATCAACCCT GTCAATGGTA AGGAAATGCC AATCTGGATT GCAGACTATC TCCTTGCTAG 8 2 20 

TTATGGAACA GGTGCGGTTA TGGCTGTGCC TGCCCACGAC CAACGTGACT GGGAATTTGC 8 280 

CAAACAATTT GACCTTCCAA TCGTCGAAGT ACTTGAAGGT GGAAATGTCG AAGAAGCTGC 8 34 0 

CTACACAGAG GATGGCCTGC ATGTCAATTC AGACTTCCTA GATGGATTGA ACAAAGAAGA 84 00 

CGCTATTGCC AAGATTGTGG CTTGGTTGGA AGAAAAAGGC TGTGGTCAGG AGAAGGTTAC 84 60 

CTACCGTCTC CGCGACTGGC TCTTTAGCCG TCAACGTTAC TGGGGTGAGC CAATTCCAAT 8 520 

CATTCATTGG GAAGATGGAA CTTCAACAGC TGTTCCTGAA ACTGAATTGC CGCTTGTCTT 8580 

GCCTGTAACC AAGGATATCC GTCCTTCAGG TACTGGTGAA AGTCCACTAG CTAACTTGAC 864 0 

AGATTGGCTT GAAGTGACTC GTGAAGATGG TGTCAAAGGT CGTCGTGAAA CCAACACTAT 87 00 

GCCACAATGG GCTGGTTCAA GCTGGTACTA CCTCCGCTAT ATTGACCCGC ACAATACTGA 8760 

GAAATTGGCT GATGAGGACC TCCTCAAACA ATGGTTGCCA GTAGATATCT A CGTGGGTGG 882 0 

TGCGCAACAT GCTGT ACTTC ACTTGCTTTA TGCTCGTTTC TGGCATAAAT TCCTCTATGA 8880 

CCTCGGTGTT GTTCCGACTA AGGAACCATT CCAAAAACTC TTTAACCAAG GGATGATTTT 894 0 

GGGAACAAGC TACCGTGACC ACCGTGGTGC TCTTGTGGCA ACCGACAAGG TTGAAAAACG 9000 

TGATGGTTCC TTCTTCCATG TAGAAACAGG GGAAGAGTTG GAGCAAGCGC CAGCCAAGAT 9060 

GTCTAAATCG CTCAAGAACG TTGTTAACCC AGACGATCTG GTGGAACAAT ACGG TGCCG A 9120 

TACCCTTCGT GTTTATGAAA TGTTTATGGG ACCACTCGAT GCTTCGATTG CTTGGTCAGA 9180 

AGAAGGTTTG GAAGGAAGCC GTAAGTTCCT TGACCGAGTT TACCGTTTGA TTACAAGTAA 9240 

AGAAATCCTT GCGGAAAACA ATGGTGCTCT TGACAAGGTT TACAACGAAA CAGTCAAAGC 93 00 

TGTTACTGAG CAAATTGAGT CTCTCAAATT CAACACAGCT ATTGCCCAAC TTATGGTCTT 9 3 60 

TGTCAATGCT GCTAACAAGG AAGATAAGCT TTATGTTGAC TATGCCAAAG GCTTTATTCA 94 20 

ATTGATTGCA CCATTTGCAC CTCACTTGGC AG AAGAACT C TGGCAAACAG TCGCAGAAAC 9480 

AGGTGAGTCA ATCTCTTATG TAGCTTGGCC AACTTGGGAC GAAAGCAAAT TGGTTGAAGA 9 54 0 

TGAAATTGAA ATTGTCGTCC AAATCAAAGG AAAAGTTCGT GCCAAACTCA TGGTTGCTAA 9600 

AGATCTATCA CGTGAAGAAT TACAAGAAAT CGCTTTAGCT GATGAAAAAG TCAAAGCAGA 9660 

AATTGACGGT AAGGAAATC G TGAAAGTAAT TGCGGTACCG AATAAACTCG TTAATATCGT 972 0 

CGTTAAATAA CGAGTTTATT AGCT CTATCT GCCACCTTCA ATAGTCCACT GG ACT ATT GA 97 80 

AsCCAACTAA ATTAGTTAAC ATTGTTGTGA AATAAGATAG GAGTCCTTCA GAGTAGAATC 9840 
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TGGAGGATTT TTTGAATCTT CTTATGAAAG TATGATATAC TATGGGCAAC TAT AAAGTTT 9 900 

GAAAAGTGAA ATAAGGAGAA TAAGATGCCA GTAAATGAAT ATGGTCAAAT GATTGGGGAG 99 60 

TCAATGGAAG CTTATACTCC AGGTGAATTG CCTTCTTTTG ATTTCTTAGA AGGGCGTTAT 10020 

GCTAGGATAG AGGCTCTTTC AGTGGAAAAG CATGCGGAGG ATTTATTAGC TGTTTATGGC 10080 

CCTGATACGC CTCGGGAGAT GTGGACCTAC CTCTTTCAGG AGTCAGTAGC AGACATGGAG 10140 

GAACTGGTCA GCCTTTTAAA TCAGATGTTG GCTCGTAAGG AC CGTTTTT A TTATGCAATC 10200 

ATAGACAAGG CAACTGGTAA GGCTTTGGGA ACTTTTTCCC TCATGCGAAT TGATCAGAAT 102 60 

AACCGAGTAA TAGAAGTGGG AGCTGTCACT TTTTCTC C AG ACCTCAGGGG GACACGGATA 10320 

GGAACAGAAG CCCAGTATCT CTTGGCTTGC TATGTCTTTG AGGAGCTTAA CTATCGTCGC 10380 

TATGAGTGGA AATGCGATGC TCTTAACCTG CCATCCAGAC GAGCAGCGGA ACGTTTGGGA 10440 

TTTATTTATG AAGGAACCTT CCGTCAGGCA GTGGTTTATA AGGGGCGTAC AAGAGATACG 10500 

GATTGGTTGT CTATGATTGA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAAT ATGG 10560 

TTGCGTCCTG AAAACTTTGA TAAAAATGGA CGACAGCACA AG AGCTTGAG AGAACTTTAA 10620 

GAGGTGTTGA GATGATTACT ATTAAAAAGC AAGAAATTGT CAAGCTAGAG GATGTTTTGC 10680 

ATCTCTATCA GGC TGTCGGT TGGACAAACT AT ACC CATC A AACAGAGATG CTGGAGCAGG 1074 0 

CCTTATCTCA TTCATTAGTA ATTTATCTGG CACTTGATGG TGATGCTGTG GTGGGCTTGA 10800 

TTCGTTTGGT TGGAGATGGT TTTTCATCAG TTTTTGTACA GGATTTGATT GTTTTGCCTA 10860 

GCTATCAGCG TCAAGGGATT GGTAGCTCCT TGATGAAAGA GGCTTTAGGA AATTTTAAAG 109 20 

AGGCCTATCA AGTCCAGCTG GCGACAGAAG AGACAGAAAA AAACGTGGGA TTTTATCGTT 10980 

CTATGGGCTT TGAAATCTTA TCCACCTATG ACTGTACAGG AATGATTTGG ATAAACAGAG 11040 

AAAAATAAAA AAA C TT G TTT GTTCTTAAGC AAAGTTT AAG GATGGTCTAG TATCATATAG 11100 

TCATTAAATA AAGACCTCCT AACTTTATTT AATAAAATCC TAAACTTTTT TCATCACAAT 11160 

CTCCTAATGA AGCCACCCAA TCAGGTGGCT TTTTTGCGGT AC G ACGGGC A TGTCGTATAT 11220 

CTGAGGTGTA AGTCCTCAGC CTGACTATCG TGAGGTAGCA GGGAGAGGAA GGGATAGCGA 11280 

AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT AAGGAGGCTT 11340 

CAAACTCTAA AGTCCAAAAA GGTAGTCGTA ACCTATATGT GTAAATCACG AGAGTAATTG 114 00 

AATTCGGACT AAGGTTTGTG TGAAAAAGAT AAATCTTTCT AG AGTCT AAA GACTCTGCGT 114 60 

CAGATTTCCT ATTTTCACTG TAACCTTTTA ACGTCCTCAT ATCTTGTATA AACGAGGAAA 11520 

GATGTACGAC TTATCCCGTG AGGTTTCATG AGCGCTGAAA GCGTAGTAAC AACGAATCAT 11580 

GAGAAGTCAG CCGAGCCCAT AGTAGTGAGG AAACTTCCGT AATGGAAGTG GAGCGAAGGG 11640 
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GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC 11700 

GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 117 60 

G ACTTTGAGG 11770 

(2i INFORMATION FOR SEQ ID NO: 173: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4185 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGCGAAACTA 


CTTTCTTAGT 


ATAACACTTT 


CAGAATCATT 


GTCAATAGAA 


ATGACTTGAT 


60 


TTTTTCAATT 


TTTTCAAGCT 


ATTTCCAAGG 


GTTGTAAAAT 


CGTCCCTGAT 


TCTGCAAGAT 


120 


AAGTAGTAAA 


CTAACTACTA 


AAAACAAGGT 


TGCCAAGAGC 


AAGGTAATAT 


AGTCTCCTTT 


180 


TTTCAAGGCC 


TGATAACTAT 


ACCATGTGCG 


TTTTTTCTCT 


TTCCCAAAGC 


GGCGAACTCC 


240 


ATGGCAGTCG 


CAATGGTATC 


AATGCG TTCT 


AGCGAGCTAA 


AAATCAAGGG 


CGTAATAATG 


300 


AGCAGATTGC 


CTTTGATTCG 


TTGCATAAGA 


GAAGCTTTCT 


TGGATAATTC 


CATCCCACGC 


360 


GCCTCCTGAG 


ACATCTTGAT 


AGTAAAGAAT 


TCTTCCTGCA 


AATCTGGAAT 


ATAGCGCAAG 


420 


GTCAGGCTGA 


CAGAATAAGC 


AATCTTATAG 


GGCACACCAA 


TTTGATTTAA 


ACTGGAAGCA 


430 


AACTGACTAG 


GATGGGTTGT 


CATCAAAAAG 


ATAATAGCCA 


GAGGAATGGT 


GCAAAGATAC 


540 


TTAATGGCCA 


AATTTAGCAG 


ATAAAAGAGC 


TCCTGGCTGC 


TTAGAGTGTA 


GACACCGATT 


600 


CCCTGCCAAA 


TCACACTTCT 


CTCTCCATAA 


AGTCCAACCC 


CATACTCCGC 


AGAAAAGAGA 


660 


TAGACCATCA 


AAACGTTTAA 


AACGGCAAAT 


ATCGTCGCAA 


AAACGGCTAC 


AAAGGAAACA 


720 


TCTTTAAAGC 


GAATTTCTGA 


TAAATAGAGG 


AGAAAGACTG 


AAAAGATGGC 


AATCAGCAAG 


780 


AGCATTCTGG 


TATCATAGCT 


AATCATGGCC 


GCCAATGATA 


CCAGAATGAA 


AAAGAGAAGT 


840 


TTCCCAGCTC 


CTGACAAGCG 


AT G AATCACA 


GTATCTCTAT 


GCTGGTAACC 


GATTAATTTA 


900 


GCTTGCATCC 


CTCTCTCCTT 


TCTTTGTAAA 


ATGCCGTTAA 


ATCC AGTGG A 


TCCACATCTA 


960 


GTTTCTTAGC 


CAAGTTAAAG 


ATGG AGGTT T 


CTTTTAGATT 


GGCTTTTACT 


AACAGCTCAG 


1020 


GATCGCTCAA 


CAGACTGGCT 


GGAACAGTAT 


CGGCAATCAA 


TTCTCCATCC 


ACCATGACAA 


1080 


GGACCCGGTC 


TGAATAATCC 


AGCATCAATT 


GCATATCATG 


GGTAATCATG 


ACAATGGTAT 


1140 


GCCCTTTTTG 


ATGTAACTCT 


TCGAGAAATT 


CCATAATCTC 


AGTATAGTTC 


TTCTGATCTT 


1200 
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GACCTGCAGT CGGTTCATCT AGGAGAATAA TTTCAGCTCC T AAG AC C AAA ATTGAAGCAA 1260 

TGGTGACACG TTTTTTCTGA CCAAATGACA GGGCAGAAAT AGGCCAATTA CGGAATTCAT 1320 

AAAGTCCACA GATTTTCAAG GTTTCATATA CTCTCGTTTC AATTTCCTTC TCATCCACAC 1380 

CTCGCAAACG GAGCCCTAGA GCCACCTCAT C AAAAAT C AT ATTGGTTGAA ATCATTTGAT 1440 

TAGGATTTTG TAGCACATAT CCTACTCGTT CCGCCCGCTC TGCAACAGAA TCGCCTTTTA 1500 

TATCCTGTTT TTCCCAAAGA TAGCGTCCTT CCGTCTGAAT AAAGCTACTT ATAGCCTTGG 1560 

CTAGAGTTGA TTTCCCTGCT CCATTTTTTC CGACAATAGC AATCTTTTCA CCCTTTTTAA 1620 

TATCTAAATG TAGGGATTTT AAAATCGGTC TATCATCATA AGAAAAAGAT ACTTCCTCTA 1680 

GTCTAAAGAG TGACTGCAAT GCTGGGGTTT CTTTTGCCAG TTCATTCTGC AACTGAACCT 1740 

GACCTTTTGA GAT AG AGAAG TTATCCAGAT TCGCTAATTG TTCTTCCTTG ACTAAGTCCA 1800 

CACCTAATTG ACGGAG ACT C GTTAGATAAA GGGGTTCTCG AATTCCATTT TGAGTCAATA 1860 

AATCAGTCGC AAGCAACTGG TCAGGGCTCC CATTAAAAAG GATACGACCA TCGTTTATCA 1920 

AGACAATCCG AT CCACAGGG CGATGCAGAA CGTCCTCCAA ACGGTGCTCG ATAATAAGAG 1980 

TCGTCGTCCC CTCTTCCTTA TGAATCTGGT CAATCAATTC GATAATATCC TGACCTGACT 2040 

TGGG ATCT AG ATTGGCGAGT GGCTCATCAA ACAAGAGAAT CGGACTTTCA TCAATCAAGA 2100 

CACCAGCCAG ACTGACTCGC TGCTTTTGTC CACCTGACAA ATCCTGAGGA CGCTGATCCA 2160 

GTAAAGGAAG AAGGTCCAGC TTTTCAGCCC ATTTATAAAC ACGAC C TTTC ATCTCATCTA 2220 

GGGCTGTCAC ATCATTTTCC AGAGCAAACG CCAAATCTTC TGCCACAGAC AAGCCAATAA 2280 

ACTGCCCATC TGTATCCTGC AAAACTGTGC TAACCAGATG AGACTTATCA TAGATGCTCA 2340 

TATCAAAGGC TACTTGACCC TTTATCAAAA ATTCTCCATA TGTCTGACCC TTGTAAATAT 2400 

TGGGAATAAT CCCATTCAAA CACTGACCCA AGGTAGATTT ACCTGACCCA GATGGTCCAA 24 60 

CAATTAAGAC TTTCTCTCCC TTGTAAATGG T C AACTCT AT CCCTTGCAAG G TCGGTTCTT 2 520 

GTTGTGTTTC ATACCGGAAA GAGAAATCCT TCCACTCAAT T a T AGCTTCT TTCATCT1 AC 2 580 

TCTCTTCATT CGCTTCTTAG ACTTCTATTT TATCATAAAT CAAGCCCTTC TTGCAGTCTC 2 640 

TCCTCTTAAA ATCTT AGCG C CAAAAAGATT CCTATCCTAG CTTACTTGCC TAACTAATCT 2700 

ATAAACATCG AAAAAGACTA GTTGCC CAGC CTTCCCCATC ATTTTATACT CTTCGAAAAT 2760 

CTCTTCAAAC CACGTCAGcT TCGCCTTGCC GTAGGTATGG TTACTGACTt CGTCAGTTTC 2820 

ATCTACAACC TCAAAACCAT GTTTTGAGCc TGCTTCGTCA GTTCTATCCA CAATCTCAAA 2880 

ACACTGTTTT GAGCAACtGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT 2940 

TAGTCCTTTT TCAAACTTCC TGCACGAGTT TGGGTTCCTG CATAGGCAAG TAAGAGAAGA 3000 
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GTTCCTGCAA 


TAGCTACAGA 


TACACCATTG 


GCAATTCCCG 


CAACAATCCC 


TTGTGCAAAT 


3060 


ACTTTTTCTG 


CCGCTTCTTG 


ATAAATCACA 


ACATCTCCAA 


GTGGTCCCAA 


GACACCCCAA 


3120 


A C AAGGG CAT 


TTCCAAGTAC 


TTGAATGAGA 


TTAAAAATAA 


GAATATCTTT 


CCAGTCAAAA 


3L80 


ACACCATTGA 


TCACGCGAAC 


GTACTTTCTA 


AAAAGTCCCA 


CAACTAAACC 


AAAGAGTCCG 


3240 


CTAGCGATAA 


TCCAAGTCCA 


CCATAGACCA 


TAACCAACAA 


GAGAGTCCTT 


GATTGCATGA 


3300 


CCAATCAACC 


CGACAAGCAA 


ACCGATAATC 


GGTCCAAAAA 


TAATAGAAAG 


TAGCGCTTGT 


3360 


ACCGCATACT 


GAAGCTGGAT 


GCTTGTATTT 


GGAACAGGGG 


TTGGAATGTT 


GATCATCCCG 


3420 


ATGACGACAA 


AGAGGGCAGC 


GCCAATTCCG 


ACAGCAACAA 


CrrGTTTAAT 


TGTAAATTTG 


3480 


ATTTCCATAC 


TATTCTCCTA 


TTTTATCCTT 


CTATTTTCTT 


TATTTC AATG 


GTCCAAGATG 


3540 


AACCGACACC 


TACATTATAG 


GCCTTGGCAA 


AGGAACCTTG 


GTTGATAGCC 


AAACCTAAAC 


3600 


GAT AG AG AG A 


GTTGATGTAA 


AGGATGGGTT 


GCCCAATTCT 


CACATCTGCA 


AATG ATTTG C 


3660 


CATAGACAAC 


CTGATTTTGA 


TAGACCAGCA 


TATCAGCATG 


AT AG ATGGT C 


ACTTCAAAAC 


3720 


GAT CACCAAA 


TTCTGGTTCC 


AGCTTGTAAA 


ATTCTTCCCC 


TGTGATAGAG 


GTCCAAAGCG 


3780 


AACCGAAACG 


CACATCCAGA 


AT AT CAATGG 


CTCCCTTCAC 


CAGATGATCT 


TCTATGATGG 


3840 


TCGCTACGAC 


TGGAAGCTCT 


ACAAT CTGTT 


CCACACTGAG 


CTCTGGCCCT 


ACTTCCTCAA 


3900 


AAGTAATGTG 


ACCACTGGCC 


AGTTTAGCAC 


CAGTATAGGC 


ATAGACATCA 


CGACCGTGGA 


3960 


AGGTATAAGA 


ATGCTCTGTG 


TTTTGACGCC 


TATTGGCCAC 


CTCAGAAATC 


TCACGAATGG 


4020 


CTACAATGCC 


AACGTGTTTC 


TTGATAAAGG 


AAAGCGTCCC 


ATTATCTGGC 


GTGACAATGT 


4080 


ATTGATTTTT 


TGCAGTCTTG 


GCAACTACAC 


TCTTACGTTT 


C GAACCG AC A 


CCTGGATCGA 


4140 


C AAC C GAT AC 


AAACGTCGTT 


CCCTCAGGCC 


AGTAATCCAC 


CGTCT 




4185 


(2) INFORMATION FOR SEQ ID NO: 1*74: 









<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 
TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 
CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 
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CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CTTCTTTTCT 2 40 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 300 

ATTGTAGTGA CTCTTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 360 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 420 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 480 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 540 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC 600 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 660 

AT ATTT CTCC AGAACATCTC CAT CCTT AAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 720 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAAGCG CAGGAGAAAT CGACCAATAG 780 

GAAAGAAATA GC CTT AG AAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 840 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 900 

TTCTTCAGAT TTCTT CATC A ACTCAATCAG CTGTAAAATA CGGCTCTTCA TCTGATTATC 960 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTCTA TTTGCACAAA CCAAATCAGA 1020 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA CTTTCGTTGA CCATACCTAA 1080 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTC ATTTCCT TGAGATAAGA TCAAGACATC 1140 

TCCTATTTGA ACATCGGAAC TCTTGATACT AACAAC CGT A TCGCCCTGTA CT AGGAAT AC 1200 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATCTGTT GCAGTTTTTT TCAAGGACCA 1260 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 1320 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 1380 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 1440 

ATAGCGAATA GG AT ACGGC A CAAAACTACG AAAAAGTACA CGCTTAACCG CTGCACCTGA 1500 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GTCATCAACT CAGAAACTTT 1560 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 1620 

TATGCGTAGA GTAAAGTGCT GTTGAT CCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 1680 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 174 0 

TAGTCGGATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 1800 

CGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA I8 60 

CATCGTCTCC ATGTTGCTTC AC AAC AG AAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 19 20 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 1980 
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AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 2040 

CAACGGTTAA TACTTCTTTT AACATAGGG 2069 

f 2 > INFORMATION FOR SEQ ID NO : 17 5: 

<U SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4597 base pairs 
IB) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
AAATCTTGCC CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT CCCCGGACTG 
TTTCAACGTC CCAAGACATA ATCTTAGGCA GATTTCTAAA ATTACACTCA AAGTGGAAGT 
CATTGACCTT TCGAATGACA GTTGAAGTTG AAATGGCCAG CTGATGGGCA ATATCGGTCA 
TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA GGAATTTGGT 
GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA TAGCACTTAA 
AACGACGTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG TAAGGAATTT 
TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA TATGATTCTT 
TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT TTCTACCCAA 
AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG AGCCCAAAGT 
TTTAGGTCGC TTGATAATAT GCGTTTTTTG AATTTTATAG ACTGCTCGTT TAAACTCTAT 
TTACTTCGTA CCTT CTGG AG CGAGACGGAA TATTAGTCAC ATACAAAATG ACT ACT ATT A 
GGATTTTATT TTCATGTACA ATTTC AG CCA GTCTTGTTAT AATCAGCCTA TAGGAATCAA 
GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG AAGTCCTTGA 
TGT CTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT TTTGAATCTG 
GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT CCTAATCAAT 
TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC AGTTTGTTAT 
GGAACGCTCT CAGTTCGAGG AGATGTTC C A ACTTCAAAGT AGTCGCTTGA CGACGCAAGA 
AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG CTAAGAATTT 
TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT GGAAGCAGTT 
GCCACCTGAA AAACG GAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT CTCATTTTCG 
TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT ATTTTTTGGT 



60 
120 

iao 

240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
340 
900 
960 
1020 
1080 
1140 
1200 
1260 
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ACTGGATTTC 


GATGAAGGAG 


ATTGGAAAGA 


AGCTGGTTTA 


ACCATTCGAA 


GAATAGCCAG 


1320 


GGAACGCCAG 


ATGGAAGCCC 


ATTTAGAGAT 


TTCTCGTTCG 


GGTCACGGAC 


TCCATATTTG 


1380 


GTTCTTCTTT 


GAGGAAGCGA 


TTCCGAGTCG 


AGAGGCTCGC 


TTGTTTGGAA 


AGAAACTGAT 


1440 


AGAACTGGCA 


ATGCAGGAAA 


GTATGCAACT 


GTCCTTTGAT 


TCTTTTGATC 


GCATGTTTCC 


1500 


AAATCAGGAT 


GTCCTTCCTA 


AGGGGGGATT 


TGGAAATTTG 


ATTGCCTTGC 


CTTTTCAAGG 


1560 


AGAAG CTT AC 


CATCAAGGGC 


GAACGGTCTT 


TGTGGATGAA 


CAGTTTCAGC 


CTTATGAAGA 


1620 


CCAATGGAGG 


TATCTACAAG 


AAATTCAGAG 


GATTTCAACT 


GCTAAAGTGG 


CACTGTTAAT 


1680 


CCAAGAGGAG 


TTAGGCAAGC 


AAGAATTGGA 


TAAGGAGTTG 


AAGGTCCTTT 


TATCCAATAT 


1740 


GATCCAACTT 


GAAAAATCGT 


CTGTGACATC 


CAAGGCACTT 


TTTTCTTGAA 


AAATATGGCT 


1800 


TCCTTTTCTA 


ATCCCGAATT 


TTATAGTAGA 


TTGAAACTAG 


AATAGTACAC 


CTCTGCTTCT 


1860 


AAAACATTGT 


TAGAAATCGA 


TTTGACTTTC 


CTGATCGATT 


TGTCCTCTTA 


TTATTTCATT 


1920 


T TACT AT ATT 


TAAAGCAGGC 


TATGCGACAG 


CCAACCTATC 


AAATTCCTGA 


GAGAATGTAT 


1980 


TTATTTGGAG 


AATCCGATCA 


TTATTTATGG 


TTGCCAAGAG 


GTTTGCTGTA 


TCCATTGCAA 


2040 


GAT AAATTT A 


AGCAGGTATC 


TGTGGAAGAT 


AGGAGAAAGG 


TACAAAGGTC 


TATTAGCGTG 


2100 


GAATTTAAGG 


GAGAACTCAC 


TTTTGAGCAA 


GAGTTAGCCC 


TGT C AG AT AT 


GACTTCTAAA 


2160 


GAAAATGGTT 


TACTTCATGC 


GGAGACTGGT 


TTTGGGAAGA 


CCGTTTTAGG 


TGCTGCTCTT 


2220 


ATCTCTGAAC 


GGAAAACAAA 


AACAATTATT 


CTAGTCCATA 


ATAAGCAACT 


CTTAGACCAA 


2280 


TGGCTAGATC 


GCTTAAACTG 


CTTTTTGACT 


TTCGAAGAGG 


AGGAGGCTAT 


CCGTTATACG 


2340 


GCATCAGGTC 


GTGAAAAGGT 


AATCGGCTAT 


GTTGGGCAGT 


ACGGTGGGAC 


TAAGAAATGG 


2400 


CTGAGTAAAC 


TGCTTGATGT 


CGTTATGATT 


CAATCTCTAT 


TTAAGTTGGA 


AAATAGTCAA 


2460 


AGTCTTTTGG 


ATGAGTATGA 


GATGATGATT 


GTGGATGAGT 


GTCATCATGT 


CTCTGCCTTG 


2520 


ATGTTTGAAA 


AAGTTGTTGC 


TCAGTTTAGA 


GGGAAGTATC 


TTTACGGTTT 


GACGGCTACG 


2580 


CCTGAGCGTA 


AGAATGGTCA 


TGAGCCTATT 


GTTTTTCAGA 


GAATTGGTGA 


GATACTCC^T 


2640 


ACTGCTGATA 


AGAGGGAAAC 


GGATTTTAAA 


CGGCAATTGC 


AATTAAGATT 


CACTTCTTTT 


2700 


GGTCATTTGG 


AAATTGAAAA 


GACCAAAGCA 


AGTAATTTTA 


TACAGCTTAG 


TGATTGGATT 


2760 


GCTACTGACT 


CAGTGAGGAA 


TCAGATGATT 


CTCAAGGATA 


TTCTAGCCCA 


AGTGGCAGAA 


2820 


GGACGGAATA 


TCTTGGTTTT 


AGTTAATCGA 


ATTCAACAGA 


TAGATGTCTT 


TGAAAAATTA 


2880 


TTGAAAGAGA 


AAGAGGTTGA 


TGACTGTTAC 


ATTATTAGCG 


GAAAAACCAA 


AGTCCGAGAG 


2940 


AGAACGAGTT 


TACTGGAGAC 


GTTAGAACAG 


TTAGATAAAG 


GGTTTGTTTT 


GTTGTCTACT 


3000 


GGAAAATACA 


TTGGCGAAGG 


TTTTGACTTA 


CCTCAGTTGG 


ACACGCTTAT 


CTTGGCAGCA 


3060 
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CCCTTTTCTT 


GGAAAAATAA 


TTTGATTCAG 


TATGCAGGTC 


GGATTCATAG 


AAACTACAAG 


3120 


GATAAGTCTT 


TGGTGCGTAT 


TTTCGATTAT 


GTGGATATTC 


ATGTTCCTTA 


TTTAGAAAAG 


3180 


ATGTTTCAGA 


AACGACAAGT 


AGCTTATCGA 


AAGATGGATT 


ATCGTGTCAT 


CGAGGGTGAG 


3240 


GAGAAACAAT 


TCGTTTATGT 


TGATAGTAGA 


TATGAGAAGG 


TGTTGAGAGA 


GGACTTAGCA 


3300 


GGGGAAAGAC 


AGGAATGTCT 


GCTTATTTTA 


CCTTATGTGC 


ACCAGACAAA 


ACTGATGAAT 


3360 


TTTCTAAAAG 


AATTTAGGAT 


TAGTCAAATT 


GAGATATGTA 


TACCAGAGAC 


GGTTGCAAAT 


3420 


AAAGCATGGC 


TAGACCAGTT 


GAAGAGCCAG 


AAAATTAAAG 


TGTCTTTTAC 


TCAATCAAAA 


3480 


ATAGTAACGC 


CTATTCTTTT 


GGTGAATAAG 


ACTATTGTTT 


GGTATGCTGC 


AATGCCATTA 


3540 


TTAGGGAAGG 


TAGATGAGAT 


GACCATATTA 


CGTTTGGAAT 


CAGCTAGTAT 


AGTTTCTGAA 


3600 


CTAGTGGCAG 


GTTTACGATA 


GAGAAAATTT 


TTAAAAATTT 


CTATGTATGA 


TTTTCATTTC 


3660 


TTTAGTGAGA 


CTGTTGC CAT 


TATCACATTC 


GAATCACACA 


AAATAAAAAA 


ATTTTTATAA 


3720 


GTACTTGACA 


AATAGATTGA 


AATATCATAA 


AATAAAAACG 


GTTACAGAGT 


TATTAATTAT 


3780 


TTAAGCTTCA 


TGTCACCATT 


AAAAATTGAA 


ATAAAAGGAT 


GTTATCACTA 


AT AC AAGTG A 


3840 


GCAGGAACCT 


ATTTAATCAC 


ATCAGAAGAA 


GTTTCTTGAT 


GTTTTTAAGT 


AGGTTCCTTT 


3900 


TATTTTAAAA 


GGGAAATTTT 


ATGATCATAA 


AACGAATACT 


AAACCACAAT 


GCCGTAATTG 


3960 


CGCAAAGTAA 


AAAAGATATC 


GATATTCTTC 


TTTTTGGAAG 


GGGAATAGCT 


TTTGGAAGAA 


4020 


AAACTGGAGA 


TAAAGTAAAT 


CCAATTGATA 


TTGAGAAAAG 


TTTTTTT CTC 


AAAAATAGAG 


4080 


AT AATATG AC 


CCGTTTTACA 


GAGATGTTTA 


TTAACGTTCC 


TTTGGAGTTG 


GTGTACATCA 


4140 


CCGAAAAAAT 


AATTAAC CT A 


GGTAAAATAA 


CATTGGGTAA 


TAATTTTGAT 


GAAATTATCT 


4200 


ATATTAATTT 


AACGGATCAT 


ATTTCTTCGA 


GCATAGAACG 


TTATAAAGAA 


GGGATTATTA 


4260 


TTTCGAATCC 


CCTACGCTGG 


GAAATATCGA 


AATATT AT AA 


AGAAGAATTT 


GAACTTGGGA 


4320 


AAAGGGCTTT 


ACAAATAATA 


AAAAAAGAGT 


TAGGTATTGA 


ACTTCCAATT 


GACGAAGCTG 


4380 


CATTCATAGC 


GCTACATTTT 


GTTAATGCTA 


ATTTAGAAAA 


TAATTTTCAA 


GAGTCGTATA 


4440 


AAATCACTGA 


AATAATTATG 


GGAATTGAGA 


AAAT CATTC A 


AGATTTCTAT 


TGTACTGAGT 


4500 


TTAACCAAGA 


TTCTATTGAT 


TATTATAGAT 


TCATAACTCA 


TATGAAATTA 


TTTGCCCATC 


4560 


GCTTGGTTGA 


GAATACAACT 


TATTGTGACG 


ATGATGA 






4597 


(2) INFORMATION FOR SEQ ID NO: 176: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 984 base pairs 
(B> TYPE: nucleic acid 
( C > STRANDEDNESS : double 
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<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CGGCTTATTT 


ACTACTTGTT 


CCATCATATA 


TGGAATATGC 


ATGAAACCTG 


CTCTCATATT 


60 


AGGGAATTTT 


TTATCCACTA 


AATAAAGAGC 


TTGGTACATC 


AAATG ATTGC 


AAACAAAGGT 


120 


TCCTGCACTA 


TTGGATACAA 


CTGCCGGAAG 


TCCCTGTTTT 


TTGATAGCTT 


GTACCATCGC 


180 


TTTGATAGGT 


AAACTACTAA 


AATAGGCCGA 


TGCTCCATCA 


ATACGAATCG 


GTGTATCAAT 


240 


TGGTTGATTG 


CCTTCGTTAT 


CAGGTATGCG 


AG CATC ATCT 


TGATTAATAG 


CCACTCGTTC 


300 


AGGTGTTAAG 


CCGGTCCTGC 


CGCCTGCTTG 


TCCAATACAA 


AGTACAGCAT 


CTGGTTGATA 


360 


TCGTAATATT 


TCTGCCTCTA 


AAACTTCTGA 


CGACTTATAA 


AAAACCGTTG 


GAATTTCTAC 


420 


CCAGCGAACT 


TCAGCCCCAT 


TAATCTCAGA 


TGGTAATAAT 


TTTACAGCCT 


CCAAAGCTGG 


480 


ATTAATCTTT 


TCACCTCCAA 


AAGGATTAAA 


ACCTGTAACC 


AATATTTTCA 


TTTTATTTTC 


540 


CTTTACTAAA 


ATGCGAGAAA 


GTACATTAAG 


AATATGTGAA 


TAACAATCAT 


TACTAGAGCA 


600 


ACACCTGCTT 


GAGCCTTTAT 


AACGCCATTC 


TGAT CTTTC A 


TATCCATCAA 


TGCTGCTGGT 


660 


AGAGCGTTAA 


AATTAGCAGC 


CATTGGGGTC 


AATAAGGTCC 


CACAATAACC 


TGCTGTCATG 


720 


GCAAGAGCAC 


CAGCCACAAT 


TGGATTAGCT 


CCCAGAGCAA 


ATACAAAGGG 


AACTCCAACA 


780 


CCTCCTGTAA 


TAACGGTGAA 


TGCTGCAAAA 


GCATTTCCCA 


TAATCATTGT 


GAATAGAACC 


840 


ATTCCAAGAA 


CATAGGCCAA 


AACTCCTATA 


AAGCGACTAT 


CTGAAGGAAC 


AATACCGCTA 


900 


ATCAGATGAG 


AGATAACATC 


ACCAACACCT 


GCTACAGTAA 


AAATAGCCCC 


CAAAGCCCCT 


960 


AATAATTGAG 


GAACAATCCC 


ACTTGTTGAA 


ACTTGCTGAG 


TCATTCCATT 


ATTTTCTGAT 


1020 


AACAGACTCT 


TAGGGTGACT 


ATTGGT AATC 


ACAAGAACAG 


AAATTGTAGC 


AAACAAGGCG 


1080 


GCAAGGCTAA 


TCGAAATCTT 


GCTAAATTCT 


GG AATC ATT T 


GCGCTAAGAC 


CAACGCAAGT 


1140 


ATTGCCATCA 


GCATAACTGG 


AATAAAAATT 


TTATTTTTCA 


AC CTGTT AG A 


TTCAATATTG 


1200 


GCTTTCATTT 


CATCTAAGGA 


TGGCAAGGTT 


CCGATACGGA 


CTTGCTTAAA 


CAATGTTAAC 


1260 


AGCGATAATA 


GGATTACAAT 


AATACCAATA 


CTCATATTTG 


GCATATAGGA 


ACCACCTATA 


1320 


AACGTAATAG 


ACAATAGAGT 


CCAAAATGCA 


GATGTCCCAA 


GTCGAACTGG 


GTTTGTTTTA 


1380 


TCTTTATAAC 


TACAATAGGC 


TGTATGGAGA 


AATTGACAAC 


CAATCACAAT 


ATAGGTCAAC 


1440 


TCTAATAGTT 


GCTTTCCCAA 


CTCTGTCATT 


TTTGTTCTCC 


TCCCCTAGTC 


TTTTTTGATA 


1500 


TCAATTTTTT 


ATCAAATAAA 


TAATTATAAA 


TCCCCACTAC 


AATAAGTGTT 


ATAACAGCAA 


1560 


CAATAATAGA 


TGTAGAAGCA 


ATC CCTGC AT 


AATTG CTTTC 


ATAGCCTAAC 


TGATCTAATG 


1620 
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CAAGAGGACT 


CCCCCAGCAC 


CTACAAACGT 


ATTTTGAGCA 


AAGAAATTTC 


1680 


CAAAATTTTC 


ATTCGCAGCC 


GCACGCGCTT 


TTATTGTCTC 


ATCTTCAACC 


TCTGTTAACT 


1740 


TTCTACCTAA 


TTGAGACTCT 


GCAGCTGCTT 


CTCCCATACG 


TTGAACCAAA 


GGTCTGACAA 


1800 


ACTGAGGGTG 


TCCTCCTAGA 


CGAATTGAAA 


AGAAACCAGC 


TAACTCTCGA 


ATAAAGAAAT 


1860 


AAACTGTATA 


GAAGTTTCCA 


ACTGTC AG AC 


CTTTAATCTT 


TCGAATCAAA 


TCGATTGATC 


1920 


GTTGCTTGAG 


TCCAAAGGTT 


TCTGACAGCC 


CCACAAGAGG 


CAAGGTAACC 


ATAAAAATCG 


1980 


TGAGCACTCG 


CTGATTGCTA 


AATTCTTTTC 


CCAAAATCTC 


CAAAAATTCA 


ACGAGAGAAA 


2040 


CACCTGAAAC 


TAAAGCTGTA 


ACCAAACCAG 


CTAAGACTAC 


TGTTGCAATT 


GTATCAAATT 


2100 


TTAAAATAAA 


ACCCACAACA 


ATGATTGCTA 


TTCCTATTAA 


TCTAATCCAC 


TCCATATCAA 


2160 


ACTCCTTTAT 


ATTCAAAATG 


ACAGTATTTT 


TAAAATTTTA 


TCAAGATCAA 


TACCATTCCT 


2220 


TATTTAATGT 


GTTTTTCTAG 


TTCTTTTTGG 


TATTTGCTAT 


TGGATTCCAA 


TTTTTCTTTT 


2280 


TGCCATTTTT 


TAAAAACCTC 


GTTATATTCT 


TTTGTTGTAA 


CAATATCTTT 


TTGCAATTTC 


2340 


ATTCCTTTAA 


AGATATATGG 


ATCCCCCTTA 


ATACCAACTT 


GTGAGTATGG 


TTTTGAGAAT 


2400 


GGTACTACGT 


TACTTACAAC 


TGGAGAACCA 


CCAGATGAAG 


CTGTTGGCAT 


CAATAATGAA 


2460 


CTATCTGTCG 


AC CAAGCTTG 


AGCTTTGGCA 


TATTTTTCAT 


ATCTTTTCTC 


TAGGTCAGTG 


2520 


GTCTCAGAAA 


CAGCATCTTC 


TAACAATTTC 


TTATATTTAT 


CCAAACCAGG 


TTTAGCTACA 


2580 


ACATCCTTAT 


CTTTTCCTTT 


CGTAATACCA 


AGGTGTTTCA 


TGGCAGAACC 


AGATTTTGGA 


2640 


TCTATAATAT 


TCAAGTGAGA 


CGCTGGATCT 


TGATAGCTTG 


CAGCCCATCC 


TGTACTGTTC 


2700 


AAATCAT AG T 


CTTTTTGAGA 


AGGAGCAACA 


TTGCCGTATT 


TATCATTTTC 


CATCAAACCA 


2760 


TCAATAACAT 


TTCCAATAAC 


GTCTGT CCTC 


GATGTTCGAG 


TCGCTATACT 


GTAGCCCAAT 


2820 


GATGCTGGAT 


CT ACTG CAT A 


GACATAAGAA 


AATGTTGTCG 


GTGCATCTGC 


TTCTTTATCA 


2880 


n*l v *T w T , 'i w f~C* C*J^C 
O l i. l J> l v» \w f\^— 


AAGCCACTAA 


AATAGCTGAC 


GTGCTCAGGA 


CCACTCCTGC 


TGTTAAGAGC 


2940 


CACTTTTTCT 


ATTTCATAAA 


GAATCTCCTT 


TGGTTTATTT 


TAATCTACTT 


TTACAATCCA 


3000 


ACCTTCTGGC 


GCTTCAATAT 


CGCCAAACTG 


AATACCCGTC 


AATTCATTAT 


AT AATTT A CG 


3060 


CGTCACAGGA 


CCTACTTCTG 


TTTCACTATA 


GAATACATGG 


AAATCATCAC 


CATGTTGAAT 


3120 


ACCTCCAATT 


GGAGAAATAA 


CCGCTGCTGT 


ACCACAGGCA 


CCTGCCTCTA 


CAAAACGGTC 


3180 


AAGATT AT CA 


ATTGGAACAT 


CACCCTCAAT 


AGGAGTTAAT 


CCCAAGCGAT 


GTTCTGCCAA 


3240 


ATAAAGCAAG 


GAATACTTGG 


TAATAGATGG 


CAAGATAGAT 


GGACTCAATG 


GTGTTACAAA 


3300 


TTCATTATCA 


GCTGTAATTC 


CAAAGAAGTT 


AGCTGATCCG 


ACTTCTTCAA 


TCTTTGTATG 


3360 
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AGTTGATGGG T CC AG AT AG A TAACATCTGA GAAATGACGT GACTTGGCCA TTTTTCCTGG 34 20 

TAAGAGACTT GCAGCATAGT TTCCACCAAC CTTAGCCGCA CCTGTACCAT TTGGTGCTGC 34 80 

ACGGTCGTAC TCATCCTGAA TCAAGAAGTT GGTTGGG AC C AAACCACCTT TAAAGTAATT 3 540 

TCCAACTGGC ATAGCAAAGA TCGTGAAAAT GTACTCTTCT GCCG G TTTTA CCCCGATAAT 3600 

ATCTCCGACA CCAATCAAAA GAGGGCGAAG ATATAAGGTT CCACCTGTTC CGTATGGTGG 3 660 

TACGTATTCT T C ATT CGC AC GGACAACTGC TTTACAAGCT TCTACAAACA TCTCTGTCGG 3720 

AACTTGTGGC ATCAAGAGAC GGTCAC AT GT ACGTTGCAGA CGTTTAGCAT TTTCATCAGG 3780 

ACGGAACAGT TGAACACTGC CATCCTTAGT ACGATAAGCT TTCAAACCTT CAAATGCTTG 384 0 

TTGTCCATAG TGAAGACTTG GAGAAGACTC TGAAATATCC AAAGTTGCAT CCTCTGTAAC 3900 

CTCTCCTTGA TCCCATTGTC CATTTTTGAA ATGAGCAAGA TAGCGATAAG GTAATTTCAT 39 60 

ATAGGAAAAA CCGAGGTTTT CCGG 3984 



(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 8703 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(Di TOPOLOGY: linear 



<xii SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TATCTAATTA TTGGT TT TTA TCGCTGACCT TGGCTATTGT TGGG G TTGTT TTACCCTTGT 60 

TGCCTACAAC ACCTTTCCTT TTGTTGTCTA TTGCTTGTTT CTCCAGAAGT TCCAAGCGAT 120 

TCGAAGATTG G C TTTATCAT ACCAAGCTCT AT C AAGCAT A TGTAGCTGAT TTTCGTGAGA 180 

CCAAGTCTAT TGCGCGTGAA CGAAAGAAAA AAATCATCGT CTCTATCTAC CTCTTGATCG 240 

GAATTTCTAT TTATTTTGCA CCTCTTTTAC CAGTCAAAAT CGGTCTGGGT GCTTTGACCA 300 

TCTTTATTAC TTATTATCTC TTC AAGGTC A TTCCAGACAA AGAATAGTTA AAACAGTAG T 3 60 

TATTTGCCTT GATAAAATTG AAAGC AT AT T CATAACAATA TGATATAATA AAATTGAAGT 4 20 

AATATTCAAG GAGAATCAAA TGATTTACGA ATTTTGTGCT GAAAATGTGA CTTTACTTGA 4 80 

AAAAGCGATG CAGGCTGGAG CTCGTCGGAT TGAACTCTGT GATAATCTAG CAGTTGGTGG 540 

GACAACACCC AGCTATGGAG TGACTAAGGC AGCGGTTGAA CTGGCAGCTA ACTACGATAC 600 

AACCATCATG ACCATGATTC GGCCACGTGG TGGTGACTTT GTCTATAATG ACCTAGAAAT 6 60 

TGCTATCATG CTAGAAGACA TTC G TTTGAC TGCTCAGGCT GGAAGTCAAG GGGTTGTATT 72 0 

TGGAGCTTTA ACTGCTGATA AAAAGTTGGA TAAGCCTAAT CTGGAAAAGT TAATTGCTGC 780 
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ATCAAAAGGA 


ATGGAAATTG 


TCTTTCACAT 


GGCCTTTGAT 


GAACTAAGTG 


ATGAAGATCA 


840 


AGCGGAAGCT 


ATTGACTGGC 


TCAGTCAAGC 


C GGTGT C ACT 


CGTATCCTAA 


CTCGTGCTGG 


900 


TGTGTCTGGC 


GACTCCTTAG 


AAAAACGTTT 


TGTTCACTAT 


C AC AG AATTT 


TGGAGTACGC 


960 


TAAAGGTAAA 


ATTGAAATTC 


TACCAGGTGG 


GGGGATTGAC 


CTTGAAAACC 


GTCAAACCTT 


1020 


TATCGACCAG 


GTGGGGGTAA 


CACAATTGCA 


TGGTACTAAG 


GTTGTTTTTT 


AAAAAATAGA 


1080 


AAGGAACTGC 


TAGCTTTGGG 


TAGCAGTTTT 


CACTTATGTT 


TGAAATTTTT 


AAATCCT ATC 


1140 


AATTTAATCA 


AGAAAAGGCT 


CATGATTATG 


GTTTT AT AG A 


AAATAGCGAA 


GTCTGGACAT 


1200 


ATAGTTGCCA 


GATTTTGCAA 


GGTGACTTTG 


TCATGACTGT 


GTCCATCACT 


GCTGATAATG 


1260 


TGAACTTTCA 


AGTCTTTGAC 


CAAGAGACTG 


GTGACCTCTA 


TCCTCACGTT 


TATATGGAAA 


1320 


GCATGAGGGG 


AAGTTTTGTC 


GGAAATGTCC 


GTGAGGCTTG 


TCTGGAGATT 


CTTTACCAGA 


1380 


TTCGGAAGGC 


TTGTTTTGAT 


GTGCAAGATT 


TTATCTGTCA 


TCAGACTAAG 


CGTATCATGA 


1440 


CTCAAGTTCA 


GGAAAAGTAT 


GGAAACCAGT 


TGGAGTATCT 


GTGGGAAAAA 


TCGCCTGATA 


1500 


CAGCTGTATT 


GCGCCATGAA 


GGCAATCAAA 


AGTGGTATGC 


CGTCTTGATG 


AAAATCTCTT 


1560 


GGAATAAGCT 


GGAAAAGGGC 


AGAGAAGGAC 


AAGTGGAAGC 


AGTCAACCTC 


AAGCATGACC 


1620 


AAGTAGCTAA 


TTTGCTTTCA 


CAAAAGGGGA 


TTTATCCAGC 


CTTCCATATG 


AGCAAGCGCT 


1680 


ACTGGATTAG 


TGTGTCCCTT 


GATGATACTT 


T ATCAGATG A 


AGAAGTACTG 


GAATTGATAG 


1740 


AAAAAAGTTG 


GAACTTAACC 


TCTAAAAAAT 


GAAATATTTT 


AATAATTTTC 


ATGAACTTTC 


1800 


AATTAGCTAA 


ATATTCTTTA 


CTGAAGAGAT 


TTTTAGAAAA 


TATAGGATTT 


AC C AC ACT AG 


1860 


AGGAATATGG 


TGCCATCTTC 


AAATACCTGA 


TTGAGAATGT 


CAAGACGGAT 


CGTCAGATCA 


1920 


TCTATTCGCC 


TCACTGTCAT 


GATGACCTCG 


GAATGGCAGT 


GGCAAATAGC 


CTTGCTGCTG 


1980 


TCAAGAATGG 


TGCAGGACGT 


GTTGAAGGGA 


CTATCAATGG 


TATTAGGGAG 


CGAGCTGAAA 


2040 


ATGCTGCTTT 


GGAAGAAATT 


GCAGTGGCTC 


TCAATATTCG 


CCAAGATTAC 


TACCAAGTAG 


2100 


AAACCAGTAT 


TGTCCTAAAT 


GAGACCATCA 


ATACGTCAGA 


AATGGTTTCT 


CGCTTCTCTG 


2160 


GTATTCCAGT 


TCCTAAAAAC 


AAAGCCGTCG 


TTGGTGGCAA 


TACCTTCTCC 


CACGAATCTG 


2220 


GTATTCACCA 


AGATGGAGTC 


CTTAAAAATC 


CTCTC ACTT A 


TG AGAT CATC 


ACACCTGAAT 


2280 


TGGTTGGTGT 


TAAGATTCTG 


CTTGGAAAAT 


TATCTGGTCG 


CCATGCTTTT 


GTTGAGAAAC 


2340 


TGAGAGAATT 


GGCCCTAGAT 


TTTACAGAAG 


AGGATATCAA 


ACCACTCTTT 


GCTAAGTTCA 


2400 


AGGCACTGGT 


CGATAAGAAG 


CAAGAAATCA 


CAGATGCAGA 


TATTCGAGCT 


TTGGTAGCTG 


2460 


. GAACCATGGT 


TGAAAATCCA 


GAAGGCTTCC 


ACTTTGATGA 


TTTACAACTT 


CAAACTCATG 


2520 
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CAGATAATGA 


CATTGAAGCG 


CTCGTTAGCC 


TAGCCAATAT 


GGATGGTGAG 


AAAGTCGAAT 


2580 


TTAATGCGAC 


AGGGCAAGGT 


TCCGTTGAAG 


CAATCTTTAA 


TGCTATCGAT 


AAGTTCTTTA 


2640 


AC CAATCTGT 


TCGTTTGGTG 


TCCTACACTA 


TCGATGCGGT 


AACAGATGGA 


ATCGATACCC 


2700 


AGG ATCGGGT 


TTTGGTCACT 


GTTGAAAACA 


GAGATACAGA 


AACCATCTTT 


AATGCAGCAG 


2760 


GGCTTGATTT 


TGATCTGTTG 


AAGGCTTCTG 


C T ATTGTCT A 


TATAAACGCT 


AATACCTTTG 


2820 


TTCAAAAAGA 


GAATGCAGGT 


GAG AT GGG AC 


GCAGTGTTTC 


TTACCACGAT 


ATGCCTAGTG 


2880 


TGTAAAGGAG 


AAGGCTATGG 


CAAAGAAAAT 


AGTAGCTCTA 


GCAGGAGACG 


GAATTGGCCC 


2940 


AGAAATCATG 


GAGGTTGGTT 


TAG AAG TTCT 


GGAGGCTCTA 


GCTGAAAAAA 


CAGGTTTTGA 


3000 


CTATGAGATT 


GACAGACGAC 


CGTTCGGAGG 


TGCAGATATT 


GATGCAGCAT 


GACCTCCCTT 


3060 


AC CTG ATGAA 


ACCCTTAAGG 


CAAGTAGGGA 


AGCAGATGCT 


ATCCTACTAG 


TAGCTATCGG 


3120 


TAGTCCTCAG 


TATGATGGAG 


CAGTGGTTCG 


CCCTGAACAA 


GGCCTGATGG 


CTCTCCGTAA 


3180 


GGAACTCAAT 


CTTTACGCTA 


ATATTCGTCC 


TGTAAAAATC 


TTTGACAGTC 


TCAAGCATTT 


3240 


GTCACCACTC 


AAACTGGAAC 


GAATTGCTGG 


TGTAGACTTT 


GTCGTGGTGC 


GTGAATTGAC 


3300 


AGGCGGGATT 


TACTTTGGAT 


ATCATATTCT 


TGAAGAGCGC 


AATGCGCGTG 


ATATCAACGA 


3360 


CTATAGCTAT 


GAGGAAGTGG 


AGCGGATTAT 


TCGCAAAGCC 


TTTGAAATTG 


CAAGAAATCG 


3420 


CAGAAAAATC 


GTTACTAGTA 


TCGATAAGCA 


AAATGTTCTA 


GCGACCTCAA 


AACTCTGGCG 


3480 


GAAAGTAGCT 


GAGGAAGTCG 


CACAGGATTT 


CCCAGATGTA 


ACCTTGGAAC 


ATCAGCTGGT 


3540 


AG ACT CAGCT 


GCTATGCTTA 


TGATTACCAA 


TCCTGCTAAG 


TTTGATGTTA 


TTGTAACGGA 


3600 


GAATCTTTTT 


GGAGATATTT 


T AT CTGAT G A 


ATCAAGCGTC 


TTATCTGGTA 


CACTTGGGGT 


3660 


TATGCCATCA 


GCCAGTCATT 


CTGAAAATGG 


ACCAAGTCTC 


TATGAACCTA 


TTCACGGTTC 


3720 


AGCACCTGAT 


ATTGC AGGTC 


AAGGAATTGC 


CAATCCTATT 


TCCATGATTT 


TATCAGTTTC 


3780 


CATGATGTTG 


AGAGATAGTT 


TCGGACGTTA 


TGAGGATGCA 


GAGCGTATCA 


AACGTGCTGT 


3840 


TGAGACAAGT 


CTGGCGGCAG 


GAATTTTAAC 


GAGAGATATA 


GGAGGTCAGG 


CTTCAACAAA 


3900 


GGAAATGACG 


GAAGCTATTA 


TTGCAAGGTT 


ATGAAGTTAG 


ACGAAAAAAT 


TACTCTAGTC 


3960 


CTTTTGATTT 


GG AATGT CAT 


CATTTTCTTG 


ATTTATGGTA 


TTGACAAATC 


TAAGGCAAGG 


4020 


AGAAGAGTTT 


GGCGCATCCC 


TGAGAAAATC 


TTACTTATTT 


TAGCCTTTAC 


TTTTGGTGGT 


4080 


TTTGGTGCCT 


GGCTAGCAGG 


AATCATCTTT 


CACCACAAGA 


CTCGAAAATG 


GTACTTTAAA 


4140 


ATAGTTTGGT 


TTCTTGGGAT 


GGTGACCACA 


CTAGTAGCCT 


TATATTTTAT 


TTGGAGGTAA 


4200 


TGGATGGCAG 


GGTCTTCGAG 


GGAATACGCT 


GCTTGGGCTC 


TAGCGGACTA 


TGGTTTTAAG 


4260 


GTCGTGATTG 


CAGGATCTTT 


CGGTGACATT 


CATT AC AAT A 


ATGAACTCAA 


TAATGGCATG 


4320 
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TTGCCAATCG 
CAGGTAACTG 
GAG AT AG AT A 
TTGCAGTATG 
TAGAAAAAAT 
GCTTAACATC 
CATTATAAAA 
GCGGATGTAT 
GACTTGAGCG 
GCGGTAGCAT 
ACTAAAAACA 
CCTTTGAAAA 
GTACTAGCTG 
GGGAAGAAGG 
CTATGAGCAA 
AGGAGGAGCA 
TAAAATCATG 
TTATTAGGAT 
TTGTTTAGAA 
TAATAAGTAA 
GAAAAGTTAG 
ATAAGGCAAC 
GGCCTTTGGA 
AAAATAAGTC 
TTTCTGGTGG 
AAGATCAAGC 
ATTTGT ATG A 
TGAAAAGATC 
TCTATACTGA 



TTCAGCCTAG 
TGGACTTGGA 
GCGAGTGGAA 
AAGAGTTGAT 
AGAAAAGGAG 
CATTTCCAGC 
TT ATG AC AAA 
CGTTTGCCCT 
TAAGTTCGCA 
GGATGAACTT 
TTGGGATGAA 
ACTCAAAGCA 
TTTTGTTATC 
TAAAATGGTC 
GGTGTAATGG 
CGTTCAACTT 
ATG CACCTGT 
AGAGAAGAAG 
TTTCTTTTCT 
TGCCAATATT 
TTTAGAAGAT 
TGTTGGTGAA 
TAGGATTTTA 
TTTATGCTAC 
CTCAGCTTCA 
TAATAATGAG 
AATTATCAAA 
AACAAATGAA 
GGGAGATCCT 



AG AGGT TAG A 
ACAACAAAAA 
ACATAAACTC 
TGCTGCTTAT 
ATATAGTAAA 
AATTTTTTAG 
ACACATTCAC 
ACTAAAGTTG 
GCCAAAGAAC 
TGCGCTTTAG 
GATATTCTTC 
T ACGGGG ATG 
AAGTTTGGAA 
TACGCCTCAT 
TGTCTATGGT 
CACCAGCTCC 
C AG AT ACT TT 
TCTAGTGTTA 
TTTTCTATAG 
TAAACATCAT 
TTTTATAAAT 
AAGTTTAATG 
ACAATGTCAA 
GCTCTTGAGA 
AAATTTGGTA 
ATTTCAGAGT 
GAAGGTATTC 
TTTATTGGTC 
TTCTTTGGTG 
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GAG AAACT AG 
ATCATCTCAC 
CTAAATAGTT 
GAAAAACAAC 
CTGAAATAAG 
AAACT ACAGT 
AAGAAGGCTA 
GTT ACATT AT 
GTAAGCGTAA 
CGCAACTCAA 
TTGGTTGTAT 
GCCGTGAAGA 
AAGCAGGTGA 
CTGCTTCAAT 
CGATAAGGAC 
AGTTGTGATC 
TAACTCATGG 
TGAGATATTA 
GATATGGTAT 
TACTAAAAGG 
GGTATAGTCT 
ATAAATTAAA 
TCGATGAATA 
AGGGAAAATA 
TTTATTGGAA 
TGGATCAGCG 
GTTTTAACTT 
GTTCTGCTAT 
TAAATATTAA 



CCCAGCTAAA 
C AGTTGAAG A 
TCGATGATAT 
GACCAGCCTA 
ATGTAAACAA 
GGACTATTCT 
CGACATTTTA 
CATGACCAGT 
CAAACCAGGT 
C C CAG AAATT 
CCTTCCTTGG 
ACTTATTACT 
ACAATTGGCT 
GACAAAACGA 
GGCAAACTCA 
CGTAAAGGGC 
GACTACCGTC 
AAGCTCCTAA 
TCTATGTAGA 
AGTTAGATTG 
AAATAAAGAA 
AGAAGAGTTT 
TGTAATAGGA 
CAAAAATCTA 
TAAAAAAACA 
ATTTTCAAAA 
TGAAAATCCT 
GGTGACAAAA 
TAGTCAGAAA 



ACCAACCGAC 
ATTCACCTTC 
CGGTATTACC 
CTGCCAGGAT 
ATGAATTGGA 
GGATTCAACA 
AAAGGTGAGG 
GACAAGGCAG 
GTTGTTCTCT 
GAAGCATTCT 
AAACCAGAAG 
GATGTACGTG 
GCCAAGCTTT 
TTGAAACTCG 
TCCCAGAACA 
TTGACATTGA 
AGGTTGAGTA 
CACTGGGCTT 
AAATATATGT 
ATGAATAAAA 
GAGTTATTAA 
CTCCAGGAAT 
AAGGGACAGC 
TTTCTTGGAA 
AACAAATATA 
TTAAAATCAG 
ATTTT TGATA 
TTACTTTGTA 
GAATTTTGGA 



4380 

4440 

4500 

4560 

4620 

4680 

4*740 

4800 

406C 

4920 

4990 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 
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ACCACTTTGT 
AACTGGTGTC 
AGTATTCTAA 
ATTTTCGTCA 
CTCCTGGCAC 
ACGAAGATCA 
AAGGTTTAAG 
TTTTTAAAGA 
TTGATGAGGC 
TAACAAAAAC 
GTGGTGTTCC 
ATTATAATAA 
AGAGATTTGG 
TCTTCATCAT 
TCTCTATCGA 
TACACGAAAC 
ATGATATTGA 
TTGAAGTTAC 
AAGAAGCAAA 
ACAGTCATTA 
ATGAATTACT 
ATGATGAGGT 
AAAAAGATCA 
GATAATCAAC 
CCTCTTTTAG 
AATGATTwGA 
CAGAAAATCA 
ATTTCCTCAC 
GTTCTTCATA 
CAACTTTTGG 



TTCTCAGACA 
CAAAACT T AT 
GCTTTTTATG 
TCAATTAACT 
GGGAAAAACT 
AATCGGATTT 
ACCAGTATCA 
TTTTTGTCAG 
TTGGGATTCT 
AT CTT ACTT A 
AGGATGGTCA 
GCAAGAATAC 
TTTGAAAGAC 
CGATGAGATC 
CCCCGGCTAT 
TGATGAAAAG 
TCGTT CAGTG 
TGTCGAGGGT 
AATTCGTCTA 
TCATATTGGA 
CTGGTCTGAT 
TGAAACTTTG 
GGCAGTAGCT 
ACAAGATTAT 
ACAGAACCTT 
CTCATACTCC 
AGACAGGGAA 
GATTT TCT G A 
TCAATCTCAC 
TGTATCTCTT 



AATCAAGGTG 
CCTGAGTTGG 
GAAAATAAGG 
CAATCTCTAT 
TATCTTGCTA 
GTACAATTTC 
AATGGGGATG 
AAAGCAAAAG 
TACTTAGAAT 
TCTGTTAATA 
CTACCTAGCA 
TACAAAAGTG 
TATGTTTCCC 
AATCGTGGGG 
CGTGGTG AAA 
TTCTATATCC 
GATACCTTTG 
CAAGCTGCCA 
AGAAACTTGA 
CCAAGTTATT 
TATATTAAGC 
GAaACTTTGA 
GATGACAATG 
TAAAGAAAAA 
GGAAAGTCTA 
TGATTTGGAT 
CGTGATTGGT 
TGAGAGTAAT 
TAGTTTAGAT 
T CCCAAGT AT 



1114 
GACCTTATCT 



AGCCATCGAA 
AAG AC AAT AG 
TAAAGTCTCC 
AAGAAATTGC 
ACCCATCATA 
GAGCTATTGA 
AAACCCAATT 
ATATAAATGT 
GTAGACAAAA 
AATATCTTTA 
GTGGAAAAAC 
CAACAGAAAT 
AGATTTCTAA 
AAGGAAGTGT 
CCGAAAATGT 
ATTTTCCTAT 
TGTTGGATAA 
ACGCTGCTAT 
TTCTTAAGTT 
CTCTCCTAGA 
AAAAAGCATT 
AAGGCGATGA 
TTTGTTGAAG 
TCC CAAGATG 
AAGGACCAAA 
TTTCTTGGAT 
GACCACTTTT 
GTTGCTTTGT 
CTACAAGCTG 



GCAAAATCAT 
ATTAGGAACT 
TACAATGGAT 
AAACCTCATC 
TAAAGAATTA 
TGATTATACG 
GTTTAGGCTA 
GATTGGAGGA 
TGCTGAAGAA 
TTTGTCAGTA 
CGAGTTGTAT 
TGTCCTAGAA 
TGATACTGAT 
GATTTTTGGC 
TTCTACCCAA 
TTACATCATC 
GCGTCGTCGT 
AGAGTTGAAT 
CGAAAATATT 
GAAGGATGTA 
AGACTACTTG 
TGATCTGACA 
AAACGATGAT 
AAT AT C CT AA 
AACGTATTTT 
AGATTTTTGA 
ATGGTCAGGA 
TGCATTATCT 
CTCGTGAAGA 
CTATTCGAAA 



AAAATAATTG 
ATGCTTTTTG 
TCATCAAACA 
CTCCGCGGTC 
ACGGATGGCA 
GATTTTGTAG 
CAGGACGGTA 
CAAGATAATT 
AAAGAATATA 
AATTATGATA 
AAAGATAAAA 
AC ATTGAGAA 
AAGAATTTTG 
GAACTCTTTT 
TATGCAAATC 
GGAACTATGA 
TTTCGTTTTG 
ATCCATGCAG 
CAGGAATTAA 
GATTTTGACT 
CGAGGTTCTT 
AATAATGAGC 
GCGGATTACT 
ACTAAGCAAT 
CATTTTTCCA 
AACAGTCAAT 
AAGATTAACG 
CTTAAACAAG 
GAGGCTTTAT 
AGGTCTTTAT 



6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 
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AAGGAATATC ATCGATTTTC TCATAACGAC AGTCATGTTA AGGGAGTGAT TGATGTAAGA 7920 

AACCATCTCA AGAAAAATCT TCCTTTCACG GGAAATATTG CCTACGCAAC GAGAGAGTTC 7980 

ACCTATGATA ATCCCCTCAT GCAGTTGGTC CGTCACACTA TTGAATACAT TAAGAATCAG 8040 

AAAAGCATTC GTCAAGGGGT ACT AG AT AA T CTCTCAACTA GTCGTGAAAA CGTATCTGAA 8100 

ATCGTGCGTG TAACGCCCTC TTATAAACTA GCTGATCGTG CTAAGATTAT TCCGGGAAAT- 8160 

CAATCTAAAC CTATACGTCA TGCATACTTT CACGAGTACA G AAACTT A C A AGAACTTTGT 822 0 

CTGATGATCC TAAACCAAGA AAAGCACGGT TTAGGGTATC AAGATCAAAA AATCTATGGT 8280 

ATTCTCTTTG ATGTTGCCTG GCTTTGGGAA GAGTATGTTT ACACCTTCTT GCCAAAAGGT 8 340 

TTTGTACATC CCAGAAATAA GGATAAGACG GATGGAATTT CAGTATTTTC TGTTGGGAAA 8400 

CGAAAAGTAT ATCCAGATTT TTATGACAGA GAACGAAAGA TTGTTCTAGA TGCAAAATAT 84 60 

AAAAAACTGG AATTGACTGA AAAAGGAATC AACCGTGAGG ACTTATTCCA GCTGATTTCC 852 0 

TATTCTTATA TTTTAAAAGC TGAGAAGGCT GGACTGATTT TTCCTAGTAT GGAGCAGTCA 85 80 

GTAAATAGTG AAATAGGAAA AGTAGCTGGC TATGGAGCTC AATTGAAGAA GTGGTCTATT 864 0 

CGAATCCCTC AGAATGCCTC ATTCTATAGT ACATTTTGTA AAATGATGGA AAATTCAGAA 8700 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

CATCACCAGT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 6 0 

CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGAGAAG CACTACGAGA 120 

ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 180 

TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGCTA 240 

GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TT ATGGCT AT GAGAAAACAT 300 

TTAAATCACT TAACCAATTG GAACAAGCCA TTATAGACTA TATTGATTAT TACAACAATA 3 60 

AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 420 

GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 4 80 
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GACCTTTTCA TAAGGTTCGC TTGATGTACC AAGATGAGGC TGGTTTCGGT AGAATCAGTA 54 0 

AACTGGGATC TTGTTGGTCT CCAATAGGAG TAGGTCCACA TG T C CAT ACT CACTATATAC 600 

GAGAATTTCG CTATTGTTAT GGAGCTGTTG ATGCCCATAC AGGCGAATCA TTTTTCTTAA 660 

TAGCTGGTGG ATGTAATACT GAGTGGATGA ACGCCTTTTT ACAACACCTT TCACAAGCTT 720 

ATCCAGATGA TTATCTTTTA CTCGTTATGG ACAATG CT AT ATGGCATAAA TCAAGTACCT 7 80 

TAAAGATTCC G ACT AAT ATT GGTTTTACCT TTATTCCTCC ATACACACCA GAGATGAACC 84 0 

CATTGAACAA GTGTGGAAAG AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAC 900 

TTTGGAAGAT GTCATGAATC AACTCCAAGA TGTCATACAA GGATTGGAGA AGGAGGTCAT 960 

AAAGTCCATC GTTAATCGGA G ATGG ACT AG AATGCTTTTT GAAAACAGAT GAGTATAAAA 1020 

TTGAATTGCT TATAAAAAAG CTCCATACAC TGGATGTGTA TAGAGCAATG GGGCTTTATT 1080 

TGATATAGAG TTCTTGGTTT TTTAGGACAA TTTCTCGGAT ACTTGCAAAC TTTTTAAGTT 1140 

TTTTGATTTC TTCTGGATGA GTGACGAGAG TGATAACATA ACCTTCCTTG CCCATACGAC 1200 

C AGT ACGGCC AGCACGGTGT GTGTAGGTTT CG CT AT CT CT AGGAATATCA AAGTTTACGA 1260 

CACATTCTAG GCTATCGATA TCAATTCCAC GAGCCAAAAG GTCAGTTGCA AGAAGCAGGG 13 20 

TTAGTTGGTT AT C TTT AAAC TTTTCTAAGA TGATTTTTCT AAATTTAACA TTAACATCAC 1380 

TAGCGAGGGA AACAGCCAAT ATATCACGAT ACTGTAGTTT TTCCTCGGCA TTCCCAAGGT 1440 

CTGACAGGCT ATTGAAGAAG ACTAGACCAC GGAAATCCTC TACATGAGCC AGTTTTCGTA 1500 

GCATATCCAC TCGATGACGT TGGTCTACCT GCATGTAGAA ATGCTGGATA TTGTCCAATT 1560 

TTTGATCAGA GAGATCAATA GTGCGTGTAT TCGGCACAAT CTTTTCTTGG TCAAACTTGG 1620 

TCGTGGCACT CATGTAGACC AGTTGGTGGT CACGAGGTGC GTAGTGAGTG ATTTT T TCTA 1680 

CAAAGTGAAT CTGAGAATCA TCTAGTAATT GGTCAAATTC ATCCAGGATG ATGGTTTCCA 174 0 

CATTCATCAT CTTGATTTTT TTAAGTTTAA TGAGTTCAAA GATACGGCCA GGAGTTCCAA 18 00 

TCAGAATTTC TGGCCCCTTT TTAAGACGTT CAATTTGTCG TTTCTGACTT GAACCTGAAA I860 

GG AAGAGTTG AGCAGTCAAT CCGATAGCTT CTGCCCACGT TTTACATACA TCAAAAATCT 192 0 

GTCCAGCAAG TTCCGTATTT GGTGCTAGAA TCAAGAGTTG TTGGGCTTTT TTCTT T TGTA 1980 

GTCTGAGAAG ACTTGGTAGG AGATACGCTA GGGTCTTACC AGTTCCGGTT TGGCTCACTC 2040 

CTAGGAGGTT TTCTCCAGCA AGAAGGGGCT CAAATAGTTG AGTTTGAATG GGGGTGAATT 2100 

CTTGGAAACC GAGTTGGTCA CTCAGTTCTT GCCATTCAGT CGGTAGTTTG GTTTTCATTT 2160 

TTCTGCCTCA AATCTAATGC CAGCAGTCTG GCGCATGGTA TATAGTAGCT CATGAACAGA 2220 

GCCTGCATCA TACAGCCAAG TTTGGTAGAG ATTCAGATCT GGTTGCTGGA TCATGTGTGC 2280 
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AAATGCAGCG 


i ACTTCCTCAG 


1 TCATCGTATG 


1 AGGAGCCTGT 


' TGGATAGGAA 


. GCTGGACTTG 


2340 


ATTTCCTTGG 


TGGTCGGTAA 


AAATAGCTGA 


> GCGAATATGC 


TCAATCGTGT 


TGAGAGTCAA 


2400 


CGTTCCATCT 


GTTGTATAAA 


TCTCGCAAGG 


AAGATTGGAA 


GTCATGTTTT 


TTCCAGCCTT 


2460 


GATGTGAACT 


TGATAGTCTG 


GGTAGAAGAG 


GATACCATCT 


CCATTTAGGT 


CAATGCTATT 


2520 


GTCAAGCTGT 


TGAGCATGGT 


AAGTCGCGTC 


ATTGGCTTTT 


CCAAAAAGAC 


CAACAGCAGC 


2580 


ATAGAGGGGA 


TAAATCCCCA 


AATCCATGAG 


GGCTCCACCA 


GCAAAACGGT 


CTGAAAAGAC 


2640 


ATTTGGTGTT 


TGTCCAGCCA 


ACAAGTCAGG 


CATCTTGGAA 


GAGTATTTGG 


CATAGTTGAA 


2700 


ATCTGCTCCT 


AACACTTGCT 


TATCTGCTAA 


AAAGTTTTTG 


ATAGTAGTAA 


AGGCTTTCTC 


2760 


GTGGTAATTA 


CGAGCTGCTT 


CAAAGATAAA 


ACAGTTATTT 


TTTTCAGCTC 


TTTGAATCAA 


2820 


ATCAAACCAT 


TCTTGTGGTT 


GAGAGACAGC 


TGGCTTTTCG 


AGAATAACAT 


GTTTACCAGC 


2880 


AGACAAGGCA 


GCTTTTGCCT 


GAGCAAAATG 


TAAGGAGTTT 


GGACTGCCGA 


TATAGACTAA 


2940 


ATCAAAAGAA 


GATTTGAAGA 


AGACTTCTAA 


TTGATCGAAT 


AGTTGGATAT 


TCTGATAGCG 


3000 


AGAAGCAAAG 


GTTGCTGCAG 


TTTCTAGTTT 


TCTAGAATAG 


ATTGCGACCA 


GTTGGTATTC 


3060 


TCCACTGGTA 


TGGGCTGCTT 


CTATGAAATG 


ATGGCTGATA 


GCGCCAGTTC 


CGATGACACC 


3120 


TAATTTTAGC 


ATAAATACTC 


CTTTTCCGAT 


TTTAAATCCT 


TCTTTCATTA 


TAACATAGAT 


3180 


AGACGGGACT 


ATCCAACACA 


GAGGAGAAAA 


TTTCAAATAA 


GCTATTAGCT 


TTCTTTTCCG 


3240 


AATAAATAGA 


TAGAAGCATA 


GAATCTAGCA 


AAC C TAG ATT 


TAAAAATGTG 


CTATAATAGA 


3300 


AGGAGGAAAA 


GGAGGATTCT 


CAGACATCTA 


GGTATCAGCC 


CAACTAATGA 


TTTGTCAATT 


3360 


TAT CCGCG AT 


ATGCTGGACT 


TGCCAGCAAA 


AAATGTGACG 


ATTTTGGAGG 


GAAGTAACAT 


3420 


TC ACGTCTTG 


CCTTCCATGC 


CCTACTCAGC 


GTAAGATTTC 


TATACTAGTA 


TAGACGTCTT 


3480 


GGCGGAGTTA 


GATAATGGAA 


TCCAAGTTAT 


CAT CG AAATT 


CAGGTTC AT C 


ATCAGAATTT 


3540 


TTTCATCAAT 


CGCCTATGGC 


CTTATCTGTG 


CAGTCAGGTT 


AATCAAAACC 


TAGAAAAAAT 


3600 


TCGCCAACGT 


G AAGGTGAT A 


CCCACCAGAG 


CTACAAACAA 


ATCGCACTAG 


TATACCCTAT 


3660 


CGCAATTGTC 


GATAGTAATT 


ACTTCTCAGA 


TGACCTAGCT 


TTTCAT AG TT 


TTATAGTAAA 


3720 


ATGAAATGAG 


AACAGGACAA 


ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


3780 


GAAGTATAGG 


TCTACTATTC 


TAGCTTCAAT 


CT ACT AG AAA 


TTCCATAGAT 


AGAAAACTAC 


3840 


ATAATCTCTA 


CAGATACGGA 


TGTTGGAGTT 


GATGTAAGAT 


GCTTTGGCTT 


GCTAGAGGAA 


3900 


TTGTGGATTG 


CCAAATTGTA 


TCATTGAAAT 


TATTGCTCAA 


ATTTGTTATG 


ATATAAATAT 


3960 


GAATAAAAGT 


AGACTAGGAC 


GTGGC AGACA 


CGGGAAAACG 


AGACATGTAT 


TATTGGCTTT 


4020 
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GATTGGTATT 


1 TTAGCAATTT 




ATTAGGCCCA 


TTTATTGCTT 


TTAAGATCTA 


4080 


CCAGCAAAAA 


Am**T w T v T'f % tkC^C 
l i 1 1 \j/\vj\- 


AAAAGATTGA 


ATCGCTCAAA 


AAAGAGAAAG 


ATGATCAATT 


4140 


GAGTGAGGGA 




AGCATTTTCG 


TCAGGGGCAA 


GCCGAAGTGA 


TTGCCTATTA 


4200 


TCCTCTCCAA 




TGATTTC CTC 


TGTTAGGGAG 


CTGATAAATC 


AAGATGTTAA 


4260 


GGACAAGCTA 




ACAATCTTGT 


TTTCTACTAT 


ACAGAGCAAG 


AAGAGTCAGG 


4320 


TTTAAAGGCiA 




GTAATGTGAC 


CAAACAAATC 


TATGATTTAG 


ttgcttttaa 


4380 


GATTGAAGAG 


iuAAAAGA 


CCAGTCTAGG 


AAAGGTTCAC 


TTAACAGAAG 


ATGGGCAACC 


4440 




GAC CAACTGT 


TTTCAGATGC 


TAGTAAGGCT 


AAGGAACAGC 


TGATAAAAGA 


4500 


GTTGACCTCC 


TTCATAGAGG 


ATAAAAAAAT 


AGAGCAAGAC 


CAGAGTGAGC 


AGAT T GT AAA 


4560 


AAACTTCTCT 


GAC CAAG ACT 


TGTCTGCATG 


GAATTTTGAT 


TACAAGGATA 


GTCAGATTAT 


4520 


CCTTTATCCA 


AGTCCTGTGG 


TTGAAAATTT 


AGAAGAGATA 


GCCTTGCCAG 


TATCTGCTTT 


4680 


CTTTGATGTT 


ATCCAATCTT 


CGTACTTACT 


CGAAAAAGAT 


GCGGCCTTGT 


ACCAATCTTA 


4740 


CTTTGATAAG 


AAACATCAAA 


AAGTTGTCGC 


TCTAACCTTT 


GATGATGGTC 


CAAATCCAGC 


4800 


AACGACCCCG 


CAGGTATTAG 


AGACCCTAGC 


TAAATATGAT 


ATTACAAGCG 


GGGT 


4854 



(2) INFORMATION FOR SEQ ID NO: 179: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 
O) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


179 : 






TAAACAGGTG 


TTAGGTGCTC 


TAAACTATTA 


AAATTCTAAG 


GAAATAAGGC 


TACTTTTTCT 


60 


GGGTCTTCTT 


CATAGTAGGT 


GTGGTTCTTT 


TTTTCGAGTG 


TAGCCCATAG 


CTTTGAGCGC 


120 


AT AGTGGATG 


GTAGTTGGAT 


GACAGCCAAA 


TTCAGAAGCT 


ATTTCAGTCA 


AATAAGCAIC 


180 


TGGATTGTCA 


GTAAGATAGT 


TTTTAAGTCT 


ATCTCTATCA 


ACTTTTCTTG 


GTTTTGTTCC 


240 


TTTTACTTGG 


TGGTTTAGCT 


CTCCTGTTTT 


CTCTTTTAGC 


TTTAACCAGC 


CATAAATGGT 


300 


ATTACGTGAG 


ATTTGGAAAA 


CGTGTGATGC 


TTCTGTTATA 


CTACCTCTTC 


GCTCACAATA 


360 


AGAGAGAACT 


TTTTTACGAA 


AATCTATTGA 


AT ATGC CAT A 


AGAAGATTAT 


ACCACATTGT 


420 


GTACTATTTT 


TGGTTCATTT 


TACTATATTT 


CTAAACACTT 


AGAAATAATA 


AAACAAATTA 


480 


AATATTATTT 


CTAAATATTT 


GAAAATAACA 


TCTATTTGTA 


TTATACTATC 


TTTGAGGTAA 


540 


CTATTATGAA 


CTATATCAAA 


AGACCACATT 


ATTTAG ATTT 


TTTAAGAAAA 


CATCGTGACC 


600 
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GACCAATCAT 


CAAAGTTGTG 


ACTGGAGTTA 


GACGAGCTGG 


TAAATCTGTG 


CTTTTTCAAC 


660 


TCTATAAAGA 


GGAGTTACTA 


GCAACTGGGG 


TAGACGAGGA 


TCAGATTATA 


TTCATCAATT 


720 


TCGAAGATTT 


GAGTTACTAT 


GATCTGCGAC 


ATTTTCAAAC 


ATTATTCGCT 


TATATAAAAG 


780 


ATCAATTAGT 


TAG CAAG AAA 


ACATACTATA 


TCTTTTTAGA 


TGAAATTCAA 


TATGTTGAAA 


840 


AATTTGAACT 


GGTAGCAGAT 


AGTCTATTCA 


TCTTAGCAAA 


TGTAGACCTC 


TATTTGACTG 


900 


GATCTAACGC 


CTACTTTATG 


AGTAGCCAAT 


TAGCAACAAA 


CTTGACTGGT 


CGGTATGTTG 


960 


AGATAGAGGT 


TCTTCCTTTG 


TCATTTGAAG 


AATATCTATC 


AGGTCAATCT 


CTCACAGAGA 


1020 


ATCTGAATAC 


AACAGAAATT 


TTTAACAATT 


ATCTCTTTAG 


TGCTTTCCCT 


TACTTATTGC 


1080 


AAACATCATC 


TTACGATGAA 


AAAATTGACT 


ATCTCAGAGG 


AATATATAAC 


TCCATACTGT 


1140 


TAAATGATAT 


TGTCACTAGA 


TTGGGAAAAC 


CAAATCCTAC 


TATTATTGAG 


CGCATTGTCC 


1200 


GAACCCTTCT 


CAGTAGTACA 


GGTAGCTTAA 


TATCAACAAA 


TAAGATTCGC 


AATACCCTAG 


1260 


TCAGCCAAAA 


TGTTTCAATA 


TCCCATAATA 


CTTTGGAAAA 


TTATTTGACA 


ACTTTGACAG 


1320 


ATAGTTTACT 


TTTTTATTCC 


GTTCCACGTT 


TTGATGTAAA 


AGGTAGAGCA 


TTATTGCAAC 


1380 


GTTTAGAAAA 


ATATTATCCC 


GTTGATTTAG 


GTTTACGACA 


TCTCTTATTA 


CCAGACCAGA 


1440 


AAGAAGACAT 


TAGGCATATC 


TTGGAAAATA 


TGGTATATTT 


GGAATTGAGA 


CGTAGATATT 


1500 


CACAAGTATA 


TGTTGGTAAT 


TTAGATAAGT 


ATGAGGTTGA 


TTTTGTTGTT 


GTAACTGATC 


1560 


TTGGCCACTA 


CGCTTATTAT 


CAGGTCAGTG 


AAACAACACT 


TGCTCCAGAA 


ACACTAGAAA 


1620 


GAGAACTTAG 


ACCACTAGAA 


GCCATTAAAG 


ATCAATTCCC 


TAAATATCTA 


TTAACAATGG 


1680 


ATACGATTCA 


GCCAACAGCC 


AATTACAATG 


GAATCGAGAA 


GAAAAGCATT 


ATAGATTGGT 


1740 


TACTAGAAAA 


ATAGATAAAT 


ATAAATCATA 


CAGCTAATTA 


GATTTGCAAC 


AGTCTGTTAT 


1800 


CAATGATTCT 


ACCCAAATCC 


TAACAAGATA 


TAGTGAATTT 




f\ i. t\ i rtn 1 


18 60 


GACACTTGAA 


AATAGAAATT 


GGGGATGAAA 


GGGGATCTAT 


AATTTCTGGA 


AGTACTATCA 


1920 


AAAATTAATA 


TCATAGTCTT 


ATTAGAGAAT 


AGCATCACCC 


ACTTTCTCAA 


ATAAGATTAA 


1980 


ATTGTAACTG 


AATTATAATG 


AAAAAGAGAC 


TGAGCAATCA 


GTCTTTAAAA 


TCAGAAAAGC 


2040 


GCATAGTATC 


AGGTATTGAA 


CAACCTTGAT 


AATATGCGTT 


TTATTATGGA 


AATATTTGCT 


2100 


TCATTTTCTC 


CTGAAATAGA 


GCTTTTGCTA 


TCCTATTTTT 


CTCTATTTCT 


AATGATTTAC 


2160 


TTCAACTTCT 


TACCTCTTGG 


GAAAAA 








2186 


(2) INFORMATION FOR SEQ ID NO: 180: 









(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 3236 base pairs 
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(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


180: 






GTCACACGTT 


TGACTTCACG 


TATTTCATAA 


GTATAAACTT 


TATI'TITATC 


GGTTAGATAA 


60 


ATCTTCATGC 


CATTTTTAGC 


ATTATCTAAA 


GGAGAAAATA 


ACATTTTATT 


AGCATTATCA 


120 


ACACCAAAGA 


TATGGTGACT 


AGCTAGACTA 


TAATTTCCTT 


CTCCCATTAC 


TTGCTCGCGT 


180 


TTCATTGTAC 


CAGCTCCGTA 


GAAGAGATTA 


ACATTATCAA 


GTCCTTTAAA 


AATCGGCAAA 


240 


TTCATTTCCA 


ATTCAGGAAT 


TGCAATTCCC 


CCAATAACTG 


GTAATTTTTG 


AGCATCCCAT 


300 


TGAGAAGTTA 


GAACAGCTTC 


CGAAGAGATA 


GCTTTGACAG 


AATCAAAGTC 


AAAATTCCCT 


360 


TCTGTATCCT 


GATTTTCTTC 


TAATTTTTCT 


TTTGATACCT 


GGCTAACTTG 


ATACTTATTG 


420 


GTATTCCAGA 


CTATGAAAAT 


ATTTCGAATT 


TGAGTATTAA 


AAATCAAAGC 


CAGTGACAGT 


480 


AATATCAGAA 


ATCCTGCTAG 


GATATTTGTC 


AGCAGATTTT 


TTCGCTTGTT 


TTTCTTTTTA 


540 


TTATTTTTTT 


GAGACATTAT 


GCTTCACCTT 


CTGTTT CGTT 


TTCTGTCCCA 


ACTTCTTCTT 


600 


TTTCTGCCAC 


CGCAACCGTT 


GTGAAAGTCA 


CTATCTGAGC 


ATCTTGATCC 


AGGCGC ATT A 


660 


CTTTAACTCC 


CATAGTTGCA 


CGTCCTGTTT 


GTGAAATATT 


GGCAAGATTG 


GTTCGAATCA 


720 


TGACACCTGT 


ATCAGTGATA 


ATCATCAAAT 


CCTCATCCCC 


TTGAACAGTC 


ATAAGACCGG 


780 


CCAGCAAGCC 


ATTTTTTTCG 


GTAATTTTAG 


CTGTCTG C AT 


TCCCTTACCA 


CCACGACCTT 


840 


TTGTTGGGTA 


TTCAGTAGCG 


ACTGTACGCT 


TACCATATCC 


TTTTTCTGTG 


ATAATAAGAA 


900 


CCTCATCTTG 


ATCAGTAATC 


AAGCTGG C AC 


CAACAACTGT 


GTCTCCTTCA 


CGAAGGTTAA 


960 


CACCTTTCAC 


ACCAGTGGCG 


ATACGGCTCA 


TACCACGAAC 


GGCTGATTGA 


TTAAAGCGAA 


1020 


CTGCATAACC 


AAACTTGGTA 


CCAATGATAA 


TATCCATATC 


TCCTTCTGCC 


AACAAGACAT 


1080 


TGATTAACTC 


ATCTTCATCC 


TTTAAATTCA 


GCGCTTTGAG 


ACCATTTTGA 


CGAATATTGG 


1140 


CAAACTCCTT 


AACACTGGTT 


CTCTTCACAA 


TACCGTGACG 


GGTTGTAAAG 


AAGAGATAAG 


1200 


CAT CATC ACT 


GCGATCAGAC 


TCAACATTGA 


TAACCGTCTG 


AAT ACTTT CG 


TCTTCATCCA 


1260 


ATTTCAAGAG 


ATTGACTACT 


GGTAGCCCTT 


TGGCAGTCCG 


AC CAT ACT C A 


GGAATTTCAT 


1320 


AACCTTTAAG 


ACGATAGACA 


CGTCCCTTGT 


TTGTGAAGAA 


GACCACATGA 


TCATGGGTGC 


1380 


TAGTTGACAC 


TAACTCACGA 


ACAAAGTCAT 


CATCTTTCAC 


TCCCGTTCCT 


TGGACACCAC 


1440 


GACCCCCACG 


TTTTTGAGCA 


GTGAACTCGT 


CCTGATCCAA 


ACGCTTAATG 


TAGCCTCTGT 


1500 


TAGAAAGGGT 


AATCAAGACA 


TCCGATTCTT 


CAATCAAGTC 


CTCATCCTCG 


AGACTCAAGA 


156.0 
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CCTGTCCAAT CATCAACTCT GTACGGCGCT TATCAGAAAA TTTACGTTTA ACTTCATCCA 162 0 

ATTCGTCTTT GATAATTTGA GAAACACCTT CAGGCTTAGC AAGAATATCT GCTAAATCCG 1680 

CAATCAGAGC CAAGAGGTCA TCATACTCAG ATTGAATCTT ATCGCGTTCC AAACCTGTCA 1740 

AACGACGAAG ACGCATATCA AGCATAGCTT GACTTTGACG TTCAGAAAGC TTAAACTTGC 1800 

TCATCAACTC AGCTTGAGcT TCCGCATcCG tTTCACTACC ACGGATGATA CGAATCAyTC i860 

GTCGATATGG TCTAGCCCAA TCAAGAGACC TTCTAAGATA TGAGCGCGCG CTTCCGCTTT 1920 

TTCCTTATCA AAACGTGTAC GACGAACAAC CACTTCTTTT TCGTGCTCCA TATAAGCATC 1980 

CAAAATCTGA CGAAGAGACA AAATTTTCGG TATACCATTT TGGATAGCGA GCATATTGAA 2040 

AC CAAAATTG GTTTGCATTT GGCTCATTTT GAAGAGGTTA TTGAGAATAA CATTGGCTGA 2100 

GGCGTCGCGC TTGACTTCAA TAACAAATCG AACACCTTCA CGGTTTGACT CATCACGTAC 2160 

TGCTGTGATA CCCTCAATGC GTTTTTCCTG AACCAAGCGA ACAATATGCT CATGCACCTT 2 220 

GGTTTTATTG ACCATGTAAG GAAATTCTGT T ACAAC G AT A CGCTCACGAC CAGTCTTAGT 22 80 

CGTTTCAATC TCTGTACGAG AACGTAGGAC AATCGAACCT TTACCTGTTT CATAAGCCTT 234 0 

ATGGATACCT GATTTCCCCA TGACAAGAGC ACCAGTTGGA AAATCTGCTC CAGGCAAGAC 24 00 

TTCCATCAAG TCCTTGGTAG TCACTTCAGG ATTATCCATG ACCAACTTCA CTGCATCAAT 24 60 

gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA TACCAGTTGC 2520 

TCCATTAACC AAAAGGTTTG GAAAACGCGC TGGCAAGACC AAGGGTTCCC GTTCATTGGC 2580 

ATCATAGTTA TCAACGAAAT CAACTGT AT T TTTGTTGATA TCACGAAGCA TTTCCAGAGC 2 64 0 

AATCTTGCTC ATACGTGCCT CGGTAT AACG TTGAGCGGCA GCACTATCTC CATCCATGCA 2700 

ACCAAAATTC CCATGACCAT CTACAAGCAT GTAACGGTAG CTCCACCATT GAGCCATACG 27 60 

GACCATGGCT TCATAAATAG AGGAATCCCC GTGTGGGTGA TATTTACCCA TGACATCCCC 2 820 

TGTAATACGA GCAGATTTTT TATGGGGTTT GTCTGGGGTC ACACCCAATT CATTCATTCC 2 880 

GTAGACAATG CGACGGTGAA CAGGTTTTAA GCCATCTCGA ACATCAGGAA GAGCTCGCGC 2 94 0 

TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC ATCTCCTTTG TCAGATTGAC 3000 

ATTCACTAAA TTTTTATCCT GCATTAATAA ATGCCTCATT TCACAATTAG TAAGTAACAA 3 060 

CATTATACCA TAAATTCCCA TCTATTTCAG CCTCTAAACC ACTAAAACGT TTACATCGAG 3120 

AACTATAAGG CATATTCGTG ACAAAGTTTT TTAAAAGTGA TAG AAT GAAG TTGTCTAGGG 3180 

AAAACCCCTA ATAGAATAAG G AGATGGTT A nACAATCACT CTGACTAACA CACAAA 323 6 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 8651 base pairs 
f B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


131 : 






AGGTCCTGAA 


> t» TATTGGAAC 


AGGAAGGTCA AGAGTTTTTG 


I GAACATTTCA 


AAAAACTCTT 


60 


GGAGTCAGTT 


WlftbTAGTAG 


CCATCTCAGG 


TAGTCTGCCA 


GCTGGCCTTC 


CAGTTGATTA 


120 


CTATGCGAGC 


1 I L*CT i AGAAC 


TTGCTAATCA 


AGCTGGCAAG 


CATGTAGTCT 


TGGACTGCTC 


180 


AGGTGCAGCA 


CTTCAGGCTG 


TTCTTGAATC 


ACCCCATAAA 


CCAACAGTCA 


TCAAACCAAA 


240 


TAATGAAGAA 


TTGTCTCAGC 


TTCTTGGAAG 


AGAAGTTTCT 


GAGGATTTGG 


ATGAATTAAA 


300 


AGAAGTACTT 


LAAUAAClTT 


TGTTTGCAGG 


GATTGAATGG 


ATTATCGTTT 


CACTTGGTGC 


360 


CAACGGTACT 


1 1 iAjt-CAAAC 


ATGGTGACAC 


TTTCTACAAG 


GTAGATATTC 


CTAGAATTCA 


420 


GGTGGTAAAT 


i t» rrcGAT 


CTGGAG ACT C 


TACTGTGGCA 


GGAATTTCTT 


CAGGACTTCT 


480 


TCACAAAGAA 




AATTACTCAT 


CAAGGCAAAT 


GTCCTTGGTA 


TGCTCAATGC 


540 


TCAAGAAAAA 


A rQjACTGGTC 


ATGTCAACAT 


GGCCAACTAT 


CAAGCTCTAT 


ATGATCAATT 


600 


AATAGTAAAA 


tiAtrt*TATAAA 


ATGGCTTTAA 


CAGAACAAAA 


ACGTGTACGC 


TTAGAAAAAC 


660 


TTTCTGATGA 


& & A 1*f*f**T* x m/^ 
/Wv\ 1 L»Vj 1 A 1 1. 


ATCTCAGCTC 


TTGCATTTGA 


CCAACGTGGT 


GCTTTGAAAC 


720 


GCCTCATGGT 


1 AAAt. ACCAA 


ACAGAAGAAC 


CAACTGTGGC 


CCAAATGGAA 


GAACTTAAAG 


780 


TCTTGGTAGC 


AGATGAATTG 


ACTAAATATG 


CTTCATCTAT 


GCTTCTTGAC 


CCTGAGTATG 


840 


GACTTCCAGC 


AACTAAAGCT 


CTTGATGAAA 


AAGCTGGTCT 


TCTCCTTGCT 


TATGAAAAAA 


900 


CAGGTTATGA 


CACAACAAGC 


ACAAAACGCT 


TGCCAGACTG 


CTTGGATGTT 


TGGTCTGCAA 


960 


AACGTATTAA 


AGAAGAAGGT 


GCAGATGCAG 


TTAAATTCTT 


GCTTTACTAT 


GATGTAGATA 


1020 


GCTCAGACGA 


ACTCAATCAA 


GAAAAACAAG 


CCTACATCGA 


ACGCATCCGT 


TCTGAGTCTG 


1080 


TGGCTGAAGA 


TATCCCATTC 


TTCCTTGAAA 


TCCTTGCTTA 


CGATGAAAAA 


ATTGCGGATG 


1140 


CAGGTTCTGT 


AGAATACGCT 


AAAGTAAAAC 


CACACAAAGT 


TATCGGCGCT 


ATGAAAGTCT 


1200 


TTTCAGACCC 


AC GCTTTAAC 


ATTGATGTTT 


TGAAAGTTGA 


AGTTCCTGTT 


AACATTAAAT 


1260 


ATGTTGAAGc 


kTCGCTGAAG 


GTGAAGTAGT 


TTATACACGT 


GAAGAAGCAG 


CAGCCTTCTT 


1320 


CAAAGCGCAA 


GATGAAGCAA 


CGAACTTGCC 


AT AC AT CT AC 


TTGAGTGCTG 


GTGTATCAGC 


1380 


TAAACTCTTC 


CAAGATACTC 


TTGTATTTGC 


TCATGAATCA 


GGTGCGAACT 


TT AACGGAG T 


1440 


TCTTTGTGGC 


CGTGCTACAT 


GGGCAGGATC 


AGTTGAAGCT 


TACATCAAAG 


ATGGTGAAGC 


1500 
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AGCAGCTCGC CAATGGtCCC ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 1560 

TTCAAAGAAC AGCAACTTCA TGGAAAGAAC GCGTGTAAGA AAGTCCTCCT AGTTTAGGAA 1620 

CATGAATCTA AAAAAATTTA AAAAAAGTTG TATGTAAAGG CTTACAAAAT AACTTACTTG 1680 

TGCTATACTT AAATCACAAG TTAATATGAA TTAGAAAGTA ACTATATGAA GTATAATAAA 1740 

AAT AGGAT AT AGTTTATTTT ACGAGCTAGG AAGGAAAAAT ACGGAAACAA TATTGCCAGA 1800 

ATAAACTATA TTTAGATGCA CATTTCATTC ATTGTTTTAT AAAAGGAGAA GATAAACGCC I860 

TACTAAAAAG AGTTTTAAAG CGTTAGTTGT AGGACTAGGT ATTGTTTCAA TATTCTTATC 1920 

AGCCTTACCT ATGGTTACTG CTTCTGTATT TGCAGATAGT GCCCTAACTA CAGTAGATAA 1900 

AGCAAATGAT ATTGTTTTGA ATGTTGATGG GAATAAATTT TATAATGTTT CGGTTTCAGA 204 0 

AGATATTGTA AATGCTGGTC AAATTTTGGA AGATTATTTT TATGTAGATA AATTTGGAAA 2100 

TATAAATTTA AAAGGCACTC CTGAAGAGTT AGCAAAAAAT ATTGGTATTT CTGTACAAGA 216 0 

AGCAAGTTTG ATGTATGGAG CTGTAAAAGA GTTACCCAAC GTTTACGAAA GAG GTCCTGT 2220 

AGGTTTTCGT TTCAATCTTG GTCCTCAAGT GAGGGGGATG GGTGGCTGGG CTGCTGGAGC 2280 

TTTCGCTACT GGATATGCTG GATGGCATTT GAAACAATTT GCGGTTAATC CTGTTACATC 2 340 

TGGATTTGTT GCTGTAATAA GTGGTGCGAT TGGCTGGGCT GTAAAAACTG CTGTAGAAAA 2400 

TTATTGGACA GTTGCTGTAG CTACAGTAGA AGTGCCGTTT GTGAACCTTC TTTACACCAT 24 60 

AGATTTACCT TAGAGGTTAT TTCTTTATGA ATCATTCTTT TAAAAAAATA ACTGTATTTT 2 52 0 

GTTTTATACT TTCTTG TGTT CTTTGTTTAT TAGACTTAAT GAATTTTAAA AATGTAGCTA 2 580 

CTTTTTTATT TTTCTGTCTT CCTGTTTTTG TTTTGATTTA CAAAAATAAA TAAAAACAGA 2 64 0 

GCCTCTCTTT GATGAATTTT AGAACATAGT TAAGTTTTAA AAAAAGTTGT ATGTAAAGGT 2 700 

TTACAAAATA ACTTACTTGT GCTATACTTA AAT CACAAGT TAATACAAGG TGAGTGTTAC 2 7 60 

TAAGTAATAT TAGGCATGAT CACAGGTGAA TTAGAAATCA GCTGATTTTC TAGTTCATTT 282 0 

GTGGTCATTT TTTGTACTTA TATACCTTTA AGATATAAAA GGAGGTTGAC ATGTATCGAA ' 2880 

TTCTAAATCC AATG AATCA C AATGTCTCGC TTGTCAGAAA TGATAAGGGA GAAGAGGTGA 294 0 

TTGTAATTGG TAAGGGAATT GCATTCGGAA AGAAGAAGGG GGATTTGATT GCTGAAAATC 3000 

AGGTTGAGAA AATCTTTCGG ATGAAGACCG AAGAGTCCAG AGAAAACTTT ATGGCTCTTC 3060 

TCAAAGATGT TCCGCTTGAT TTTATCACAG TGACCTATGA AATCATTGAT AAGCTATCAA 3120 

AGAAATATCA TTATCCGATT CAAGAGTATC TCTATGTAAC CTTGACAGAT CATATTTACT ' 3180 

GTTCTTATCA AGCTCTAACT CAAGGAAGGT ACAAGGATAG T AATCTGC C A GATATTTCCG 3240 
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CTAAGTATCC TGTCGCTTTT CAAATCGCAA ATGAAGCTTT TGAAATTTAC CGTCAGAAGC 3 300 

TAGCAGATCA TTTTCCTGAG GACGAAATTA TTCGGATTGC TTATCATTTC ATTAATGCTG 3 3 60 

AAGGTGAAAA TGAAGTGGAA CTTGTGGAGT CGATTGATAA GAGGAAAGAA ATTCTCAGGA 3420 
ATGTTGAAGA AGTTTTAACG GACTATGCAA TTCAACGAAC TAAAAAGAAT AACCATTTCT 34 80 

ATGATCGCTT TATGATCCAT TTGAATTATT TCTTGGATTA TTTAGACAGA TCTAGAGATG 3540 
ATAACCAATC ACTTCTGGAT ATGGAAGATC ATATTAAACA ATCCTATCCA AAAGCCTTCG 3 600 

AGATTGGTTC CAAGATCTAT GATGTGATTA CCCAACATAC GGGTCTTGAT TTCTATAAAA 3660 
GTGAACGAGT TTATCTAGTT CTACATATCC AACGTTTATT GTCATAAAAA TTTATTTAAA 3 7 20 

ACTATATAAG GAGAATTCTA TCATGAATAG AGAAGAAGTA ACATTGTTAG GTTTTCAAAT 3780 
CGTAGCCTAT GCTGGCGATC CTCGTTCAAA ACTATTCGAA GCCTTGAAGG CTGCTGAAGC 3840 
TGGTGATTTT GAAAAAGCGG ACGCTCTGGT AGAGGAAGCT GGTAGCTGTA TTGCAGAGGC 3900 
TCACCACGCC CAAACAAGTC TATTGACTAA GGAAGCTTCA GGTGAGGACT TGGCTTATAG 3960 

TGTAACCATG ATGCATGCCC AAGACCACTT AATGACAACT ATCTTGTTAA AAGATTTGAT 4020 

G CATC ATTTA ATTGAACTCT ACAAGAGAGG AGTTCAATAA TGAATAAACT AATTGCATTT 4 080 

ATCGAGAAAG GAAAGCCTTT CTTTGAAAAA CTATCTCGTA ATATCTATCT TCGTGCTATT 4140 

CGTGATGGTT TCATTGCAGC TATGCCTGTT ATTCTCTTCT CAAGTATCTT TATCTTGATT 4200 

GC CTTTG T AC CAAACTCATG GGGCTTTAAA TGGTCTGATG AAGTTGTAGC CTTTCTGATG 42 60 

AAACCTTATA GCTATTCTAT GGGTATTCTG GCTCTCTTGG TAGCTGGTAC AACAGCTAAG 4 3 20 

TCATTGACTG ACTCAGTAAA CCGGAGCATG GAAAAAACCA ATCAAATCAA GTATATGTCA 4 380 

ACATTGTTGG CAGCAATTGT TGGTTTGTTG ATGTTGGCAG CTGATCCTAT CGAAAGTGGT 44 4 0 

CTAGCTACTG GATTCTTGGG GACAAAAGGT TTGCTTTCAG CCTTCCTTGC TGCCTTTGTT 4 500 

ACTGTAGCCA TCTATAAGGT TTGTGTTAAG AACAACGTCA CTATTCGTAT GC CTGACGAA 4560 

GTTCCACCAA ATATCTCACA AGTCTTTAAA GATGTGATTC CATTCACTCT ATCTGTTGTT 4 620 

TCTCTTTATG CTCTTGACTT ATTAGCACGT TATTTTGTTG CTTCTAGTCT GGCAGAATCA 4 6 80 

ATCGGTAAAT TCTTCGCACC ACTCTTCTCA GCACCAGACG GATACCTTGG TATTACCATT 474 0 

ATCTTTGGTG CCTTTGCCTT CTTCTGGTTT GTTGGGATTC ATGGT C CATC TATCGTTGAA 4 800 

CCAGCTATCG CAGCTATTAC CTATGCCAAT GCCGAAGTTA ACTTGAACCT TCTCCAACAA 4860 

GGGATGCATG CAGACAAGAT TCTTACTTCT GGTACACAAA TGTTT AT CGT TACCATGGGT 4920 

GGT ACAGGT G CGACATTGGT CGTTCCATTT ATGTTCATGT GGTTGACAAA ATCGAAACGT 4 980 

AACCGTGCAA TCGGACGTGC TTCAGTAGTT CCTACCTTCT TCGGTGTAAA TGAACCAATC 5040 
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TTGTTTGGTG CACCTCTTGT TTTGAATCCA ATCTTCTTCA TTCCATTTAT CTTTGCTCCA 5100 

ATTGCAAACG TATGGATTTT CAAATTCTTT ATTGAAACTC TTGGAATGAA CTCATTCACT 5160 

GCTAATCTAC CATGGACAAC TCCAGCTCCA CTAGGTCTAG TTCTTGGAAC TAACTTCCAA 5 2 20 

CTGCTATCAT TCATTCTTGC TGCCCTTCTA ATCGTGGTTG ACGTTGT CAT TTACTATCCA 52 80 

TTC CTT AAGG TCTATGATGA ACAAATTCTT GAAGAAGAAC GTTCAGGTAA GTCTAATGAT 53 40 

GAATTGAAAG AAAAAGTTGC TGCAAACTTC AACACTGCAA AAGCGGATGC TATTCTTGAA 54 00 

AAAGCGGGTG TCGATGCAGC ACAAAATACC ATCACTGAAC AAACAAATGT CCTCGTTCTC 54 60 

TGTGCAGGTG GAGGAACAAG TGGTCTCCTT GCAAATGCTT TGAATAAGGC AGCAGCAGAA 552 0 

TACAATGTCC CTGTGAAAGC AGCAGCAGGC GCCTATGGTC CTCACCGTGA AATGTTACCA S580 

GAGTTTGATC TTGTTATCCT TGCCCCTCAA GTTGCTTCAA ACTTTGAAGA TATGAAAGCA 564 0 

GAAACAGATA AGCTCGGTAT TAAACTAGCG AAAACAGAAG GCGCTCAATA CATCAAATTA 5700 

ACTCGTGATG GAAAAGGTGC TCTTGCATTC GTACAAGCGC AATTCGATTA AGGCTAGAGA 5760 

CTCTGAAATA GTCTCCCATC GTTACGGAAA TCGCTATGGC GAATTTCCTA TTATTAATTC 5820 

GTCGGTAAAA AGATATCGTT TTTACCTCCT CATGTCACAA TTCGGTGACT TGGTACAAGA 5880 

AGTGAGATGG AG AAGGATG G CTCACTGACT CCTCTCCTCT CACTTTTACT TTATTTAAAT 594 0 

CAAGAAATAG GTGAAAAAAA TGACAAAAAC ACTTCCAAAA GACTTTATTT TTGGTGGCGC 6000 

AACAGCTGCT TATCAAGCAG AAGGTGCTAC ACATACTGAT GG AAAAGGAC C AGTTGCT TG 60 60 

GGATAAATAT CTTGAGGATA ACTACTGGTA CACTGCCGAA CCAGCTAGTG ATTTTTACAA 6120 

TCGATATCCA GTTGACCTCA AGCTAGCAGA AG AGT AT GGT GTCAATGGTA TTCGAATTTC 6180 

TATTGCTTGG TCACGTATTT TCCCGACTGG TTACGGCCAA GTAAATGCTA AAGGTGTTGA 62 4 0 

GTTTTATCAT AATTTATTTG CAGAGTGTCA CAAACGTCAT GTTGAGCCTT TTGTAACTCT 6 300 

TCATCACTTT GACACGCCAG AAGCTCTCCA CTCAAATGGA GACTTCTTAA ACCGTGAAAA 63 60 

TATCGAACAT TTTGTAGACT ACGCTGCCTT CTGTTTTGAA GAATTTCCAG AAGTAAACTA 64 2 0 

TTGGACAACC TTTAATGAAA TTGGACCAAT CGGTGATGGT CAATATTTGG TTGGGAAATT 6480 

CCCTCCAGGT ATCCAGTACG ACCTTGCCAA AGTCTTTCAA TCACACCACA ATATGATGGT 654 0 

GTCTCATGCA CGCGCGGTAA AATTGTACAA AGAGAAAGGC TATAAAGGGG AAATTGGTGT 6600 

TGTTCACGCC CTGCCAACTA AATATCCTCT AGATCCTGAA AATCCAGCAG ATGTTCGTGC 6660 

AGCTGAGTTG GAAGATATCA TCCACAATAA ATTCATCTTA GACGCAACTT ATCTAGGTCG 672 0 

CTATTCAGCT GAAACCATGG AAGGTGTCAA CCATATCTTA TTAGTCAATG GTGGTAGTTT 6780 
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GGATCTTCGT 


GAAGAAGATT 


TTACAGCATT 


AGAAGCTGCA 


AAAGACTTGA 


ATGATTTCCT 


6840 


AGGAATCAAC 


TACTATATGA 


GTGACTGGAT 


GGAAGCCTTT 


GATGGAGAAA 


CTGAAATTAT 


6900 


CCATAATGGT 


AAAGGTGAAA 


AAGGAAGCTC 


TAAGTATCAA 


AT C AAAGGTG 


TTGGTCGTCG 


6960 


TGTAGCTCCT 


GACTATGTAC 


CACGCACGGA 


TTGGGATTGG 


ATTATCTACC 


CTCAAGGTTT 


7020 


GTATGACCAA 


ATCATGCGTG 


TGAAGAAAGA 


TTATCCTAAC 


TACAAGAAGA 


TTTACATCAC 


7080 


TGAAAATGGT 


CTCGGCTATA 


AAGATGAGTT 


CGTTGATAAC 


ACTGTTTACG 


AT G ATGGTCG 


7140 


TATTGATTAC 


GTGAAGCAAC 


ACTTGGAGGT 


TTTATCTGAT 


GCGATTGCAG 


ATGGAGCTAA 


7200 


TGTAAAAGGT 


TACTTCATTT 


CGTCATTAAT 


GGATGTCTTC 


TCATGGTCAA 


ACGGTTATGA 


7260 


GAAACGTTAT 


GGTCTCTTCT 


ACGTAGATTT 


TGAAACTCAA 


GAACGTTATC 


CTAAGAAATC 


7320 


AGCTCACTGG 


TACAAGAAAG 


TAGCGGAAAC 


TCAGATTATA 


GACTAGTAGA 


ATTAGTCATT 


7380 


AGATATAGAA 


TTTTAGTGAG 


TCAAAAAGAT 


GTTCAAAGAT 


TTTATCCAAT 


CTATTTATGA 


7440 


AAAAAAGTTT 


AT ATT ATAAA 


TTTCGAAAAA 


TGCTCTCAAA 


TACCGTGTTT 


GACGAGTGAA 


7500 


GAATTGAAAA 


GTCTTGGAAA 


ATGGTATGTC 


TCGACTGGTA 


AAGAATGGAT 


TTGTCATTCA 


7560 


GATGATGAGC 


TGGAAGAATT 


TAAAAATCTA 


TTTTTAAATT 


TTATCAATCC 


TGAAGAATGG 


7620 


GATACTATCT 


CCTTTGATTC 


AGATTTTATG 


CCGTTTCAAC 


AATCGTAACC 


AATTTCTCAA 


7680 


AAAAGTTAAA 


TCTTATATTT 


AGTACTCTGT 


AAAACTCTTA 


TCTAATCACG 


TTGCTTATAC 


7740 


TCAATGAAAA 


TCAAAGAGCA 


ACTTTAAACT 


AGGAAGCGAG 


TCGCAGATTT 


CTCAATGCAT 


7800 


AG CTTTG AGG 


AATTGGGCAA 


AAAGTCTTTG 


ATATAGAAAA 


ACGCATAGTA 


TCAGGTGTTT 


7860 


CAACACCTGA 


TACTATGCGT 


TTTATTGTGG 


GAAGATTTAC 


TTTTTTTCTT 


CTGAAATTGA 


7920 


GTTGTTACCC 


AGGCTCTTTC 


AGTTTATTAA 


GGCTTGATGA 


CTTTAATGTG 


TTTAGATAGC 


7980 


TTAAAAAGGA 


TTGAATCACT 


TAGTTTAGAA 


TCTGAAACAA 


TAGTATCAAG 


ATTTGATACA 


8040 


TTATAAAAAG 


T AT AAAAATC 


AAACTTATTG 


AACTTGCTAT 


G ATCTGC GAG 


TAAATATTTT 


8100 


TTATTAGAAT 


TATTTAAAGC 


GATGCGTTGA 


GCCTCTCCCT 


CTTCCTCGCT 


AAAAGTAGCT 


8160 


AGAGCTCCGT 


TTTGAATACC 


ATTACAGCTA 


ACGAAAG CTT 


TAGAAAATTG 


GAGATTAGAG 


8220 


AGATTTTGTA 


GGGTCAATGT 


ACCAACAAAA 


GCACCTGTAA 


TATCGCGATA 


ATTTCCACCT 


6280 


ATTAAAATCA 


AATCTGTTAA 


TTTTCGTTCG 


CTTAAAATCA 


GAAAAACAGG 


TAGACTGTTG 


8340 


GTTACGACGC 


GGATATTGTC 


AATAGGCAAC 


TCACGCGCAA 


AAAACTCTAA 


TGTTGTTCCT 


8400 


GGTCCAATGA 


AAATAGTTTC 


TCTTTCTTCT 


ACTAGACTGC 


CTGCAAAATG 


GGCTATTTCT 


8460 


TGTTTTTCTG 


CCGTTTGGAG 


GGCTTGTTTT 


TCAATATTTG 


ATCGCTCATT 


AGTCAAAAGG 


8520 


GAGTTGGTTC 


GAAGTTTTTC 


AGCTCCACCA 


TGCACACGAA 


TCAGCAAATC 


TTTATCAGCT 


8580 
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AATTCCTCTA AATAGCCCCT TGCAGTCATA TCTGAAACGG CTATTTCGTC CATAATCTGT 8640 
TTAACTGTTA T 8g51 
(2) INFORMATION FOR SEQ ID NO: 182: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3786 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

AATCTCCAAT CAGTGCCACT TCAGCTACAA AGAAGAGGAG GATAATAACT CCGTTCACAA 60 

GGACAGACAA GAATAATTGA TAGAAGGAGT CGCTTTCACT TGCTTGACTT GGTCTTGTAA 120 

TGATwTGGAG ACTGGCAAGC AGAATG ATT C CAATGCTAAT CACACACAAG AGGCCTGTAA 180 

ATCGTAGGCT ATCAAAGAAA GCAAAGAAAC TAGCAATAGC AGTGAGGAmG ATTGGAATTG 24 0 

CCAAGAGTTG ACTATATTGT TGGAGAACCT TGTCTAGCGT CCAGTCCTTT TCCTGGTGGA 3 00 

TAAATCGTCT CACAACGAAA CTACCCAAGA GGAATGAAAA GAAGAAGAGT GTTGTCGCTA 3 60 

CTAGGATAGA GATGATAGAA AAAAGAGTTA AAGGAGCTAG CTGCTCAGGG AAGCGACTGT 420 

TAATGCTTGC TATATGTCCA TAGTAAGCAT GTTTGATGTG ATAGATACTA AAGAAAAAGG 480 

AAGATGCAGA AAACAGAATG AGCAAGAGAA AGGCTGTGTA ACTGTGTGTG ATACTTGTTT 54 0 

CCAACTTACT TGTAGGAGAT T TG ATCGCTT CCACTAGCCA AGACCAAAAA TCAAGCACTT 600 

GCTCTTTCCA TTTATCCCTA GATTTTGGAG CTTGGTCGGG GATATAAGGA CTTTCTAAAG 660 

ATTTACTGAT AAGAAGTGGC TCTTTCGTGG TTGCTTTTTG CTGAGGAAGA GCTTCTTGGC 720 

TCTCTTCAGC TATAGTGACT TTTTCTGTTT CTTTAGAAAG GTCTGCCTCT TCTTCAGTAG 780 

AATTAGATGC CTTCTTTTCT TCTATTTCTG TTCTCGCTTC ACTGTCTTCA GGAGCTTCAA 840 

TTTTCTCTTC TTGCTGGCTT TCCAATTCGA CTTCAGCTTG AGGGACTTCC TCCTCTAACT 900 

GAGTATTTTT TTCAATTGGT GTATCGAGAT CGGCTATCGT TTCTTCAGCC TTGTCTGCAA 96 0 

CCTCTTGAGC TTGCTCTTCA GGCTTGTTCT TGCTTGTTGT TTTTACAAAA TCATTACTTT 1020 

CAAACCATTC TTGTTTCATG GTAGAACCTC CTTTTTAGTT AGATAAATAT GTTTC CAT AG 1080 

TAG CAAATGT AAGCGTTTTT GTCAACGTCT GCTTGGTGTG GATATTAGAT CAATATTATC 1140 

AT C AG ATCT C GCAATGAGTT GATCCTTGAC ATCGGTTTTT TCAGTTTTGT AAGGGTTGCT 1200 

TAATTCCGTA CCTCTTGATT CAGGCTTTTC TCTTGTGAAT TGGAAGATAG AACCATAGTT 1260 
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GCTTGAGATG TCCCAGTTAA TTCGTTGGCT 


, TTCTTTCTGG 


TCTAGGATGA 


, TTCTGAGATA 


1320 


ATCTTTGGCA GTCAGTTCAA 


> 1 1\iV.LA1(j 


: GACTTGGATA 


TTTTCAGCGT 


GGAAGTGATT 


1380 


CTGT*WS A C 




I A l \» rtiTAAG 


AACTGTATCA 


AAGATATTAA 


CGATATTGGG 


1440 




1 i Al- iui 1 I 1 


I vjA rACGACT 


TCCTTCAATT 


CGGAGGATAT 


AGCTGTTTGT 


1500 


a r v r m & cinnTT* 


OCATVi i CAA 


GGCT AGC AT T 


TATGATGGTG 


GTTTGTCCGC 


GATTGGCTGA 


1560 


\»«» t v> 1 lun 1 


CCTTTTAGAG 


TTCTCCCTTT 


TGGTAGTCGG 


AGAATAACTT 


CTTCAAAACG 


1620 


ACTAGAGTAn 




TATGAAGAAT 


CCCACCAATT 


CCAGAAGAGA 


GAAACGGAGT 


1680 




1 1 (_ 1 1 A 1 v_ A\j 


1 uAuACTCAG 


AGTTCTATCG 


TTCTGATTGG 


TGATAAGATC 


1740 






GATGGTAAGA 


AATGTGGATT 


TGATCATCGA 


AAGAGTCTGT 


1800 


GATGGTGAnr 1 




GGAGAGTAAT 


TTCTAGGTTT 


TCGACTTCCT 


TGCCAAAGCT 


1860 


TAGCTTTTfT* 

4 *\ vJ ^— I 1 * 4 l« Sw 


u i At_\jOi_ i A I 


CATAGACAGG 


TTCTTTGGAC 


ATGGAAAGTA 


GGCTCTTAAC 


1920 




i 1 ACL. 1 A 


CAAAAAGCAG 


GATAAAGCCG 


ATAACGGTAG 


TCACCACACC 


1980 


AAAGATGAfIA 


AATCCTTTTG 


1LLAI 1 IALCj 


C ATGCTG AT T 


ACCTCTCTTT 


CCTTTTTTAA 


2040 


GAACAAATTG 




A(_AATGAG T A 


GACCGAAGAA 


GCGAGTTGCA 


TAGGAAATGC 


2100 


rAAGTl A A If* 
LnAy i /\>VAAl_ 


I AuLuAAbAA 


GCACCGATAG 


CCAGTAAACC 


AGAACCAAAA 


ATCAAGATAA 


2160 




GGCTTGGGCG 


AGGACAGTGA 


AACTTTCAAC 


TAAAAATAGG 


AATCCGCCGA 


2220 




1 A 1 uuAAAL I 


GCAAAGAAAG 


CCAGAATGAC 


AGTCAAAGCG 


GCTACAAGAA 


2280 


TTGC GAAP A fi 


VJVj 1 LALLiAWj 


ATGO CGATT C 


CCAGAGGAAT 


GCCGATAGGT 


GCTGCAAGGA 


2340 


GGGCTAAr AA 


vKjv.uA 1 A AAj i 


AAAAA I IajTC 


GGTTATTTTT 


TTGAGCGGGT 


GCTTCATTGA 


2400 




uAljAAuA 1 iu 


OATAGAACTT 


CGTGGGCCGC 


TTCTTTGGGA 


GTTCCCAAAC 


2460 


TAGCCATflAn 


TTCT TCTT CT 


CCTTCGACTC 


CAGCATCGTC 


AAAGAGCTCT 


CTGAAATAGT 


2520 


CCATGGCTTC 


GATACGGTCA 


GL1TCAGGTA 


GTTTCTTGAG 


ATAGAGTTCT 


AGCTGAGTCA 


2580 


GGTATTCAGT 


TCTTGTCATG 


GCGGATACTC 


CCTTCTATGA 


TGCCATTGAT 


GGTGTCT GT A 


2640 


TAGAGTGCCC 


ATTCATCTTT 


TAGGCTCAAG 


AGCTGCTCTA 


TACCACCGTT 


TGTCAAGGAG 


2700 


TAGTATTTGC 


GCATGCGACC 


TTGGAACTCT 


CTAGAATAGG 


TTGTCAGAAA 


GCTATTGCCT 


2760 


TCCAATTTTT 


TGAGAATGGG 


ATAGAGTGTG 


GATTCTTTGA 


TATTAGCGAT 


CAGCTTAATG 


2820 


GTTTGGCTAA 


TCTCATAACC 


ATAAGAATCA 


CCCTGCTCCA 


GTACAGCCAA 


GATGAGAAAT 


2880 


TCAATCAAGG 


CAGAGGATGT 


TGGAAAGTAC 


ATGGGAAACC 


TCCTTTTCTA 


ATGTGTAAGA 


2940 


TTTTTATATA 


TAATTTTTCT 


ACACATACAT 


TGTACATCTA 


AAAGAAAGCC 


CTGTCAAGAG 


3000 


AAATGTGTAA 


AATTTTTATA 


TATAAAAAAC 


TTCTAGCTAA 


AACTAGAAGT 


TTAAAGGATC 


3060 
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TTATCCGCTC TCTCCACTGT AAAGAGGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3180 

CCAAAGGCTG CTAGGATATA GCCTCCTGTT TG AG AT AGGC CGGACAATTG GGCTGTCTTT 324 0 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGAGAT AAGGGAAGAG GGCACTGGTT 3 300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 3 3 60 

AGCATGGAAA TGCCGACCAC ACCAGCTAGT GAAACCAGAG TGAGCATGAG CTGACGGTTC 34 2 0 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGGAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGGTCATGAC GGTOTAAAAG ATCAAGCATT GAAAACCTGA AAAGATAATA 3600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TTTTTTGTTT GGTTTGTGGA 3660 

GCTAGTCTAT GATTATAGCG GTGATTTGGG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 3720 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTCTAATGGG CACAGCTAGA 3 7 80 

TAGGAA 37 86 



(2) INFORMATION FOR SEQ ID NO: 18 3: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3054 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
TCAGCTAAAA AACATTGCTA AATTGATTGA AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTTCCTTCTT GAT AC AAAAG GACCTGAAAT CCGTACAGAA 180. 

TTGTTCGAAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGCA 24 0 

ACT AAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATG ATGATGTTGA AGTTGGTCGT CAAGTTTTGG TTGACGATGG TAAACTTGGT 3 60 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATCG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 4 80 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AACAAGGTAT CAACTTCATC 54 0 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG 


GACATGTTCA 


ATTGTTCGCT 


AAAATCGAAA 


ACCAACAAGG 


TATCGATAAC 


660 


TTAGATGAAA 


TCATCGAAGC 


AGCTGATGGT 


ATTATGATTG 


CTCGTGGTGA 


TATGGGTATC 


720 


GAAGTACCGT 


TCGAAATGGT 


TCCACTTTAT 


CAAAAAATGA 


TTATCAAGAA 


AGTCAATGCT 


780 


GCAGGTAAAG 


TTGTTATCAC 


TGCAACAAAC 


ATGCTTGAAA 


CAATGACTGA 


AAAACCACGT 


S40 


GCAACTCGTT 


CAGAAGTATC 


AGATGTATTC 


AACGCTGTTA 


TCGACGGAAC 


TGACGCTACA 


900 


ATGTTGTCAG 


GCGAGTCTGC 


AAACGGTAAA 


TACCCACTCG 


AGTCAGTAAC 


TACAATGGCT 


960 


ACAATCGACA 


AGAACGCTCA 


AGCTCTTCTT 


AATGAATACG 


GACCTCTTGA 


TTCAGATTCA 


1020 


TTTGAGCGTA 


ACTCTAAGAC 


AGAAGTAATG 


GCTTCTGCTG 


TTAAAGATGC 


TACTAGCTCA 


1080 


ATGGATATCA 


AATTGGTTGT 


AACTCTTACT 


AAGACAGGTC 


ATACTGCACG 


TTTGATTTCT 


1140 


AAATACCGTC 


CAAATGCTGA 


CATCTTAGCA 


TTGACATTTG 


ACGAATTGAC 


ACAACGTGGC 


1200 


TTGATGTTGA 


ACTGGGGTGT 


TATCCCAATG 


TTGACAGATG 


CTCCATCTTC 


AACTGACGAT 


1260 


ATGTTCGAAA 


TCGCTGAACG 


TAAAGCGGTA 


GAAGCAGGTC 


TCGTTGAGTC 


AGGCGATGAT 


1320 


ATCGTTATCG 


TTGCTGGTGT 


GCCAGTAGGA 


GAAGCTGTTC 


GCACAAACAC 


AATGCGTATC 


1380 


CGCACAGTAC 


GTTAAGAAAA 


AT AT AAAAAC 


CTATCATATC 


CAGCTTTAGA 


GCTTGTGTGA 


1440 


TAGGCTTTTT 


GTATAGAGGG 


TAAGAAATAG 


GCAAAACTTT 


CATAATGGAT 


TGATACTCTT 


1500 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCGTCG 


CCTTACCGTA 


TATATGTTAC 


TgACTTCGTC 


1560 


AGTTCTATCT 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCAACtG 


CGGCTAGCTT 


CCTAGTTTGC 


1620 


TCTTTGATTT 


TCATTGAGTA 


TGAAATAAGA 


TATGCACAAA 


TTGATTAGAA 


AGTCAAATGA 


1680 


ATTTCTACAA 


ATGTTTT AG C 


AATCGTAATG 


TACTTGTCTA 


GATTCGATCT 


GATATATTTT 


1740 


CGATTTAATG 


ATATGGTATT 


TAAAACCTCC 


AAAGTAGCTT 


ACTCCATTCT 


TTTACTTACG 


■ 1800 


TGAGTGTAGA 


TGTTATTTAC 


TGTTTTAGCG 


TTTTTGTGTT 


CCACTCTAAC 


CATTATAGCA 


1860 


TTCTTCTCAG 


CTAGTGTACT 


AAGGAGTGTG 


TGCCTGAAAA 


TATGGGAACT 


AAGGGGCTGG 


1920 


TTTATCGGTT 


TCTCTAGTTT 


AGTATTTGCC 


TTTTG CAAAG 


TGATCTTAAA 


TGCCTTTCl'C 


1980 


TAAATTTACA 


TAT C ACT ATT 


GTTTAACAAA 


ATCTAATCTA 


TTTTAGGTCA 


CTTATTCTTT 


2040 


TTTTGAAATG 


TAGAATGAAC 


TTTTTCAAAG 


TTTTTCGAAT 


CTTTTAAAAT 


CTGTTTGCTT 


2100 


TATATCGCCA 


TTCTCCCCCC 


TTTTTTAATT 


CTCCCTATAT 


AGC CTGAC AG 


CTTTCCCGAT 


2160 


GGTACGAATA 


TGGTTGCTTT 


CGTCTAGGTG 


GATGTCGGGG 


TATTCGGGAT 


TGAGTTTTTT 


2220 


TGAGGCAGCC 


TTGGCGGAGT 


TTCTTGACAT 


AGTTAGTGCC 


GTCTACTTGG 


AAGATGCCGA 


2280 


T GGT ATT AT A 


GTCAATCTGT 


GGGGTATTCT 


TGATAAATAG 


GTAGTCGCTG 


TTTCTTATCT 


2340 


TTGGCTCCAT 


GGACTTGCTG 


ACGACATAAG 


CGATTGGGTC 


GTAGTC GTCT 


GGGATAATGG 


24 00 
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AAACTCCATA TCTAAATCGT TGTCCTGCAT CGAGCGGCTA CCTGCAGAGA TAAACTACCT 24 60 

AACACGAGAG TAAGTAGTCT GTCTGTAGTC G TCC AGTCTG ATGATTTTTA CGATACTTCG 2 52 0 

TTTTTCTGAT CATACAGTTG CCTCTCGGCA TAGGTCAGAA CTTTACCTTG TCTGGGTGCT 2580 

TCCCGTTGGT CGTAGATAGA TTGGATATCG CTAGGAGAAT CCTTTTGAAC TGGAGGAAAG 2640 

AGGGCATCGA TCAAGCTACT GAATACTTTA ACT AAGT C AA ATATAGTATT TTTCTTAGTA 2 700 

GACCTAACCC TTTTTTCATA ATTTCTAATG GTGTTTTTAC TTATACCTAT CTTAGTACCC 2760 

AATTCTTATT GAGTCCAACC ATTACTAGTC TATATTGTTT TATAGTTGAT TGAGTTTGGA 2820 

ATAGTACGCT GTAGCTGCTA AAACATTTCT AGAAATTAAT TTGACTTTCC TAATAGAGTT 2880 

GTTCATATCT TATTTCAATC TATTATGTTT TTCACCTCTA ACAATCGCAA TCTCTTCTTT 2 940 

ATCCATGAAT GAAATCGCTT TCTATTTTTG TAAGTAAAGC ATAACACGAA ATCCACGAAA 3000 

ATGAAAACCT TTGTTGTGTT TTCGTAAAAA ATTTGTTGAC AGAGCACGAA ACGC 3054 

f2) INFORMATION FOR SEQ ID NO: 184: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TGTGATTTTC yGAAAATTTG GTAAAATATA TCTTAATCAT TTTCAGGAGC ACAAAAATTT 60 

GACAAGATAT CAGAATTTAG TAAATGGAAA ATGGAAATCA TCTGAACAAG AAATTACGAT 120 

TT ATT C AC C A ATCAATCAAG AAGAATTGGG T AC AGTTCC A GCCATGACTC AGACTGAAGC 180 

TGATGAGGCT ATGCAAGCTG CGCGTGCAGC CCTGCCAGCA TGGCGAGCTT TATCAGCAGT 240 

TGAACGTGCG GCTTATTTGC ATAAAACAGC AGCTATTTTA GAACGCGATA AGGAAGAAAT 300 

TGGTACTATC CTTGCCAAAG AAGT AG C AAA AGGGATTAAA GCAGCAATTG GAGAAGTAGT 3 60 

GCGTACAGCA GACTTGATTC GTTATGCTGC TGAGGAAGGT CTCCGTATCA CTGGACAAGC 420 

AATGGAAGGT GGTGGTTTTG AGGCAACAAG TAAAAACAAA CTGG CTGTTG TCCGTCGTGA 480 

ACCAGTTGGT ATCGTGCTAG CGATTGCTCC CTTTAATTAT CCAGTTAATT TATCTGCTTC 540 

TAAAATTGCA CCTGCCTTGA TTGCAGGGAA TGTGGTCATG TTTAAGCCAC CAACACAAGG 600 

TTCCATTTCT GGACTCTTGT TGGCTAAAGC ATTTGAAGAA GCAGGGATTC CGGCAGGTGT 660 

TTTCAACACC ATTACAGGTC GTGGTTCAGA AATTGGGG AT TAT AT C ATTG AGCACAAAGA 720 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC T CCTATTGG A GAACGTATTG GTCGTTTAGC 780 

TGGTATGCGT CCTATCATGT TGGAACTTGG TGGGAAAGAT GCAGCTCTTG TACTAGAAGA 84 0 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG CCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC AGTTGGTGAT C C ATTTG AC A ATGCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 108 0 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GGCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGC C A ACGAATCTCA 1260 

ATTCGGCCTT C AAT CATC AG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAG TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATAGCATTGA 1440 

AGCGATGACA AATGTCAAAT CCATTGTTTT TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GCTTTTATTT TTTTGCTATA AAATAATAAT AATTATAGAA AAAATACGAA 15 60 

CTTTTTGGTA TTATAATAGA TTGAAACCGG l590 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4848 base pairs 
(B> TYPE : nucleic acid 
<C> STRAND EDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CAGCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACT AGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACC AG AGCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGACCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA CTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGG TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATAAAAA GGTGAAGCAC 600 

CCATACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 

TTCT ATAAG A AAATGGG AT T TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 7 20 

CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 780 

CTGCTAAAAG TGTACTTGGT AGTAACCAAG GAAGTAGAGC AC CAT ATTC A AATAAGAAAT 84 0 

CAAAACGAGA TTTATGTTTT CTGACAACAC GAGCAAATAC AACTGCGAGA ATTGTTGCTG 900 

TTGTCGCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 102 0 

GAATTGCAAC TGGATCTGTA AATGAGTATA AT ACT AT AAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 1140 

TTTGTTTTTT AAGAGGCGCT TT AGTCTT AG AGATAGAAAT ATAATTTCCA CCTTTTTCTA 12 00 

TCTTATTCAT GATAGTAAGC AAAATTGTAG TTGCAATACC TAAAATAATT GCAAGTAGGG 12 60 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTCCTACTGC AGATAAACCA CTAAGAAAAA 13 80 

CCATAATAGT AAGTGCAAAT AGAGTTGGAA TTAAGGTTGG TAACACTACT TTTCGGAAAA 14 40 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAAC CACTC 1680 

CTC CAT AAAT TAAAGAGGTC AT AT AACCT A ATTTTAAAAT TTTAGC AC CT TTAATATCAA 1740 

AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1300 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACGAT 18 60 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TT AC AAAT AC ATTCACTACT AG AT C AAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 19 80 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTGT T C AGTTTCAC GATT GAATCG 2220 

AACTTTCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 2 2 80 
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TAATACTTCG 


TGCACTGTTT 


CATCGGTCAA 


AACATTAATA 


TCTCCAATAA 


AATC AC AT AC 


2340 


AAATTCAGTT 


TGAGAATTAT 


GATAAATCTC 


TACTGGTGTA 


CCGACCTGTT 


CCATGTATCC 


2400 


ATTGTTAAAG 


ACTGCAATTC 


TATCAGATAA 


AGTCAAGGCT 


TCCTCTTGAT 


CATGAGTAAC 


2460 


ATATAAAGTA 


GTAATACCTA 


ACTCTTTTTG 


AAGTCTTTTC 


AACTCTTTTC 


TCAAATCTAC 


2520 


ACGTAATTTT 


GCGTCAAGGT 


TTGACAATGG 


TTCATCTAGA 


CAftAGAATTT 


TAGGTTCAAG 


2580 


AACCAGAGCA 


CGAGCCAATG 


CTACCCTTTG 


TTGTTGACCC 


CCAGATAATT 


CTGATACATT 


2640 


ACGCTGTAAC 


TGTTGATCAG 


AGATCTTAAT 


TTTTGCTGCC 


ACTGCTGATA 


CTTTAGCTTT 


2700 


AATAACATCT 


GGAGCTACCT 


TCTTAACTTT 


TAAACCAAAT 


GCAATATTAT 


CAAAAACAGT 


2760 


CATAGTTGGA 


AATAGCGCAT 


AAGATTGAAA 


TACAATACCA 


ATTCC AC GCT 


TTTCAGCTTC 


2920 


CAAATGAGTG 


AC ATCTGTT C 


CATTAACTTC 


AATACTTCCT 


GATGATCGAT 


CT AG AAAACC 


2880 


TACCAATGCT 


CTCAAAGTAG 


TTGATTTACC 


ACATCCTGAA 


GGCCCAAGAA 


ATGTAAAAAA 


2940 


TTCCCCTTCA 


TGTATATCTA 


AATTCAGATT 


ATCAATTGCA 


ACAAAATCAC 


CATATTTAAT 


3000 


TTGAATATTA 


TCAAATTTAA 


TCATCTCACT 


AACTCCCTCT 


ATTACTAAAC 


CAAAAGCCTC 


3060 


TCTTT ATTT C 


TTCCATAAAT 


TTAGAAATAA 


TAGAGAGACT 


TGGACATAAA 


AATTAACTCT 


3120 


TATTTCTTAT 


TGTACGTATT 


CTAATTCAGC 


TTTTTCTACC 


CATTCATCCA 


AATGCTTTCC 


3180 


AACAGCTTCC 


CAGTCAATAT 


TTTGTGGTTT 


CACTTGATCA 


ACAAATTTCT 


TCGTATCTTC 


3240 


aggtagatct 


TTGAGGGCAT 


CTTTATTTGC 


AGGAATAGAT 


CCAAAGTTCT 


TACTATATTC 


3300 


TACTTGAATT 


TCTGATTGAC 


CAAACCAATC 


AATAAATTCT 


TTAGCTAACG 


CTTGTTTTTr 


3360 


ACTAGTCCTT 


AAAAC CAT AG 


TTTGTTCAGT 


TACAAATGGT 


ACACCAATCT 


CAGGAGTCAT 


3420 


AACTTTGAAA 


ACAACATTTT 


GTTCTTTTTG 


TCCAACTAAT 


GCACCAGAAC 


CCCACATCAT 


3480 


TCCATATTGT 


ATTGGATCTT 


CTTTGTCTAA 


CATCTTAACA 


ATTGAACTTT 


CTCCCTTTTG 


3540 


AAGAGTGTAT 


GCATTTTTCA 


AATATTCTTT 


TGCTACTTCC 


CAACCTTTTT 


CGGAAACACC 


3600 


TAATTCACCT 


TTATCATCAA 


GGTATCGAAC 


TAAGATACTT 


GCTAGAATTG 


CCCGTCCTOT 


3660 


ACCTCCTTGA 


AGACCAGAAA 


TTGAATATTT 


ACCTTTATAC 


TTACTACCTA 


ATTCAGTCCA 


3720 


ATCTTTAGGC 


ATTTCTTTTA 


CATCAGGCGC 


CCCAATTAAA 


ACTAATGGTT 


GAACAATCAC 


3780 


AGGATTATAA 


TAATTATCTT 


TATCTGATAA 


AGATTGATCA 


ATTTTATCTA 


ACCATTTAGG 


3840 


CTTGTACTGT 


ACTAGTAATT 


TTTGATCTCT 


AATTTTATTT 


GAATCAACAG 


CACCAATTCC 


3900 


AAATACCATA 


TCTGCAACTG 


C ATT ATTCTT 


CTCAGCAATA 


ACACGGTCTG 


CTAATTGAGC 


3960 


GCCAGCGATA 


T CAACC ATTT 


TT AT ATT AAA 


ACCAGCTT CT 


TTTGCTTTAG 


CAGTTAACCA 


4020 


ATCACCACGA 


CCATTTGAGA 


CTGAGTTCGA 


AT AG AT AA CT 


AATTCTTGAC 


TTTTATCAGC 


4080 
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TTTTTCTT C A GATGAAGAAG C AGTCGT AG A ATTTGAACCT CCAGAGCAAG CAGCAAGTGT 414 0 

AGTAAgAGCA ACTCCCGTTG CAAGTACAGT AGACCAAACT TTCATTTTTT TCATGATAAG 4 200 

TTCTCCTTTT TTATTATTTT ATTTAAATTT TTCGTGATAT GGAACAAATT GTCTCATATC 4 2 60 

TTCAAATACA GTATAGTCAA TACGGTTTAC AGTAATAGTT GGAATCTTCT CTAATAAAAT 4 320 

TTCAGTTAAT TCTGCTCTGA CTTTAGTAAA CTCTTCTTCC TCCTCTTCGG TTAGAGGAAT 4 380 

CCGAAGATAC CCAATTGAAA TATGGAATTG ATATCTATCA TGATTAGCGA AACAAACACC 44 40 

TGCTTTTTCT GAGACATAAG TACGAATTTC TTCTAATCTC TTTGCAGAAG C TTC ATCTGC 4 500 

AGGTTCAACT AGTATGTTTT GTTTTCCCAT TTCAGTTATA CGCATATGAA TTTCTTCATC 4560 

CAACAATGGA AAAATTTCAA GTTGTTTAGC AAAGTAATCA TGTATTTCCT GTAAAGGTGT 4 620 

ATCTAGAGGA AGATTACTGC TCCAAAACTC g CTCACGATT TTCATGGCAC AACAATTCAA 4 680 

TTACAGTCAT GTGAATAGAA TTCCTTGGAG TTAAAGTAAA CTTATCGATA AATGGTAATT 4 74 0 

CTCTATAACG TGATTGAATA ATATCAACAA CTTCCATCAA ATCTTGTTTA G TAT AAAG AT 4 800 

TTGCTACAAC TGTATTCCCA GGGAAATGAT TAAATTCCCC ATTCTCGG 4848 



(2) INFORKAT ION FOR SEQ ID NO: 186: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3763 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA 120 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 18 0 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 24 0 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 3 60 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 

TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TGTGTTTTTT TCTACACTTA T AC CAT AAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA AGGCAAGGAT TCTGGATGGT TTTT AG ATTT GGGGTGAATA ATTGGGGATT 660 

TAGGAGAAAT GATGGTATCT TCCAAATCAA AATCAACTTC ACTCCATAGT CTCAACTGAT 7 20 

TGATTTTCCC ATCTTGATAG GTCACATCCT TGTCAAGGAT AAACTGAGTC AACACCTCAT 780 

GTTGACCTTG ACACCTGATG TCATCTACCA AGAGCCAGAC ATCCTCTACC AACATGAGGA 840 

TTTTTCTCCT GTGAAGATAA GGCAAATCAG GTTCTGCTGA CCAATAAGCC CCCT CAAT AT 900 

AATGCACTCC CTCCCTTTCT TTATGGTGAC AAAACAGGGA GTGAGGATAG TATTCATATT 960 

CCCAGGATCC CGTGATTCTT TCCGGAGCTT TCCCATCTAC AATGCAGGTC GAATGACTCC 1020 

AAGCACTCTT TAAGAGATAA CGTTCATATA TCTCCCGATA AGAATAACGC CCAGCATCTA 1080 

TGAAAATAGG TTGGCCTTGA TACTGTAAGC AAAAACTATT CTCGTCACTA TGACTATGGG 114 0 

CACTTCCTAG CGGACCATTT TTGAAAAATA GATAACGATG TTCATCCTTA ATGCAGACAT 1200 

GTCCAGAGTC TTCAAAGATC ATGGACTTAG GCTGCCAAGC TCTCTTTTCA AATT CCTGC A 1260 

GTCG CTTG AC CTTTTCTCGC CCCAGGAACA AGAGGCTAAG CAAATCAACT TTAACATCCA 1320 

GACCGTTAAG AAGGTCTTCC TGGTTCAAAA CCACAGCAGA CAGGCTCAAA ATTTCTGTCG 1380 

TTTCTGTAGA ATCGCTATCA C C AAAAGCC A AAGTCCGTCC ATCTAAGCCT GTCATCATTT 1440 

GAATATAGGT CGCCATCTTT TCCAGCAACT CTTGGTAACT ATCTTGCAAG TCTGGAAGCA 1500 

AGAGACACAA ATCCAGCAAG GCTTTATAAA CCTCTACATG ATAGAGAATC GACTGTTCAA 1560 

ACTGGCTTCC ATCTCCTAAA ATCTGTGTCT CAATTTGCTG TTTCAACTCC TCTGAAGCAA 1620 

AATGGTAAGC TTCTTCTAGA TCCATCTTAT CTGAAAAGAA ATGATAGATA GCAAGCATCG 1680 

GAATTGTTTG TAAAATCCCC CAGTTACTAA GGGTGTACTT GGCGCGATAG TAG CTTTTCA 174 0 

TAAAGTCAAT CTGCTTTTCT AGACTGACCA AAATTTTCTC TAGTTCTTTC TCCTCTAGCA 1800 

AGTCAAATTT CAAGAGGAGC AAGAGTAGTT TCAACCAAGT AAAGGAACGA ATACCCGTAT 1860 

CCAAGGTTCT AGTCATCAAG GATTGAGGAG AAAATTCTCT CACCTGCTCA ATCCAATCAA 1920 

ATAGAAAGAA CTTGCACTTT TGAATATAGT CCTTATCTCC TTCTACCAGA TACCCTATCA 1980 

TAAACTGCAA GAGATATTCT TGTCGATTGA GCATATAAGA C C ATTCTGG A TCATCTTCAA 2040 

ATACTTGATC CCATACCATC GGCTGGATTT GATGGATTTT TGAACAAGGC TCCATATCCC 2100 

AAGGACTATC AAACATAAAA CGATTGTCCA TCAAGCGTTC AAGGGAACTC TTGACTTTCT 2160 

CATAGTCTTT TGAACAGTGC GACAAGATAT AATCACGACA TTGATTTCCA TCGACTCTTT 2220 

CAAAAAATTG TCTTCTTTCT TCTTTCATTA TCTATTACCA GAAAAAGAAC TACTTAAAAA 2280 

GCAGTTCTTT TGTCTTTCCC ATTACACTTT CCTTTTCTAC ATGGAT GACC ACACCTTTTG 2 340 

CAATCTGCAA GGAGACCAAG TCATCTTGGA TAGAAATGAT TTTTC C ATGA ATTCCAGACA 2400 
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ATAACAACAC 


TTCATCACCA 


AATGTTAAAG 


AAGCTAAATA 


CTCTTCTCGT 


TGCTCCATCT 


2460 


GTTTGCGAAG 


CAACTTTTGC 


TGACGAATAG 


AATGAAAGCT 


TGACAGTAAA 


AGGGGACTCA 


2520 


CTGCCAAGAC 


AATCACTATT 


CCATAAAACA 


ATGTTGTATC 


CATTAAGCTA 


TAATCTTAAG 


2580 


CCAGCTTCCG 


ATAATTCCGA 


TGATAACTGT 


TAAAATAACG 


AGTTTATATG 


TTGTCCATTT 


2640 


CTTTTCTTTG 


ATCAAGTAGT 


AAACTAAAAG 


TGTAAATAGG 


GCTGGTAGAA 


GAGCTGGAGC 


2700 


AACCTTATCA 


AGCATTCCCT 


GAATACTTAC 


GATACTTTGT 


TTAG CGTCTG 


CTTTAACTTC 


2760 


CCCTGCAGCA 


AAGGTAATCG 


GCACCATAAT 


CTTAACAGAT 


GTCGCTGCCA 


AACCAGCAAT 


2820 


TACG t T AC AC 


CGATAATATT 


GGCAATACGA 


GAAATCGTTG 


CCATCTGTTC 


GCTTAGTTTA 


2880 


TCAATCACAG 


TTGTTCC TAG 


TTTGTATCCA 


TACAGACCAG 


TTGACAATTT 


AATCGCTGTT 


2940 


AAAATCGTAT 


TCATCGCAAG 


GAAGAACAAG 


ATTGGACCGA 


CAACCAAGCC 


TTCTTGAGCA 


3000 


AACGAAGCTG 


CGATGGTTGA 


GAACAATGGA 


GCTAAACAGA 


ATTGAGAAAG 


AGAATCCCCA 


3060 


ATACCTGCCA 


ATGGTCCCAT 


CAAGGCCATC 


TTGATGCTAC 


GTGTTTCTTT 


TGCCGGACGG 


3120 


CCATTTTCCA 


ACATTACAAG 


ATGCAAGCTG 


GTAATAAAAG 


GCAGGAAGTG 


TGGGTTGGTA 


3180 


TTATAGAATT 


CACAGTTTTC 


TTCCAAGGCT 


TGGTAGAAAC 


CTTCCTGATC 


CTCTCCATAG 


3240 


TGTTTTTTCA 


AAGCAGGATA 


CATCACATTG 


GCATATCCCA 


ACCCTTGATA 


GTTACTATAG 


3300 


TTAAATCCAT 


TTTGACAAAA 


GAATGCCCGC 


AAAGAC GTTT 


TAAGATAATC 


ACGTTTTGTT 


3360 


AATTTGTTAG 


ATCCAGTCAT 


CGTGTGCTTC 


CTCCTCTACC 


ACATGATCCG 


CTGTTTTTGG 


3420 


CTTGTTATAA 


AATTCAATCA 


AAGCAAAGAT 


AGTACCTACA 


ATTGCAATAC 


CAATTGTTGG 


3480 


G ATGTTT AG A 


TAAGCTGCAC 


AAACATATCC 


CAACAAGACA 


AAGGGAATCA 


ACTCTTTCTT 


3540 


AGCCATCACT 


GACAAGATCA 


TCGCAAAAC C 


GATAGCTGGG 


AGCATTTTAC 


CAGCAACTGT 


3600 


CAAACCTGTA 


AGTAATACCG 


GTGGAATGTA 


GTCTACGAGT 


TTCAACAAGG 


TATCCATTGA 


3660 


AAGGGCACCA 


AGCAACCCAA 


GGTAAATCCA 


ATAAAGGCAA 


ACAACCAAAT 


TGTTGCATTT 


3720 


AGAGTGAACT 


TAAATTTCTT 


CAAATTATGG 


TTTTTCAAGT 


GCT 




3763 


(2) INFORMATION FOR SEQ ID NO: 187: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5053 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
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t. AATCTCTGA 


, GTATGTGCGG 


TCAATACTAw 


CAAAGGGAAT 


yCCTGACGTC 


AAGTAATGTT 


60 


CAAi iGGmCT 


ATAGGTAATG 


GCAACCACTC 


CATCAACTTT 


ATTATGACGC 


AACATCTCCA 


120 


GATAGTCTTG 


CTCTCTATTT 


GTACCATTGA 


TAG AACAT AA 


GAGTAATTTG 


TTATTTCTCT 


180 


TATAGACTTC 


ATTTTCCACA 


TGCATAGCAA 


ATTCTGAAAA 


GAAGGGATGC 


CAGATACTTG 


240 


GTACAATGAT 


TGCAATCGTT 


TCTGTTCGAT 


TTTTTTTCAT 


TCCTCTAGCG 


TAGTAATCTG 


300 


GAATGTAATT 


CAAAGTTTTA 


AT CGCTTGTT 


CCACTTTTTT 


CAAAGTTACT 


TCTTTAATGC 


360 


CTTTTTCTTT 


ATTAATTACA 


CGTGAAACAG 


TTCCAACACT 


AACTCCTGCT 


TCTAAAGCAA 


420 


CATCTTTCAT 


GGTAATTGAT 


TTTCTTTGTT 


CTACCATATT 


ATCACCTCCT 


TTCAATATAT 


480 


AGTATCATGC 


AAATGCTTTT 


TAAGCAACTA 


TTTCTCAATC 


ATTTTTGGCC 


AGATCATTTA 


540 


TCCCATCATG 


AATAAAATCA 


CTCCAATTAG 


CTTTTGAAAA 


TACTTCAATT 


TTCATGTGTA 


600 


AACATCTACA 


TAAAACAGGA 


AAAGCCTTGG 


TTTCATGGCT 


TTTTTCGTAT 


CTTCTATAAA 


660 


AAAAGCAAGA 


GTTTTAGATG 


GCTATAAATC 


TAGATGTACA 


TTTTCCTTAA 


ATGATTGAAG 


720 


GTCTTTTCTT 


AACAAAAACA 


CCCCCAAAAT 


TAGACTTTTT 


CTGTCTAACT 


TTTGAGGTAC 


780 


AGTTCAAACG 


CGAAATAGCG 


TTTTTTTGTT 


ATTTTTGGTT 


ACTCATCTAA 


TCGAATAAAC 


840 


ATCATGGCAT 


TTAACAAGTA 


TATGAGTGAG 


ACCGTGTTTA 


TATTATTTGA 


ATAGATGAGT 


900 


CTCTTATTTT 


CAATAGGAGG 


AATAATAAAA 


TTAGAAATAA 


TGATATCATA 


AGGTGAATCT 


960 


TCTAAAGATT 


CCTTTGATAA 


TT CTAATTC A 


GTCCAAACTT 


CCAGTTCAAA 


ATTATTGCTA 


1020 


CAATAATAAG 


AAAGTGTCTC 


TGCAACGAAT 


TTTGCATGAT 


ACTGATCAAA 


ATTACTCATA 


1080 


ACTAAAACCT 


TTAGTTTAGG 


CTGATTTTGT 


AGCAAATTAA 


TCACCAAATG 


TTTGGTATGA 


1140 


GTGATGAAGG 


TATAAGATAG 


ATGATTTACC 


ATCATTGAAC 


TAGAACAAAC 


CTCAAGAGTC 


1200 


T CT AAATAGT 


GAGAAAGCTC 


TTTTTTTATA 


TCTGAAACAA 


ATTTTGGAAA 


AATATTTTGA 


1260 


AAGTTCCTGA 


TTGTATTCCC 


TTTTTGATCA 


AATAAAATAA 


ACTCAGTAAA 


CAACTCTTGA 


1320 


CGATACAGAT 


GTGCGGTATT 


ATGCAGATGC 


CAAATCAGAT 


TATCCTTATT 


CTCCATTTCA 


1380 


ATCTGATACT 


TGACTGAAAT 


CTGATCAATA 


AAATCACTCA 


ATAGATGGTA 


AGATTTTTCA 


1440 


ACATAACTAT 


ccrrrrtTAC 


GC ATTTCAT A 


AAGAGACTTT 


CATCTATGAA 


AAACATTTTT 


1500 


TGAAAGTAAG 


ACACAAATAA 


TTGGCAAACA 


ACTTCTTCAT 


CTAAAGAGAT 


ATTGTATTCT 


1560 


GATTCAAAAC 


TCTGAGCAAC 


ACCTTCTATT 


CCTTCTGCCT 


GC ATT AAAAA 


ATCCAAACTT 


1620 


TGGTCGTTAA 


AAGAATCTTT 


ATCTACTTCC 


ATAAAATGAC 


CAAACTTTAT 


TCTATATAGG 


1680 


TTCGTAACTA 


GGAGCAACTT 


TAGCATTCTA 


TGCGTTGACA 


AATTCATTGG 


AAAGCTTGTT 


1740 


TCCTTATAAA 


CCAATTCTAA 


CAATTGAGAT 


AGTGGCTCTG 


ATGAAAAATT 


TTCAAATGGC 


1800 
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CATTCTAGGA AATAATATTT TTCTGAAAAA TATTGTGCAA AAAAGTAACG AATGTCTCTC I860 

TCATTTCCAA TGATTTGAAC AGGGGTCAGA CTAACTTCAA ATTGAAATTG CCTTTTAATC 19 2 0 

ACTTTATTGA TTTGGCTAAT AATACGATAG AGCGAAGATG AACTGATATA AAATTCTTTA 19 80 

CAAATACTCT CAGCTTGACA ACCTTCATTA AAGAAGATGA ATTCTAAAAT CGAAAAATGA 204 0 

GTTGAATGTT TAAAGAAATG ATGGTAAACC ATTTCAATAT CACTATCATC GGT ATT AAT A. 2100 

ATGCGTATAC CATTAGTAGA AGAATGAAAA ATCAAGTCAG GAAAAGCAGA TTTAACATGG 2160 

GATAGATCAT CTTTGACTGC ACGTTCTGTA CAATTTAATA ACTCTGCTAG TTCAGAACGA 22 20 

TGAAACCAAC GTTTATG TTC AAATAATAAT TCTAATAATT CTAATTGCCT ATGACTTTTT 2280 

TTAGATAATA AATCTCTCAT GAATATCTTT CTCTCTTTAT AAATTATCGG ATTAAACCTC 2 34 0 

TTGCAATTAT ACCACAAAGA ATAGGTATAC CATGATATAA CGACTTTTCC TAAAATCTTT 2 400 

TATTTCGTAT AATAACACTA CGGAGACAAT ATATAAACAA TTTT CTTATT TTACCGTCTA 24 60 

TTGAGGGCGT GAATACAGAA TCAAATTCAA GTCTAAAGAT TATATTTTTA ATTTTAAAAA 2520 

TTATATAATA GCAACAATTA AAGAATTTGA TTTTTTAAAA TT AT AT AAT A ATAACAATCG 2 580 

AAATAATTGA CTTTTCTATA TTAAAGTTAT ATAATAGTAA TAATCAAAGA AATTGATTTT 264 0 

TTGATATTAA AATAAAAAAG GAGGGTAGGC AGTGTTGTGA TCAATTATTG CTGGAGGTCT 2700 

TATTGGTCTC TTGGCAGGTA AAATCACTAA AAAAGTAGTT CTATGGGAAT CATCGCAAAT 27 60 

GTATTCGCTG GTTTAGTCGG GGCATATGCA GGACAATCTC TTTTAGGTAG TTGGGGTCCA 2820 

GCAATCGCTG GAATGGCTTT GCTCCCATCT ATTGTAGGTG CAGCGATTGT GATTACTGTA 2 880 

GTGTCATTCT TTACAGGTAG AAAGTAAACT TTTCGCCAGT AAAGTTAGCA AACTATTTTT 2940 

AAATCAATGA CGGGAAAAAT AGTTTAAATG TTAAATCGAA AGGATTGTAT ATCTCAAAAC 3000 

CAAAGAAAAT ATGTTTGATT ATTTT CTGT A TTTTAATCTT G AC AATTTT C CTTCCTGTTT 3060 

TGATAGATTA TCATCAAGTT AGTGATCTAG GTATTCATCT ACTT AG CTGG AGACAGAACT 3120 

CCGTAGTTGA ATTCTATCTT GCTAGATATG TCTTTTGGGG GACAGTGGTT CTATCAACTT 3180 

TAGTTTTATT ATCCATTTTA GTTGTGATGT TTTATCCTAA ACGTTACTTG GAAATCCAAC 3 24 0 

TTGAAACTAA AAACGATACA TTAAAATTAA AGAATTCGGC AATCGAAGGT TTTGTTAGAA 3 300 

GTTTGGTGAG TGATCATAGA TTGATCAAGA ACCCAACTGT TCATGTAAAT TTACGAAAAA 3 360 

ATAAATGTTT CGTTCATGTA GAAGGTAAAA TTCTTCCTTC AGACAACATC GCTGACAGAT 3 420 

GCCAAATAAT TCAAAATGAA ATAACTAATG GATTGAAGCA GT TT TTTGGT ATTGAGCGTC 3480 

AAGTAAAACT TGAAGTTGCA GTAAAAAATT ACCAACCAAA ACCTCAAAAC AAAAAGACTG 3 54 0 
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TTAGTCGTGT 


GAAGTAAGGA 


ACTAAAAAAT 


GGAATGGCTT 


AAACAATATC 


GATATCCAAT 


3600 


TATCGCTGGT 


CTCATAGGCG 


TATTTCTGGC 


TTGTTTGATT 


GTCTCCTTTG 


GCTTCTTCAA 


3660 


AACAATATTT 


GTATTGATTT 


TAGGAGCACT 


GGGAGTTGCA 


GCTGGATTAT 


ATATCGAAAA 


3720 


AAACTATATA 


GATAAATAAA 


AAAATAAAAA 


TTACTAATTT 


AATTAAAGGA 


GTTTCATATG 


3780 


TCAAACGAAA 


AAAACACAAA 


CACTAACGTA 


GAAAAGAAAG 


ATGCTACTGT 


TGTAGCTCAC 


3840 


GAAATCAAAG 


GGGAACTTAC 


TTACGAAGAT 


AAAGTTATCC 


AAAAAATCAT 


TGGTCTTTCA 


3900 


CTAGAAAACG 


TTTCAGGTCT 


TTTGGGAATC 


GATGGTGGTT 


TCTTCTCAAA 


TCTTAAAGAA 


3960 


AAAATCG TT A 


ACAGCGATGA 


CGTAACAAGT 


GGTGTTAACG 


TAGAAGTTGG 


TAAAACACAA 


4020 


GTTGCAGTTG 


ACTTAAACGT 


TATTGTTGAG 


TACCAAAAAA 


ATGTTCCAGC 


TTTATATTCA 


4080 


GAAATCAGAG 


AAATCGTATC 


TTCAGAAGTT 


GCTAAAATGA 


CTGACTTGGA 


AATTGTTGAA 


4140 


ATCAACGTAA 


ACGTTGTCGA 


CATCAAAACT 


AAAGAACAGC 


ATGAAGCAGA 


CTCAGTAAGC 


4200 


CTTCAAGATC 


GCGTATCTGA 


CGTTGCTGAA 


TCAACAGGAG 


AATTCACTTC 


AGAACAATTC 


4260 


GAAAAAGCTA 


AATCTGGTCT 


TGGATCTGGT 


TTCTCAACTG 


TTCAAGAAAA 


AGTTAGCGAA 


4320 


GGTGTAGAAG 


CTGTTAAAGG 


TGCAGCAAAT 


GGTGTAGTAT 


CTCACGAAAA 


CACTCGTGTA 


4380 


AACTAAGATA 


AAATAAATAT 


AACAGGAGAA 


ATTATCATGT 


CAGTAGAAGA 


AAAATTAAAT 


4440 


CAAGCTAAAG 


GTTCTATTAA 


AGAAGGTGTT 


GGGAAAGCCA 


TCGGTGATGA 


AAAAATGGAA 


4500 


AAAGAAGGTG 


CAGCTGAAAA 


AGTTGTTTCT 


AAAGTAAAAG 


AAGTTGCCGA 


AGACGCTAAA 


4560 


GACGCTGTAG 


AAGGTGCTGT 


AGAAGGTGTT 


AAAAACATGT 


TG AG TGGCGA 


CGATAAATAA 


4620 


GGTTAAAAGT 


TACTTTATCT 


TTTTAGTAAT 


ATTAGTCAAA 


AGAGTCTGAG 


TCAAGATGAT 


4680 


TCTCAGAAAA 


CAAAAAGCTA 


GAGATTCCCA 


ATTGCGGAAC 


TCTAGCTTTT 


TAATTTTGCC 


4740 


TCTTTCTCTT 


ATTATATTTC 


AGCAGGTTGT 


TGGCCATGAG 


TACGAATCCC 


ATGTCAATTC 


4800 


TCACTTGACG 


CTTACCTCTC 


AGATGACATC 


TCTTATAACC 


CAAACAAACC 


TTTATCTGCC 


4660 


CAAAGACAGA 


TTTCATATCA 


ATCTTACGTT 


TAGCGAAAAT 


TTGTCTACCC 


TTGGAAGATA 


4920 


AAAGTGCCTG 


ATATTCTTTA 


GTTTTTAAAC 


ACTGGTAACG 


TTCATTCATA 


TACAGTCTCT 


4980 


TTTGAGGGGC 


TGATTCAGGT 


TCATAATCGC 


AGTCAACATT 


GATTTCAAGG 


CTGTTTGCTT 


5040 


TCTATCTCCC 


CGG 










5053 


(2) INFORMATION FOR SEQ ID NO: 188: 









< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 
AATTCTCTTT TTTCCAACAA AATGT ATC AC CTGCACTTGA ATACTTCTCA TTGTTTGTAC 6 0 

ATTCATCTAC TTTCATATAA TCTTTTACAA AATCATAATA TCACATAACA CACTATCCCT 120 
TTTAGACAAT ATTCCAATTA GCCTTATTAA TTC AAAACT A TTGTATTAGT AATTATAACA 180 
GATGTATAAT AGAAAAGCAA TGATAGATAT TATCAATTAA GCGAATTTAT ATCTAAAAGG 24 0 

GATATTAAAG AAAGGAGATA TGCTTATGAA GATTTACAAA AAACTATTTG CTTATGTCCA 300 
AG AT AAG AAA TATCTTGGGG TTTTGGCCAT AATTTTTTCT CCTATATCTC CTGCACTTAC 3 60 

AGTATATGGA TATTATTTAA TCTACAAATT TCTAGATAAG TTAATAATTA ATTCAAACTT 420 
ATCCGGTGCA GAGAGTATAG CATTAAAATC TGTTATTACA CTAACAAGTG GAGCGATATT 4 80 

TTATTTTGTC TCAGG AATGT TTTCACATAT CTTGGGATTC AGGCTTGAAA CAAATTTAAG 540 
AAAAAGGGaA TCGATGGTCT GGAAAAAGCA AGTTTTAGGT TCTTTGACTT AAATCCATCT 600 
GGTCAAATAA GAAAGATTAT AGATGJCAAT GCTG C AC AAA CTCATCAGGT GGTAGCACAC 6 60 

ATGATTCCCG ATAGTTCTCA GGCAATAATC ACACCCGTAC TTGTACTTGC ACTTGGCTTT 7 20 

ATAGTAAGTA TAAGAGTTGG CATAATTTTG CTTGCTCTTA CTATAATTGG TGGCTTAATT 780 
TTAGGGGCAA TGATGGGCGA GCAAGAATTT ATGAAGATAT ACCAAGAATC CCTATCTAAA 840 

CTAAGTGCTG AAACTGTTGA G T ACGTG AG A GGAATGCAAG TTGTAAAAAT ATTTAAAGCA 900 

AATGT AGAGT CTTTTAAAAG CTTTTATAAG GCGATAAAAG ATT ACTC AAA GTATGCTTAT 9 60 

GATTATTCCC TATCTTGTAA AAGGCCTTAT GTTTTGTATC AATGGTTATT TTTTGGACTG 1020 

ATTGCAATTT TAATTATTCC TATAGTTTAT TTTATGACTA GCTTAGCTAG CGCAAAGGTG 1080 

ATTTTACT'iO AGCTTATCAT r --. TTTT ATTT TTATCAGGAG TTCTCTTTGT TTCATTCATG 114 0 

AG AATG ATGT GtACTCCATG TATATTTCTC AAGGAAATTA TGCAGTAGAT ACTTTAGAGG 12 00 

CGCTTTACGA AGATATGCAA AAAGACAAAT TAGTGCATGG TAATGTCAAT AATTTTAAAA 12 60 

ACTATAATAT AGAATTTGAG AATGTTAGCT TTGCTTATAA TGATAAACCT GTCATTGAAA 13 20 

ATTTATCCTT TAATTTAGAA GAAGGAAACT CCTACGCACT TGTCGGTTCA TCTGGATCAG 13 80 

GCAAATCAAC AGTAGCAAAA CTTATATCAG GTTTTTACAA TGTTAATAAA GG AAG C AT AA 144 0 

AGATAGGCGG GATAGCAATA AGTGAATATT CTGACGAACC CTTAATTAAA GCCATTTCCT 1500 

TTGTTTTTCA AGATTCAAAA TTATTCAAGA AGAGCATTTA TGATAATCTA GCGTTAGCTA 15 60 

ATAAAGATGC GACGAAAGAT GACGTTATGA GAGCCTTAAA ATTAGCAGGA TGCGATTTAA 1620 
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TATTAGACAA ATTCCCAGAA AGAGAAAATA CAATCATAGG CTCAAAAGGT GTTTATTTAT 1680 

CCGGTGGAGA AAAACAAAGA ATTGCAATTG CTAGAGCAAT TTTAAAGGAT TCCAAAATTA 1740 

TTATTATGGA TGAAGCATCA GCATCTATTG ACCCAGATAA CGAGTTTGAA TTGCAAAAAG 1800 

CTTTTAAAAA TCTTATGAAG GATAAAACAG TTATCATGAT TGCACACAGG CTATCTACAA 1860 

TTAAAGACCT TGATGAAATT ATTGTCATGG ATAGTGGAAA AATTATAGAA AGAGGGTCTG 1920 

ACAAAGAATT AATGTCAAAA GATACAAGGT ATAAGAGCCT GCAAGAGATG TTTAACAGTC 1980 

CGAATGAATG GAGGGTTTCA AATGAAAGAG TTTTATAAAA AAAGATTTGC TCTTACAGAT 2040 

GGAGGAGCAA GAAATTTAAG TAAAGCAACA CTGGCTTCAT TTTTCGTTTA TTGTATAAAC 2100 

ATCCTTCCTG CCATATTACT TATGATTTTT GCTCAGGAAG TTTTGGAAAA TAXGGGCAAA 2160 

AGCAATGGCT TTTATATAGT ATTCTCAGTT TTGATTTTGA TAGCAATGTA TATTTTGCTT 2220 

TCTATCGAAT ACGATAAATT ATATAACACA ACCTATCAAG AAAGTGCAGA TTTAAGAATA 22 80 

AGGACAGCGG AG AATTT AT C AAAATTACCT CTATCTTACT TTTCTAAACA TGACATTTCC 234 0 

GACATTTCAC AAACAATCAT GGCTGATATT GAAGGCATAG AGCATGCAAT GAGCCACTCA 2400 

ATACCAAAGG TGGGCGGCAT GGTACTGTTT TTCCCATTAA TATCTGTAAT GATGCTAGCG 24 60 

GGCAATGTCA AGATGGGTTT AGCTGTAATT ATTCCATCTA TTTTAAGCTT T AT ATTT AT A 2520 

CCTTTATCTA AAAAATATCA GGTTAATGGA CAGAATAGAT ATTATGATGT CTTAAGAAAA 2580 

AACTCAGAAA GCTTTCAAGA AAATATCGAA ATGCAAATGG AGATTAAAGC AT AT AATTT A 2 640 

TCGAAGGATA TTAAAGATCA CT TAT AT AAA AAAATGGAAG ATAGTGAGAA AGTACACTTA 2700 

AAGGCGGAAG TAACTACAAT TTTAACTTTG TCTATATCTT CAATATTTAG CTTTATATCT 2760 

CTTGCTGTTG TGATATTTGT CGGCGTAAAT CTAATTATTA ATAAAGAGAT AAATTCTCTC 2820 

TACCTTATAG GAT ATTT ACT AGCTGCTATG AAGATAACAG ACTCTTTAGA TGCATCTAAA 2880 

GAGGGCTTGA TGGAAATATT TTATTTATCG CCCAAAATAG AAAGATTAAA AGAAATTCAA 294 0 

AATCAAGATT TACAAGAAGG CGATGACTAT AGCTTAAAAA AATTTGATAT TGATCTAAAA 3000 

GATGTTGAGT TTGCCTACAA TAAAGACGCA AAAGTTTTAA ATGGTGTAAG TTTTAAAGCT 3060 

AAGCAGGGAG AGGTCACTGC TTTGGTAGGT GCAAGTGGCT GCGGTAAAAC AACTATCTTG 3120 

AAACTTATAT CAAGACTTTA TGATTATGAC AAGGGACAAA TCTTAATCGA TGGCAAAGAT 3180 

ATAAAGGAAA TATCAACAGA ATCCCTTTTT GATAAGGTGT CTATTGTTTT CCAAGATGTG 3 24 0 

GTTCTCTTTA ATCAAAGCGT TATGGAAAAT ATTAGAATCG GTAACCAAGA TGCAAGTGAC 3300 

GAAGAGGTTA AAAGAGCAGC AAAACTTGCA AATTGCACAG ATTTTATAGA AAAAATGGAT 3 360 

AAAGGTTTCG ATACAGTTAT TGGTGAAAAC GGAGCTGAGC TATCAGGAGG AGAAAGACAA 3420 
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AGATTATCAA 


TAGCCAGAGC 


CTTCTTAAAA 


GATGCGCCGA 


TATTGATCTT 


AGATGAGATA 


3480 


ACAGCAAGCC 


TTGATGTTAA 


C AACGAG AAA 


AAGATTCAAG 


AGTCTTTAAA 


TAATTTAGTT 


3540 


AAAGATAAAA 


CTGTTGTAAT 


CATTTCACAT 


AGAATGAAAT 


C CAT AG AAAA 


TGCAGACAAG 


3600 


ATAGTAGTTC 


TTCAAAACGG 


AAGAGTAGAA 


AGCGAAGGTA 


AGCATGAAGA 


GCTTTTACAA 


3660 


AAATCAAAAA 


TTTACAAAAA 


TTTAATAGAA 


AAGACAAAAA 


TGGCAGAAGA 


ATTTATTTAT 


3720 


TAGGAGGACT 


ACAATGGATA 


ATAAAAAATT 


AAAAGTAAAA 


GATTTAGTAA 


GCATCGGTGT 


3780 


TTTTGGCGTA 


ATTTATTTTG 


CCTTCATGTT 


TGGAGTTGGT 


ATGATGGGCT 


TGATTCCAAT 


3840 


ATTGTTCTTA 


ATATACCCGA 


CAGTATTAGC 


CATAGTTGCA 


GGAACTGTTG 


TTATGTTATT 


3900 


TATGGCTAAG 


GTTCAAAAGC 


C ATGG GC ACT 


ATTTATATTT 


GGT AT GAT AT 


CACCACTTGT 


3960 


GATGTTTGCA 


GCTGGTCATA 


CCTACGTAGT 


TGTGGTTTTA 


TCACTTATAG 


TAATGATAAT 


4020 


AGCAGAATTA 


ATTAGAAAGA 


TTGGTAATTA 


TAATTCATTT 


AAATACAATA 


TGCTTTCTTA 


4080 


TGCAATCTTC 


AGCACATgGA 


TATGTAGCTC 


TTTAATGCAA 


ATGCTTTTAG 


CAAAAGAAAA 


4140 


ATATATGGAG 


TGGTCTTTGA 


TGACTATGGG 


AAAAGATTAT 


GTTGATGTAT 


TAGAAAAGTT 


4200 


AATAACTTAT 


CCTCACATGG 


CTTTAGTAGC 


CTTAGGTGCT 


TTCTTAGGAG 


GAATTCTTGG 


4260 


AGCATATATA 


GGCAAGGCTC 


TATTGAAAAA 


ACACTTTTCA 


AATGGATTAT 


ATTGTGTGGG 


4320 


ATACTTTACT 


CCTTGCCTAA 


TTTTATGGTG 


CTATCTGAAT 


TAAACCCTAT 


AGTTAAGATG 


4380 


TTTTTGAGTA 


TACCTATTGT 


TATTAGAATG 


TTTATTTTAC 


CATTTATGGC 


AGCAAGCTTT 


4440 


ATGATAAAGA 


CCTCGG AT GT 


AGGCGCAATA 


ATTTCATCGA 


TGGATAAGCT 


TAAGATTTCA 


4500 


AAGAATGTAT 


CCATACCTAT 


TGCGGTTATG 


TTTAGATTCT 


TCCCATCTTT 


TAAGGAGGAG 


4560 


AAGAAAAACA 


TCAAAATGGC 


TATGAGAGTA 


AGAGGGATAA 


ATTTTAAAAA 


CCCAGTCAAA 


4620 


TATCTTGAAT 


ATGTTTCTGT 


GCCACTACTC 


ATTATATCAT 


CT AAT AT AT C 


AGATGACATT 


4680 


GCAAAAGCGG 


CAGAAACAAA 


GGC AAT AG AA 


AATCCAATTG 




n i l l LUC 


H t H U 


GTAAAGATAC 


AGCTAATTGA 


TTTTGTTTAT 


GTTTTAGCGG 


TTGCTGGACT 


TATTGTGGGA 


4800 


GGCTTAATAT 


GGTTGAAATA 


AAAAATTTAA 


GTCTTGATTA 


TGGTGAAGAG 


CATATATTAG 


4860 


ATGATATATC 


ACTATCCATA 


GCCGAGGGAG 


AGTGCGTGCT 


ATTTACAGGA 


AAAAGTGGAA 


4920 


ATGGTAAGTC 


ATCTTTAATA 


AATTCAATCA 


ATGGACTAGC 


TGTAAGGTAT 


GATAACGCAA 


4980 


AGACAAAGGG 


CGAAATAATT 


ATTGATGGTA 


AGAATATAAA 


AAATTTGGAA 


CTTTATCAAA 


5040 


TCTCAATGCT 


TGTTTCAACT 


GTTTTTCAAA 


ATCCTAAGAC 


ATATTTTTTT 


AATGTCAATA 


5100 


CGACATTAGA 


ATTATTATTT 


TATTTGGAAA 


ATATCGGTCT 


TGCAAGAGAA 


GAGATGGACA 


5160 
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GGCCTTTGAA GGATATACTT GAGATATTCC CGATAAAAAA TCTTTTGAAC AGAAATATAT 5220 

TTAATCTATC CGGCGGTGAA AAACAAATTC TTTGCATTGC AG CTTCTT AT ATAGCAGGTA 5280 

CAAAGATTAT AGTTATGGAT GAGCC TTC AT CGAATTTAGA TATTAAAAGC ATAAGTGTTT 5 34 0 

TGGCAAAGAT GCTAAAGATA TTAAAAGAGA AAGGCATAAG CATAATTGTT GCAGAGCATA 5400 

GAATTTATTA TTTGATGGAC ATAGTTGACC GTGTATTTTT AAT AG AT AAA GGAAAGCTTA 54 60 

AAAAAACTTA TACTAGAAGT GAATTTTTAA AGCTAGATAA AAATGAATTA AATGCTTTAA 5520 

GTTTAAGAGA TAAAGAATTA AGTAAATTAA AAGTTCCTTA TTTAAAAGAA GGTGGAGAGT 5580 

ATC AG AT AAA AAATCTTAGT TACAAATTTA CTGATGATGA GTGTTTAAGC TTAAAAGATA 5640 

TTTCGTTCAA GCTTGGGAAA ATTTATGGCA TAATAGGATC CAACGGACGA GGAAAATCAA 57 00 

CGCTTTT A AG ATGTTTAATA GGTCTTGAGA AAAAATCAAA AGAAGAAATT TATTTTAAGG 57 60 

GAGAGAAGCT ATCTAAAAAA GAAAGACTCA AAAACTCTTC ACTTGTTATG CAAGATGTAA 5 820 

ATCATCAATT ATTCACAGAT GAAGTATTCA ACGAGCTTAG ATTAGGAGTA AAGAATTTTG 58 80 

ATGAAGAAAA GGCGAAAATC ATTTTAAACC CCAATTATTC ACCCCAAATC TAAAAACCAT 5 940 

CCAGAATCCT TGCCTTAGCT TAGATCCTGG ATGGTTTCTT TTTTCACCCA ATGGGTGTTT 6000 

TTTACTAGAC AAAAAAGAGT TTCCCCTTTA TGGTATAAGT GTAGAAAAAA ACACAAAAAG 6060 

AAAGGAAACT CACATGAACA GTTTACCAAA TCATCACTTC CAAAACAAGT CTTTTTACCA 6120 

ACT AT CTTTC GATGGAGGTC ATTTAACCCA GTATGGTGGT CTTATCTTTT TTCAGGAACT 6180 

TTTTTCCCAG TTGAAACTAA AAGAGCGGAT TTCTAAGTAT TTAGTAACGA ATGACCAACG 6240 

CCGCTACTGT CGTTATTCGG ATTCAGATAT CCTTGTCC AG TTCCTCTTTC AACTGTTAAC 6300 

AGGTTATGGA ACGGACTATG CTTGTAAAGA ATTGTCAGCT GATGCCTACT TTCCAAAATT 63 60 

GTTGGAAGGA GGGCAGCTTG TTCACAGCCA ACCTTATCCC GTTTT CTTTC CAGAACTGAC 6420 

GAGGAAACAG TCCATAGTTT GCGATGCCTC AACCTTGAAT TGGTCGAATT CTTTTTACAT 6480 

GTTCACCAGC TG 64 92 
(2) INFORMATION FOR SEQ ID NO: 189: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 
AACTGAAGGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATGACCT 60 
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TAAGAAAGCT CAAGAAGACA ACAACTTGGA CGACATGAAA ACAAAACTTG AAGCATTGAA 120 

CGAAAAAGCT CAAGGACTTG CTGTTAAACT CTACGAACAA GCCGCAGCAG CGCAACAAGC 180 

TCAAGAAGGA GCAGAAGGCG CACAAGCAAC AGGGAACGCA GGCGATGACG TCGTAGACGG 24 0 

AGAGTTTACG GAAAAGTAAG ATGAGTGTAT TGGATGAAGA GTATCTAAAA AATACACGAA 300 

AAGTTTATAA TGATTTTTGT AATCAAGCTG ATAACTATAG AACATCAAAA CATTTTATTG 3 60 

ATAATATTCC AATAGAATAT TTAGCTAGAT ATAGAGAATT ATATTAGCTG AACATGATAG 420 

TTGTATCAAA AATGATGAAG CGGTAAGGAA TTTTGTTACC TCAGTATTGT TGTCTGCATT 480 

TGTATCGGCG ATGGTACCAG CTATGATATC ATTAGAAATA CAAACATATA AATTTGTAAT 540 

ACCGTT CAT A ATTGGTATGA TTTGGACAGT AG TTGT ATTT CTTATGATCA ATTGGAATTA 600 

TAT AGGC AAA TACTAAGAAG AGACAAAAAT ATATAAATAT TTCTGTACTT ATAGGATATT 660 

TAAAATCAAA ATAAAGTTAA TTTACTTATT TGCAGAGGTT GCAACCCAGC CTCTGTTTTT 720 

CGATAAAAAG GCACGGAATC TCATTTGTTT GGGTTTTGTC TCATCAATAG AAAGGAACAA 7 80 

AGAGTGTTCG TAACTGAACA CGGGTTTCAG AATTTC TTAC TAAATATAAA AGAAAGGAAT 840 

TGAACCCGAC CTAAATGGTG GTTCGATTCA GAACATCAAT AGAAAGGAAT AAGGGTGTTC 900 

GTAACTGAAC ACGGGCTATG GACTGTGCCA AAAAGATAGT TTTTTCTAGG ACGTAAGCGT 9 60 

CCGTCGTCAA AACTCCTAGA TGGCTGTGTC CGTTTGACGC CCTTTGTATC TTGAATTATG 10 20 

AACAATACTG AATTTTATGA TCGTCTGGGG GTATCCAAAA ACGCTTCGGC AGACGAAATC 10 80 

AAAAAGGCTT ATCGTAAGCT TTCCAAAAAA TATCACCCAG ATATCAACAA GGAGCCTGGT 1140 

GCTGAGGACA AGTACAAGGA AGTTCAAGAA GCCTATGAGA CTTTGAGTGA CGACCAAAAA 1200 

CGTGCTGCCT ATGACCAGTA TGGTGCTGCA GGCG CCAATG GTGG TT TTGG TGGAGCTGGT 12 60 

GGTTTCGGCG GTTTCAATGG GGCAGGTGGC TTCGGTGGTT TTGAGGATAT TTTCTCAAGT 13 20 

TTCTTCGGCG GAGGCGGTTC TTCGCGCAAT CCAAACGCTC CTCGCCAAGG AGATGATCTC 13 80 

CAGTATCGTG TCAATTTGAC CTTTGAAGAA GCTATCTTCG GAACTGAGAA GGAAGTTAAG 144 0 

TATCATCGTG AAGCTGGCTG TCGTACATGT AATGGATCTG GTGCTAAGCC AGGGACAAGT 1500 

CCAGTCACTT GTGGACGCTG TCATGGCGCT GGTGTCATTA ACGTCGATAC GCAGACTCCT 1560 

CTTGGTATGA TGCGTCGCCA AGTAACCTGT GATGTCTGTC ACGGTCGAGG AAAAGAAATC 1620 

AAATATCCAT GTACAACCTG TCATGGAACA GGTCATGAGA AACAAGCTCA TAGCGTACAT 16 80 

GTGAAAATCC CTGCTGGTGT GGAAACAGGT CAACAAATTC GCCTCGCTGG TCAAGGTGAA 1740 

GCAGGCTTTA ACGGTGGACC TTATGGTGAC TTGTATGTAG TAGTTTCTGT GGAAGCTAGC 1800 
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GACAAGTTTG AACGTGAAGG AACGACTATC TTCTACAATC TCAACCTCAA CTTTGTCCAA 18 60 

GCGGCTCTTG GTGATACAGT AGATATTCCA AC TGTTC ACG GTGATGTTGA ATTGGTTATT 1920 

CCAGAGGGAA CTCAGACTGG TAAGAAGTTC CGCCTACGTA GTAAGGGGGC ACCGAGCCTT 1980 

CGTGGCGGTG CAGTTGGTGA CCAATACGTT ACTGTTAATG TCCTAACACC GACAGGCTTG 2040 

AACGACCGCC AAAAAGTAGC CTTGAAAGAA TTCGCGGCTG CTGOTGACTT GAAAGTAAAT 2100 

CCAAAGAAAA AAGGCTTCTT TGACCATATT AAAGATG CCT TTGATGGAGA ATAATACTCT 2160 

TCGAAAATCT CTTCAAACCA CGTCAGCGTT GCCTTGCCGT ATATATGTGA CTGACTTCGT 2220 

CAGTCGTATC TACAACCTCA AAACAGTGTT TTGAGCAGCC CGTGGCTAGT TTCCTAGTTT 2 280 

GCTTTTTACT TTATAGATTT TTTAAGACTT TCCTAAGTAA TGACGGACCG TAGTGACCTC 2 34 0 

CTTCGAAGTT CCATACCTAA ACTTTGAACC TAAGTTTTAA AGTTTCCGGA CAGCTGAAAC 2 400 

CAAGCTGTTT CAGGTGTTTT CATTACGGCA GAAAGTCTTC GATTTAGTTG TGAAATGGTG 2460 

AATGATACTC TTCAAAAATT TCTTCAAACC ACGTCAGCGT CGGCTTGTCA TGGCTATGGT 2 520 

TACTGACTTC GTCAGTTCTA TCCACAACCT CAAAACAGTG TTTGAGCTGA CTTCGTCAGT 2 5 80 

TCTATCCACA ACCTTAAAAC GGTGTTTTGA GCAGTCTGTG CCTAGCTTTC TAGTTTGCTT 264 0 

TTTGATTTTT ATTGAGTATG AATTACCTAA ATTATGATCC ATAGTTGATG GGATATATAT 2700 

AATAGATTGA AAT AGAAT AT GAACAAATTG ATAAGAGGAT TTTAAAGTAA TCTCTAACAA 2760 

TGCTTTAGAA ACTATGGTGT GCTATTCTAA ATTCAATTCA CTATAACTTG TTTACGTTTT 282 0 

AAAAAAGAGC CGTCGGGCTC TTTTTACTTA TCTTCAGTTC CCTGCATTTC TTTTATCACA 2880 

GCTAGTCTAG TCTGGATATC CTTTTCCAAG ACCTTAAACT TGTAAGTCAA GTCTTCTTGG 2940 

TATTCCTTGA TAAGTTCTTT TTGCTGGTTA ATGATTTGCA GGCTGTTTTG GATAATATCC 3000 

AC ATCG TCCT TGATAGCTTG AACGCGGTCA GTGGTATTCA AGACTTCATC TGTGATGGTT 3060 

TGGCGATTTT TTGTAACCAG ATAACTTCCG GCTGCAGCTC CTGCAAATAG CAGTAGGTTG 3120 

G AT AATTT C A TAGCAACTCC TTAAGCGTTT TTGATGGTTT CAGCGACTTG AGCAAGTTTG 3180 

TCAAAGTCTG GTTCGTGGGC GATAAAATCA ATCTTGAGGT CATCGTCAGC ACTGTAGCGA 3 240 

GGCACAAGGT GAACGTGAGT ATGAAAAACT GTTTG AC C AG CGACTTCTTC ACAGTTGGAA 3300 

ATGATATTCA TACCAGCAGC CTTAGTGACT TTCATGACTT TTTGAGCTAC TTTTGGTACT 3360 

TGGGC AAAG A GTTGGcTGGC GCTCGTAGCA TCCATCTCCA AAAGATTGCG ATAGTGTTCT 3420 

TTTGGCACGA CCAAGGTGTG TCCTAGTGTT ACTTGAGAGA TATCAAGAAA GGCAAGGACC 3480 

TGCTCATCTT CATATACTTT TGAAGCAGGA ATTTCCCCTG CGATGATTTT ACAAAAAATG 3540 

CAATCTGACA TAAAATCTAC CTCTACTGTA CTGAATTTTG ATATAATATA GCTACATTAT 3 600 
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ACCAGATTTG GAGAAAATAT GTTAGAAATT AAAAACCTGA CAGGTGGCTA TGTTCATGTT 3660 

CCTGTTTTGA AAG ATGTGTC CTTTACTGTT GAAAGTC GGC AGTTGGTCGG TTTGATTGGT 3720 

CTCAATGGTG CTGGGAAATC AACGACGATC AATGAGATTA TCGGTCTGTT GGCACCTTAT 37 80 

AGTGGCTCCA TCAATATCAA TGG CCTG ACT CTGCAAGGAG ATGCGACTAG CTACCGCAAG 3 840 

CAGATTGGCT ACATTCCTGA GACGCCTAGT CTGTATGAGG AATTGACCCT CAGAGAGCAT 3 900 

ATCGAAACGG TTGCTATGGC TTACGGTATT GAGCAAAAAG TGGCTTTCGA ACGAGTAGAG 3 960 

CCCTTGTTAA AAATGTTCCG TTTGGAACAG AAATTAGACT GGTTCCCTGT TCATTTTTCA 4 020 

AAAGGGATGA AGCAGAAGGT CATGATTATC TGTGCTTTTG TGGTGGATCC AAGTCTTTTC 4080 

ATCGTGGATG AGCCTTTCCT TGGTCTTGAT CCGCTGGCTA TTTCTGATTT GATTCAGCTT 414 0 

TTGGAAGTGG AGAAGCAAAA GGGCAAGTCT ATTCTCATGA GTACCCACGT GCTGGATTCG 4200 

GC GG AG AAG A TGTGTGATGC CTTTGT C ATT CTTCACAAGG GAGAGGTGCG TTCCAAAGGC 4260 

AATCTCCTGC AACTACGTGA AGCCTTTGAT ATGCCTGAGG CTAGTTTGAA TGATATTTAC 4320 

TTGGCTCTGA CCAAAGAGGA GG AT CTATG A AAGACTTGTT TTTAAAGAGA AAGCAGGCCT 4 3 80 

TTCGTAAGGA GTGTCTTGGT TATCTGCGCT ATGTGCTCAA TGACCACTTT GTCTTGTTCC 444 0 

TGCTTGTCCT GTTGGGCTTT CTAGCCTACC AGTACAGTCA ACTCTTACAA CATTTTCCTG 4 500 

AAAATCATTG GCCTATCCTT TTGTTTGTAG GAATTACGTC TGTTTTACTT TTACTTTGGG 4560 

GAGGAACTGC CACCTATATG GAGGCTCCAG ACAAGCTCTT TCTCTTAGTT GGAGAAGAGG 4 620 

AAATTAAGCT CCATCTCAAG CGTCAAACTG GCATTTCCCT AGTCTTTTGG CTCTTTGTAC 4 680 

AG AC C CTTTT CTTGCTGTTA TTTGCGCCTT TATTTTTAGC AATGGGTTAT GGCTTGCCAG 4 740 

TTTTTCTGCT CTATGTGCTT TTATTGGGGG TAGGAAAATA TTTCCACTTT TGTCAAAAGG 4 800 

CCAGCAAATT TTTCACTGAA ACTGGACTGG ACTGGGACTA TGTTATTTCT CAAGAAAGCA 4 860 

AGCGTAAGCA AGTCTTGCTT CGTTTCTTTG CCCTCTTTAC GCAGGTCAAG GGAATTTCAA 4920 

ACAGCGTTAA GCGTCGTGCC T AT C TGG ACT TTATTTTAAA GGCTGTTCAG AAGGTGCCTG 4980 

GGAAGATTTG GC AAAATCTC TATCTGCGTT CTTATCTGCG AAATGGCGAC CTCTTTGCTC 5040 

TCAGTCTTCG TCTTCTCTTG CTTTCCTTGC TGGCGCAGGT TTTTATCGAG CAAGCTTGGA 5100 

TTGCGACAGC AGTGGTAGTT CTCTTTAACT ACCTCTTGCT CTTCCAGTTG CTGGCCCTCT 5160 

AT CAT GCCTT TGACT AC C AG T ATTTGACC C AACTCTTTCC GCTGGACAAG GGGCAAAAGG 5220 

AAAAAGGCTT ACAGGAGGTA GTTCGAGGAT TGACCAGTTT TGTTTTACTT GTGGAATTAG 5280 

TTGTTGGGTT GATTACCTTC CAAGAAAAAC TAGCCCTTCT AGCCTTACTA GGAGCTGGTT 5 340 
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TGGTTTTACT AGTCTTGTAT TTGCCTTATC AGGTAAAACG TCAGATGCAG GACTAACATT 5400 

GCTGATACGA CACTAAAAAA GAAGTTGAGT TCAGTCTGTC TCAACTTCTT TTTTGTTACT 5460 

ACAGGATAAT GGTTGGTCCG TAGAGACTTA T ACTCTT CG A AAATCTCTTC AAACCACGTC 5 520 

AGCCTCCTCT TACCGTACTC AAGTACAGCT TGCGCCTAGC TTCCTACTTT GCTCTTTGAT 5S80 

TTTCATTGAG TATTAACTTG GTCTTGACTT GGTCAAAGTG G AAGCGGT C A TAGGCCCGCC 564 0 

AAGCGGCGCG AGTTGGAGCA TCTGGATCAA GAGCGCTGAG TCCCATGAGA AGACTCCAAC 5700 

TCTGGTAAAA TTTTTCTAGT TCAATCAAGA ATCGATTATC CACTGTTTCA GCCTTGGCTA 57 60 

GAAAACCAAG AATAGAGTTT AATTGCTCCT GAAAGCGGAC GTCGTCAGCG CTTCCCTCTT 5820 

TGCATGCTTG GTAGGCTTTG TTTAAGTCAG TAATCAAAGT ATGAGCTCTT TTGATGGGGT 5880 

CTGTATCTGT CATGGGAATG CCTCCTTTAA TCTGGGTGCC AGTCTTACTT CTGGCAACTG 5940 

TGTTT TG AT A CTCTTAGTTT ATCACTTTTA ATTCTTTTTT TTTATTCAAA TCTTTAATTG 6000 

TCATTGAAAT GTCTTGAATT GCGCTGAGTG AATTTTATGA TAAAATAGTT GTAAGCTCAT 6060 

CATGATGTTG TAGAAAATAA TCCTTTTAGG AGTTTTCAAA GACTGTTTAG GATTGGCTGT 6120 

GCTTGGGCTA GACCTTTTCT GTTATTCTTT TCTTAGGAGG AGAATCCAAT GAAATATATG 6180 

ATTATTCAGA CGCAGAAAAC AGTCTATAAA GTAAACATCG ACGATATCTA CTATATCCAA 62 40 

ACACATCCAA CTAAAGCCCA TACCGTACAG ATTGTTACAG AAGAAGCTAG TTTTAATATG 6300 

CTTCAAAATT TAAGTAATCT TGAGAACCAA TGTGGGGAAA CCTTGATGAG ATGTCATCGA 63 60 

AATTGTTTGG TTAATCTTGA TAAATTAAAA TCGATTGATT TTCAAGAAAG AATCCTTTTT 64 20 

CTCGGAGAAG AAGGTCAATA CGCTGTCAAG TATGCCAGAC GTCGCTATAC AGAAATTCGT 64 80 

CAAAAATGGT TGAAAGAGGG AGAGTAAGAA GATGAGAATA TTTGTTTTAG AGGATGATTT 6540 

TTCCCAACAG ACTAGAATTG AAACGACGAT TGAGAAACTT TTGAAAGCAC ATCATATCAT 6 600 

TCCTAGCTCT TTTGAGGTAT TTGGCAAGCC GGACCAACTG CTGGCTGAAG TGCATGAGAA 6660 

GGGGGCCCAT CAGCTATTCT TTTTGGATAT TGAGATTCGA AATGAAGAGA TGAAGGGACT 67 20 

GGAAGTGGCT AGAAAGATTC GGGATCGGGA TCCTTATGCC CTCATTGTCT TTGTGACGAC 6780 

TCACTCGGAG TTTATGCCCC TGTCTTTTCG CTACCAAGTG TCTGCTTTGG ACTACATTGA 6840. 

TAAGGCCTTG TCAGCAGAGG AGTTTGAATC TCGGATCGAG ACAGCCCTCC TCTATGCCAA 6900 

TAGTCAAGAT AGTAAAAGTC TGGCGGAAGA TTGCTTTTAC TTTAAATCAA AATTTGCCCA 6960 

ATTTCAGTAT CCTTTTAAAG ACGTTTACTA TCTCGAAACG TCGCCCAGAG CCCATCGTGT 7020 

TATTCTCTAT ACCAAGACAG ACAGGCTGGA ATTTACAGCG AGTTTAGAGG AGGTTTTCAA 7080 

GCAGGAGCCC CGTCTCTTGC AGTGCCACCG CTCTTTTCTC ATCAATCCTG CAAATGTGGT. 7140 
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G C ATTTGG AT AAGAAAGAAA AACTGCTTTT CTTT 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


190: 






C CACC AGGG A 


AAATCATTGA 


AGTTGGTAGT 


CACCAAGAGT 


TAATGCAGGC 


GCAAAGTTTC 


60 


T AC CATC ATC 


TATTCAATAA 


ATAAGGAGAA 


TGTCATGAAT 


CCT AAT CTTT 


TTAGAAGCGT 


120 


CGAGTTTTAT 


CAGAGACGTT 


ACCATAACTA 


TGCGACAGTG 


TTAATTATAC 


CTCTTTCATT 


180 


ACTATTTACT 


TTCATCTTGA 


TTTTCTCCCT 


TGTTGCCACA 


AAAG AAATT A 


CTGTTACTTC 


240 


CCAAGGAGAA 


ATCGCCCCTA 


CAgTGTCATT 


GCCTCCATTC 


AGTCAACCAG 


TGATAATCCT 


300 


ATCCTAGCTA 


ATCATTTAGT 


GGCAAATCAA 


GTAGTTGAAA 


AAGGGGACTT 


ACTC AT C AAA 


360 


TACT CTG AAA 


CAATGGAAGA 


AAGT CAGAAA 


ACTGCCTTAG 


CAACTCAATT 


ACAAACACTT 


420 


GAGAAGCAAA 


AAGAAGGACT 


TGGAATTTTG 


AAACAAAGCT 


TAGAAAAAGC 


GACTCATCTT 


480 


TTTTCTGGCG 


AGGATGAATT 


TGGCTACCAT 


AAT AC CTTT A 


TGAATPTTAr 


Tiun A Tf/^ 
l nAnLAni I \. \. 


C A n 


CATGATATTG 


AACTGGGTAT 


CACAAAGACT 


AACACCGAAG 


TTTCAAATCA 


AGCTAATCTT 


600 


TCCAATAGCA 


GTTCATCAGC 


TATTGAACAA 


GAAATTACAA 


AAGTTCAACA 


ACAAATTGGA 


6 60 


GAATATCAAG 


AGTTGAGAGA 


TGCTATCATA 


AATAACAGAG 


CACGCTTACC 


AACTGGCAAT 


720 


CCGCACCAGT 


CAATTTTGAA 


TCGTTATCTT 


GTAGCCTCAC 


AAGGACAAAC 


ACAAGGAACT 


780 


GCAGAGGAGC 


CATTTTTATC 


TCAAATTAAT 


CAAAGTATTG 


CAGGTCTTGA 


ATC AT CT ATC 


840 


GCAAGCCTCA 


AAATTCAGCA 


AG CTGGT ATC 


GGAAGTGTAG 


CAACTTATGA 


TAACAGTTTA 


900 


GCAACCAAAA 


TTGAAGTACT 


CCGCACTCAG 


TTTTTACAGA 


CAGCCTCACA 


GCAACAACTA 


960 


ACTGTGGAGA 


ATCAATTAAC 


AGAATTAAAA 


GTACAACTAG 


ATCAAGC C AC 


ACAG CGTTTG 


1020 


GAAAACAATA 


CCTTAACCTC 


CCC AAGT AAA 


GGTATCGTTC 


AT CTG AACAG 


CGAATTTGAA 


1080 


GGTAAAAATA 


GAATTCCAAC 


TGGTACAGAA 


ATTGCTCAAA 


TATTCCCTGT 


CATCACAGAT 


1140 


ACAAGAGAAG 


TACTAATCAC 


TTACTACGTA 


TCTTCTGACT 


ATCTACCTCT 


ACTAGATAAA 


1200 


GGACAAACTG 


TAAGATTAAA 


ACTGGAGAAG 


ATTGGAAATC 


ACGGCACCAC 


CATCATCGGC 


1260 


CAACTTCAGA CAATTGATCA 


AACTCCTACC 


AGAACAGAGC 


AAGGAAATCT 


CTTTAAATTA 


1320 
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ACCGCTCTTG CAAAACTATC TAACCAGGAT AGTAAACTCA TCCAATATGG CTTACAAGGT 1380 

CG CGTCACT A GTGTAACT AC AAAGAAAACA TATTTTGATT ATTTCAAAGA TAAAATTTTA 1440 

ACACATT CTG ATTAATTTTC AGATAACACT CTATAACTAT TTATTATCTT ATCAAAAAGG 1500 

AGAATCATAA CATGGATAAG AAACAAAACC TAACTTCATT TCAAGAACTA ACAACTACCG 1560 

AACTCAATCA AATTACAGGT GGAGGATTGT GGGAAGATTT ATTATATAAC ATTAATAGAT 1620 

ATGCTCATTA CATCACATAA GAACTTCATC ATCCAATACA ACTATAAAAA AATAAGACCG 1680 

AGAAACAAGT ACTCTCGGTC TTATTTTTCA TCATTCT GT A TGTATCACAG TAAGTACCTG 1740 

ACGAAAGACT TGATTTTGAC AGGTGGTATT TAGACTGGTA TTAGGATGGC TTTCCACAAT 1800 

CTTCATGACG GTATAGAGAC CAACTCCTCT CTCCTCCCCT TT AG AA CTG G CTCCAAAGGA 1860 

GAAGATTTCA GAAATATCGA TGCCCTCTTC TTTGATGGAG TTTTCGATGA TAAAGGTCTC 1920 

CTGTGCTCCA TTTTTTAAAA AGGCGATTGA AACATGAGGT TGACTAGCTT CCACACTGGC 1980 

TTCAATAGCA TTGTCACAAA GGATAGACAC AATGGTTAGA AAATCAAGTA GACTCATCCC 204 0 

CTCGACCTGA ATCT CCTCAG GAACTTCGAC ATTAAAGACA ATGTT CTT AT CTCTGGCTTT 2100 

TAAAAATTTC CCTGCTAGAA GACTTTTGAG GGCTTTATCA CGAATATTTA CCAATCTGCC 2160 

CAGGTCATAT TTATTGTTCT GCAATTTCTG ACTGGAATCC TTTAAGACGG AGC CATAG AC 2220 

CTCTTTTATC TGCTCCATAT CCTCCTCTTC AATGCCCAGA CGTAAGCTAG TCAAGAGGTT 2280 

GGTATAATCA TGACGAAAGC TCCGTACTTC CTTGTAAAGC TCCTCTATAT GCCGACTATA 2340 

GCGTTCCATA TCTCTATAGC GCAGGGCCTG CTCTT G TTCC AATCTCTCAT AGAGTTTTTC 2400 

CTTCAAATAG GTATCCAATT TCTTGATAAC CCCCATAAAA AAGACTAGGT AAAAGACTAG 2460 

GATGAGATGG CGAACAGTCT TTGATTGAAT ACTTTGTTCA TATTCAAAAA AAGACAGACT 2520 

TTCCATGACT AGATAGTAGC CACCCATTAT CCAGTTAATC TGAGTCAGGG ACTTTTGAAA 2 580 

GGCTTTATCG AGAATCTCCT TTCTCAAGCT AGTAAAATCG TAGTCCAACC ATTTCAAAAA 2 640 

AGCTAGAGAA ATGAAGAAAT TGAAAATTAT TATACATAAC CCAGTAAATG AGT AGC CATC 2700 

ATATACTTGC CCTTGTCCCA AAAATGGAAG CACAAAATAG GAGACTCCTC TATAAAAGAG 2760 

ATTCACCAAT ATCATTGGAA AGAGACCATA AAAGAAAAGG AGTTTTTTAG GAAGCCCTCT 2820 

CAATAATAAG AAAGATAAGC CTATGCCGTA CAAGGGTTCC ATAAAATAAG ATAGGTAAAC 2 880 

ATTTCCTACT AT AT AGCT AA T CAT C ACAAA AACAAAGGCC AACAGT AT CT TCAAAAGAAA 2940 

GGCCTTAAAA ATCCT CTCG A AAGTAAGATC AATTCCATCC ACCTTAAAGA AGATGACAAT 3000 

TTCTAGTCCA TTAGTAACAA GTGTATACAA CAATATCCAA GCAATGTTCA TAAATTCTCC 3060 

TAGCTCAGTG TAATTTATTG ATGGC CTCAG ACACTTCCCT GACCTTATAA CGGGCGATTA 312 0 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT 3 207 

(2> INFORMATION FOR SEQ ID NO: 191; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103S7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CTGAATCAAG TGTACTGCAC CAGTTCGTGC ATCAGGCATA ACAACATCTA CAGATATAAT 60 

ATTGTTTTCT GAGTCCGCCT CATAAGTTAA AATCATAAAT TTTTCGATAT TCGAATTTTT 120 

AGTAGCTTGT TCAATTTCTT GAATCATTTC ATCAGAAACT AACTCCATCT GAATTGGAAA 180 

GGAATGACTA TTTT CAT CAT TTTTGTAGGA AGAATGTTGA TTAAGATAAA GTGTATTCAT 24 0 

CTGAGCATAT TCAAATAAGT AGCCACTCTT ATTTTTTTGT ACCAAAGGAA ATTGGTTTGT 300 

AAGTCGCTTC TTACCCTTTA TAATTAACAA TACTTTCCCA TATTTTTCTG TATTTGTTTC 360 

AAATTCTAAA TATCCCCAAG TCTGTCCTGC TAATTGTAAT TTATACTCAA ACAAATCTGC 4 20 

TGATGCAAAT GCAGTATCAA TATGATTAGG TCGCGTCCAT GCATAAC CAT TCGACACTAT 480 

CATTGTCTCT CTTTTTTCTA GACGTTCATC TACATAATCT TTTTGCCCTT TCATCAAAGT 540 

ATCTACAATT TTTTGTGCCT CAAGCGAATC AAAGAGATCC TGATTCAACA TAATTCTTCC 600 

TCCTC CAAAT ACTTTTTAAT GAATTATACC ATTTTCTTAA AGAAATTACT ACAATAATTA 660 

TCTTTTTCTT AAAGTTCTGT CTCAGAGTAA TTTAGAAAAT TATATCTTCT ATAGTAAAAT 720 

CAATTAAAAA CTGAACAAAT TTATTGGGAA ATTCAAATCG CTTTCTGAAA ATATTTTAGC 7 80 

AACCGTAGTG TAATATTCCA GATTCAATTC ACTATAAAAC TGACCTTTCT CCTGCAAAAG 840 

AAAAAGGAAA GACTTCCTTT CGTGCCTTTC CTCTTACTTG CTACTTGTTT GATTATTTTT 900 

GGTAAGCTAC TGCTTGTCTG ATAAAATCCT GAATCGGCTC TCCTTGGTGG AGAGCTTTTA 960 

CTATTTTCGA ACCGACGATA ACACCATCTG ACACCGCATT GAAGCGTTCC AGATCGGCTT 1020 

GACTAGATAC ACCAAAACCT GTCAAGACTG GGATGTCGGC CACTTGATGA AGTTGCGCCA 1080 

AGTGCTTGTC CAAAT CTGCA CGGTAATTGC CTGATTTCCC TGTCACTCCA TTGATGGCAA 1140 

CGGCATAGAT GAATCCCTCC GCCCCTTCAA TCAACTCTTT CTGGCGCTCA ATTCCTGTGG 12 00 

TCAAGCTTAC TAAAGGAATC AAGGCGATAT CTGTATTTGC CAAAAATGGT TCTACAAAGT 12 60 
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TGGCATGTTC 


ATGAGGCAGG 


TCTGGGATAA 


TCAAGCCCTT 


CACAGCTGTA 


TCAGCCAGAT 


1320 


CTTTGACAAA 


GTTCTCC AC A 


CCGTACTGAA 


AGAGGGGGTT 


GAAGTAGGTC 


ATCATGACCA 


1380 


GTGGAATCTC 


TGTTTCAATG 


GTTTTCAAGG 


TTTCAACTAA 


AGCCTGGGTA 


GAGGTCCCGT 


1440 


GGGCTAAACT 


GCGCAAGCCA 


CCTTCTTCGA 


TAACAGCTCC 


ATCTGCAACA 


GGGTCTGAAA 


1500 


AGGGAATACC 


CACTTCAATT 


GCAGAGACAC 


CCAAATCTTC 


TAAAAAGTGA 


ATT GTTTCAG 


1560 


CAAGACCGTC 


CAAACCTTTC 


TCGTGGTCAC 


CAGCCATGAT 


ATAGGGAACA 


AAAATTCCTT 


1620 


TTCCAGCTGC 


TTTAATAGCA 


TTTAATTTTT 


CTGTTAGTGT 


CTTAGGCATG 


AGCTTCTCCC 


1680 


TTCTrTGC TO 


CATCTGCTTC 


CAAGCGG TCC 


TTGACTTGAA 


CCACATCCTT 


GTCCCCACGA 


1740 


CCTGATAGGC 


AGACAATCAT 


AGACTTTTCT 


GGTCC AAGTT 


CTTTGGCCAA 


TTTCACCG C A 


1800 


AAGGCGATAG 


CATGGCTAGA 


TTCCAAGGCT 


GGGATAATCC 


CTTCCACACG 


AGACAAGAGT 


I860 


TGGAATCCTT 


CCAAGGCTTC 


TTCGTCTGTC 


ACAGGGACAT 


AGCTGGCACG 


TTTAATATCG 


1920 


TGGTAGTGAG 


AATGCTCTGG 


ACCGATACCA 


GGATAGTCCA 


AACCTGCTGA 


GATAGAGAAG 


1980 


GCTTCAAGAA 


TTTGACCATG 


GGCATCTTGG 


AGCACATCCA 


TGAGGGAACC 


GTGAAGGACA 


2040 


CCTGGACGAC 


eCTTGGTCAA 


GGTAGCTGCG 


TGGTGCTCTG 


TATCCACACC 


AAGCCCTGCT 


2100 


GCTTCAGTTC 


CATACATAGC 


TACTGACTCA 


TCTTCTACAA 


AGGGATGGAA 


GAGCCCGATA 


2160 


GCATTCGACC 


CACCACCAAC 


ACAGGCTACT 


AGGGCATCTG 


GCAGATCTCG 


ACCTGTCAAG 


2220 


TCACGGTACT 


GTTGTTTAGC 


CTCTCGACCG 


ATGACACTTT 


GGAAGTCACG 


AACGATTTCT 


2280 


GGAAATGGAT 


GAGGCCCCAA 


GGCAGAACCA 


AGGATATAGT 


GGGTATCGTC 


GATATTAGCC 


2340 


ACCCATGAAC 


GAAGGGCTGC 


ATTGACCGCA 


TCCTTGAGCA 


CGCGCGAACC 


ATCTGTTACA 


2400 


GCCTCGACCT 


TGGCTCCCAA 


AAGCTCCATG 


CGGAAGACAT 


TGAGGGCTTG 


GCGTTTGACA 


2460 


TCTTCCTCAC 


CCATGTAGAT 


GGTACATTCC 


ATGTT AAAG A 


GGGCTGCAGC 


AGTTGCAGTT 


2520 


GCCACACCGT 


GCTGACCAGC 


ACCCGTTTCT 


G CG AT AATTT 


TCTTTTTACC 


CATGCGTTTG 


2580 


GCAAGCCAAA 


CTTGTCCTAA 


GGCATTGTTA 


ATCTTGTGGG 


CTCCTGTATG 


GTTAAGGTCT 


2640 


TCCCGTTTGA 


GAT AAATCT T 


GGCTCCGCCA 


ATATGCTGGG 


TCAAGTTTTT 


TGCGTAATAA 


2700 


AGAGGAGTTT 


CACGTCCTAC 


GTACTGGCGC 


AAAAGCTGGT 


TTAATTCCTC 


TTGGAAACTT 


2760 


GGGTCTGCCT 


GACTTTCACG 


GTAGGCCTTC 


TCCAACTCCA 


AAACTGCTGT 


CATCAATGTT 


2820 


TCTGGGACAA 


AACGTCCGCC 


GAATTTTCCG 


TAAAATCCAT 


CTTTATTTGG 


TTCCTGATAT 


2880 


GCCATGCTTT 


ACCCTCTCTA 


TAAATCTTCT 


AATCTTTTCA 


TGATCTTTTT 


GTCCATCTGT 


2940 


CTCCACTCCG 


CTCGATACAT 


CTACTGCATA 


GGGAGTAAAG 


TGTTGAATTG 


CTTTTACTAC 


3000 


ATTATCTTCA 


TTAAGGCCAC 


CTGCGATAAA 


GAAGGGCTGT 


CCTAGTCCAG 


TCGTATCCAG 


3060 
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TTGACCCCAA TCAAAGGGCT GGCCACTTCC TGCCACAGGG GCATCAAAGA GTAGATAATC 3120 

TGCCTGAGAA TTGGGGACAT GCCCATTTCC ATCTACCTGC ACAGCCTGAA TACTGGCACA 3180 

AGGCAAATTC TCAAATAAAT CATCTGCCAC CTGACCGTGA ACTTGAACCA AGTCCAAGCC 3240 

AACTTTGTCA ATCGCTTCCA GCAGTTCTAC CCGACTTGGT GAAACAAATA CTCCAACCTT 3300 

TTTCACATCT GCAGGAATAA GCTTTGCCAA CTCAGCTGCC TCTTCTAAAG TCACCTGTCT 3 360 

TTTACTAGGT GCAAAGACAA AACCGATATA GTCGGCTCCT GCTGAAACGG CTGTTTCCAC 3420 

CGCTTCTTTG GTCGATAGTC CACAAATTTT AACCTTTGTC AATCTGCAAC TCCTTGATTC 3480 

TCTGGGCCAC ATTTTCTGCC TGCATAAGAG CTGTCCCTAC CAAAATTCCG TTAAAGTATG 3 540 

GGGCTAGTCG TTCCGCATCC TGC CCTGTG A AAATGGCAGA TTCAGAAATG TAATAGCGAC 3 600 

CTTCCTCAAA GTAAGGGGCT AAATCTACAC TGGTCTGCAA GTCGACCTCA AAGGTAGTCA 3 6 60 

AGTTGCGGTT GTTGACCCCG ATAATCTCAG CACCAAGTCT GTGGGCTACC TCTAGTTCAG 372 0 

CTAGATTGTG ACTCTCCACT AAGACTTCCA GACCAAGCTC TGTCGCGTAG TCATACAGTT 3780 

CCTTGAGGCG TTCTTCGGAC AAGGCTGCCA CAATG AG CAA GATAACTGTC GCACCTGCAT 3840 

TGCGAGCGCG GATGATTTGC TTTTCATCGA TGATAAAGTC TTTGTTGAGC GTCGGAATCT 3 900 

CTACCTGACT GGAAATTTCC CGTAGATAAT CCAAATGCCC TTTAAAGAAA ACCTCATCTC 3 960 

TC AACACCG A AATCATCACT GCTCCGTTTT CTTCATAAGT CTGGGCCTGT TG CACAAT AT 4020 

CCACATCGAG ATTGATATCT CC CAAACT AG GGCT AG CTTT CTTGACCTCA GCGATTACCT 4080 

GCAAGCGGTC CTGATGATTC TTCAAAAATT CTGCCAAGCG ATAGGTCTGG CGCAGAGGCT 4140 

GGATTTGCTC CAGCTTCATC TGCTCCACCT CACGCGCCTT CTGCTCTAAG ATTCGTCCTA 4200 

AAAATTCCTG ACTCATTTTT GGTACTCCTG TAACAGTCTG AGTTTTTCAA GGGCCTTGCC 4260 

TCTAGCAATC ACTTGACGGG CCAAGGCAAC CCCTTCCTTG ATGCTATCAA TCTTACCATT 4 320 

AGCATAGAAA CCAAGACCAG CATTCAAGAC TGTCGTTTCC AAGAATGGAC TTGCTTCGTT 4 380 

TTTCAGAACG CTAAGCAAAA TTTCTGCATT TTCCTGAGCA TTCCCACCAC GAATATCTTC 4 44 0 

CATAGCATAG CCTTCCATTC CCAAATCCTC TGGAGTAAAG CTTGACAAGC TGATTTCGCC 4500 

ATTTTCAAGA AGTCCAATCT TGGTTGTTCC GTTCAAGCCA GCTTC AT CCA ACCCTTCTGG 45 60 

TCCAGCAACC ACGATGGCAC GTTTGCGACC CATATTTTTC AAAACCTGAG CTGTACTTTC 4620 

TAGGAGTTCT GGACGACTAA TTCCAAGAAG CTGTGTTTCT AAAGCCATTG GATGAATCAG 4 680 

TGGACCAGTC AAGTTCATAA TCGTTGGAAT TCCCAATTCC AAACGAGCTG GCATGATGTA 4740 

TTT CAT AGCT GGGTGCATAT TTTTAGCGAA GAGAAAGACG ATTCCAGTTT TATCAAAGAC 4800 
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CTTACCTAGT TCAGCTGGTT TGAGGTCAAG ATTGATTCCC AAGGCTTCGA GGACATCTGC 4 860 

GGAACCAGAT TTAGAAGATA TCGAGCGGTT ACCGTGTTTG GCCATGTCAA TACCGCCACC 492 0 

AGCCAAGACA AAGGCTGCAG TTCTGGAAAT ATTAAAACTG AAAGACTTCT CCCCACCTGT 4980 

ACCACAGTTG TCCATGGCAT CATGAATCTC AGTTGGAATA TGCTGGGCAT GTCCTCTCAT 5040 

GACTTGGGCA ATGGCTGTGC GTTCTTCAGG TGTTTCCCCC TTCATCTTAA GAGCTAAGAG 5100 

GAGAGAAGCA ATCTGCGCTT CAGTTACACG CCCAGTTACG ATACGCTCAA TGACATCCGT 5160 

CATTTCCACA CCTGATAAAT TTTCAAATTT TGCTAGTTTT TCAATAATCT CTTTCATCCT 5220 

AGTTTCCTCA CTTTACAACC TCCTCGATAA AATTCCGAAT AGAAGACAAG CCGTCTGGCG 5 280 

TTCCAATGCT CTCTGGATGG TACTGGAAGC CATAAATCCG TAGGTTTTTA TGTTGAATCC 5 34 0 

CCATGATGGC TTGGTCATCA GTCGAACGAG CTGTCACTTC AAAGTCTTCT GGCATTTCCT 5400 

CAATCAAAAT ACTGTGATAA CGCATGACCG CACGGCCATC CTCAATACCT TGATACAAAA 54 60 

CAGATGGCGC TTCAAAGTTG ATATTGCTCT GTTTCCCATG CATGACTTTT GGAGCCAAAC 5520 

CTAGCTTACC ACCAAAGACT TCTGCAATGG CTTGGTGGCC CAAACAAATC CCAAGAATCG 5580 

GCTTCTTGCC TGCAAAATCA CGAATCATGT CTTCCATCTT TCCAGCATCA ACTCGCCAAC 5 64 0 

CAGGACCAGG AGAAAAGACC AGACCATCTG CTTTTTCAGC TTCTTCATAC AGCTTGGAAT 5700 

CATCATTTCT CAGAACCTGA ACTTCTGCAA AATTCC C AAT GTATTGGGCC AAGTTATAGG 5760 

TAAAAGAATC ATAGTTGTCA ATCAATAAAA TCATGGTCTT AGTTCTCCAA TTCT AGT CAT 5820 

AGATTTTGCT TTGTTAATGG TTTCTTGGTA TTCGTTTTGG GCGATAGAGT CGTAGACAAT 5880 

CCCTGCCCCA GCCTGCACAT AGGCTCTTTG ATTTTTGAGA ATCATGGTTC GGATGGCGAT 5940 

GGCCAAATCC AT ATCAC CCG TCGCAGACAA GTAGCCGATT GCCCCAGCGT ATACTCCCCG 6000 

TTTTTCCGTT TCCAGTTCAT AGATACGTCT CATCGCTCGA ATCTTTGGTG CTCCAGAAAC 60 60 

GGTTCCAGCA GGAAGCGTTG CTTTCAAGGC ATCCATGGCA GTGAGTTCTG GAAGCAAACG 6120 

CCCCTTGACT ACGCTGGTCA AATGCATGAC GTAGCGGAAG AGCTCCACTT CCATATACTT 6180 

AGTGACTTGG ACACTGGTCG TTTCAGAGAT GCGGCCAATA TCGTTACGCC CCAAGTCTAC 62 40 

CAACATTCGA TGTTCTGCTG TTTCCTTCTC ATCAGAGAGG AGGTCAGTCG CCAAGGCCTT 6300 

GTCTTCTTCA TCCGTAGCCC CTCTTGGTCG CGTCCCTGCA ATCGGATTGG TTGTCACGAT 63 60 

GCCATTTTTG ACAGAAACCA AA C TT T CTGG ACT AG CT CCG ATGATTTGAT AATCCCCAAA 64 20 

ATCATAGAAA TAAAGGTAAT TAGAAGGATT AGTCACGCGG AGATTTCTGT AGAAGTCAAA 6480 

TGGATTTCCA GTAACTTCTG CTGAAAAACG CTGGCTGAGT ACACATTGGA ACATATCTCC 6540 

GTTACGAATC AAGTCACGAG CTGTTTCTAC CATTCCCTCA AACTTATGTG GAGCGATATG 6600 
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CGGTTTGAAG 


TCTAACGGAG 


ATAGATCCAA 


ATCTTCAAAT 


TCATTTGGAG 


CAGGAATGCG 


6660 


TAATTCCTCA 


AGCACTTGGT 


TCAAGGATTT 


TTCCAAGGCC 


TCTTGACTGC 


GCTCACTATA 


6720 


AAGTGCATCC 


TCTATGACAT 


GTATCTTCTC 


CTTCTTGTGG 


TCAAAGACCA 


TATAGCTCTC 


6780 


AT AG ACAAAG 


AAATGCATGT 


CTGGCGTCCC 


AATTGTATCC 


TCAGGGATTT 


GACCAATTTC 


6840 


TTCATAAAGC 


GAAATCATAT 


CGTAACCCAC 


AAAACCAATG 


GCTCCACCAC 


CAAAAGGTAG 


6900 


CTCTGAGTGG 


TGCTGACTCT 


TATGAATCAC 


TTCATAAAGG 


AAATCCAAGG 


GATCCCGATC 


6960 


AATCACTTGA 


CCATTTTGAT 


AGAGAACCCC 


ATTTTCAAAC 


TTAATCTCAA 


AAACTGGATT 


7020 


ATAGGCTAGG 


ATAGAAAAAC 


GAGCTGTTTC 


CTTGTCTCTC 


GGAATACTCT 


CTAAAATAAC 


7080 


CTTATGTTGC 


CCCTTTAAGC 


GCATATAAGC 


CAAGATTCGT 


GATAAGACAT 


CTCCATGAAT 


7140 


GATTCGTTCC 


ATTGTAATTT 


CCCTTTCAGT 


TCTACTTCTA 


GTCCGTGGTG 


ACTGTATGAA 


7200 


AAATCCCCAC 


GCAAAATAAC 


TTGCGTGAGG 


ACGAAATTCG 


CGGTGCCACC 


TCAATTATAG 


7260 


GATTTCTCCT 


ATCTCTCATT 


CCTGTCTCAG 


ATATCTCCTG 


TAACAGGCTG 


TGCGATAAAG 


7320 


GGCACTCCCT 


TGAGAATGAT 


GTTTTCTTCT 


CTCGTTTCAG 


ATGAACCCAA 


CTTTACAGCT 


7380 


TTCTCTGCTT 


GTTTTCAGCA 


ACCACAAGCT 


CTCTGTGAGA 


GAAAGAACTG 


TAATTTTTCC 


7440 


ATCTATTATT 


TTTTAGCTTC 


TAGTAGTCTG 


CAATCGCAGC 


TAGGTCCTTG 


CCTCCACGAC 


7500 


CAGAGACATT 


GATGAAGAGA 


TGTTCATCTC 


GGTACACCTT 


TAT ACT CTTC 


GAAAATCTCT 


7560 


TCAAACCGCG 


TCAACGTCGC 


CTTGCCGTAG 


GTATGGTTAC 


TGACTTCGTC 


AGTTCTATCT 


7620 


GCAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


7680 


TGTTTTGAGC 


TGACTTCGTC 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


7740 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGC 


AGCCTGCGGC 


TAGTTTC CT A 


7800 


GTTTGCTCTT 


TGATTTTCAT 


TGAGTATTAC 


TAGCTTTTTT 


CGTATTAGTC 


CAGCCTTTTT 


7860 


GTTTGCTTTT 


AGT AGT AGG C 


ATGG AG CTGT 


AG AT AG AACT 


CAAGTTCATC 


AAAGCGACTT 


7920 


AAGGCCCTAA 


TAAAAGATAA 


ACCAAACGAC 


GGATAGAAAA 


AAGCCCACAC 


ACAGAATATA 


7980 


CTTCCGTGTG 


AGGGCGTTGG 


TAACGCGGTG 


CCACCTCAAT 


TATAAAGGGA 


CTATCCCTTT 


8040 


ACATCTCTGC 


CTTGTTTAAC 


AACAAGCTGC 


ACTGTAAGGT 


GTGCGCACCG 


AATTTTCATT 


8100 


GTTTCAAATT 


CATTTTCAAA 


ATCAGCCCAC 


TTTCACTACT 


TCCAACCACC 


TATTCACAAT 


8160 


CACCACAGGC 


TCCCTGAAGA 


TCAAAAATAG 


TTACTTTTCT 


GATTTGTTGA 


ACTTATTTTA 


8220 


AT AC TTTGTT 


TTTTCTTTGT 


CAAGACTTTT 


TTACGATTTT 


TTTGAAAATA 


TCATTCGAAT 


8280 


ATGACCATGT 


CTTCCTTAGA 


TCGAACATGA 


ACATGTCCCA 


CTTCTTAGAA 


ATTGGATCCA 


8340 
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ACTCAATAGA 


AACTGAATGG 


AGGCTAAACA 


GAACTTATTT 


TAGAACACTC 


CATCTTTTCC 


8400 


ACT AGG ATTT 


TCAAGAATTA 


AACAATACTA 


GAAACTCTGT 


CTCCTAACAA 


ATTTAGGAGA 


8460 


AACTTCAACA 


GATGTGACAC 


TTTCCCCTTT 


AATAATTGCT 


AAAACACCTT 


CTATCATTTC 


8520 


TTTAGCCAAT 


TTAACATAAT 


TGGGAGCAAT 


TGTAGACAAA 


GCTGGAGTAT 


AATACTGAGA 


8580 


AATAGGAATA 


TTATCAAATC 


CAATGATAGA 


AATATCATCT 


GGAATAAGAA 


TTCCTTTCTC 


8640 


ATAGCACGCA 


CGAATCAACC 


CCTGAACCTT 


TTCATCTCCT 


GAAACAAAAA 


TAATGTCCGG 


8700 


ATAATTTTGG 


GTAGTCAAGT 


GCTGCATTGC 


ATAAGAATAA 


ACTGAATCAA 


TTGTAGATAA 


8760 


GCCATAAATG 


ACTTTTAAAT 


C CAT AAAGT A 


ATTTTTATCA 


TTCAGAAAAG 


AACGCACACC 


8820 


TCTTTCACGA 


TCCTTATTAA 


CATGGCATTC 


TCCTCCCATA 


AGCAACCACA 


TATTTTTAAA 


6880 


TTTTTCTTCA 


GTTACAGCTT 


TC AT CAT ATC 


ATAAGTAGCT 


TG AAAATT AT 


TATTAGATAC 


8940 


ATAGACTACT 


CCAGACGTTT 


GAGATTCACC 


GAAAACAAGA 


AAAGGCATAT 


GCTTCTTCTT 


9000 


TAAATACTGA 


ATT CTG AT AT 


CATCTACACT 


TTCATAAAAA 


ACAATAACAC 


CATCTACTAG 


9060 


GCTACCTGTG 


CTTGATATAA 


TTGAATTACT 


AATTGTATCC 


TCCTCTCCAA 


AGTACTCAAC 


9120 


TATAGCATTA 


ACACCAAATT 


CTTTACACGT 


CCGTAACACT 


TTATCTAACA 


GCGTATGAAA 


9180 


CCAAATTAAA 


GGAAAAGAGT 


CGATTTTTTT 


TACAGAAATC 


AATATATTTA 


TAGCTTCTTT 


9240 


TTTAGTTAAA 


TTTTTTGCAT 


ACGCATTTGG 


AATATACGAC 


AATTCCTCTA 


TAACTTTTTG 


9300 


AATCGCTTGA 


TAAGTTTCTT 


CTTTAACATT 


TACTCCACCA 


TTAATAACTC 


GTGAAACTGT 


9360 


TTTTGGAGAA 


AAACCTGATA 


AACGTGCAAT 


ATCATAAATA 


GTTACCTTTT 


TCCCATTTAT 


9420 


ATTTTTCATT 


TCAGTCCTCC 


ATTACGAACA 


TTCTAATATT 


ACTATACAAT 


ATTTAATTTT 


9480 


TTTTAACAAG 


AGAATTTAGT 


AAATTATTTA 


AGATCCACAA 


AT T C AC AAAA 


TTAATTTTAC 


9540 


AAATATTCTT 


CCCCTTCAAA 


AAAGTTTAAA 


TTGCATTTCA 


CACCTTTATT 


TTTAAGAATG 


9600 


TTTCCAACTT 


CACGACAAAT 


AAATTCATAT 


GAGAAAAAAC 


TGCCATAAAA 


TTGTAGATTA 


9660 


ACTTTTTCAG 


TAAAATGTGT 


AGGATTTATA 


AAAACATATA 


ATAGCCTGTC 


AATGTAACAT 


9720 


TTTAACATAG 


AGTTAATTTT 


TTCTTTAAAG 


ATAACATTTC 


TTATCAACTC 


ATCAGGAGGT 


9780 


AAATGAAAGG 


CAAACACCAT 


TTCACAAATA 


TCATAAAAAG 


AAATAAATTT 


GTATACTTGT 


9840 


ATCAAACAAT 


TATTATCAAA 


AT ATT CT ATT 


TTACCTAAAT 


CAAAATTGAT 


TTTATAATCT 


9900 


TTCATAAAAA 


CCTCTGAGCA 


AAAATCTACT 


CAAAAATTAG 


ATG ATT AAAA 


CATCTAAAAA 


9960 


GCAAAAGGAC 


AAAAACATCT 


GTCCC TTTGT 


TTACTAAATT 


TCAGCTAATT 


TCTTCGACAT 


10020 


AAATAACACC 


TACAATATTA 


GCAATTTCTT 


CCATCAGTCG 


AAGATGTTCA 


AATCTACCTG 


10080 


ATAATTCCAG 


AGTAATAAAT 


GACGCTATTT 


ttttutccgg 


AACATCAAAG 


T ATT C AATTC 


10140 
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TGTCAGAATT AACATCTCCA AACGCTGTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 10200 

CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACC ACT AT A TACAATTCAA 102 60 

T CATTTT AG A ACGATTTTGC ACATATTTTT TTACTGGTTG GAACATGGAT ATCACACCCC 10320 

AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 10357 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CGGGACATTC TCAATCTTCT GTCTTTTGTT TTTCTCTTCT TTCTATGATA CAATGGAAAA 60 

AATAAATTCA AAAGGAGTTT TTTTATGACT TATCCAAATC TCTTGGACCG CTTCTTAACC 120 

TATGTTAAGG TCAACACGCG CTCTGATGAA CACTCTACTA CTACTCCAAG TACACAGAGT 180 

CAGGTTGACT TCGCAACAAA TGTCCTAATT CCTGAAATGA AACGTGTTCG ACTGCAAAAT 240 

GTTTACTATC TACCGAATGG TTTTGCTATT GGAACCTTGC CAGCCAACGA TCCGTCTTTA 300 

ACACGTAAGA TTGGTTTTAT ATCGCACATG GATACTGCTG ATTTTAATGC TGAAGGAGTC 360 

AATCCACAGG TAATTGAAAA CTACGATGGT GGTGTGATTG AACTAGGGAA TTCTGGTTTC 420 

AAACTCGATC CAGCTGACTT CAAGAGTCTT GAAAAATATC CAGGACAAAC GCTCATCACA 4 80 

ACAGATCGAA CAACCTTGCT AGGTGCTGAT GACAAGTCAG GAATTGCTGA AATTATGACA 540 ' 

GCCATTGAAT ATCTAACTGC TCATCCTGAA ATT AAGC ACT GTGAGATTCG TGTTGGTTTT 600 ' 

GGTCCAGATG AAGAAAT CGG TGTTGGTGCC AATAAATTTG ATGCAGAAGA TTTTGATGTG 660 

GATTTTGCCT ACAC TGTTG A TGGTGGTCCA CTAGGTGAAC TTCAGTACGA GA CTTTCTC A 720 

GCCGCTCGTG CTGAATTGCA TTTCCAAGGT CGTAATGTCC ACCCTGGTAC TGCCAAAGGG 780 

CAGATGGTCA ATGCCCTTCA GCTAGCAATT GATTTTCATA ATCAACTTCC AGAAAATGAC 840 

CGACCTGAGT TAACTGAAGG TTACCAAGGT TTTTAC CAT C TAATGGATGT GACAGGTAGT 900 

GTTGAGGAGG CGCGTGCAAG CTACATCATT CGTGATTTTG AAAAAGATGC CTTTCAAGCG 960 

CGTAAAGCAT CCATGCAATC TATCGCTGAT AAGATGAATG AAGAACTTGG GAGCGACCGT 1020 

GTCACTCTCA ACTTGACAGA CCAGTACTAC AATATGAAAG AAGTCATTGA AAAAGATATG 1080 

ACTCCAATTA CCATTGCTAA AGCCGTTATG GAAGATCTAG GTATCACGCC TATTATCGAA 1140 
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CCAATCCGGG GTGGAACAGA CGGCTCTAAG ATTTCCTTTA TGGGAATCCC AACTCCGAAT 1200 

ATCTTTGCAG GTGGCGAAAA TATGCACGGA CGTTTTGAAT ACGTTAGCCT TCAGACTATG 1260 

GAACGTGCAG TTGATACCAT CATTGGCATC GTAGCTTATA AAGGCTAAAA AGACGAGGTA 13 20 

GCTCAGCTAC TTCGCCTTTC TTTTTATTCT ACTGGTTTTT C TTGATTT CC AGTAGTTGTA 1380 

GAAGATTCTG TTGTTTCATT TTCTCAAGTT GATTCAGCAG GTTT AGAATC TCTTGTATTG 14 40 

CTTGGTTTGT TTTCCTCGCT AGCAGTTTCA ATGTTAGATT CTGCAGTTGC GTTTGGTTGG 1S00 

TTCTCAGCAC TGGTGTTATC ACCATTTGCT TCAGCATTTC TTGCTGGACT TGTTTCTTCA 1560 

CTTGCG CT AG CTTTTGACTG GATTTGATGA TTCAAAACTA GAATAGCTTT TGTCGATTCA 1620 

AGTAAAGCTG TTTTGTCTTT ACTCTT AG C A GAAACTTGAT CTAATAATGC ATCCACCTTA 1680 

TCAAAGTCCG CATCAGATCC ATTATTACTT TCTAAATAAG AGTGAAGCGA CATGAGAATA 1740 

TCGTAGAGTT TTTGATAGAG TACAAGTGTC TGAGGATCTT GCTCAGCATT TTCCTTTTCT 1800 

TGTTGAAGGG CGCTAGCGAT ACGAGTCAAG ACATCTTTTA CCTGACTGTT TACTTCATCC 18 60 

AAGTCTGCAT CAGCCTTGTT TGTGGCAGCT TTTAGATTTT CTACTTCTTC TGCCAAGGAT 1920 

TGTCTGATTC CTTCTTCATG GATTTGTTCC AAGAGTTGAT TTGCCTTGCT CAAAAGACTT 19 80 

TCTACTTCTT CCTTGCTATC TGTCG CAG AT TATTGGTTGC TATCTACCAT GTACTCCTAA 2040 

AACAGGAGAG TTATAATCCA AGATTACAAG GC CTTACAGA AATAAGAAAT CCAGATAAGA 2100 

CAATGTTCGT CCAAGACGCT ATTCGCTTCG CACAGCAGCA CGGATTCAAT ATGCTTTAAT 2160 

TTTAAAGTTT AGGTGTCAAG ACCTCTTTTT AGTGTGCCCA AAATTTACAG AAGTAATCAA 2220 

TCAACTAACT TTTATTTTTT TCAAACTTTC AGTAAACTGA CCTAAAGCTA ACTCAATCTG 2280 

TCTTTGTAGA TGCTTCTGCT ATCAGCTAGA AGTTGATCTA CTTTTGCCAA GACTGCCTTC 234 0 

TCATCAAAAG TTCCAGGTTG ATAGTTGGAT TGCAGGGATG GAATCTTGTT TTTCAAAGCC 2400 

GCTTCATATC CCTTAGTTTG AACCTTGATG TAGTGATTGT GGTCGCCATG AGGAATCACA 24 60 

AAACCTTCTG AATC TTCACT TATAATTCGA TTGGCATCAA AACCATG AC C ATCTTCTTCC 2520 

TCATGATGGA CATGTAGTGA CGGATTACTT AATACAGAAC TAGAAGAACT TCCTACCTCT 2580 

TCCGTGTTAG AGTGTGATGG GGGATTGTTA AGAGATGACT TAGGAATATA GTGATAGTGA 2640 

TCCCCATGTC TTACTATATA AGCATCACCT GTATCTCTGA CAATATCATT AGGGTTAAAG 2700 

ACATATGTGG CTGCTAATTC ACCTGCCGAC AAGTCACTCT CAGGAATGAA AT GAT AGTGA 27 60 

CCACCATCTG GTACTATAGT AGATTGAAAT AGAATATGAG CAAATTGATA AGGGGATTTT 2820 

AAAGTAATTT CTAACAATGA TTTAGAAACT ATGATGTGCT ATTCTAAATT CAACTCACTA 2 880 

TATATAACCA TCATCGGTAG TATAACGTCC CTGTAATTTT GCTACAGATA CTTCTGCACT 2940 
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AG CTCCTTT A 


TCGTCTTTAC 


CATGTTCTTG 


TTTTTGGCGA 


TTG ATTTC AT 


CTTTTGTT CG 


3000 


TACATTTTCT 


GCATGAGCTT 


GATCTTTAAG 


GTAAACATAA 


TACTTTCCAT 


CTACCTTAAT 


3060 


AATATATCCT 


CCCTTAACCT 


AACTGACG AT 


ATCTTGATCT 


TTCCGCTGAT 


AGTTGCGGGC 


3120 


TTTCATTAAT 


AGCTCTTCAC 


TAAAGAGCGC 


ATCAAAAGGA 


ACTTTACCAT 


TATAGTAGTG 


3180 


ATAATGATCG 


CCATGAGAAG 


TTACATAACC 


TTGATCTGTA 


ATCTTAATAA 


CAATTTGTTT 


3240 


TCCTTGAATT 


CCTTCTTTTT 


GACTAACCTA 


GTCTGGAGTC 


AAATTTTCAG 


TCTTCTTAGT 


3300 


GTCTTTATTA 


CTGTTTACAT 


ATGAAACACG 


ATTTTT ATCT 


GTATTGGCCT 


GTTAGCTATG 


3360 


TTGGTTCAGA 


GCATAAACAC 


ACAGACTTAA 


GGAAAGGATA 


A C AAC AG AT C 


CAGCTGCTAT 


3420 


ATATTTCTTT 


TTAAATTTCA 


TAATTACCTC 


ATTTCTATAA 


TTATTTATAT 


GATGTCTTCA 


3480 


TTATTAAATG 


ATTAAATAAA 


TTAATTAACC 


AATTAATTAA 


CTAGTAAATA 


TTCCACCTCT 


3540 


TTTTAAGTTG 


TATGTCAAGA 


AATTTTATAT 


ATTAATAATA 


AAATGAAATT 


CTCCCAAAGT 


3600 


C AG AGTTTT A 


TTTCTAACTT 


TTGAGAGAAC 


TTCATTTTTG 


ATTCAGACTT 


TTTCTACTGC 


3660 


TATTCCTTAC 


GCTATGAGAT 


CAGATAAATT 


CTTTTTTATC 


ACTTCTCCAC 


TTGGCAATCT 


3720 


TAATTCAATC 


GTTCCATCCA 


TATTGAATAT 


AAC ACT ATCT 


AAGCCTAATC 


CGTAACTAGC 


3780 


TGTAAATTTT 


TCTAATTTTT 


CTTGTACAGG 


ATCTACTGCT 


GGAGCTTCCT 


CTAATGCTGG 


3840 


ATCTAACATA 


GGGTCACTCC 


CCACATTCCC 


TTCTGGATTC 


AACATTCCAT 


TATCCGTTGA 


3900 


GTTTTCTGGT 


TTTACAGGTT 


TTTCGTTTGG 


TGCCTCTGGT 


AAAGAATCTG 


CTGGTTTATT 


3960 


TTCTGTTGGT 


TGGTTCTCAA 


CTGTTCCAGT 


AGATACTTTT 


CCATTTTCAG 


ATGGTTTATT 


4020 


TTCACCATTT 


CCTTGAGGTG 


CTTCTCCTGT 


AAAATCTGCC 


ATATTCTTTT 


T AATGACTTC 


4080 


TCCCGATGGT 


AAATATAATT 


CAATTGTTCC 


GTCCATATTA 


AACAAGACAT 


TTTCTAGCTT 


4140 


CATCCCATAA 


CTTTCAGCAA 


ATTTTGCTAC 


TTTTTCTTGT 


ACAGGATCCA 


CTGTAGGAAC 


4200 


TT CTT CT AAC 




TAfiTArTATT 


V_t_V_/\Lj 1 I X, 


GAAAGTTTTT 


CTTTTTCTAC 


42 60 


CTTCTCACTA 


GTCTTTGGTT 


CTTCTACCTT 


TTCATCAAGT 


TTTAAGTTTT 


CTTGTGCTTT 


4320 


ATTCCTTTTA 


AATTGTGGTA 


GAATACTTGG 


TTTATCACTT 


TGATTTTCTT 


TTTCCAAGAT 


4380 


AGGTACTTCC 


ACAATATAAG 


TCGATTGATT 


GTCCAAATAA 


GCATTTGCCA 


TGAAGGTTAC 


4440 


AGGAATTTTA 


TTTCCGGCCG 


TTCTGGTTGT 


TCCTTGGTTT 


AATTTCGGAA 


TCGGTAATTT 


4500 


G ATTTCAC CA 


ACTTTATAGT 


TATTTTCTAA 


ATAAGCATTT 


CCATGAAATT 


CATCAAACAC 


4560 


TCTGACTAAA 


GCATCAGTTC 


CTTTAGGCAC 


TGCAAATTGA 


GGGTTCACTC 


TTAAATAAGT 


4620 


ATCCCCTGCA 


TGGAAAGGAT 


AGAAAATCGT 


TTGACTGGCC 


ATTTTGTAAG 


CTAAAGAGGT 


4680 
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TGGAACTGTA AATGTACCAT CATAACTTAC TTCTGGATAA TCTTTTGAAG CCATAGTATA 4 740 

CTTAAATGTT TGTCCTGGTA AATAAGGTTG ATCTAATTCA AAGTTTGCAA TATTCCCTAC 4 800 

TCCTTCTCCA AATACTTTAC CACATACTTT CTCCAATACT TTTCCATCTG GTGTTATTAA 4 860 

TTTTACTAGC ATATTGATAC CTAATTTTTT CTCCAATTCA GGCGGAAAAC TAAAAGAAAC 4 920 

GCGTTTTTGA CCATTGGCTA GAGTAAAGTT TTGATTATTA AACGTACTAT TTTTTAACAA 4 980 

ATTAACAACA TTCGTTAATT CTTCTCCAGT ATAAACTTTA TTCCCTTCTT TTTTAGCAAC 5040 

TCCTTCTTCG GGTTTAAACA GTTCATAGTT ACTGTGAGAA TG AC C AATT C CAACCGGTTT 5100 

ATGTTCATCA ATCGGATCTG CATGATGGTG ATCTCCATGC GGATAAATAA TCGCATTTTT 5160 

TTCTTTATTC ACGACAATAC TTTCACGTTT GACACCATAT TGTTTCATAA TGCCAGCAAT 5220 

TTTTTCTTCG ATTTTTTTAT CTAAATCTTT CATTTCTTTG GCATTACTTG GATAATCCTG S2 80 

TTCATGAGAT GACAAAGAAT CTAATCCATT ATGACTAGTT TTAACTTCCT CTAAATGTTT 534 0 

TTGCGCAsCT TAATTTGCTC TTCTGTCAAG TCCTTCTTGA AG AAATAAT G ATTGTGGTCT 5400 

CCGTGACTCA TGACAAAACC TGATTCATCT TCAGCGATAA TACGATTAGC ATCAAATCCG 54 60 

TATCCATCTT CTTCATGTTT CTCATGTGAA GTTCCTGGAT TGATTGGAAG AG ATGG AG AA 552 0 

GGTGTTGCTA GACTATTGTT TGGAAGAGTC GGTTGCCCAA TTTGATTTGA TTTTGGAATG 5580 

TAATGGAAAT GATCACCATG TCTTACAATA TAAGCTGTAG CCGTTTCTTC AACGATATCT 5640 

TTTGGATTAA AAATATAACC ATCAGATGCT GAAGAGAGCT CCTTAC TTGT CGTTAAAGAA 5700 

GAAGGATTGC TTGAAAGACT GCCTAGACTA GACACTACTT CATTAGGTTT TGCATTTGTA 5760 

GAAACTCTAG AAC CAGTTCC ACTGATAGGC ACCATTCTGG CAATCTTTTC TTCTAAGGCA 5820 

GAAAGCTTGC TGTAAGGAAT AAAGTGGTAA TGGTCG CC AT GCGGAATCGC AACTC CATTT 5880 

GGTGTACGAC TGATAATCTT AGCAGGGTCA AAGACCAGGC CATCTGATTC ACTGTAACGT 594 0 

TGGGCGCTAG GTCAATCATA GAGTTCCTTC AAAAGACTCT GGAGATTTTC AGATTTATTT 6000 

GCTGGCTTGC TAGTTGATCC TTTTGCTACA GATTGCGTGT TATTGTCACT AGCTGTTGAA 60 60 

GAATAGCTTA ACTGACTCGG TTGCATATTT TTTCCAGCCA GATGTGCTTT AGCTGCTGCT 6120 

AATTCACTAG CAGATAAATC GCTTTTGGGA ATGTAGTGAT AGTGACCTCC ATGAGGAACG 6180 

ATATAAGCAT TACCCGTATC TTCGATAATA TCAGCTGGAT TAAAGACATA ACCATCATTT 6240 

GTCGTATATC GTCCCTGAGA CCTTGCTACA GCAACATTAC AGTTAACCTT CTCATTATCT 6300 

TTGACATGTT CTTGTTTTTG ACGATTGATT TCATCTTTAG TTCGAACATT ATCAGCATGA 6360 

GCTGCATCTT TCAGGTAGAC ATAATATTTT CCATCGACCT TGATGATATA ACCACCCTTG 64 20 

ACTTCATTGA CAATATCAGC GTCTTTAAGT TGATAGTTTG GATCCTTCAT CAAGAGTTCT 64 80 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATGATAGTG GTCACCGTGT 6S4 0 

GACGTTACAT AGCCCTGATC TGTAATTTTG ATTACAATTT GCT C AGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGGTCTGG TGTCAAGTTT TCA C TTT T CT GACTTGACTG GCTGCCATCC 6660 

ACATAAGACA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCACCTG CTATATATTT TTTACTAAAT 6780 

TTCAT AAATC CCTCATT TCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TTACTTTATT 6840 

AAATAGTTTT CTAAACCCGG GGGTACC 6867 

(2) INFORMATION FOR SEQ ID NO; 193: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

CGTTCTAAAA ATGCAGTACG TTTGATTGAG AAATCAGTTA AAGGTATGCT TCCACACAAT 60 

ACACTTGGAC GCGCTCAAGG TATGAAGTTG AAAGTATTTG TTGGAGCTGA GCACACTCAC 12 0 

GCTGCACAAC AACCAGAAGT TCTTGACATT TCAGGACT T A TCTAAGGAAA GGAACAATAA 180 

AGTATGTCAC AAGCACAATA TGCAGGTACT GGACGTCGTA AAAACGCTGT TGCACGCGTT 240 

CGCCTTGTTC CAGGAACTGG TAAAATCACT GTTAACAAAA AAGATGTTGA AGAGTACATC 300 

CCACACGCTG ACCTTCGTCT TGTCATCAAC CAACCATTCG CAGTTACTTC AACTGTACCT 360 

TCATACGACG TTTTCGTTAA CGTTATAGGT GGTGGATACG CTGGTCAATC AGGAGCTATC 420 

CGTCACGGTA TCGCTCGTGC CCTTCTTCAA GTAGACCCAG ACTTCCGCGA TTCATTGAAA 4 80 

CGCGCAGGAC TTCTTACACG TGACTCACGT AAAGTTGAAC GTAACAAACC AGGTCTTAAG 54 0 

AAAGCTCGTA AAGCATCACA ATTTAGTAAA CGTTAATTCG AAAGAATTAC TATACTTATA 600 

CAGAGCACCT TTCGGGGTGT TCTTTTTTTA TACTTTCTTA CTAAATTGGT GCAATTGACA 660 

CAGTTGTTGC GACTTTAGTC GCTTACAAAT GTGGCTGCAA CCTGACATGG TCAGTTGCCT 720 

CAAAACGTTA AT CAATACGA TTATATCAAC GTTTCAAAGC ACTCAAGGGT TTACCCTATG 780 

GGTGCTTTTT TCTATACTTT CTAAAAAAGT TTACCCTAAA ATTTGCCCTA AAATTACCCT 840 

ACTTATTTTT AAGATGTTGG TAGGCAACTT GTCCAGCAGA TAATGGAACT ATGTTTGAAG 900 

TATTAACATA AGTCTTAGTT GTAACGGTAT CGCTATGAGT TAATGCTTCA GAAATGGCTT 960 



WO 98/18931 



PCT/US97/19588 



1162 

CTAAGCTCAT TCCTGCTTT T TTAGCAAGTG TCGCTCCTG 
(2) INFORMATION FOR SEQ ID NO: 194: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D> TOPOLOGY: linear 



999 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTC A TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAAGTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 12C 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCTAGATCG TTT CC AT ATT 180 

AT CC AACATC TCAGCCGTGC CATGAGTCGT TTTCGTG TTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CGTT A CTGGA AACTCATCCA ACAGGATAGT 3 00 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 3 60 

GAAATTCTTG ACAAGATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 4 80 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTGTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCGTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAAGCTTAGC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 7 80 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGGATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT T C ATTTT C AC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCG AGTGT A TTTTTGAAAT 1080 

GACCT AGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAG G AC AC AG ATAT CCGTC A 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCCTTTAT CCATC C AAAA G AC ACG ACTG TCTGAACCAT 1320 

GCTCTTGATA AAAGATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 
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TATGAGGAGG AAATAACTTA CTTTCCGGAT GGAAACAATC GTTTTCTTCA TGAGCATCAA 14 40 

TAGTAAAGAA GATATAATCT CCTCGTTCAA AAGCTAATCG AGTTACCTTG CTGATGGCAT 1500 

CCCAAATCGC CTGAGCTGGA GCACCTGCTC TTAGTTTCCC ACT AT C AG C A ACAAAATCTT 1560 

CTGTATAATC AATCGAAATT AAAGCCTTTG TCATTAGTAA TCTCTTTTCT TCACTTCTTC 1620 

AAAAATATCT GAAATCAAGA CCTTAAGATA GGTTCCCTTC ATTCCAAGTG AGCGACTTTC 1680 

AATAATCCCC GCAGACTCAA GTTTACGAAG AG C ATTG AC A ATCACAGAGC GAGTGATTCC 17 4 0 

GATACGATCT GCAATCACTG ACGCAGTCAA CTTCCCTTCA TTTCCATTTA ATTCCCCTAA 1800 

AATTGCTGAA ACAGCACGGA GTTCGGAGTA AGAAAGGGTA TTGACCGCCA TGGTG AC AG C 1860 

AGTACGACGA CGAATATTTT TCTCATCTTC TTCACGTTGG AAGTTAAGAA GCTGAATCCC 1920 

AACAACGGTA CTGGCAATCT CAACAAGAAC CAAGTCCTCA TCTTCGAATT TTTTATCATT 19 80 

ACGCCAAATA ATCAAAGAAC CAAGGCGAAT CCCCGATACA TGAATCGGTG CAATAGTCGT 204 0 

CAAGCCATCT GGAAAATCAT CTCTACTCTC AATAGGGAAA ATACTCATAT CAT GCTCAAC 2100 

AGGCAAGTTT GCTTCTGTTT CGTAAATCAT ATTAGCCCCT TGAACGTAGT CAT CTGGG AA 2160 

AATCTTAGTT TGGAAGAATT GCTCACGCGA TCTGTATTTG TTTTATAACG CATAAAATAG 2 22 0 

CCAAGCAGAC GTCCCTTACT ATTGATAATG CAGGCATTGC AATGAATAAT ATCCGCTAAC 2280 

TGACGCGTAA TAGCGTTGTA AGGGAGCTCA TCTCG 2 315 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATGTGTT ACCTTCTTCA 60 

TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 120 

AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 180 

ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 24 0 

TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC CTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 300 

AATTAATTTA TCCGCCACAG TGTAG CTGT A GATTCCAGCT TCCATGTCGA CCACACTCGG 3 60 

TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 42 0 
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AGGAACCAAG AC TTCGTCCG AATCATTCAT GGTCAATCGA ATTAAATCGG ATGTCACCTT 4 80 

GCTTGGGGCT AATTCCACCT TTTGGATAGC CGCCTTGAGT TCTGGGCTAA TTTGAGCAAG 54 0 

TTCTGAGACA AAAACTTTGA TTTGTTCACT ATCATTAAAG AGAACTGATA AATAAGTTTC 600 

TGGTAAACTG TTCAGACTCA CAGAACTAGT CTCAAGCTGA CCACTGGAAA GAATAGGATA 6 60 

ATGATTTTCA C C AG AAAT AT AGTAGGCCAC AATATCATAT TCCTTGACCT TAATAGTGAA 720 

CTTAGTTGGA AATTGATAGA CAAGTTGAGC TGATTCAACC CAATAGTTAG ACTTAATCTG 780 

CTTTTCATAT TTTGCCTTGT CTAGCAGAAG GTTAATCGTA TAATCCGAAT CCTGAATGCC 84 0 

TGAAGCCTGT CGAATATCAT CAGCTGTAGT TTGCACCGTT CCCTCAACAC GAATATCTTT 9 00 

CATGGTCGCA TAAGGACTGA GCAAGTAGGC AGAGACAAAC AATAGAAGCA GACTTGGAAA 9 60 

TAAAATCGTG AAGGCTCGCA AGATATGGAT ACCAGGAATC TTTGCTTTGG CTGGTTTTTC 1020 

CTTTGTAGCC TTTTTAGCAA GCTTTTTATC CTGTTCCTCC TTCTCTTTAG ACTCTGGTTC 1080 

TTCTTTCTCT TCTTTCTCTT TGTCAGCCTC TGAGGATGCT ACTTTTTCTT CAGACTCTTC 1140 

CTTAGCTGAT TCTGAATCTT CCTGGTCTGT TTCACTCTCC TGGTCCTGTT TATCCTCTGA 12 00 

CTTCTCAGAT TCTTCTCCCA TTCGAGCTTG TCTTTCCTTT TCCTTCTCCT CAGCTAGAGC 12 60 

CGCCTCTTCT TCAGCCTTCT TTTTTAGATA TTCTTGGTTT CGTTTCTGCC ATTCTGATAA 13 20 

CTCTTTCAAT TCTTCGAGGG TTTCTTTGTC CTCATTTTTC TTATCTTTTG ACATTTACTT 13 80 

TCCTTATGAT AAATCTTTTT TCAACAATTG ATAAAAATCT GCTAGAGATT TCAATTCCTT 14 4 0 

AGAAGCTTTC ATCTTAGCTT GGTAATCTTC CTTGTGACTT AGTAAGTGAG AAACCTTCTC 1500 

TTCCAAACTA TCCAAGGTCA AATCGCTTTC TTGAAGGTCT TCTGCATAGC CTTTCTTAAC 1560 

AAAGTAAGCT GCATTTTCAA TCTGGTCACC ACGACTAGCT TCACGACCAA GCGGCACAAT 1620 

GACATGCAAT TTTGCTATCG CCAAGAGCTC AAAAATCGTA TTGGCAC CAC CTCGTGTCAC 1680 

AACAATATCA CCCAATTCCA TCAAGGGTTG ATAGAGATCG GTCACATAGT CAAC ACG AAA 1740 

AAGATTTTGC CTCAACTCAT TCAGACTAGA ATCTCCAGTT AGATTGATAA TATTGTAGCG 1800 

CTCTGTTAGT TCTTTCTTAT GGTCTGTCAC CAATTGGTTA AAGAC AC GAG CGCCTGCAGA 1860 

ACCGCCAACA AACAATACAG TTGGCAATTT GGGATTAAAG TGG G TT T GAA TATCCACCAA 1920 

TTCATCTGGT TCTGGAGTGT TTTTGTCCGA AACCTTGGTC ACCGCTCCCA CATGCTCAAC 1980 

CTTAGCCAAA CTCGAAGCTT GTTCAAAGGT TGAATACATC TTAGTCGCAA ATTTATAGGC 204 0 

GATTTTATTG GCCAAGCCCA T AG ACAGGT C AGATTCGTGA ATAAAGACAG G C ACTCCTG A 2100 

CACACGCGCA GCGATAACAG GCGGTACTGA GACAAAGCCC CCCTTTGAAA AAAGGGTCTG 2160 

TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT CCCCAACCAA CTTTG AAGAC 2220 
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GTCCAGCATA TTTTGCCAAG 


r AGAAATAGCG 


! ACGCAATTTT 


CCACTCGCAA 


TAG AATGG AA 


2280 


GGTGACATCC 


: AAACCTGACT 


TAAGGATTTC 


' TTGCTCTTCC 


AT AC C AC ACT 


TGTCCCCGAT 


2340 


ATAGTGGACT 


TCCCAACCAT 


CTTCGATGAA 


■ CTTGGGCATT 


AACAAAAGAT 


TGAGGGTAAC 


2400 


GTGTCCAACC 


GTCCCCCCAC 


CTGTAAAGAC 


AATTTTTTTC 


AT ATT ATT CT 


TTTAACTCCG 


2460 


CTACTGTGTC 


GATAAAGAGG 


TCGCCACGTA 


CTTCAAAGTT 


AGCATACATA 


TCCCAGCTAG 


2520 


CATTGGCAGG 


ACTAAGAAGA 


ACCACATCTC 


CTTGAGTCGC 


AAGCTCATAG 


GCCTTGCGGG 


2580 


TCGCATCTGC 


AATATCTGTC 


GCCTCCACAT 


AAGCGACACC 


AGCCTTGTCT 


GCTGCCCGTT 


2640 


TGACACGTTC 


TGCAGATTGA 


CCCAGGATGA 


C C ATCTTCTT 


G AGT CC ACT A 


ATGTCTGG C A 


2700 


CCAATTCGTC 


AAACTCATTG 


CCACGGTCCA 


AACCACCTGC 


AATCAAGACG 


ACCTTGCTGT 


2760 


TGTCAAATCC 


TGACAAGGCT 


TTTTGAGTAG 


CCAAGATATT 


AGTTGATTTA 


CTGTCGTTAT 


2820 


AGAATTTAAC 


ACCCTTGATG 


TCATCCACAA 


ACTGGAGACG 


GTGTTTGACA 


CCACCGAAGG 


2880 


CTGAAAGAGT 


TTCCTTGATG 


GTTTGATTGT 


CCACATCACG 


AAGCTTGGCT 


ACAGCAATAG 


2940 


TCGCAAGGGC 


ATTTTCCACA 


TTGTGGCTAC 


CTGGAACACC 


GATTTCATTC 


GCTGCCATGA 


3000 


CTACTTCACC 


ACGGAAGTAG 


AGTTGACCAT 


CTTCCAGATA 


AGCTCCATCA 


ACCTTTTCAA 


3060 


GTGTTGAAAA 


TGGTACAACA 


GTGGCTTCTG 


TCTTGGAAGT 


CAAGTCTTTT 


GCCAAGTCTT 


3120 


GATTAAAGTT 


CAAGACAAGG 


AAATCAGCTG 


CTGTCATCTT 


GTTCTGGATA 


TTCCACTTGG 


3180 


CTGCTACATA 


TTCCGAAAAT 


GACCCATGGT 


AGTCGATATG 


AGTTGGCATG 


AGGTTGGTAA 


3240 


TAACCGCAAT 


CTCTGGATGG 


AATTCTTGAA 


CACCCATGAG 


TTGGAAAGAA 


GAAAGTTCCA 


3300 


TAACAAGCGT 


GTCCTTATCT 


GATGCTATTT 


GAGCAACCTG 


ACTAGCTGGA 


TAGCCGATAT 


3360 


TCCCTGATAA 


AAGACCATGT 


TGGCCAGCAG 


CAGTCAAAAC 


TTCCCCAATC 


AT AG TCGTTG 


3420 


TGGTTGTCTT 


ACCGTTCGAT 


CCTGTGATAC 


CAATAATCGG 


TGCTTCTGAA 


ATCAAATAAG 


3480 


CCAATTCCAC 


CTCAGTCAAG 


ACTGGAATTC 


CCTTGGCCAA 






J 34 U 


TGTTGTAGGG 


GATACCTGGA 


TTTTTCACCA 


TAAGGGCAAA 


CTCTTCATCC 


AAGAGTTCCA 


3600 


AAGGATGGCC 


ACCTGTAATG 


ACCTTGATCC 


CTTCTTCCAG 


CAAACTTTGG 


GCAGCTGGAT 


3660 


TGTCCTCGAA 


AGGTTTCCCA 


TCATTTACTG 


TCACAATGGC 


ACCTAGCTTG 


TCCAACAAAC 


3720 


GAGCTGCAGA 


TTCACCAGAC 


TTGGCCAAAC 


CTAAAACAAG 


GACTTTCTTA 


TTTTTAAATT 


3780 


GATCTATTAC 


TTTCATGTCT 


CGAACTCCAT 


TTCTACTCCT 


ACTATTTTAC 


CATTTTTATG 


3840 


GAAATAAAAA 


AGCCACAAAG 


TGTGTTTGTG 


ACTCTTTCTT 


CTAACTGAAT 


CTTACCATAT 


3900 


CATCTATGTG 


ATAAATCGGT 


AACTCGAATG 


ACCTGATCCA 


CTTGCTCCCA 


AATCAGAGGA 


3960 
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TTATCGGTCG 


1 CAATAATAAT 


GGTCCGATTC 


GGATTTTTTA 


AAG ATT CT AG 


GATGGAAAGT 


4020 


AATTCCTCAG 


AGTTTrrGGG 


GTCTAAGGAA 


GCGGTTGGTT 


CATCTGCGAG 


GATCAAAGGT 


4060 


GGATCCTTTA 


AAATTATCTT 


CGCTAGTGCA 


ACACGTTGTG 


CTTCTCCTCC 


TGATAACTCA 


4140 


AATATAGGTT 


GCTTCAAATC 


CAAATAAGAG 


AGGTTTACAC 


GGTTTAGAGC 


TTGTTTCATC 


4200 


AAAGAGATTT 


TCTCTTTTTC 


CTTCAACTTT 


TTACCAACTA 


AACCCAGATT 


GAGATTCTCT 


4260 


TTGACGGTTT 


GGCTTTCAAT 


TAAGCCAAAA 


TCTTGAAATA 


AGTATCCTAA 


GTAATCTCTA 


4320 


AAGAAAACAG 


AAGGCTTGAT 


GTCCTTAAGA 


GAAGTGCCAT 


CATAGATGAT 


TTGCCCTTTG 


4380 


TCATATGGCT 


CCAATCGTCC 


AATCATATTC 


AAGAGTGTTG 


TCTTACCACA 


GCCACTTGTA 


4440 


CCGATTAAGG 


CATAAATTTT 


CCCACCTTCA 


AAATGAAGAT 


TCATATCTGA 


AAATAGCTGA 


4500 


CGGCT TCCAA 


ATTTTTTAGA 


TATATTCTTT 


AGTTCAATCA 


TCCTATTTTC 


CTTTCATAAT 


4560 


TGTCATAGAA 


ACACGAGATT 


CTTTCTGCGC 


TTGACGGTAA 


AGCGTCAAAA 


CTGCACTAGC 


4620 


TAGAAAGACC 


AATAAAGTGA 


GCAAGCCAAT 


CACCAAGTCT 


CGACTCCTTA 


AAATAAAGAG 


4680 


ACTAGCACCA 


AATACAAAAC 


TAGCAAATTG 


GCTAACCATA 


TACTGAGCAT 


GTGTTTCAAA 


4740 


AAATCGTAAA 


CCTGAAATTC 


GTTTAATCAA 


GATATCTCGG 


CGGAATTGCT 


CGAAATATAG 


4300 


AAG ATTGAC A 


GAATAAAAGA 


GTAACAAGGA 


ACTGGCTATT 


CCAACAATAG 


CTCCTAAGAT 


4860 


TAAAGTTGCT 


GTTTCAGTTT 


GAACTTCATT 


ATAACGAGTT 


AGATAAACAC 


TTCTTCCTTC 


4920 


TTTAAGATAG 


GATACTTGCT 


CATAAATTCC 


AGCTTTCTTC 


AAGAGTTCTA 


GCCCACTCTC 


4980 


ATATCCTTTG 


ATAAAGAGTT 


GTTTTCCAGC 


ATTGATAGAC 


CAACTAGATA 


AGGATATAAA 


5040 


ACTATCACCT 


GTAGAAGTCG 


GCGTGAATAC 


CACTAAAATC 


GGATCAGTCA 


AATACTGACT 


S100 


AGATACGGGA 


TTCTCACCGT 


TATTATAAAC 


AAACCGCTTT 


TCTCCCATTG 


AAAGATAACT 


5160 


AACGTGCGCT 


TTCATCTCAT 


AATCCAAAGG 


AGCACTTGCC 


TCCTCACCAG 


ATTTT C CAT A 


5220 


ATAACTCAAT 


CTTTCTTCAA 


AAACTTTCTT 


AAGTTCTGCT 


TCTCGAGAGC 


GCAAATGTTC 


5280 


TGGGAGCAAG 


AGGATAAACT 


CACCTTTTTC 


GAGATGGGCT 


AACTTCTGTT 


TGGTCTCAGC 


5340 


ATCTACCACG 


ACCTTTTCCT 


TGTCCAAATA 


ACTGGGACTA 


ACATAGAGCG 


TATTAGCATC 


5400 


TGAAC TAT AG 


GTATCCAGTG 


TCTCTCCCTG 


TTCATTTTTT 


CCTTGTGGAT 


TGGCAAAATG 


5460 


GAGCAGATTA 


TCCTTTACAT 


AAAGAGCTTG 


TTCTTCTTCG 


ATTGCTTCCT 


TGGCAAAGGC 


5520 


ATACCACTTG 


CTCTGATTTT 


CTGTATCTTT 


TCCTCTATCA 


CCTAACCCAA 


AGGAAATCTG 


5580 


GTAATAGTCT 


GCTCTGT CCT 


GCCATGCTTG 


TTTTGAAATT 


TCAAGTTCTT 


TCAATCGTTG 


5640 


GTAAGACGTC 


AAACCTGTCT 


TAACAGCGTA 


GCCTACTGTA 


AAAACAGCTA 


CTAACTGACA 


5700 


CAATAGGGTT 


AAAGC CAT C A 


AG CGTTT AAG 


GGGTAATCTT 


CCCTTAATAA 


CGGGAACTAA 


5760 
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TGCTTTGTAA CTCAAACTCA TTAGGTAAAG GAGCATTAGT AAAATTGAAA TCGCCAATAA 5820 

AAACAACAGA TAGAAACTAA TCCCAAAACC ATAGGTGGCT AACAAGATAG GATAAAACAA 5880 

ACCTTGACTA AAAAGAACGA CTCCCCCACC TAGGAAGGAA AGGAGGGCTG ATAGAAGGAG 5940 

CCATTTGATA TCAGTAGATA AAGAATGCCC CATGATGGAT AAGAGAGTCT GACCAGAAAA 6000 

GAGTTTTATA CCTGCTGCTC TCATTTCCTT AATCCGAGTG ATAATCACTA AAGCAAAGAA 6060 

AGATAAGCCA AATATTGCTA AACTAATTAA AATAACGGGA TTTAGTAATA TTCGAAAAGC 6120 

AAGAAAATAG GGCGGTATCT TTCGGTCAGC ACTTGCTTTA TAACCCAAAT CTCCTAATTT 6180 

ATCGGCAAGC TTTTCTTTCG TCAAGGAGCC TGACAAAAGG AGATAACTAT TTAGCGGAnT 6240 

AtACGTTCAC GACTTTCTTG GCTAGCTTCT TGGAATTCTT TTGGTAAAGT TCCCTGACCA 6300 

TAAGTTGCAT AAGTAAAGTG AGTCGTCCCA TCCTTACTCG GCTCTACAAT TCTTCTAGCT 63 60 

ATTAAACTCT GTTCTGAGTT TGCAAAATTC TCCAATTCCT GTTCAAATAC CTCACGCGTC 64 2 0 

GGTTCCTGAG TATCTTTTTT GACACGAAGT AAAGAAACGG AATCATAGCT TGCATATAAA 64 80 

TATTGTGGCG CACGTAAGAC AATAATCCAA GCAAGGAAGA AGCTGAGAAA AAAAGTTGAT 6 54 0 

AATAATATGA ATAGTTTCTT CATAGTAGAC TC CTTGT AAA CAAAATTCCC CCTGTAATTT 6600 

CTTACAAGGG GAACGATTTA AATCAATGAA CGATTAGTCA TAATCACAGT AAAATGCTAC 6660 

TTGTTCTCCC CATTTAGTCC AAATCCATGC AGG 669 3 
(2) INFORMATION FOR SEQ ID NO: 19 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 
{C> STRAND EDNESS : double 
{ D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 60 

GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC G ACGATTTG A TATAAAATGA 120 

AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 180 

CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAACT TGTTGGAGCC 240 

AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 300 

TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 3 60 

ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGT CTCAAAT 4 20 
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CATCCTTACA 


TTCTAGTAGA 


TGATGTTTTG 


ACAGCCTTTC 


AATCCTTAGC 


ATCCTACTAT 


460 


CTTGAAAAAA 


CGACTGTTGA 


TGTCTTTGCT 


GTTACAGGTT 


CAAATGGCAA 


GACAACGACT 


540 


AAGGATATGT 


TGGCGCATTT 


ACTGTCAACA 


AGATACAAGA 


CCTACAAAAC 


ACAAGGCAAT 


600 


TACAATAATG 


AGATTGGCCT 


TCCTTACACA 


GTTCTTCATA 


TGCCTGAAGG 


AACAGAAAAG 


660 


TTGGTTTTGG 


AGATGGGACA 


GGATCACTTG 


GGCGATATTC 


ATCTCTTGTC 


TGAATTGGCT 


720 


CGTCCAAAAA 


CAGCCATCGT 


GACCTTGGTT 


GGAGAAGCCC 


ATTTGGCCTT 


TTTCAAAGAC 


780 


CGTTCAGAGA 


TTGCTAAGGG 


AAAAATGCAA 


ATTGCAGACG 


GAATGGCTTC 


AGGTTCCTTG 


840 


CTTTTAGCGC 


CGGCTGACCC 


TATCGTAGAG 


GACTATTTGC 


CAACTGATAA 


AAAGGTGGTT 


900 


CGTTTTGGGC 


AAGGGGCAGA 


GCTGGAAATT 


ACTGACTTGG 


TTGAGCGCAA 


AGATAGTCTG 


960 


AC CTTCAAGG 


CCAATTTCTT 


AGAGCAAGCC 


CTTGATTTGC 


CAGTAACTGG 


CAAGTACAAT 


1020 


GCGACAAATG 


CTATGATTGC 


ATCCTATGTT 


GCCTTCCAAG 


AAGGAGTTTC 


AGAGGAGCAA 


1080 


ATTCGTTTGG 


CCTTCCAAGA 


TCTTGAATTG 


ACGCGTAACC 


GTACCGAGTG 


GAAGAAAGCA 


1140 


GCCAATGGAG 


CAGATATCCT 


GTCAGATGTT 


TACAATGCCA 


ATCCAACTGC 


TATGAAACTG 


1200 


ATTTTAGAGA 


CTTTCTCTGC 


CATTCCAGCC 


AATGAAGGTG 


GCAAGAAAAT 


TGCAGTGTTG 


1260 


GCGGATATGA 


AGGAGCTTGG 


TGACCAGTCT 


GTTCAACTTC 


ATAATCAGAT 


GATTTTGAGC 


1320 


CTTTCTCCAG 


ATGTGCTTGA 


TACCGTGATT 


TTCTATGGAG 


AAAATATTGC 


TGAATTAGCC 


1380 


CAATTGGCCA 


GTCAAATGTT 


CCCAATCGGC 


CACGTTTACT 


ACTTCAAGAA 


AACAGAAGAC 


1440 


CAGGATCAAT 


TTGAAGACCT 


AGTCAAGCAG 


GTCAAGGAAA 


GCCTTGGAGC 


CCATGACCAA 


1500 


ATCCTGCTCA 


AAGGCTCTAA 


CTCTATGAAT 


CTAGCCAAGT 


TGGTAGAAAG 


TTTAGAAAAT 


1560 


GAAGACAAGT 


GATTTTGTCA 


AGTATTTGCA 


AAGAATGATT 


GCCATTACAG 


ATACTGGCTT 


1620 


AACCTTTACA 


AAAGATCCGT 


TTGACCGTGA 


GCGCTACGAA 


GACTTGCGAA 


GTCTGTTATC 


1680 


TGAAATGTTG 


AATCAAGCAT 


CAGACCTTGA 


TTCCGAAGAA 


GTGGCAGAAG 


TCTTGAAGCC 


1740 


AACTTCTGCT 


TATGCGACTC 


CGTTAATGGA 


CGTCCGTGCT 


TGGATTGTTG 


AGGATGAGAA 


1800 


GATTTGTCTG 


GTTAGGGGAC 


AAGGAGAGGA 


TAGTTGGGCT 


TTGCCGG 




1847 


(2) INFORMATION FOR SEQ ID NO: 197: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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CAAGCGAAAA 


CATTTTTTAT 


TCCAAATAAA 


CAGAGCATTT 


TAGGAGAACA 


AGAGATTTTG 


60 


AATGCCAAGT 


CGATCTTGGC 


CTTGCTAGAC 


GGTTTGGAGT 


CACATAGCTA 


TGATGTAGTC 


120 


TATCTCCGTC 


AGCCTCTTAA 


TCGTCTCGAA 


TATATCGAGT 


GTGCGATAGT 


GGGGCAATCA 


180 


C AATTT CTCT 


TTAAGGTCAG 


TTATGCTGAT 


GGTCAAAAGG 


CTTACCCTGT 


CGATCTTCCT 


240 


GACCTACTAA 


CAAAGACAGA 


CTGGCAGATT 


ATCAACTCAT 


TTTTAGATGC 


TTTGCTTGCT 


300 


TATACAGGGA 


CTGATATTGA 


AGGGCTACAT 


GGTTTTGATT 


TTGAAGCTTA 


TTTCCAAGCA 


360 


AGTATTCAAG 


CCTATCTAGC 


AGACCCTGTA 


GCTC GTTTTA 


CGATTTGCCA 


AGGAATTTTT 


420 


AATCCTATTT 


TCTTTAGTCG 


TGAGAACTTG 


AAAAGCTTTT 


TAGAGGCACA 


TGGCTTGCCT 


480 


CAGTTTGAAG 


CGCGTGTGCG 


TGCGGTTCAA 


GAGACAGATG 


CCTACTTTGC 


GAGAGTTTCC 


540 


TTCTATCAGG 


ATGGAGAAGG 


AAAAGTGCAT 


GGCGTTTACC 


ATCT AG CTC A 


AGGAGTCAAG 


600 


AC AGTT TT AC 


CGAGAGAACC 


GTTTGTTCCT 


GCAGCCTATA 


TTGAGCAATT 


GGTGGATAAG 


660 


GAAGTCCAGT 


GGGAGATTGA 


CTTGGTTCAA 


ATCACAGGAC 


ATGGCTCTAA 


ACCAGAAGAC 


720 


TATGAAGCCA 


TTGCTCGCTT 


GGACTATGCA 


AAATTCTTAG 


AGCTATTACC 


CCCATCTTTT 


780 


TACCACCAAC 


TAGACGCCAA 


TCAAATAGAA 


GTGCAACCCA 


TATTAGACAA 


AGATTTTAAA 


840 


ACATTAGCAC 


AAGAAAAGTA 


AAGCAGAAGC 


AGGTCAATCG 


ACTTGCTTTT 


TTG AC AT AG A 


900 


AAAAATCCTG 


CCAAGa TGAC 


AGGATTGCTA 


CTCAATGAAA 


AT CAAAG AGC 


AAACTAGGAA 


960 


GCTAGCCGCA 


GCTGTACTTG 


AGTACGGTAA 


GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


1020 


TTGAAGAGTA 


TGAAGTTTAA 


AGAAAAGCCA 


AGATACGAAG 


AT 




1062 


(2) INFORMATION FOR SEQ ID NO : 198: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 
TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCCTCAGT cTATCTACAA CCTCAAAAAC 
ATGTTTTgAa JcGCtcGTCAG t TCT ATCT AC AACCTCAAAA ACATGTTTTG AcaGCcTcGT 
CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 
CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 
TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCAGT GCTTTgagcG CTTCGTCAGT 
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TCTATCTACA ACCTCAAAAC AGTGTGTTGC GCAGCCTTTA ATCAGCCGCC TAGTCCGCTC 3 60 

TATGGTATTC ATTAAGTCAA CATCTCTTGT TTAAGAGCAC CAAATCAGGA AATCTTCTCG 4 20 

ATTCCCTGAT TTTTTCTATT T ACGTTTTCG TGTTGAGCTA CGTTCTGTCA AACCATGAGG 4 80 

TAAGAGAACT TCAC GTTCTT CCAACTCTTC CTTATGCATA ATCTTGGTCA ACATACGCAT 540 

ACTAATGGCA CCAAGGTCAT AAAGAGGTTG GGCAATCGTT GTCAAGTTTG GACGGGTAAA 600 

GCGTGAGATT TGTGAATCAT CACTAGTAAT AATTTCAAAA TCTTCTGGCA C AG AAACAC C 660 

CTTATCAGCC AAACCGTTCA AGACTCCTGC TGCCAACTCA TCACCTGTCA CAACTGCTGC 720 

AGTTGCATTT GATGAAATCA AACGCTCTGC TAAGGCGTAA C CAT CAT CAT AGCTATATTT 780 

AGATTCAAAT ACCAAACCCT CACTATAAGT GATTCCTGCT TTTTTCAAGG TTTCCTTGTA 840 

GCCAACTAAA CGAACCTTAC CATTGATGTC ATCCACTAGC GGACCGCTAA CGAAAGCAAT 900 

ACGCTCATTT TCTTTAGCAA GGTAACTCAC TGCATCAATT GTTGCTTGCT TATAGTCAAT 960 

ATTGACACTT GGCAACTGGT GCTCAACATC GACAGTTCCT GCGAGAACAA TCGGAGTACG 1020 

TGAACGCGAA AATTCTG AG C GAATTTTATC TGTCAAGTGA TACCCCATAT AG AT AATGC C 1080 

ATCTACCTGC TTTGAAAAGA GGGTATTGAC AACAGAAACT TCTTTCTCGT TATCTTCATC 1140 

GCTATTAGCT AGGACAATAT TGTACTTGTA CAT TTCTGC A ATATCATCAA TCCCCTTAGC 12 00 

CAAACTCGAA AAATAACCAT TGGTAATATT TGGAATCACG ACACCGACAG TGGTTGTCTT 1260 

TTTACTTGCA AGACCACGCG CAACTGCATT TGGACGATAA TCCAAACGAT CAATTACCTC 1320 

TAG C ACTTTT TTACGGGTAT TCTCTTTTAC ATTTTTATTG CCATTGACCA CACGGCTGAC 1380 

CGTCGCCATG GAAACACCTG CTTCACGAGC G ACAT CAT AA ATGGTTACTG TATCATCTGC 144 0 

ATTCATTCCT TTTCCTGTCC TTTCTATCTC ACACATTCTT TTACAAGTAG AGGTACTGAT 1500 

TGAAGCTCTA TATCTACTTA CAAAAGTGAA GATGTGAAAA TTTCGTTTTC ATATTTCTAC IS 60 

TTATTCCATT CTATCACTAA TT G T AAACAC TTTCAAGTGT TTTTTGAAGA TTGATTGAAA 1620 

AAATTTCATA GAAAACCT AG GTTTAGCTCC TTGCTACCAC CTTAGACTAA ACAAAAAGGA 1680 

GGAAACTAAG CCCTCCTAAA GTTATAGTAA AATGAAATAA GAACAGGATA AATCGATCAG 174 0 

GACAGTCAAA TCGATTTCTA ACAATGTTTT AGAAGTAGAG GTGTACTATT CTAGTTTCAA 1800 

TCTACTATAG GTATTGTTCC ATTCACTACC GTCAATTTTA GCACATAGTC TTCATGAAAA I860 

TATTATATCA TCATAACCAA CCAGATTCTT TCGCGATATT AGCTGCCTCT GTTCGATTAC 1920 

CTGCATCTAG TTTCGAAAGA ATATTGGTGA CATAGTTTCG GACTGTTCCG TTGGATAGAT 1980 

AAAGTTTGTC TGCAATTTCT TGGTTAGAGA AGCCCTGAGC AATTCCCTTT AAAACTGCGA 2040 

TTTCTTGCTC CGTTAATGGA TTGGGATGCA TCATCACCAC TTCCATCAAT TCAGGCGAAT 2100 
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ACTCCTTGCG 


TCCTTCGAGG 


ACGGTGTGCA 


AGGTTTGCAT 


GAGGTCTGCA 


ATGTTTCTTT 


2160 


CTTTTAATAC 


ATAAGCATCT 


ACTCCAGCCT 


TGACCGCACG 


TTCAAAATAC 


CCAGGACGCT 


2220 


TGAAGGTCGT 


CACCACAACC 


ACCTTTGTTT 


CAAGCTTTTC 


TGCTCGT AT C 


CACTCCAAGA 


2280 


CTTCAAGACC 


TGTCTTAACA 


GGCATTTCTA 


CG TCAAGG AT 


GGCGATATCT 


ACAGACTC CT 


2340 


TTTCTAATAG 


TTGGATTGCT 


TCTTGCCCAT 


TCTTGGCTTG 


AAAGACAGAC 


TCTACATCCG 


2400 


GTTGAAGCAT 


GAGCAACTGG 


C AC ATGGCAT 


CTCGCAACAT 


ACTTTGATCT 


TCTGCGACTA 


2460 


ATACTTTCAT 


CTACTTTCTC 


TCCTTATAAA 


GTAGTCGAAC 


CTGCACTTCA 


GTTGGATGTT 


2520 


TCTGACTGAT 


TACACTTACT 


TCTCCTGAAA 


ATGGAAAAAC 


ACGATTTCGG 


ACTGTATGGA 


2580 


GCTCATCCCC 


GCTTATAGAG 


GCAAAGCCAC 


AGCCATCATC 


TCTCACTGTT 


AG AATGAGT T 


2640 


CTTTCTCTGT 


CCGTTCTAAT 


TTCAAGTAGA 


CTTTAGACGC 


TTTAGCATGT 


TTGATGATAT 


2700 


TGGTCACTAA 


TTCAAGCAAA 


ATCATGGAAG 


CCGTTG ACTC 


CAATTCCTGA 


GTTAAGCTAG 


2760 


ACTTGTCCAA 


GTGATTCTCA 


ACTTGAACCT 


CAATTCCAGC 


AATTTCTAAC 


ATCTTTTTCA 


2820 


CAGTCTCTAG 


TTCGGATGTC 


AAAGTTCTAG 


ACTTAAGATT 


TTCCACAATG 


GTTCGCACTT 


2B80 


CATTCATGGA 


tCCTTGCTGA 


TCTGGTGAAT 


TTCTTTTAAT 


TCCTTPTCCA 


CCTGTGGATA 


2940 


AGCCTCCATC 


TGAAATAACT 


GCAAGGCTAA 


ATCTGTCTTG 


ACACTCAGCA 


TAGCAAAGGT 


3000 


ATGTCCCAGA 


CTATCATGCA 


AATCCTGACC 


GATACGACTA 


CGTTCATTTT 


CAGCAAGCAA 


3060 


TAGATTTATC 


TGAGCATTTT 


GCTTGACCTG 


AGCTTCTTTC 


AAATCCTCGA 


CAATACGAAT 


3120 


CCGAACCAAT 


CCAAAAGTCA 


TTAAATCGAC 


AAAAGTAAGA 


ATTACAAGTA 


GATAGAATAG 


3180 


AAACTCAACT 


TCGATTCTCT 


GAAAAATCAA 


CAGTTGCCCC 


ACAACAAGGA 


CTTGAGCAAG 


3240 


AAGAAAAGTC 


CAGACATGTA 


AAGACTTTAA 


ACTACGT AC G 


CTGAAATGAT 


AACTTAAGAG 


3300 


ATTGGATAGG 


AAAAAGAAAA 


ACCAGATATA 


ATTAACAGCA 


ACAAAGGCAG 


TATTCCCAAC 


3360 


TACATAAGTC 


AGCATGAGGC 


CCCAATATAG 


CCAAGATACG 




T Ji flTT^^TT* & A 
I /\v» 1 1 <J i i i\J\ 


J *4 £, \J 


AACACCCAAA 


TATGCCACTA 


CAAATAGAAT 


ATCAATCAAT 


AAATGCCAGG 


CAGAAAGCCA 


3480 


CCCAGTCACT 


ACAGACAGGA 


TGGGGAAAAT 


CATAAAAATT 


AAACTGATCC 


AAAACATATA 


3540 


ATGTATTCTT 


TTCAGTCTTT 


CAAGCATTAA 


GCATTCTCCT 


T ATGACCTTG 


AAGGTAAATG 


3 600 


GTCAAACCAA 


ACAAAACTAC 


TGAAAAAACA 


AGTAAATAAA 


CTGTGGCTGA 


TAGATTGATG 


3660 


CCACCCTCAT 


TTAAGAAGGT 


CTTGAGCAAC 


TCCATCAACT 


GATAGGTCGG 


GAGACACTTA 


3720 


C CT ACT ACTT 


GCATCCAGTC 


TGGAAATAAA 


GAGATAGGCA 


TCCAGAGTCC 


ACCTAAAACA 


3780 


GCCAACCCTA 


GATAAAGAAG 


ATTGCCCACG 


ACAGACATCA 


ACTGACTAGT 


TGGTAAGAGA 


3840 
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GTCAAGGTCA 


AACCAAGCGC 


TACGAAGGCA 


ATACTTCCTA 


CTATCAGCAA 


AAGTGCAGCC 


3900 


CCAATCCAAT 


TTCCAAGAGA 


CATGTCCACA 


CCTCTTACAA 


AATGCCCAAC 


TGAGAAAACC 


3960 


ACCAAGATTG 


AAACCAAATA 


ATCAACCAGC 


ATACTTGTTA 


TCTTTGATAG 


ATAATATTCT 


4020 


ACCATATTTA 


CAGGGCTATG 


ACGCAATGTT 


TTCTGCCAGT 


TGTTGATCTT 


GTCGGTATGT 


4080 


AAAACAACTG 


GG AATGAGAA 


GATAGCTGTT 


GACATCATGG 


AAAATGCAGT 


CATGGAGATA 


4140 


AGATAATCAC 


G C AT AAAATT 


CGCGAGTTCA 


CCTCGTGTGT 


CCTGATAGAT 


ACCAGAAAAA 


4200 


AATAAATAGA 


AAGCCGT CGG 


CATCCCTACT 


GACAATAGAT 


AATAGATCAA 


TTGTCGTTTG 


4260 


GTCAATAAAA 


ATTCTATCTT 


ACTAAGTGCT 


AGCCATCGTT 


TCATCTTAGT 


TATCTCCCTT 


4320 


CTGCGTTTCT 


TCAAAGATTG 


TATCCAACAA 


ACTACGATTA 


TTAACTTCAA 


TTTCTTGTAT 


4380 


GCCACATCCT 


GCTTGAACTA 


ACAGTTCCCA 


AAAAGCATCT 


GCTTCGCGTG 


TGACTACTTG 


4440 


TAG AG CATC C 


TGTTTTTGTG 


ACCAGTTTTC 


AACCAAGTTA 


GACTGCTCAA 


TGACTTCCTT 


4500 


GTATGCCAGA 


GGAAGGATAA 


AATGCTTTTC 


AATTCCCTCA 


CTACGCATAG 


CTAGAGGCGT 


4560 


CGTATCACGA 


ATC AACTCTC 


CCTTATTTAA 


AACCAAAATC 


CGGTCAGCCG 


TATGCTCTAC 


4620 


CTCTTCAATA 


TAATGAGACG 


AATAGAGAAT 


CGTGACTCCT 


TGCGCTTTTA 


GGTCCCGAAC 


4680 


GATTTCCCAA 


AAGCGTTGAC 


GAGTTGAAGT 


ATCCATGGCA 


GCAGTTGGTT 


CATCTAAAAA 


4740 


GACAAGCTTT 


GGTC GCCCAA 


TCAAGGTCAA 


GACAAAAGAG 


AAGAGACGCT 


TTTGCCCGCC 


4800 


TGACAATTTT 


TCTGCGAATT 


GCTCTTTTTG 


TTGCTGGTCA 


AACTGCAATA 


GTTGATCGAT 


4860 


TTCCTGATCG 


CTCAAGGAAT 


TTGGATAGAT 


ACGTTGAAAG 


AAAGCAATCA 


ACTCTTTGAC 


4920 


CTTTAATTTC 


TGAACGATGA 


CATTTTCTTG 


AGGCAGATAA 


CCTCTAATAT 


AGTCTAACTG 


4980 


AGAACTCGTC 


ACTGACAAGC 


CTTGGATGGA 


TACTTGACCG 


CTTGTGACCA 


GTTTATCTCC 


5040 


AAGCAGACAG 


TCCAAGAGTG 


TGGTCTTCCC 


AGCACCATTG 


GGCCCAATCA 


AGGCGACGCA 


5100 


TTCACCTTCA 


GCTACCTCAA 


AGGAAATACC 


CTTCAAAATA 


GCCTTGCCCT 


TGATGTTTTT 


5160 


ATTTAGGCTT 


TCTACCTTAA 


TCATATTCAT 


GATATTCTCC 


TTTCAACCAC 


TCCATTCTCA 


5220 


TAAGGAAAAC 


GACGAAAATC 


ATAAATCCAA 


ACCCCAAAGC 


ACCACGAATG 


AATTGGCGAA 


5280 


gCAAGGTTTG 


GTCAAACCAA 


CCTGTAAACA 


TTTCCACTAA 


CCATACCAAG 


AGTGACAGGC 


5340 


CGATAAAGAA 


ATAGATGATC 


CCTCTCTTCA 


TTCCTCAAGC 


TCCTTTTTCA 


CATCTCCCAC 


5400 


TAATTTCAAA 


CCTTCTCTAA 


CAAGCCAAGA 


CATCATTCCA 


AAGCCAGCAA 


AGAGCTCCCA 


5460 


AGGAAAATGA 


TAGAAACTCT 


CATCCAATCC 


CGAAAACATG 


AGTTAGGTCA 


TAACTCCTGC 


5520 


TACT ACT AAA 


CTCACTGCGA 


TAATCATTTT 


ATTTCTCATC 


TCTTCTTCCT 


CCATTTCATA 


5580 


CTACAATTAT 


AGTCTTTTGA 


AATCAGAGGA 


GACAGAAGCT 


TCTGTCACTA 


GAAAATATGA 


5640 
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CAAATCTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA AC AC AT AT AT 5820 

AGTGTATTGA ATCTATAACA GTAGCCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5880 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC GTTGAACAAA 594 0 

ACCCTTCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACT ACGGT AT TTATTGAAAA GACTGGAGAC AAAAAGTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 612 0 

ATCTAGTTTT AAAAATACAC ACTCACATAT CTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCGAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCCGAGTT 62 40 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TAGTCAGGTA 63 00 

CTCTATCCAG TTGAGCTAAG GGTG CTCCAT ATTATGCCGA GGACCGGAAT CGAACCGGTA 63 60 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCGTCT GCCAGTTCCG CCACCCCGGC 64 20 

CTCTCTAAGC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 64 80 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAG ATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6 600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCGCCAAAC CCGCATATAT G AC CCGT ACT GGGCTCGAAC 672 0 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 67 80 

CTTGCGTTAC CTTAAACGGT CCCGACGGGA ATCGAACCCG CGATCTcGCC GTGACAAGGC 6840 

GACGTG 6846 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GAATTCATTT TAAATAAAGA TACGGGAGAG GTAAGTGAAT TAAAACCTCA TAGGGTAACT 60 

GTGACCATTC AAAATGGAAA AGAAATGAGT T CAACG AT AG TGTCGGAAGA AGATTTTATT 120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 180 
G GTTTCG AAG GTAAAAAAGA CGCTGGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 240 
AAACCTGTAT TCAAGAAAAT AGAGGAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 300 
GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 360 
AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 4 20 

GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 4 80 

TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 54 0 

ATAGTAGGAA TTTTTCTTGG ATTGAAGAAA AAAAATCAAG ATT AAG AT AA AAGCTATAGA 600 
AAAAAATGGT TTATGTACTG AG ATT AG ATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 6 60 

GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 720 
TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 7 80 

GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCT AAT AG A GGTGATGAGA 840 
CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGGACAA 900 
ATTTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 9 60 

AGTTTCGAAA ACCAAAGGCA TTGCCCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 1020 

GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 1080 

AGGTTTTAGA GGATGAACTT G ATT CAG ATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 1140 

CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATACTGGT GTTTCAAGTC 1200 

TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATAGTA CACCTCTGCT 12 60 

TCTAAAACAT TGTTAGAAAT CGATTTGACT GTCCTGAATG ATTTGTCCTG TTATTATTTC 13 2 0 

ATTTT ACT A T AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTC ATCT 13 80 

CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 1440 

ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCAT CTGG A TAAGCTTGTG 1500 

AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGACTCAGCC 1560 

CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 1620 

CTATTGGACC TTTTTGTCTG GGAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 1680 

TCAGAGTCGG AGTGGTTCGG T AG T ACAAG A ATTCCTAGGA GATTATTCTG GCTATGTTCA 1740 

TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC T ATGCGAT AG 1800 

CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CGCTAGAAGC 1860 

TTATCGTCAA CTGG AAG AAG CTGAACTTGT TGGATGTTGG GCGCATGTGA GAAGGAAATT 1920 , 
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TTTTGAAGCG ACCCCCAAGC AAGCAGATAA ATCATCCTTA GGAGCTAAAG GTTTAGCTTA 1980 

TTCTCATCAG TTATTTTCCT TGGAAAkAGA CTGGGAGGCT TTGCCACCTG ATGAACGACT 2040 

ACAGAAACGT CAAGAACATC TCCAGCCCCT AATGGAAGAC TTCTTTGCTT GGTGCCGCCG 2100 

TCACTCAGTT TTAGCAGGTT CAAAACTAGG AAGGGCAATT GAATACAGCC TCAAGTATGA 2160 

AGAAACCTTT AAGACTATTT TGAAAGACGG ACATCTGGTC CTTTCCAATA ATCTAGCTGA 2 2 20 

ACGCGCCATT AAATCATTGG TTATGGGACG GAGTAAAAGA GTCCAGTGGA CTCTTTTAGC 2280 

CTGAGCTCAG TTTAAAAAAG CGAGGGTGGT TATTTTCTCA AAGTTTTGAA GGAGCTAAAG 2340 

CAAGAGCTAT TGTTATGAGC TTGTTGGAAA CAGCTAAACG TCATCAATTA TAGTGCGTTG 2400 

AATCTATAAC AGTACGCATC G ACTGC T AAA ACATTTCTAT AAATCAATTT TCCTTTCCTA 2460 

ATCGATTTGT TCATATCTTA TTTCAATCCA TTATAAATAG CGACAAATAT CTATCCTATC 2520 

TTCTAGAATG TCTTCCAAAC GAGGAAACTC TCGTAAACAA AGAGGTTTTA GAGGTTTATT 2 580 

TACCATGGAC TAAAGTTGTA CAAGAAAAGT GCAAATAAGA AATCTCCAGA TTAGGAACTA 264 0 

TCCGTGAGTT CACTAATCTG G AG ATTTT TC AATAGAtTCG TTATTGGGCG GTTACGATAT 2 700 

G AT c ACT ACT TCGTCAGTCT TAT CT AC AAC CTCAAAACAG TGTTTTGAGC AACCTGCGAC 27 60 

TAGCTTCCTA GTTTACTCTT TG ATT TTC AT TGAATATTAG AACAGAAAAA ATGCTTGGAG 2820 

TATTTGTTTG TGTGTTTATT TTTATATAAC AAACTATAAA CAAAATAAAA ATATAAAAAA 2 880 



2911 



AGAGACAAAA AAGAACAGAA AGTAATTGAC A 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE ONE SS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 

GAAAATAAGT CTTGACAGAA AG CG CT AT CA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGT TTC AAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT 120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAAT C AAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 3 60 
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TTTACAAAAT TCATATCCTC TCCCAACTTT GGTATCTTGT TAGCCAACAC ATTAAAATTA 4 20 

AGTATCTATG GTTTATTGCT TGGCTTTTTA CCACCAATCA TTCTCGCGAT TATGCTCAAT 4 80 

CAACTCTTGA GTGAAAAAGT CAAAAAACGA ATTCAGCTCA TTTTATACGC ACCAAACTTT 540 
ATCTCAGTCG TTGTTATTGT CGGTATGATT TTCCTCTTCT TTTCAGTGGG AGG ACCAATC 600 
AACAATTTTC TTTCTATGTT TGGAATGAAG GCTGACTTCT TGACAAATCC AGACTTCTTT 660 
AGACCTTTAT ACATCTTTAG TGGTATCTGG CAAGGAATGG GCTGGGCTTC AACGCTCTAC 720 
ACGGCAACAT TGGTAAATGT AGATCCAGCC TTAGTAGAAG CAGCCCGACT GGATGGAGCC 7 80 

AATATCTTCC AACGAATCTG GCACATTGAT ATTCCAGCTC TTAAGCCTAT TATGGTTATC 840 
CAATTTGTTT TAGCTGCAGG TGGAATTATG AATGTCGGAT ATGAAAAAGC ATTCTTGATG 900 
CAGACATCGT TAAATTTGCC AACTTCTGAA ATTATCTCGA CATATGTCTA TAAAGTTGGT 960 

CTTGTATCAG GAGACTATTC TTACTCAACA GCGGTTGGTT TGTTTAATGC AGTGATTAAC 1020 

GTAGTATTGC TTGTTGCAGT TAACCAAATC GTTAAACGCA TGAATAATGG TGAAGGAATT 1080 

TAAGGAGGAA AGTATGAAAA ATTCGATTAT GGATACAAAA TTTGATAGAC GTATCTTACT 1140 

CTTAAATAAA ATCATTATTG TCTTTATCGT TTTGATGACT TTGCTTCCTT TACTTTATAT 1200 

CGTCGTAGCA TCCTTTATGG ATCCTAAGGT TCTGGTTAGT AGAGGGATTA GCTTTAATCC 12 60 

AGCCGATTGG ACTGTAGAAG GTTACCAGCG TGTATTCAGT GACCAATCTA TTCTAAGAGG 1320 

TTTTATCAAT T CT CT ACT AT ACTCTTTTGG ATTTGCAGCT TTAACAGTCT TGCTATCTGT 1380 

GTTTACAGCT TATCCTCTTT CTAAGAAAGA CTTGGTTGGA CGTCGTTGGA TTAACTACTT 1440 

CTTGATTGTA ACTATGTTCT TTGGTGGTGG TTT AGTCCC A ACTTACTTGC TCGTAAAAGA 1500 

ATTGGGAATG CTCAATACTC CATGGGCTAT CATTGTTCCA GGTGCTGTTA ACGTTTGGAA 1560 

TATTATTCTT GCTAGGGCCT ATTTCCAAGG ATTGCCTGAA GAATTAGTTG AAGCTGCTGT 1620 

CATTGATGGT GCAAATGATT TACAGATTTT CTTCAAAATC ATGCTTCCTC TTGCAAAACC 1680 

AATTATGTTT GTTCTCTTCC TTTATGCTTT TGTAGGACAG TGGAACTCAT ACTTTGATGC 1740 

AATGATTTAT ATCAAGGATC CAAACTTGGA ACCATTGCAA CTTGTACTTC GTAAAATTCT 1800 

CATTCAGAGC CAACCAGGTC AAGACATGAT TGGAGCACAA GCGGCTATGA ATGAAATGAA 1860 

ACGTTTAGCT GAATTGATTA AATACGCAAC TATTGTCATT TCC AGCTTG C CATTGATTGT 1920 

TATGTATCCA TTCTTCCAAA AATACTTTGA TAAAGGAATT ATGGCTGGTT CACTTAAAGG 1980 

ATAAAAAAAG AAAAAATAAA AGGAGTTTTC TCATGAAATT CAAAACATTC TCAAAATCAG 2040 

CAGTTTTGTT GACAGCTAGT TTAGCAGTAC TTGCAGCCTG TGGCTCAAAA AATACAGCTT 2100 

CAAGTCCAGA TTATAAGTTG GAAGGTGTAA CATTCCCGCT TCAAGAAAAG AAAACATTGA 2160 
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AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC AAATGAAAAG TT AATTT TGC 2 220 

AACCTTTGGA GAAGG AAAC T CGCGTTCATA TT G ACTGG AC CAACTACCAA TCCGACTTTG 2280 

CAGAAAAACG TAACTTGGAT ATTTCTAGTG GTGATTTACC AGATGCTATC CACAACGACG 2 340 

GAGCTTCAGA TGTGGACTTG ATGAACTGGG CTAAAAAAGG TGTTATTATT CCAGTTGAAG 2400 

ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT GG ATG AG AAA CCAGAGTACA 24 60 

AGCCCTTGAT GACAGCACCT GATGCGCACA TTTACTCATT TCCATGGATT GAAGAGCTTG 2520 

GAGATCGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC TTGGATTAAC AAACATTCCC 2580 

TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA TTTGATTAAA GTCCTAGAAG 2 640 

CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA TGAAATTCCA TTTTCATTTA 2700 

TTAGTGGTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC TGCATTTGGT ATAGGGGATA 2 7 60 

ACGATGATCA TTTAGTAGTA GG AAATGATG GCAAAGTTGA CTTCACAGCA GATAACGATA 2820 

ACTATAAAGA AGGTCTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG ATTGATAAAG 2 8 80 

AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG TCATGATCAG AAATTTGGTG 2 940 

TTTACTTTAC ATGGGATAAG AAT AATGTT A CTGGAAGTAA CGAAAGTTAT GATGTTTTAC 3000 

CAGTACTTGC TGGACCAAGT GGTCAAAAAC ACGTAGCTCG TACAAACGGT ATGGG ATTT G 3060 

CACGTGACAA GATGGTTATT ACCAGTGTAA ACAAAAACCT AGAATTGACA GCTAAATGGA 312 0 

TTGATGCACA ATACGCTCCA CTC CAATCTG TGCAAAATAA CTGGGGAACT TACGGAGATG 3180 

ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA CACTTACCAC 3 240 

TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA AGTAGGAGGA CCACTAGCTA 3 300 

TCCTAGATTC AT ACT ATGGT AAAGTAACAA CCATGCCTGA TGATGCCAAA TGG CGTTTGG 3 3 60 

ATCTTATCAA AGAATATTAT GTTCCTT AC A TGAGCAATGT CAATAACTAT CCAAGAGTCT 3420 

TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA AGCAGATATG AATGACTATA 34 80 

TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT TG AT ACT GAG TGGGATGATT 3540 

ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT CGCTATTAAA CAAAAATACT 3 600 

ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAG 3660 

TAGAAAAAGC CAATCGTTTT ATAGCAGAAA ATAAACATCT CGTTAATACT CAATATAAGC 3720 

CTGAAGAACA TTTTTC AG C T GAGATTGGTT GGATCAATGA TCCAAATGGA TTTGTCTATT 3780 

TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTCTT TGGGGGCCTA 3840 

TG C ACTGGGG ACATGCTAAA AGTAAGGACT TGGTGACTTG GGAGCACTTG CCAGTGGCAC 3900 
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TTGCTCCTGA CCAAGATTAT GACCGAAATG GTTGTTTCT C AGGCTCTGCC ATTGTCAAGG 3960 

ATGATCGCCT CTGGCTCATG TACACTGGAC ATATCGAAGA AG AAACCG GT GTCCCCCAAG 4020 

TGCAAAATAT GGTATTTTCA GATGACGGGA TTCACTTTGA AAAGATTTCC CAAAATCCAG 4080 

TTGCAACTGG ATCAGACTTA CCAGATGAGT TGATTGCTGC TGATTTCCGT GATCCAAAAC 4140 

TCTTTGAAAA AG ATGGACG C TATTACTCCG TAGTAGCTGC CAAACACAAG GATAATGTGG 4200 

GCTGTATCGT TCTACTAGGG TCCGATAACC TAGTAGAATG GCAGTTCGAA TCCATCTTTT 4 260 

TAAAAGGGGG AGAACACCAA GGTTTTATGT GGGAATGCCC AGATTACTTC GAGTTAGATG 4 320 

GGAAAGATTG CCTTATTATG TCACCCATGC GTTATCAGCG TGAGGGAGAC TCATATCATA 4 380 

ACATCAACTC ATCCCTTTTG TTCACGGGTA AGGTAGATTG GAG AG AAAAA CGTTTTATCC 44 4 0 

C AGAATC AG T TCAAGAAATT GATCATGGCC AAGACTTCTA TGCGCCTCAA ACATTCTTGG 4 500 

ACGATCAAAA TCGTCGTATC CTGATTGCTT GGATGCAGAC ATGGGGGCGT ACCCTTCCAA 4 560 

CCCATGACCA AGAACACAAG TGGGCATGTG CCATGACTCT ACCTAGAATT CTAAGATTGG 4620 

AAGATGGCAA ACTAAGACAA TTCCCTGTTA AAAAAGGCCA AT ATC AAATC CAAATAGATA 4 680 

AAGATTGTCA TTACCACTTA GGAAATGATA TAGATTATCT TGAATTTGGT TATGACAGTA 4 74 0 

ATGCGCAGCA AGTTTACATT GATCGTAGCC ATCTT ATTC A AAAAATTCTA GGTGAAGAAG 4 800 

AACAGGACAC TAGTCGACGG TATGTAGATA TTGAAGCTAA AGAATTGGAA GTTGTTCTAG 4860 

ATAAAAATTC CATCGAGATT TTTGTCAATC AAGGTGAAGC AAGCTTGACT GCAACTTATT 4920 

ACTTAACGGT GCCAGCTGAG CTATCACGAA TTGATTAAAA ATTAAGTTAT TTCTCCTAAA 4 980 

GAAAAAGTTC TCTTTCTAAA ATAGTGGAAA GAGGACTTTT TGTGTTTTGG GTATATAAGC 5040 

TTAGTTTATG GTATTTGTAA AATTGGTGTT G G ATT AT GAT TTAAGCTAGT TTTCTAAAGA 5100 

ATTTGAAAAA AATTTTATTT AAGCAAAAAA ACCTTGGTTC CAAGGCTTTT CCTGTTGTAT 5160 

TTAGATGCCC CCTACAGGGA TTGTAGGAGA TATGTTGCTT AG ATGTTCT T GATTTTCTGG 5220 

TGTTTTGTAA CGTTTAAATG AGTTTTTTGA GTTTGTTGGT GGGGCGTTGC CCGGCAATTG 5280 

CCCGACTTAT TGCTTGAAAA AGAATTTAAA ATATAGTATA GTTAATTATA GATTAACACT 5340 

TGCTTGGAGG AACTGATGAA GAACAATGAA AGATTAGGTA TTAAATTAAG TAGAGATAGC 5400 

GTTTTAGGAT TGAGGGAAGT TAGAAGGCTT TATTTAGGCA GTTCAGATAT CCCAGTTTCT 54 60 

GATGGCTATG TGATTGAAGT TGCTTATAAC CAGATATCAC ATGAGATTGA TATTATTGAT 5520 

TGGGTAGAGT TGAACAAGTC AAAAATTAAG ATAAGTGAAA TTAGTGAAAG CGTGGATATA 5580 

GATGCCACTA GCTTGAGAAC AACTTTGACT TTAGACACAT TAGTATATGA AGGTATGAGA 5640 

GATATACAGT TAAAGTTGAG AGAGCTTACA AAGGGGAGAG TATTCTTTTC ATTTGTAGTG 5700 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT TACTAGAAAA ATTTCAAGAA 5760 

AAGTCTTAAT CAAGTATTGA CACTTTATCT GGATTTCGGT ATAATATGCT TAGAAAGGAA 5820 

TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA CGAATACAAA AACATATTTT 5880 

TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA GAGGTAGATA AATGCAATTG 5940 

AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA AATATTTTAA CTTGAATTAT 6000 

AGAAAAGGAG AAACAAATCA TGAAACAAAA ACAACCGATT GTTTCTAGAA CGAAACAACA 6060 

TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA TTCGCTAAGT TGTCGCCCGA 6120 

TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT TCATTTGCGA ACTTGATTAA 6180 

AGAAGCGTAT GGGGGTAAAA AT CT AAACG T AGTTTATGCG AGTCGGATGT TGGCTCTCTG 62 40 

GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT TCTTTAGCAG ATGCGCTTTT 6 300 

TAGTGATAAA AAAATTTGTC TAGATTCTTA CT ATT AC C AC AAGAATACCT CTAATACCAT 6360 

AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT AATTATATGG TTTTAACTCG 64 20 

AG AAG CG AC A CCTGAATACA TTTATGTTGT ACAAACTGAA ATGCCAAAAG ATTCAGATTT 64 80 

ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCGTTTAGT ACCATGCATT ATGCATTTTT 6540 

AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG CC AT AT CG AA ATTGAATTAT 6600 

TTAAATTTAT ACTCTTCGAA AATCAAATTC AAACCAAGTC AGCTTCGCCT TGCTGTACTC 6660 

AAGTGCTGTC TGTGGCTAGC TTCTTAGTTT GCTTTTTGAT TTTCATTGAG TATTACTCTT 6720 

ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT TATAGCGAAA ATTTTAGGTT 67 80 

CTATAATATT TGTAGTGGGT AAACCACTAT AGATATTATG GAGCCTATTT ATTGTAGAAA 6840 

AAAGTCCCAT ATGA g a S4 
(2) INFORMATION FOR SEQ ID NO : 201: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3895 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGCT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG ACATTT T CGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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CACTTCTGCA 


TCTTCATAGT 


TTGATATCAA 


AATCTGTCCA 


TTTTGGTAGA 


CTGCTGGCAA 


240 


GTCGATTTCA 


CTT CTTTAGC 


ATAAAAGTTA 


TTG AGC ACT A 


GTAACTTTTG 


ATCCTCAAAC 


300 


TGGCGTTCAA 


AAGCGTAGAC 


TTGTTTGCTA 


TCTTCAAAGG 


CTGGTTTGTA 


ACTTCCTTCT 


360 


GAAATGATTG 


GCATTTCCTT 


ACGCATCGAA 


TCAAGTCTTG 


ATAGAAGGTA 


AAAATCGGAC 


420 


CCTGGATTTC 


ATTTTCTACA 


TTGATGTATT 


TATAGGATTT 


ACCAGCTTTC 


AACCAAGGAG 


480 


TGCCTGTTGA 


AAATCCTGCA 


TTTTCCGAAG 


CATCCCACTG 


CATGGGAATG 


CGTGAATTAT 


540 


CACGCGACTT 


AGCTTGAATA 


ATCTGGAAGG 


CTTCTTGCTG 


ACTCTTTCCT 


T CTTCT AAG A 


600 


GCATCTGATA 


GGCATTAAGC 


GATTCGACAT 


CCACATAATC 


AGCCATAGAA 


TCATAGTCTG 


660 


GGTCAATCAT 


CCCGATTTCC 


TCACCCATGT 


AGATATAAGG 


TGTCCCACGT 


GACAGGTGAA 


720 


TGCTGGCTGC 


TAGCATGGTG 


GCTCCTTCCT 


TGCGGAAGTT 


TTGAATATCG 


ACAAAACGGT 


780 


TCAAGGCACG 


TGGTTGATCG 


TGATTATTCC 


AAAAGAGGGC 


ACTCCAACCG 


TCTTTATCAC 


840 


TCATTTCCTT 


ACCCCAACTA 


TGGTAAAGAC 


TCTTCAACTC 


TTCAAAATCA 


AAGGGAGCCA 


900 


AGGTCCACTT 


TTGTCCATCC 


TTATAGTCCA 


CCTTGAGGTG 


ATGAAAATTA 


AAGGTCATGG 


960 


ATAATTCCTG 


ACGATCAGGC 


GACGAATAGA 


GGACACAGTT 


TTCCATGGTG 


GTAGAAGACA 


1020 


TTTCCCCAAC 


TGTCATAAAG 


CTATCGTCGG 


ATCCAAAAGT 


GGCTTGGTTC 


ATCATACGCA 


1080 


AATAGTTATG 


AACGATGGGT 


TTGTCTGTAT 


AAGCTGGCTT 


CCCTTCATTT 


TCAGGACAGT 


1140 


CCACTGAAAC 


CTCGTCCTTA 


CCGATCAAAT 


TGATCACATC 


AAATCGGAAA 


CCTTTGACAC 


1200 


CCTTGTCGCG 


CCAGAAATTA 


ACAACCTTGA 


AAAGCTCCTT 


ACGGACATTG 


GAATTGCGCC 


1260 


AGTTAAGGTC 


AGCCTGGGTC 


TCATCAAATA 


GGTGAAGATA 


GTATTTCCCA 


GTATCCCCGA 


1320 


AAGGCGTCCA 


TGCAGAACCA 


CCAAACTTAG 


ACT G CC AATC 


TGTTGCTTCG 


TCTTGGATGA 


1380 


AGAAAAAGTC 


TTGATAATAC 


TTATCACCAG 


CTAGGGCTTT 


CTGAAACCAT 


TCATGCTCTG 


1440 


TCGAACAATG 


ATTAAGTACC 


ATGTCCAGCA 


TAAAGTCAAT 


CTTGTGCTCT 


TTACCGACAC 


1500 


ACACCATTTT 


CTCAAAATCA 


GCCATATCAC 


CAAAAAGAGG 


ATCCACTGCC 


ATATAATCTG 


1560 


AAATATCGTA 


ACCATTATCC 


CGTTGAGGGC 


TTGGATAGAA 


TGGATTGAGC 


CAGACCATAT 


1620 


CCACACCTAG 


TTTGGCTAAA 


TAGGGAATTT 


TTTCGATAAT 


C C CACGG AAA 


TCCCCAATAC 


1680 


CGTTTTCAGT 


GGTGTCTTTG 


TAAGATTTTG 


GATAGATTTG 


ATAGACTACT 


TTTCCTTTAT 


1740 


CAAGTGTCAT 


CTGTTTCTCC 


TTTTCTGATA 


AAAGGGAGGA 


AGCAGTCTTC 


CGTCCCTATT 


1800 


TGTGCTATTT 


CAATTATACT 


CAATGAAAAT 


CAAAGAACAA 


ACTAGGAAGC 


TAGCCACAGG 


1860 


TTGCTCAAAA 


CACTATTTTG 


AGGTTGCAGA 


TAGAGCTGAC 


GTGGTTTGAA 


GAG ATTTTCG 


1920 


AAGAGTATTA 


GATTCGTGTA 


GCGACCATGA 


GAG ATGCTC C 


AGCTTGGATC 


GTTGTCGGAT 


■ 1980 
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AAGTTCCCGG AATAGTCGCT GTATAAGCAT CTTGGTTGGT GATGATAACA GGAGTTTCTG 204 0 

TCACCAGACC TGCAGCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 2100 
TAACGTGATC TCCTTGGACT ACAAGACTTT CAAAACCTTT GCCATCAAGA CCTACTGTAT 2160 
CCATACCGAT GTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG ATGGCATGCT 2220 
TGGTAGGGAA AAGAACCGTC ACTGTCCCAT TAACTGCAGA GGTCAACTCA CCTTGGCTTG 2280 
GTTCAATGAC TAGACCTTGC CCCATGACAC CTGATGCAAA AATAGGATCC GTCGCTTGAC 2340 
TCAATTCTTT CACTTGGCCA GTTAGTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 2400 
GTTCATGGTT CACAAATTCT GCTTCTTCTT GAGCAACGAA TTCTGCCTGC AAGTTCGTAT 2460 
CGCCCTCTCT TTTTGTAAAG AGACCAGCCT TGCGGAAGAA GAAAGTCAAG ACCATTGGAA 2520 

CAACAATCGC AACTAGCATA GTTCCTGCAA ATGGCAGCAT GTATTGAGGT TGAATAGAGA 2 580 

GAATACCTGG CAAACCACCG ATACCAATAG AAGCCGCAGT TACATTAAAA GTAACGGATA 2 640 

ACATGCCTGC AAGGGCTGAA CCAGTCATCC CAGCAACAAA TGGATAAATA TATTTTACGT 2 700 

TAACCCCAAA AAGAGCTGGT TCTGTAACAC CGAGATAGGC TGAAATGGTT GCAGGAAGTG 27 60 

AAACCTGAGC CTCACGCTCA TCATGGCGAT GCATGAAATA ATAGGCAAAC ACGGCTGAGC 2820 

CTTGAGCAAT ATTAGAAAGA GCAATCATTC GCCATAGGGC AGTGCCACCA GCATCCGCAA 2 880 

TCAATTGTGT ATCAATGGCA TTGGTCATAT GGTGCAGACC TGTGATGACA AATGGAGCGT 294 0 

AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGC ACCAGTTAAA CCTGCCAAGA 3000 

CAACTGATGA AAGTCCTTGT CCAATTGT C C AACCGATTGG TCCCAAAACA GT ATG AG C CA 3 060 

AAATCAAGGC TGGAATCAAT GACAAGAAAG GTACAAAAAT CATAGAAATG ACTTCTGGGA 3120 

TATGCTTGTG CCAGAAGATT TCAAGATAAG ACAGACTCAA ACCTGCAAGC AACGCTGGGA 3180 

TAACTTGGGC TTGGTAACCG ATACGATTAA CAGTAAAATA GCCAAAATTC CAAACCCAGT 3240 

TTGCCGCGAT ATCAGCTGCT GGCGTTGAAG CAACCGCATA GGCATTGAGC AACTGAGGCG ' 3 300 

ATACCAAACA GATTCCGAGA ACAATTCCCA AAATTTGGCT GGTTCCCATC TTACGAGAAA 3 360 

CAGAC CAAGT AATCCCTACT GGTAAGAACT GGAAGA7ACC TTCACCAGGC AACCAGAGGA 3420 

AGTGATTGAC ACCTGCCCAA AACT GAG AGG ATTCTGTGAT GGTCTTGCCA TCCAACATCG 3 4 80 

ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCCTCCG ACTATCAAGG 3540 

CTGGAATAAT CGG AGTAAAA ATCTCCGCCA GAGTGGTCAT AACACCTTGG ACCACGTTTT 3 600 

GATTACTCTT AG CTGCAG AC TTGGCTGCTT CTTTGGAAAC ACCCTCAATA CCTGAAACGG 3 660 

■ CTGTAAAATC ATTATAAAAG ATGGGCACGT CAT TTCCAAT GATTACCTGA AATTGACCTG 3720 
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CATTTGTAAA GGTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 

TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG AGTTACGGCA GTCACATTTT 384 0 

CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CCCGG 3895 



<2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 
(8 ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



AGGATCGCCG 


CTCCAGCTAC 


TAAGTCTCGT 


GCAGTGCCGA 


TTTATCAAAC 


AACATTTTTT 


60 


GTTTTTGATG 


ACACGTAGGA 


AGCTGCCGAT 


CTGTTTGCCT 


TGAGGAAACC 


AGGGAACATT 


120 


TATACTCGTA 


TCACCAATCC 


TACAACAGCT 


GCCCTTGAAG 


GTGGTGTTGA 


AGCGCTAgcA 


180 


ACAGCATCAG 


GTATGACTGC 


AGTGACTTAT 


ACGATTTTGG 


CGATTGCCCA 


TGCTGGTGAC 


240 


CATGTAGTGG 


CTGCTTCGAC 


TATTTACGGT 


GGAACCTTCA 


ATCTTTTGAA 


AGAACCCCTT 


300 


CCTCGTTATG 


GTATCACAAC 


AACCTTTTTC 


GATATTGATA 


ATTTGGAGGA 


AGTAGAAGCA 


360 


GCTATCAAAC 


ACAATACCAA 


GCTTGTCTTG 


ATTGAAACCT 


TGGGTAACCC 


CTTGATTAAT 


420 


ATTCCAGACC 


TGGAAAAACT 


GGCAGAGATT 


GCTCATAAAC 


ATCAAATCCC 


ACTTGTGTCA 


480 


GACAATACTT 


TTGCAACACC 


TTATTTGATT 


AACGTCTTCT 


CTCATGGCGT 


TGACATTGCC 


540 


ATTCACTCTG 


TGACTAAGTT 


TATCGGTGGG 


CATGGTACAA 


CTATTGGAGG 


AATAATTGTC 


600 


GATAGTGGTC 


GTTTTGACTG 


GACGGCTTCA 


GGGAAATTCC 


CTCAATTTGT 


TGACGAGGGT 


660 


CCAAGCTGCC 


ACAATTTGAG 


CTATACTCGT 


GATGTGGGTG 


CAGCAGCCTT 


TATTATAGCT 


720 


GTTCGAGTTC 


AATTGCTTCG 


TGATACAGGT 


GCAGCCTTGT 


CACCATTCAA 


TGCTTTCCTC 


780 


TTGCTACAAA 


GACTTGAAAC 


CTCTTCACTT 


CGTGTGGAAC 


GCCATGTACA 


AAATGCTGAG 


840 


ACAATTGTTG 


ATTTTCTTGT 


CAACCATCCT 


AAGGTAGAAA 


AGGTAAATTA 


TCCAAAACTT 


900 


GCAGATAGTC 


CTTATCATGC 


CTTGGCTGAG 


AAATACTTGC 


CAAAAGGTGT 


CGGTTCAATC 


960 


TTTACCTTCC 


ACGTCAAAGG 


TGGCGAGGAA 


GAAGCACGCA 


AGGT CATTG A 


TAATTTAGAA 


1020 


ATCTTTTCTG 


ACCTTGCAAA 


CGCGGCAGAT 


GCTAAATCGC 


TTGTTGTCCA 


TCCAGCAACA 


1080 


ACCACTCACG 


GTCAATTGTC 


AGAAAAAGAC 


CTAGAAGCAG 


C AGGTGTC AC 


ACCAAACTAA 


1140 


ATTCGTTTGT 


CAATCGGTCT 


TGAAAATGTA 


GAAGATTTGA 


TTGAAGACTT 


GCGCTTGGCC 


1200 


TTGGAAAAAA 


TTTAAAGTAA 


AAGAAGATAA 


ACAGTGGGCT 


TCGACTCACT 


GTTTTTGATT 


1260 
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TTCCCTCAGG CATGATATAA TCGTTACAGA AGTCTAGAAA GAGGAACGAT ATGAACGAAA 13 20 

TCAAATGTCC CAACTGTGGG GAAGTCTTTA CAGTAAATGA GAGTCAGTAT GCCGAACTCT 13 80 

TGTCCCAAGT GAGAACGGCA GAGTTTCATA AGGAACTACA CGATAGGATG AACCAGGAAC 14 40 

TGGCCTTGGC TGAGCAAAAG GCCATGAATG AGCAACAGAC TAAACTGGCT CACAAGGATC 1500 

AAGAAATTGC GCAATTACAG AGTCAGATCC AAAACTTTGA TACAGAAAAA GAATTGGCCA 15 60 

AGAAAGAGGT TGAACAGACA AGCCATGACG CTCTCTTCGC TAAGGACAAG GAAGTACAGC 1620 

TCTTAGAAAA TCAGTTGGCT ACCTT GCGTT TGGAGCATGA AAATCAACTA CAAAAGACCC 1680 

TTTCTGACCT AGAAAAAGAA CGGGATCAGG TTAAAAACCA ACTACTTTTG CAGGAAAAGG 1740 

AAAATGAATT ATCTTTGGCT TCTGTTAAGC AAAACTACGA AGCCCAGCTC AAGGCAGCTA 1800 

GTGAACAAGT CGAGTTTTAT AAGAATTTTA AGGCTCAACA ATCTACAAAA GCGATTGGGG 1860 

AAAGCCTAGA ACAGTATGCA GAGAGTGAGT TTAACAAGGT TCGTAGTTTC GCCTTTCCAA 1920 

ATGCTTACTT TGAGAAGGAT AACAAGGTCT CTTCGCGTGC GTCTAAAGGG GACTTTATCT 1980 

TCCGTGAGTG TGATGAAAAT GGAGTTGAAA TCATTTCTAT CATGTTTGAG ATGAAAAACG 2040 

AAGCGGACGG AACAGAGAAG AAGCACAAGA ATGCAGATTT TTACAAGGAA TTGGACAAGG 2100 

ACCGTCGGGA GAAGAACTGT GAGTATGCCG TTTTGGTGAC CATGCTTGAG GCTGATAATG 2160 

ACTACTTTAA CACAGGGATT GTTGACGTCA GTCACGAGTA TGAAAAAATG TATGTTGTTC 2 2 20 

GTCCTCAATT CTTTATCCAA TTGATTGGTC TCTTACGTAA TGCGGCGCTA AATTCCCTAA 2280 

AATACAAGCA GGAGTTGGCC TTGGTTCGCG AGCAAAATAT TGACATTACG CATTTTGAGG 2 340 

AAGAT TTGG A TGCCTTTAAG CT AG CTTTTG CTAAGAACTA TAATTCAGCT TCGACTAACT 2400 

TTGGAAAAGC TATTGATGAA ATCGACAAGG CCATCAAACG CATGGAAGAG GTTAAGAAAT 2460 

TCCTGACCAC ATCTGAAAAC CAACTCCGTT TAGCTAACAA CAAATTGGAA GATGTCTCTG 2520 

TTAAAAAATT GACCCGGAAA AATCCAACAA TGAAAGCGAA GTTCGAAGCA CTGAAGGGGG 2580 

AGTAGAAAGC AAAAATGAAC GGTATTATTA ACTTAAAAAA GGAAGCAGGA ATGACCTCGC 2 64 0 

ATGATGCGGT TTTTAAACTG CGTAAGATTT TGGGAACCAA GAAAATTGGT CATGGTGGAA 2700 

CCTTGGATCC GGATCTGGTG GGTGTTTTGC CGATTGCGGT TGGCAAGGCG ACACGCATGG 2760 

TCGAGTTTAT G CAGGACG AG GGTAAGATCT ATGAGGGGGA AATCACTCTG GGCTATTCCA 2 820 

CGAAGACTGA GGATGCTAGT GGGGAAGTGG TCGCAGAAAC CCCTGTTTTG TCTCTCTTGG 2880 

ATGAAAAGCT TGTTGATGAA GCGATTGCTA GCTTGACTGG GC CT ATTACT CAGATTCCCC ' 294 0 

CTATGTATTC GGCAGTTAAG GTTAATGGTC GCAAGCTCTA TGAGTATGCG CGTGCTGGTC 3 000 
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AGGAAGTGGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCGA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGT 3120 

ACATCCGTAC TTTGTCAGTT GATTTGGGTG AAAAGCTTCG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3 240 

TTGCTGAAAA AGTAGAGGCT GGCCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3 300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AG AAGTTCG C TTTGGTCGTT 3 360 

TTATTGAGCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3 4 20 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA G G TT TT TAGC TAGATCGTTT 3 4 80 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3 540 

ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT 3 600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA ATATTTAGAG 3 660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCACGATTTG GTGTGATAAT TCCATGATGC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AG AC AG AAAT GAAAAATACC 3 840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATCATTACC 3 9 00 

AGGCACCTAA GTCTGTCATT GATTTTGGTT TATGGA 3936 



(2) INFORMATION FOR SEQ ID NO: 203: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 24 0 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AG CGATTC AT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 3 60 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC AC G ACTTC CT TTTTTGAACT CATCACGGCG 480 
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ACCATTCCCA 


CGGCGATCAC 


GCTCTCGACG 


GTCCTCCCCA 


CGACGGCCTC 


CACGACCTCC 


540 


CTTAGCTTTA 


CCACCGAAAC 


CATTACCTGA 


TGG TTTAAAC 


GGTAGTGGCT 


TTTCACGTGC 


600 


AATCTCCACT 


TCTGGAAGCC 


TATCTGGGTC 


TTGGACTGTC 


AGACTCAAGA 


TATACATTGC 


660 


CAATTCTTCT 


GGAGTAAACT 


CAGCAGCCAA 


TTTGCG AG C A 


TCCTTACCAA 


ATTTCTCAAA 


720 


GTTGGCACGA 


ATGGTTTCAT 


CTGCAAAATC 


ACGTTCGATT 


TTCTTGAGAG 


CTACCTGTTT 


780 


TTTTGATTGG 


AAGGATTCTT 


CTACACTTGC 


AGGTTTGAGA 


CCTTTCATGC 


GTTTCTTAGT 


840 


CAAGTTTTCA 


ATGATTTGAA 


GGTAACCCAT 


TTCGTTTGGA 


GCAACAAAAG 


TAATAGATTG 


900 


ACCTGACTTA 


CCAGCACGAC 


CTGTACGACC 


GATACGGTGA 


ACATAACTCT 


CAGGATCTTG 


960 


TGGAATATCG 


TAGTTGTAGA 


CATGGGTCAC 


ACCTGAAATA 


TCCAAACCAC 


GCGCTGCAAC 


1020 


GTCTGT CGC A 


ACCAAAACAT 


CAAGATTGCC 


ATTTTTAAAG 


TCACGAAGGA 


CACGAAGACG 


1080 


TTTGTTTTGG 


TCTAGGTCGC 


CATGAATTCC 


TTCTG CACGG 


AAGCCACGAA 


TTTTCAAACC 


1140 


ACGAGTCAAT 


TCATCCACAC 


GGCGTTTGGT 


ACGACCAAAT 


ACAATAGCGA 


GTTCTGGTTG 


1200 


TGCCACATCC 


ATGAGACGAG 


TCATGGTGTG 


AAATTTTTCT 


TGTTCCTTAA 


CACGGATATA 


1260 


GTACTGGTCA 


ACCAATTCTG 


TTGTCAATTC 


CTTAGCCGCA 


ATCTTGACAT 


GTTCAGGGGC 


1320 


TTTCATAAAC 


TGAACACCGA 


TACGTTTGAT 


GGCATCTGGC 


ATAGTTGCTG 


AGAAAAGCAA 


1380 


AGTTTGACGG 


TTCTCAGGTA 


CACGGGAAAT 


AATGGCTTCG 


ATGTCTTCAA 


GGAAGCCCAT 


1440 


GTTAACCATT 


TCATCCGCTT 


CGTCAAGGAT 


AAGGGTTTCA 


ATCTCTTGTA 


ATTTCAAGGC 


1500 


CTTGCGTTTA 


ATCAAGTCCA 


AGAGGCGACC 


TGGAGTTC CC 


ACCACAATAT 


GGGCACCAGA 


1560 


TTTAAGAGCC 


TTAATTTGTT 


TTTCAATGCT 


TGATCCG CCA 


TATACTGAAC 


GGACTTTGAC 


1620 


TCCCTTACTA 


CGACCAAAGC 


GGAAGAGTTC 


TTCTTGACTT 


TGGACAGCTA 


GTTCACGAGT 


1680 


TGGAGCGATG 


ACCAAGGCTT 


GGATAGTCGC 


TTCTTCTGTA 


CGGATTTTTT 


CAAGGGTAGG 


1740 




GCTGCAGTTT 


TTL CTGTACC 


AGTCTGAGCT 


TGACCGATAA 


CATCCTTGTC 


1800 


TTCAAGGGCC 


AAAGGAATAG 


TTTGTTCTTG 


GATAGGACTA 


GCTTCTACAA 


AAC CAGCTTT 


1860 


TTCAATTTCT 


GCTAGCAAAT 


CAGCAGACAA 


GTTTAATTCA 


TTAAATTTCA 


CGTTATTCTT 


1920 


CTTTCTAAAG 


GTGGTGCGAA 


GCCACCCTAT 


AGGGCTTAGT 


TTATACTTTT 


CTTTTTATGA 


1980 


CGTATTTTCA 


TATAACTAGA 


TATAAAATCG 


TGTTGCTTCT 


TTTCCACAAA 


AGAAAAGTAC 


2040 


TGTTTTCTTT 


GCAACCTATC 


TAGTATAACA 


CAAGACCAGA 


GCAAAAGATA 


GCCCCATTTC 


2100 


TACAGAAAAT 


CATGTAAGCG 


CTTTTTGACT 


TTCTTTTTTG 


ATTGAACGAC 


CTAGATAATA 


2160 


AGACAAAGCC 


AAGGCGATAC 


TGTATAAAAT 


GAGAAAAACG 


AACAAGGTTT 


GTGTGTACGA 


2220 
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ATGAGCCATT TTATAAGTCT CTGCTAATAA AATAGGTCCC GCTAAACCAG CCATTGCCCA 22 80 

AG CTGTT AAA ATATAACCAT GCAGAGCGGC CAATTCCTTG GTTCCAAAAA TATCACTGAG 2340 

ATAAGCTGGA ATCAAAGAAA AACCACCTCC ATAGCAAGTC ATCAAAATAG ACATAGCAAC 2400 

TACAAATAAA ACGGAATCTG TAAAGAGCCA AAGTGAGAGA GAAAAGAAAA GATTCACAAG 24 60 

CAGTAATATA CTAAAGGTTA GAGGGCGACC GATATAGTCA GACAAACTCG CCCAGAGCAA 2S2 0 

GCGACCAAAT CCATTGAAAA TCCCCAAAAC ACCCACCATT ACTGCTGCAT GACTTGTAGA 2580 

CAAGCCAGCC ATCTCCTGTG CCATTGGCGA TGCCGCTGAA ATT AAGCC T A AACCACAAGC 2640 

TATGTTGATA AAGAAAATAA TCCAAAGCAT ATAAAACCGA TTGCTTTTTA GAGCCTGATT 2700 

TG C AG CC ATT CCTTGCGTCA AAGAGGCTGT TTTTTCTTTC CCTGAAGAAG ATAAAATTGC 27 60 

AAGC TCTTGC TCATTTCGAC GCTTAATGAA TTGTGAAGCT AGGAGCATGA TAATAAAGTA 282 0 

ACTTGCTCCT AAAATATAAA AAGTTTCTAC AAGCCCTACC CCTGCGATGA GCTGTTGCGC 2 88 0 

TATGGGACTA GTCAATAAAG AAGCAAAACC AAACCCCATA ATCGCTAAAC CTGTTGCGAG 2940 

ACCACGTTTA TCAGGAAACC ATTTTATAAT CGTCGACACA GGGGTAATAT AGCCTGCTCC 3000 

CAAACCAAGC C C ACCTAAAA TGCCATAAGC GAGATACAAC AACCACAGCT CTGACGGTCT 3 060 

ATTGCAAATC CTGTT AAGAT ATTTCCACCT GCGTATAGAA AAGCAGATAG ACTTCCCATG 3120 

ACTTTCGGAC CAAATTTTTC TACCAAACGC CCCATAAATG CAGCCGATAA GCCCAAACAA 3180 

AAGATTGCTA GACTAAAGGC GAAGGCAACA GAAGCCTGAT CCCATCCCGT 3230 



(2 J INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 
<B ) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCTATGAAGA CTGTCCCAAC TGGGTGTCCT TCTAGGCTAT CTGGTCCTGC CACTCCAGTC 60 

AAACTAATTC CAAAATCAGA CTGGGTCTTG CTTCGTGCCT GCTCAG CCAT CTTCTGAGCT 120 

GTAAATTCAG ACACCACACC ATGTTCTTCC AAATTCTTGG CAGGAATATC CAACATCCTT 180 

GATTTTTCCT CCAAGCTATA GGTCACAAAA CCACCCTTAA AT AT ACTT G A AACTCCAGAA 240 

AAATTCGCCA CGGTAGCTTG GAAAAGACCT GCCGTCAAAC TCTCTGCAGC CGCGATGGTT 300 

TTCCCTTGCC TTTTCAGTTC TTCTACCACA ATGCTGGCTA AACTAGTTTC TTCCCCATAA 3 60 

C CAT ACCAAA AGTCTCGTAA AGAAATTCCT TCGAAAGTCT GGCAGTCCAA GATTTGATTT 420 
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TCCAAGATAT 


CCAGCGCTTG 


l ATTCGCCTCT 


1 TCTTGACTGC 


' TAGCCTTTGT 


TGACAGACGT 


480 


AGAGTGACTT 


CTCCTGTCTT 


GGCATAAGGG 


GCCAAGGTAG 


GATCGATCTG 


ATTATCAATT 


540 


AAATCAGCCA 


AAATCGTAAC 


CAACTGGCTC 


TCGCCAATCC 


CAAAGAAACG 


AAGAACTCGG 


600 


GAATACAGCT 


TGCTCCCTGT 


CATCAACTTG 


GGTAGAAGTT 


GGTTTAAGAC 


CATGGGTTTC 


660 


AATTCACTTG 


GCGGACCTGG 


AAGGACGACA 


TAGGTC ACT C 


CGTCTACTTC 


TAATTTTCCT 


720 


CCAACAGCCA 


GTCCTGTTTC 


GTTTGG CAGT 


GGAATCGCTC 


CTTCTACAAT 


TTGAGCTTGT 


780 


CTTTCGTTAT 


TCGGTGTTCG 


GGCATAGTCT 


GGTCGCAGGG 


TAAAAAAGAT 


ATCCAACTTC 


840 


TCCTGAGCCT 


GAGGATCAAA 


GACTAATGCT 


TTC CCTAAAA 


ATTTAGCTAG 


GGTTTGTTTG 


900 


GTTAGGTCGT 


CCTCAGTTGG 


CCCCAAACCG 


CCTGTCAAAA 


TCACCAGACT 


GCTACGTTGA 


960 


CTGGCAATCT 


CAAGCAAAGA 


CAAGAGACGA 


ACTTCATTGT 


CTCCTACAGC 


CGTCTGAAAA 


1020 


TATACATCTA 


CCCCAATCTC 


AGCTAGTTTT 


TCCGACAAAA 


ACTGGGCATT 


GGTGTTGACA 


1080 


ATCTGCCCTG 


TCAAAATCTC 


TGTTCCAACA 


GCAATGATTT 


CTGCTTTCAT 


GTTTCCTCCT 


1140 


ACCTATCTAT 


TCGTATTTTT 


TTGAAAAAAT 


CGCAGGAATT 


TTC CT AC GAT 


TGATTTTTTT 


1200 


ATTTGTATCA 


AAAGTTAATT 


ATCTTCATCA 


CCAACAGGTG 


CTCTGCCAAA 


TAAATCTTCA 


1260 


AATAAAACCG 


CATTGGTTTC 


AAGCTGAGTA 


ACTTCTTCTT 


GTCCCAAAGA 


ACGTCGGAGT 


1320 


AGATTTTGCA 


TTTCCAACAT 


ATGTGCTCTC 


GAAACAATCT 


GGTAAGAAAC 


ACCTTGAAGT 


1380 


ATCTCTCCTT 


CACCCTGCAA 


CTGCTGAGTT 


TCAATGGTTT 


TAAATGAATC 


TTTATAGCCT 


1440 


AGCAAGTTAG 


GGATACTTTT 


TGCAGACAAA 


TCAATATTGG 


TCTGCATATT 


GTCACTCAAA 


1500 


GCTTTTAGAA 


TCTCTTGATA 


ATGACCAATG 


CTATTTAAAC 


TGAGAGCTTT 


TTCCATGACT 


1560 


TTTTGAATAA 


CTTCACGTTG 


ACGTTTTTGA 


CGACCATAAT 


CCCCCTCAGG 


ATCTTGGTAA 


1620 


CGCATTCGTG 


CATAGACTAG 


GGCTTCTTCT 


CCCCCAATAT 


GTTGCTCCCC 


AACACCGATA 


1680 


GAAATAGTAT 


TAAATTCTTC 


TTGGTCACTG 


ATAGAAATTG 


GGAAACCTAG 


GATATTATTG 


1740 


ACTGTAATAC 


CTCCTACTGC 


ATCCACTAGT 


TTTTGCAATC 


CTCTCATATT 


GACCATCACA 


1800 


TAGCGATCAA 


TATGGATATT 


CAT C ATTTTT 


TGAATGGTTT 


CTATAGCAAG 


CTCTGCTCCA 


1860 


CC ATCTG CAT 


ATGCTGAGTT 


CAGTTTCGCT 


TCATGAGCCT 


GACCATTCCC 


TGATTCAATG 


1920 


CGCGTCAGAA 


TATCCCGCTC 


TAAACTCATC 


ATTGTTGTTT 


TTTTCGTTTT 


AGGATTCACT 


1980 


GTCATCAAGA 


TCATGCTATC 


ACTTCTACCG 


AC C CAAGTTT 


CAGTTCGTTC 


AACATTTCCG 


2040 


GTGTCCACTC 


C C ATT AAC AG 


AATGGTTAGA 


GGTTCAGTCG 


CTTCAATAAC 


CTTGGTTTCT 


2100 


TCACCGATTT 


TTTTATAGGT 


TTTAGCTAAG 


GTTTCTGTCC 


CTTGTTGATA 


AATAGTATAA 


2160 
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GCAAAAACAC 


CTACTCCTAC 


TACAGTTACA 


GAAAGTAAAG 


CTAGCACCAT 


TCCAATAATT 


2220 


TTTTTAACCA 


TATTTCTACT 


AACCTATCAG 


TTTACCCATC 


AAGTAAACAT 


CGATAAATTT 


2280 


CCCTTCTTCT 


ATATATGCCC 


CACGCTCTTG 


GCTACCTTCA 


ATGACAAAGC 


CATGCTTTTG 


2340 


ATAAAGATGG 


ACTGCTGCTT 


GATTACGAGT 


TTGGACAGTC 


AGTTGGAGAC 


GACGCAGAAT 


2400 


GCCACTTGCT 


TGTGCCCACT 


CTATCGCTTC 


TTCTAGCAAC 


AAACTTCCCA 


AGCCATTATT 


2460 


CCAATATCTT 


TTTCCAATCA 


CAATGAAGAG 


ATCTCC AAT A 


TGACGGACTC 


TCTTACGCTG 


2520 


ATCAGCTGTA 


ATATTTACAA 


TACCAGCAAT 


TTTGCCATTT 


AAGAATGCAA 


GTAAGGTTAT 


2580 


CTGATTGTCC 


GAACTAGCTT 


GCTTGTTGAG 


GAATATTTCC 


ATCTCCTCAC 


TAGTCAAGAG 


2640 


AATACCATCT 


CCGTCTAGGC 


TGGTAAAGTC 


TGTCTCCAAA 


CTCACACGAT 


TTAAAAAGGC 


2700 


CACTAATTCA 


GCTGCATCTT 


TGGGCTCTGC 


TTCCCTAATG 


AGCAATTCAT 


ACTCCATATT 


2760 


GAAGCTCCTC 


TAACAATTTC 


TCAGCACGCA 


AACCCTTTGC 


CTGAAAATTT 


AAACGGCGTC 


2820 


CATCTGCTTC 


TTTTAGAATT 


TCCAATTCTA 


AATAAGCATC 


TGGCAAGGCA 


TCTCCTAAGA 


2880 


GATTTCCCCA 


CTCAATAACA 


GTCACGCCGC 


CACCAAAGAT 


AAACTCATCC 


AAGTCGATAG 


2940 


AATCAGCATC 


TCCTTCAATA 


CGATAAACAT 


CTAGGTGATA 


AAGTGGAAGT 


CGACCTTCAT 


3000 


ACTCTCTCAC 


GATAGTATAG 


GTGGGACTTT 


TAATCATTTG 


AGAAATCTGT 


AATCCTTTTG 


3060 


CAAGTCCTTT 


AGTAAAGGTC 


GTTTTACCTG 


CACCCAGTTC 


TCCAGTTAAG 


ATTAAAACAT 


3120 


CATTCTTTGC 


TAATAGATGG 


CCCAAACGCT 


CCCCTAAGGC 


TTGCAACTCT 


TCTTCATTTT 


3180 


TTGTGTACAT 


ACTCTTATTA 


TACCAAAAAC 


TTTTCTTTTG 


TGTCTATTTT 


CCTACTAAAC 


3240 


TTATCATCAT 


AACATCCATA 


AAAAACAGGC 


TTTCTCTAAA 


AGAAAATGAG 


CGTAACAATG 


3300 


ACCAATACAA 


GATCTCGGAA 


AATATGACCA 


TAAAAGGAAA 


CTTCCTTCTT 


AACCGAATTT 


3360 


GGGACAAGAT 


AGGCTGCAAA 


AAACAAGCCC 


AGTCCAATAT 


AAATCAGAAG 


TG AG AC AAT G 


3420 


GTCATTGGAT 


TTCTTAAGAA 


AAGAAGTGTT 


GCTAAAATAG 


TCACCAACAC 


TGTCTTTTTT 


3480 


CTGTCCAGCA 


TAGCAAGAAA 


ATCGCGCACG 


TATTTTTTCA 


AGGGTAAAAA 


AATCAGCAAA 


3540 


TCTAGCCCAA 


ATAGGAAAAA 


GAAGGATGGC 


AATAAAAAGT 


C AACT AATTC 


TTGCTGCAGC 


3600 


GTATTTTTGA 


TGAACAAGTT 


ATCTGACAAA 


ACAAGAACAG 


CTCCTAACAA 


ATTAATTAAG 


3660 


AGTAACATAC 


TGTAAAAAAG 


CTTCACCGAC 


TTCTTACTGG 


CTAGGACACT 


ATGGACTTCT 


3720 


TGCTTACGGG 


T AT AAAGAT A 


ATTT ACT C CA 


GCACAGATTC 


CTGAAACGAA 


AACCATGCTT 


3780 


CCGATGAAAA 


AAG CTGT ACT 


TTGTTTAAAG 


GACAAGATGC 


ATTCCTTCCA 


TAGGAAACAG 


3840 


CTACTCAAAC 


TGATTTGAAT 


TAAAGCTAAC 


AAAAATAAGA 


TTCTCATTGA 


TTTCATCTTC 


3900 


TCTCTCCCTT 


CCTACCAATC 


ATTATACTAG 


GAGAAAAGAG 


AGAACTGTTT 


CTAATCTTCT 


3960 
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CAAATGTCTC TTTAAGACCC TAAACAAACA CTAGAGACTA ATACTCAATG AAAATCAAAG 4020 

ATCAAACTAC GTAGCTAGCC ACAGGTTGCT CAAAACAGTG TTTTGAGATT GCAGATAGAG 4080 

CTGACGTGAT TTGAAGAGAT TTTCGAAGAA TATAAATTTG AAATCATGAA AATCCGTCAA 4140 

ACGGGTGGTT GTTTT GTCTC GCACCTCACG GAGCGAGACG GACTCACACT CACATAATTA 4200 

TAAGGCTGAT AGTATTAATC TAACTATCAG CtTmCAGGTT ATTTAACGTT TCAGAAAAAC 4260 

TATAATGTCA AGATTAACTA AACAGTATCT AGTTCCTTCA AATAATTTTC TATCTTCATC 4 3 20 

AACATT AAAG GATTGTTATA AATCTTACAT AACT CTCTTG CTTCTATATA ATAATTTTTG 4 380 

ACTTGTTCTC TGTCTAGAAA TTTGGCTCCA GCATTTCCTA CAAGAATAAG TAGAGGAGCC 4440 

AATTGGTACC TTGTCTGTCT TTGTTTACAG AGTTCAATCG TTTCAAGAGC TTCTTGGATG 4 500 

GCTTCATTAT ATTTTTCCTT TGATACTAGG TAGTGAGCGT ACTTGTAACG AACTCTGATG 45 60 

TAGCCAAATA AAAACTCTTG ATGGTCCAAA TTTTTTGTCT GATACAACTC TATTAAATGA 4 62 0 

GAGTAGTTTG CCTCATATTC TTGTTCACGA CCCACTAAGG AATAGAAATT AGATAGAGTA 4 680 

TTCAACGCCT TTAAATAAAT CAGAGTATTT GAAGAGACTT TTAATAATAT ATTTTCCAAT 4740 

GACGAAATTG CCTCACACTT ACTGTCATAT TGATAGAAGT CAATTATAGA TTTAATCCAT 4 800 

TCAAGGTAAG TTCGGTCTTC T AATGTT AG A AAAGTGCTTC GTTCTACTTC TATTTTATAA 4860 

AGATATTCTA AATCGT CATA ATTTCTGTCA TCTAATAGGC GAGCAGATAG ATGTTTGAAA 4 920 

TTAGAGAGGT TAGACTTAAC TTCGATTTGT TCATTGAAAA AGTAATCCAA AGGGACTTCA 4 980 

AGTCGTTGAG AGAGTTTGAA TAACAAGTCT GCGGAGGGAA TAAAATGACC TCTTTCAATT 5040 

TTACTAATCT GGCTTTGTTC ACAAATTCCT TCTGCAAGAG TTTGTTGGGA GAGTCT 5096 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH : 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGTAA ACCTAAAAAG 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGG CATGG AC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GGAATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 2 40 
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GTTTTAATCC TAAGTGATCT CTTGGGAGGA TCGCCATTCA AGGTTTCTTC TACCATAATG 300 

GGAGAAAATC CAGCCAAGAC AATGAATGTT CTCTCGGGTT TGAACTTAGC CATGTTAATG 3 60 

GAAGCAGTCT TTGCTAGAAT GGCTCATAGC TTTGATGAGG TTGTTAATAA ATCAGTAGTG 420 

GCGGCCCAGG GCGGAGTCGT AAATGGTAAA GAATTGTTTT CAACGGATGC AGAGGAAGAG 4 80 

GAAGAAGATT TCGAATCGGG TATTTAAAGG GTAAAAGAAT GATAAAAAAG GTTACGATTG 54 0 

AAAAAATAAA ATCGCCTGAG CGCTTCTTAG AAGTACCACT TCTGACGAAA GAAGAAGTCG 600 

GCCAGGCAAT CGATAAGGTT ATTCGGCAGT TAGAACTCAA C CT TG ACT AT TTCAAGGAAG 6 60 

ATTTC CCG AC GCCAGCTACC TTTGATAATG TCTATCCAAT CATGGATAAC ACGGAATGGA 720 

CCAATGGTTT CTGGACAGGA GAACTGTGGT TGGCTTATGA ATACAGTCAA CAGGATGCAT 780 

TTAAAAACAT CGCTCATAAA AATGTTCTTT CTTTCCTGGA TCGTGT C AAT AAGAGAGTAG 84 0 

AATTGGATCA CCATGATCTC GGCTTCTTGT ACACACCCTC TTGTATGGCT GAATATAAGA 900 

TAAATGGAGA TGGAGAGGCT AGAGAAGCAA CCTTGAAAGC TGCAGATAAG TTGATTGAAC 960 

GCTATCAAGA AAAAGGTGGT TTTATTCAAG CTTGGGGAGA CTTGGGCAAG AAAGAGCATT 1020 

ACCGTTTGAT TATCGACTGC TTGCTCAATA TCCAACTCTT ATTCTTTGCT TATCAAGAAA 1080 

C AGGCG AT C A AAAATACTAC GATATTGCAG AAAGCCATTT CTATGCTTCA GCTAATAATG 1140 

TAATCCGTGA TGACGCTTCG TCCTTCCACA CCTTCTATTT TGATCCTGAG ACAGGTCAAC 1200 

CCTTTAAAGG TGTAACGAGA CAAGGGTATA GTGATGATTC ATGCTGGGCA CGTGGTCAAT 1260 

CATGGGGAGT CTATGGTATT CCTTTGACTT ATCGTCACTT AAAAGACGAG tCCTGCTTTG 1320 

ACTTGTTTAA GGGTGTGACC AATTATTTCT TGAATCGTCT GC CAAAAG AT CATGTGTCCT 13 80 

ATTGGGATTT GATTTTTAAT GATGGTAGTG ATCAATCACG AG ATTCTTC A GCAACAGCTA 144 0 

TCGCCGTCTG TGGGATTCAT GAAATGCTAA AACATCTCCC AGAGGTGGAT G CTG AC AAAC 1500 

ATATTTATAA ACATGCTATG C ATGC CATGC TTCGTTCCTT GATCGAACAT TATGCAAATG 1560 

ATCAATTTAC CCCTGGTGGG ACAAGTCTCC TCCACGGTGT GTACTCATGG CATTCAGGTA 1620 

AAGGAGTGGA TGAAGGCAAT ATCTGGGGTG ACTACTATTA CCTAGAAGCC CTTATCCGTT 1680 

TCTACAAAGA CTGGAACCTA TATTGGTAGG AGGAGAAATA TGACAATGCC AAATATTATT 174 0 

ATGACCCGTA TCGATGAACG GTTGATTCAT GGACAAGGAC AACTTTGGGT AAAATACCTA 1800 

GGTTGTAATA CGGTCATTGT TGCCAATGAC GAAGTAAGCA CGGACAAGAT GCAACAAACT 1860 

CTGATGAAAA CAGTTGTGCC AGACTCAGTT GCCATGCGTT TCTTCCCTTT GCAAAAGGTG 1920 

ATTGATATCA TTCACAAGGC TAATCCTGCT CAAACGATCT TTATCGTTGT AAAGGATGTG 1980 

AAGGACGCTT TAACCTTGGT AGAAGGTGGT GTCACTATCA AAGAAATCAA TATTGGGAAC 2040 
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ATTCACAATC CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTGGG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGG AG C TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCT GTCA TTTGGTGTCG 2340 

GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG C C AATGGTG A ATTGG 23 9 5 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

CCTTCTTTAG AGGTTAATTT TGCAAAATCG TCGATTGTTA TATAAGGATT ATTATAGAGA 60 

CTGTTCGCAA AGAATCTCTG ATAT GTTT T T GAATCTTTTG AATACAAAAC TAT CTCTCT A 120 

ATAGCATTGC CATCTGTTCC ATCAATTGGT AAACATACCG T AACT AG AAA AAGAATTATA 180 

TTCAAAATAA AAAATTCTGA TGCGTACGGC ACAAATCCCA AAAGTGCTAA TATTGCGACA 240 

AT T AGGTTAG CTCCACCTCC CCCAAAGAAG TAGAACACCA AATTCCTATC ACTATTTTTT 300 

TCATTAGTAA TGTTTCTATT ACTCATTTGA CAATAACCGA ATGCTAATAA CACTGGAAAT 3 60 

TTGAAATATA TTTTTTTTCT GAAATAGAAG AAAAAGGGAG TAGCAAGCAT CTCTAGTTTA 420 

TAAGATAAAC ATCTTCCCAC TAAAAAATGA CCTAGTTCAT GTAATGTAAT TGATATTAAC 480 

GAAATTAAAA TCAATCGAAA ATAATAGATT AATGAATCAT TTGGAAAAAT TATCAATAAT 54 0 

AGGAACAATA ACGGAATCAA ACATAAATAT ATGACAGAGT TATTTAATAT TTTCAACATA 600 

AT ACCATTC C TCTAAACTAT TAG CTTC AAA AAGGCGTTTT TTCTCCCAAT ACATCTTCTC 660 

AAAATGTTCG GAATCATAAT TTTCTAAAAT T AATTTT A t G TCTGGTAAGC TCTTTCTTGA 72 0 

TAATCCGTTG TTTTGTACTT AATTTTCCCT TCAAGTACAT CTTCAATTTT ATAAGTTGCC 780 

TCCATCAACT GAGCCTCTGC AATATCTTTG AGTGAATTGG TAATTGAAAC TTGGTGTAAT 84 0 

ATCTGTCCCS CCATATATGA AAATATATCT CTAAGATATT CTGACACATT ATCAGAGCCG 900 

TTACTCTCAG CAACATCTAA TGTTACAACA AACTTTCCAG CTAATCGAAA AAGATGGCTC 960 

CACCCCCCAA TCCTTTCAAT AAAGTTTTTT GTGTCCACAG ATACGTTTTG TAAATATACA 1020 
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GGAGAAGAGA TAATTATAAT ATCAGACTCT AATAACTCTT TTTTTATAAC ACCTCCATCA 1080 

TCAGCATTAC TTTGCCTATC AATTCCTTTC TTAAACAACT CTTCTGAATC AGAATTAGAT 1140 

ATTTCT AG CT CTGAATTGAA AGGTGTCCTG AAAGATATAT CAACATTATT TCTACTAGAA 1200 

ATGATACTTG AAAGTCTCTT AGTATACTCT AAAGTCTTAG AGTTATGATT TCGCACTCCT 1260 

GCATATATAA ATATTTTATT CATTTTAATT CATCCTCTCA ATTTGAATTT AGTAGATTTT 1320 

TCAAGATAGT ATGGTACAAA AACAGACTTT TGTTGACTCA CATTATTACA TATGTTTTGT 1380 

ATTAAACCAA AATCAATACT ATTTTTGGAG T AATTTTG AT TTTAGTTTAA AATCATTTCT 1440 

ATAACAGTAG CATATACCTC AAGCCGTTTA GCAATTAGAA TAGAACTTTT CTTTATTATA 1500 

TTATTATCTC AACGAAAAGC TACACTATTA AAAATATTTT ATAGAATTAC AT ATT A AACT 1560 

AGTCAATCTT GGTATTTTTA TATTGCTTAA TGAGTGGACA CCTCTATTTT AGAAACAAAA 1620 

CTATAAATTA AGCTAGATTT CAAGTAATGA GGGGATAACT ATCTTTTTGT C ATTCTG AT T 1680 

CAGTGCCATA TACCTTAAAA AAGTATAAGC AATACCAGTC ACACCTGTAT ACAAAGAAAA 1740 

ATCTGGGAAA TTGCTTGTTT GGACGATACG ATACTCTCCT TCTTTTGATT TATTCATTAC 1800 

AACACTACAC AATAAAGACT CCAATTCCAT ACTAGTATCC ATTTCTTTCA TGTAGTCGAT 1860 

GTAAAAATTT ATTATGGCCA TACT TCCATG GC AAAATGT A TCATTATCTA AACTAGCTAC 1920 

AATTCCCTCT GGAACACTTT GGGGATGATT AACTAATGTC CCAAATTCTC CACTACACCA 1980 

CTTCAAAGAA TGAATTTTGA TTTTCTCCCT AGGAACTAGT TGTAAAATTA ATTCTTTATA 204 0 

TTTTTTAAGT CTTGTCACTT TATAAATATT TTTTAATGTA AAAATTACAC CTGATAGTCC 2100 

ATGGCCAAAA CTATATCCAA AATTACTATT ATCTCTCTCG CTTACATCAT TATATAGCGT 2160 

ATCACCTAAA CTTAATACTA GCCTTAGAAC ACGTTCCTTC TCTATTCCTC TCCTATAATA 2220 

TCTTACCAGT GTATTAATTA AAGGTACAAG ACCATTAATA TAGTCAGACT TGTTTGAAAC 2280 

ACTTGCAAAA TCAGTCTTTT CAAGCTCAGT TAAAACACTC TTTATATAAT TTAAGCATGC 2 340 

GAGAGTATTT GTATCGTAAT CCTCTATAAT GGATAGAACA ATGAAATATC CTATATCCCC 2400 

AGTTAAACCA AATGTGGTCT TAGATAAAGA AACAGATGGC GGAATTGCAG ATAACATTTT 2 4 60 

ATTGTACAGT TGAGTATATG ATGATTTATC TTTCAATAAT TTTACATAGT ACATAAACAG 2520 

TAATATTCCA GCTCTACCCC TATACATATC ATTmCCCGTT TGTTCAAGAC ACCATTTAGA 2580 

ACCTTTAAAA TTAACAGGTA TACTCCAAAT TGGATATTCC TC AT AAAT AT TATTAATAAC 264 0 

CAAAGAGTCT GCAATATTTT CTACTTCATT ATGCAGAATA GTAACTAAAC TTTCATTTGG 2700 

GAGTTTTTTT CT ATT AG AT A AGTTTAATTT ATATCCTTTT TTTCGCTGAT CAAAGCTTGG 2760 

AAAATAAATT TCAATGATAT CAAGTTGCTT TTCTAAATTT TCC AAATT AT TATTAGGTAA 2 820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTCATCTAGG GAAATAAAAT GATTTCCAAA 2880 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 29 40 

CATTTGTTCA CATTCG CTTG CTACAATACC TTGATTAGAA AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCT CT AT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GC AT AG CT AT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AAC CAT AC AA 3120 

GTTTGCTTTT AATAGCATTT TAAATTCTTC TGTTTTATTT AACTCTTCAA ATATCAGATA 3180 

AAAATCCCT A AAACCTTTTT TGAAAT CTTT TATATACTTA TCAAATTCTA TATCACCATC 32 4 0 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 33 00 

ATCAGTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 33 42 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34S4 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 



GAGAAAAGAA 


TGTTAAAGAA 


AAATGATATT 


GTAGAAGTTG 


AAATTGTTGA 


TTTGAC CC AT 


60 


GAAGGGGCAG 


GAGTTGCCAA 


GGTAGATGGT 


TTGGTCTTTT 


TTGTAGAGAA 


TGCTTTACCG 


120 


AGTGAAAAAA 


TTCTCATGCG 


TGTCCTCAAG 


GTCAATAAAA 


AGATTGGCTT 


TGGAAAAGTT 


180 


GAAAAATACC 


TTGTCCAGTC 


ACCACACCGT 


AATCAAGATC 


TAGATTTGGC 


TTACCTGCGT 


240 


TCAGGAATCG 


CGGATTTAGG 


ACACCTTTCT 


TATCCAGAAC 


AG CT CAAGTT 


TAAAACCAAG 


300 


CAAGTCAAGG 


ACAGTCTCTA 


CAAGATTGCT 


GGAATTGCAG 


ATGTAGAAGT 


TGCTGAAACG 


360 


CTTCGTATGG 


AACATCCAGT 


CAAGTATCGC 


AATAAGGCGC 


AGGTGCCCGT 


TCGTCGAG T G 


420 


AATGGTGTCT 


TGGAAACAGG 


ATTTTTCCGT 


AAGAATTCCC 


ATAACCTCAT 


GCCCCTTGAA 


480 


GATTTCTTTA 


TCCAGGATCC 


TGTCATTGAC 


CAAGTCGTAG 


TAGCTCTTCG 


AGACCTGCTC 


540 


CGTCGTTTTG 


ATTTAAAACC 


TTATGACGAA 


AAGGAACAGT 


CTGGATTGAT 


TCGGAATCTT 


600 


GTGGTGCGTC 


GTGGTCACTA 


TTCAGGACAA 


ATCATGGTCG 


TTTTGGTGAC 


AAC TCGTCC A 


660 


AAAGTTTTTC 


GTGTTGACCA 


ATTGATTGAA 


CAAGTTATCA 


AGCAGTTCCC 


AGAGATTGTG 


720 


TCTGTCATGC 


AAAATATCAA 


CGACCAGAAT 


ACCAATGCGA 


TTTTTGGTAA GGAGTGGCGC 


780 


ACTCTTTATG 


GTCAAGACTA 


TATTACGGAC 


CAGATGTTGG 


GAAATGACTT 


CCAAATCGCT 


340 
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GGCCCAGCCT TTTACCAAGT CAATACTGAA ATCGCCGAGA AACTCTATCA AACAGCCATT 900 

GACTTTGCAG AGTTAAAAAA AGATGATGTG ATTATTGATG CCTATTCTGG TATTGGAACC 960 

ATTGGTTTAT CAGTCGCCAA GCATGTCAAA GAAGTCTACG GTGTTGAACT GATTCCAGAA 1020 

GCAGTAGAGA ATAGCCAGAA GAATGCTTCT TTGAACAAGA TTACTAATGC CCACTATGTC 1080 

TGTGACACGG CTGAAAATGC CATGAAGAAA TGGCTCAAGG AAGGTATTCA ACCAACCGTT 114 0 

ATCTTGGTTG ATCCTCCACG CAAGGGCTTG ACAGAAAGCT TTATCAAAGC AAGCGCCCAA 1200 

ACAGGAGCCG ATCGCATCGC CTATATCTCC TGCAATGTCG CAACCATGGC GCGTGATATT 12 60 

AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC AGCCGGTGGA TCTATTTCCT 1320 

CAAACGCATC ACGTCG AG AC GGTAGCACTT TTGT C C AAAC TCGATGTCGA TAAGCACATA 13 80 

AGTGTTGAAA TTGAGCTGCA TGAGATGGAT TTGACAAGTG CGGAGAGCAA AGCAACATAT 14 4 0 

GCTCAAATCA AAGAATATGT TTGGAATAAA TTTGAATTAA AAGTTTCGAC ATT AT AT ATT 1500 

GCACAGATAA AAAAGAAATG TGGAATAGAA TTACGAGAAC ATTACAACAA GTCTAAAAAG 1560 

GATAAACAAA TTATTCCACA GTGTACACCT GAAAAAGAAG AAGCCATCAT GGATGCTTTG 1620 

AGACACTTCA AAATG ATT T A ATAGAAAAGA ATGAC AGT AT ATGACTTTCT GCATTT ATT A 1680 

CATTCCTACT TGGT AT AG G A ACAGCTATTA TTCCTTTCTT GCAAGGTATC AATTAGAAAA 1740 

TAGGCTCAAT ATAAAGATTG ATAGGATCAT TTTTATATTT AAAGGAGCGT TGAAATGATT 1800 

GATAAAGGCA ACAAAAAATT TTAGGATAAA TTTGCTAAGT TGTATGCCTC TTTTATGAAA 18 60 

AAAGATAAAG AGGTTTATGA TAAAGTTTGT GAATATCTTA GTCCTCATTT GAATAAAGAT 1920 

ATGGAGGTGC TTGAACTTGC TTGTTGGTTT CGTGTCATAA CAGTTATAGA GGCAAATAGT 1980 

TATGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA AAACTCAAAA AATAAATAGT 204 0 

TGGTGTGCTG CTTACAATAT CCATTTTAAT AATGGATATT GTAAGCAGCA CCCCcAtGAA 2100 

TTTAAAGATT CTTTAAAGAG TCTTATTTTG TGATGAAAAT TTAATATGTA AATCTCAGAC 2160 

GATAGAAATT AAAAACTCTA TCGTCTTTTT TATACTCAAA ATTAGGAGGT AAAAATGGTA 2220 

AGGATAAGAG GTCCCACTTA AAACAATTTA TGGCAAAATA AGGACGGAAT AACACAACAA 22 80 

ATTCTCTAAA ACAAATCACT AAATCAATGT AAGATTGAAT GAAATCAATA TTTATGCTAT 2340 

AATTAAATAA ATTTAATGAA GAAAAAAAGA GGGATATTAT GGCACTTAAC TATAAACCAT 2400 

TATGGATACA GTTAGCAAAA AAAGGACTAA AGAAAACAGA TGTAATAGCT ATGGCAGGAC 24 60 

TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC AATTACATTT AAGAATTTAG 2S20 

AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT TATTAGTTTT GAAGATAATT 2 5 80 

TTAGTGACGA GGAATAGAAA ATGACTTTAA GGACAGAAGA TCAAGTT AGG GATTATGCAA 2 64 0 
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GAGAAGTATA GGCTTTAATG AAGTTGAAGA AAACAT CAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAC CCAGATGGTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2 820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2 8 80 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 294 0 

GATAATTATT ATCGATACAA TAGACTTGAA ATATGTTTAA GGAGTTTTTA TGAAAaCAAA 3000 

TTTTTTCTAA TmGCTATTTT AGCTATGTGT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TT CTAAAG AG CATGCGCCTG ATAAAATAGT TTTAGATCAT 312 0 

GCTTTCGCTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTCTTG GATTTT C AAA AGCAAATTAC 32 40 

GGTGTAAGTG CTGATAAAGG AGTTTT AC C A TGGACAGAAG AAAAAATCAA AGAACTAAAT 3300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3 3 60 

AAACCAGATG TTATCTTAGC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 34 54 



(2) INFORMATION FOR SEQ 10 NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAG AAAAAAATGA TTTTAACAAG TCTAGCCAGC GTCGCTATCT TAGGGGCTGG 120 

TTTTGTTACG TCTCAGCCTA CTTTTCTAAG AGCAGAAGAA TCTCCAqAAG TTGTCGAAAA 180 

AT CTTC ATT A GAGAAGAAAT ATGAGG AAG C AAAAGCAAAA GCTGATACTG CCAAGAAAGA 2 40 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATGATCAGAA 3 00 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 3 60 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 4 20 

TAAATCTGAC GCTGAATATC AGAAAAAATT AACAGAGGTC GACTCTAAAA TAGAGAAGGC 4 80 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT GCGTTGGCTG AGACTAAGAA 
AGTAGCTAAG AGAAAATATG ATTATGCAAC 
AGAGGCTAAG GAACTTGAAA TTGAAAAACT 
AG T TGCT ACT GCTCAACATC AAGTAGATAA 
TGATGATGGC AC AG AAGTT A TAGAAGCTAA 
TAAACAAGCT GAGTTAGCAA AAAAACAAAC 
TCCTGAAGGT AAGACTCAGG ATGAATTAGA 
AAAAGCTGAT GAACTTCAAA ATAAAGTTGC 
AATATTACTT GGAGGGGCTG ATCCTGAAGA 
TGCTAAAAAA GCTGAGTTAG CAAAAAAACA 
TGATCCTGAA GGTAAGACTC AGGATGAATT 
TAAAAAAGCT GATGAACTTC AAAATAAAGT 
TGAAATATTA CTTGGAGGGG CTGATTCTGA 
AGCTACTAAA AAAGCTGAAT TGGAAAAAAC 
GTTAGGCCCT GATGGAGATG AAGAAGAAAC 
AGCTCCTGCA CCAAAACCAG AGCAACCAGC 
TGCACCAAAA CCAGAGCAAC CAGCTCCAGC 
AAAACCAGAG CAACCAGCTA AGCCGGAGAA 
ACCAGCCACT CCAAAAACAG GCTGGAAACA 
TGATGGTTCA ATGGCAAT AG GTTGGCTCCA 
TAACGGCGCT ATGGCAACAG GTTGGGTGAA 
ATCAGGTGCT ATGAAAGCAA GCCAATGGTT 
CAGCAATGGC GCTATGGCGA CAGGCTGGCT 
CGCTAATGGT GATATGGCGA CAGGATGGCT 
CGCTAATGGT GATATGGCGA CAGGATGGGC 
CGCTAACGGT GCTATGGCTA CAGGTTGGGC 
CGCTAACGGT TCAATGGCAA CAGGTTGGGT 
AGCATCAGGT GCTATGAAAG CAAGCCAATG 
CAATGGCTTA GGTGC CCTTG CAGTCAACAC 
TGGTGAATGG GTTTAAGCCG ATTAAATTAA 



1196 

AAAAGCAGAA GAAGCTAAAG CAGAAGAAAA 
TCTAAAGGTA GCACTAGCGA AGAAAGAAGT 
TCAATATCAA ATTTCTACTT TGGAACAAGA 
TTTGAAAAAA CTTCTTGCTG GTGCGGATCC 
ATTAAAAAAA G9AGAAGCTG AGCTAAACGC 
AGAACTTGAA AAACTTCTTG ACAGCCTTGA 
TAAAGAAGCA GAAGAAGCTG AGTTGGATAA 
TGATTTAGAA AAAGAAATTA GTAACCTTGA 
TGATACTGCT CCTCTTCAAA ATAAATTAGC 
AACAGAACTT GAAAAACTTC TTGACAGCCT 
AGATAAAGAA GCAGAAGAAG CTGAGTTGGA 
TGCTGATTTA GAAAAAGAAA TTAGTAACCT 
AGATGATACT GCTGCTCTTC AAAATAAATT 
TCAAAAAGAA TTAGATGCAG CTCTTAATGA 
TCCAGCGCCG GCTCCTCAAC CAGAGCAACC 
TCCAGCTCCA AAACCAGAGC AACCAGCTCC 
TCCAAAACCA GAGCAACCAG CTCCAGCTCC 
ACCAGCTGAA GAGCCTACTC AACCAGAAAA 
AGAAAACGGT ATGTGGTATT TCTACAATAC 
AAACAACGGT TCATGGTACT ACCTAAACGC 
AGATGGAGAT ACCTGGTACT ATCTTGAAGC 
CAAAGTATCA GATAAATGGT ACTATGTCAA 
CCAATACAAT GGCTCATGGT ACTACCTCAA 
CCAATACAAC GGTTCATGGT ATTACCTCAA 
TAAAGTCAAC GGTTCATGGT ACT AC CT AAA 
TAAAGTCAAC GGTTCATGGT ACTACCTAAA 
GAAAGATGGA GATACCTGGT ACTATCTTGA 
GTTCAAAGTA TC AG AT AAAT GGTACTATGT 
AACTGTAGAT GGCTATAAAG TCAATGCCAA 
ATCATGTTAA GAACATTTGA CATTTTAATT 



600 

660 

720 

780 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 

2340 * 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATCTTC GTATTCTTTA GGTACCTATC 2 400 

TTATGATTTC AGGAAATGTC ATTAAAAAAA CGACTCATTT TCTCTAACCT GAAAAATAGA 2 460 

TTAGAGAAAA TGGGTTGTTT TATCTATTAT AGTTATTTGA ATGAAGmTAA GAAGAAGGTA 2520 

TACTCACATC ATTCACATAA TCTGTATATT GACTATAAGT TTTAAAAAAC AATTTTTAAG 2 580 

CTCTTCCTTG TCTTCTCTAA CCAAGCGTGT TATAATGAAT ACTGCTCAAG CGACCTTCAA 2 640 

TCGTGAAGCA CACACGACCT TCAATCGTGA ATAAACGAAT AGATGGGAGA CTTACCATGA 2700 

GTGATAACTC TAAAACACGT GTTGTCGTGG GGATGAGTGG TGCTCTTGAT TCGTCGGTGA 27 60 

CGGCTCTTTT GCTCAAGGAG C AGGG CT ACG ATGTGATCGG TATCTTCATG AAGAACTGGC 2 82 0 

ATGACACAGA TGAAAACGGC GTC TGTACGG CGACCGAAGA TT AC AAGG AT GTGGTTGCGG 2880 

TGGC AG AC C A GATTGGCATT CCCTACTACT CTGTCAATTT TGAAAAAGAG TACTGGGACC 294 0 

GCGTTTTTGA GTATTTCCTA GCGGAATACC GTGCAGGGCG CACGCCAAAT CCGGACGTTA 3000 

TGTGCAACAA GGAAATCAAG TTCAAGGCCT TTTTGGACTA TGCCATAACC TTGGGGGCAG 3060 

ACTATGTAGC GACTGGGCAT TATGCTCGAG TGGCGCGTGA TGAGGATGGT ACCGTTCACA 312 0 

TGCTTCGTGG CGTGGACAAT GGCAAGGATC AGACCTATTT CCTCAGCCAA CTTTCGCAAG 3180 

AACAACTTCA AAAAACCATG TTCCCACTAG GACATTTGGA AAAGCCTGAA GTACGCAGAC 3240 

TAGCAGAAGA AGCAGGCCTT TCGACTG CT A AGAAGAAAGA CTCGACAGGG ATTTGCTTTA 3 300 

TCGGAGAAAA GAACTTTAAA AACTTTCTCA GCAACTACCT GCCAGCTCAG CCTGGTCGCA 3360 

TGATGACTGT GGATGGTCGC GATATGGGCG AGCATGCAGG TCTTATGTAC TATACAATCG 3 420 

GTCAGCGTGG CGGACTCGGT ATCGGTGGGC AACACGGCGG TGACAATGCC CCTTGCTTCG 3480 

TTGTCGGAAA AGATCTAAGC AAGAATATTC TCTATGTACG AC AAGG ATT C TACCATGATT 3 54 0 

CGCTCATGTC AACTAGCCTA GAAGCCAGTC AAGTCCACTT TACTCGTGAA ATGCCAGAAG 3600 

AGTTTACGCT AGAATGTACG GCTAAATTCC GTTACCGTCA GCCTGACTCT AAGGTGACCG 3660 

TTCATGTCAA AGGAGAAAAG ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 3 72 0 

CAGGACAGGC AGTTGTCTTT TACGATGGCG GG 37 52 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3580 base pairs 
<B) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



TATTT AT ATT 


TTTTTATCTC 


TGGCATACTT 


TGATACCTTT 


TTAGACTTAA 


ACT CTTT AAT 


60 


AGTGCCTTTC 


CACCTCTTTT 


TATCTATAAA 


GATTCTCCTA 


CATCATAATT 


CATTTTTTTA 


120 


TTTAAACCTT 


TCTGTCTTAG 


TTTGTCTTTA 


TCTTCTTCAT 


ACCATTTTAA 


GATTGTCACA 


180 


TAGTGGTTTT 


GATAGGTCTT 


ACCACTGCTT 


TCCATGTATC 


TGGATAGTTT 


ATTTATCATT 


240 


ATATCTGTGT 


GTGAGTTTAA 


TTTTTCTTTT 


AGATTTTTAT 


ATTCTTCTTT 


GCTTAACCTT 


300 


ACATTTTTGA 


ATTCTCCATA 


AAAAATGGGG 


GTGGACTTTT 


TATCTATCTC 


TCCCTCTCTC 


360 


T CTTT ATCT A 


TCTCTATATC 


TTTCCATGTA 


ATTCCAATCT 


GGAGTACCTC 


TACTGTCTAT 


420 


CGGTAATTTA 


ATTTTGATAT 


CTGGCAATAC 


TGTGC TAG AT 


ATTTGATCTT 


TATATTCAGT 


480 


ATTTTTTAAA 


GCTTGCCTAA 


TAATTGAAGT 


TAAATAGAAT 


GCTACTTCTT 


TATTCAATTC 


540 


TTTATTTTTT 


AATTTTAAAC 


AATGAATTTT 


CATATCTAGG 


CTTGCTTTAT 


ATTT ATG AT A 


600 


AAAGACTGCT 


CCTAAAAATG 


AAACAGATAT 


AAAATTTTCA 


AAAACTCTAT 


AATTTTTATC 


660 


ATCT AT ATCT 


TCGTAGTAAC 


CTAAGATACC 


ATTGTCAATA 


TTTGTAGCAC 


TAATTCTAGG 


720 


AGTTTTTCCA 


TCGAGTAAAT 


ATCTTTTTGG 


AATAGATGAG 


CCTGTTGGTA 


CTTAACTCGA 


780 


TTTCCCCTTT 


TTTTCGGTAA 


TAAATATTTC 


TTTTTATTTT 


GTTGTCTGAT 


ATTTTTCCTA 


840 


CCTGTCCTTT 


GTAGGATGAG 


TATTTTCTAG 


ATTTTCyTGA 


ATAACTTTTT 


ACTTGAAGTT 


900 


TTAGCTTTTG 


AACTAGTCGT 


TGTACTTTCT 


TTTTGTTTAT 


TATCAGTCCT 


GATCTTTTTA 


960 


ATATTGCTGT 


TATTCTCTAT 


ATCCTATTTT 


TCATTCATGA 


TATTCTTTTA 


CTAATTTTAT 


1020 


CTTAAATTCT 


GTGCTGTATT 


TGCCATTAAA 


AAACTGACCT 


CCTTTAGTTA 


GTTTTTTGGC 


1080 


CTAACTTTTG 


AGGCTCAGTT 


CAAAATTTGC 


GACTTTTAAA 


TGAATTCCAA 


TATTCAATTA 


1140 


TTAAGAGTTA 


ACATGGTGCT 


TGCCAATAGG 


AATCATTAGA 


GGCGAATTGG 


AAATAGGGTC 


1200 


ACGTATAATT 


TTTGCTTCAA 


GATTAAAGAT 


ATCTTTAACT 


AGTTTATCAT 


TTAGTATATC 


1260 


TTCAGGCTTT 


CCCTCTGCAA 


CAAGTTTACC 


TTCTTTAATT 


GCAAATAGGT 


AATCAGCGTA 


1320 


TCTTGCTGTT 


AGATTTATAT 


CGTGCAAAAT 


CATGCAAATG 


GTTGTCTTAT 


ATTTTTGGTT 


1380 


TAGATCAGTC 


AAGAGGTCTA 


ATAGTTCTAT 


TTG AT ATG AG 


AT AT CCAAGT 


AAGTAGTTGG 


1440 


CTCATCTAAA 


AGTAGGATAC 


TTGTATCTTG 


GGCTAGGGCT 


AGAGCTATCC 


ATACTCTTTG 


1500 


CCTTTGACCC 


CCAGAAAGTT 


CTTCAACTAG 


GTTATTTGCT 


AGATCTTCAA 


CATTGGCCTT 


1560 


AACCATTGAT 


CTGTTTATTA 


TTTCAAGGTC 


ATCTTTTCCA 


AGACTCTTAA 


AAGGCTTTCT 


1620 


GTAGGGGAAA 


CGACCACGGC 


TTACAAGATC 


AGCTACTGTT 


ATTGATTCAG 


GGATTATTGG 


1680 


AGATTGAGGT 


AATATAGCTA 


TGTGTTTTGC 


TAAATCTTTT 


1 TCTTTATAAG 


; AATTAATTGA 


1740 
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TTTATTATCA AGCAATACTT CTCCCTCTAA T GGCTTT AT A AGTCGAGACA AG GTTTT AAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT G AT ATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA G ATT ATTTG A 19 20 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT A CTGGAT ATC TAGCTGGTAA AATATTTTGA 2040 

GAGAATATGT CTGATAACAA AACTAGTAAA ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCAGCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2220 

GAAAGCTCGG G ATTTG CTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 2400 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 2 4 60 

AAACCAATAC CT AAT ATT AT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGC T AGT AAA 2520 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 2580 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2640 

ATT AT ATC AG GACTTCCAAG AGGATTTCTT AACATAGTTT GAAAGATAAA TCCTGCCAAT 2700 

CCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 27 60 

GAAAAACTAG CTCCAGGAAC AG TTTCACT A TTTAAGACTT TAATCAAAGT TGAAAAAGAA 28 2 0 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TG ATT ATT AT TAATAAAAAT 2880 

AGTGAGGAAA ATAGTCTTAT TCT ATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 294 0 

AT T AGTTAT T AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TT G C AGT AAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3 060 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 312 0 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGAT CTT A 32 40 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACC C AT AG CT AAAGCGTTTA 3 300 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 33 60 

ATATAGAAGA TAATGTAGCT GCTCCAAGG C TACCTATTTG CCAAAATCTA AATTTGTCTA 34 20 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC TT AC AAAACT GCTTAAAGCC ATACTAACAC 3 4 80 
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AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCG TT A CAGCAATCGC 3540 

GTATACAAAA AT T GCACTT A CTAAGCCACC AATGATTGCG 3 580 

(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

<Ai LENGTH: 11378 base pairs 
(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CCAAATTGCT CCACAATTAT TATGGAGTCG TCGTTTGGCA GATGGGCGTG AT ATGTG TGC 60 

TCAAGAATGG TTGACAGGCA AGATATTGAC CCCCTATGAT ATGAATCGTA AGCAAAT CGT 120 

CAATATTTTA ACCCGTCTTC ATCGCTCACG TCCGTTGATG ACACAATTGA GTCGTTTGGG 180 

CTATGCCATG GAAAC AC CTG TAGATTTACT AC AGTCTTGG CAGGAAACGG CTCCAGATGC 24 0 

TTTGCGTAAA AATCATTTTA TCAGTGAAGT GATCGCTGAT TTACGTCAGA CTATTCCAGG 300 

ATTTAGAGAG GACCATGCGA CCATTGTCCA TGGAGATGTA CGACATAGTA ATTGGATTGA 360 

GACAGATAGT GGCTTGATTT ATTTAGTAGA TTGGGATTCG GTT CGCTTG A CCGATCGCAT 420 

GTTTGATGTG GCCCATATGC TCTGC CATT A TATTTCAGAA CAT CAGTGG A AGGAATGGTT 480 

GACCTACTAC GGTTACAAGT ACAATCAAAC GGTATTAAGT AAATTGTATT GGTATGGTCA 540 

ATTGTCTTAT TTGAGTCAGA TTTCCAAGTA TTATATGAAC CAAGATTTAG AAAATGTCAA 600 

TCGGGAGATT CATGGTTTGC GT CATTTCCG AGACAAGTAT GGAAAGAGAA GATGAGAGTT 660 

AGAAATCGTA AAGGGGCAAC AG AATT ACT A GAGGCAAATC CCCAGTATGT GGTCCTCAAT 720 

CCCTTGGAAG CCAAGGCAAA ATGGCGGGAC TTGTTTGGCA ATGATAATCC CATTCATGTG 780 

GAAGTTGGAA GTGGAAAGGG TGCCTTTGTT TCAGGTATGG CCAAGCAAAA CCCTGACATC 84 0 

AACTATATCG GGATTGATAT TCAAAAGTCT GTTTTG AG CT ACGCTTTGGA CAAGGTGCTT 900 

GAAGTTGGAG TGCCTAACAT CAAGCTCTTG TGGGTAGATG GTTCTGACTT AACTGACTAC 960 

TTTGAAGACG GTGAGATTGA TC GCTTGT AT CTGAACTTTT CAGATCCATG GCCGAAAAAA 1020 

CGCCATGAAA AGCGTCGTTT GACCTACAAG AC CTTCTTGG ATACCTTCAA ACGTATCTTC 1080 

CCTGAAAATG GAG AAATTC A TTTCAAGACG GATAACCGTG GCTTGTTTGA GTACAGTTTA 1140 

GTGAGCTTTT CTCAATATGG CATGAAACTC AATGGTGTCT GGTTAGATTT GCATGCCAGT 1200 

GATTTTGAAG GCAATGTCAT GACAGAATAC GAGCAAAAAT TCTCAAACAA GGGGCAAGTT 1260 

ATCTACCGAG TTGAGGCAGA ATTTTAAGAG ATAACCTAAA ATTAG GCTGT ACAAGTGCTT 1320 
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TTGCTTTACA TAAGTTGGCA AACGTGCTAT ACTG AT AGT A AGAATATGAA AAGTGAGGCG 1380 

GGGAAATATC TTCGCCTCTT GCTTATGAGG AGGTGG ACGC AATCGCAACA ATCGTAGAAT 144 0 

TAGTCAGAGA AGTTGTAGAA CCTGTCATAG AAGCTCCTTT TGAACTCGTG GATATCGAGT 1500 

ATCGAAAGAT TGGCAGTGAC ATGATTCTCA GTATTTTTGT AG AT AAACC C GAAGAATTAC 15 60 

CTTGAACGAC ACGGCAGACT TGACAGAAAT TATCAGTCCT GTCCTAG AC A CCATCAAGCC- 1620 

AGATCCCTTC CCAGAACAAT ATTTCCTAGA AATTACCAGT CCAGGTTTGG AACGTCCTTT 1680 

GAAAACCAAG GATGCCGTCG CTGGAGCGGT TGGAAAATAC ATCCATGTCG GGCTCT AC C A 1740 

AGCCATCGAT AAGCAAAAGG TCTTTGAAGG AACCTTGTTG GCCTTCGAAG AGGACGAGTT 1800 

GACTATGGAA TATATGGACA AGACGCGTAA GAAAACCGTC C AAATTC CAT ACAGTTTAGT 18 60 

ATC AAAAGC A CGTTTAGCAG TTAAATTATA GAAAAAGAAA GGATAGCTTT TGAGGATTCA 1920 

AAAGTGAAGA AAACATGAGT AAAGAAATGC TAGAGGCCTT CCGCATTTTG GAAGAAGACA 19 80 

AGGGAATCAA AAAAGAAGAT ATCATCGACG CAGTAGTAGA GTCGCTTCGT TCCGCTTATC 2 040 

GCAGACGCTA TGGTCAGTCA G AC AG CGT AG CTATTGACTT CAACGAAAAA ACAGGTGACT 2100 

TTACAGTTTA TACTGTCCGT GAAGTTGTTG ATGAAGTATT TGATAGCCGT TTGGAAATCA 2160 

GCTTGAAAGA TGCTCTTGCC ATTAATTCAG CTTATGAACT TGG AG AC AAA ATCAAGTTTG 2220 

AAGAAGCACC AGCTGAGTTT GGT CGTGTAG CAGCCCAATC TGCCAAACAA ACCATCATGG 2 2 80 

AAAAAATGCG CAAgCAAACA CGTGCCATCA CTTACAATAC TTACAAAGAA CATGAGCAAG 234 0 

AAATCATGTC TGGTACAGTA GAACGCTTTG ACAACCGCTT TATCTATGTC AACCTTGGTA 2400 

GCATCGAAGC CCAATTGTCA AAACAAGACC AAATTCCTGG AGAA G TTTTT GCTTCTCATG 24 60 

ATCGTATCGA AGTTTATGTT TACAAGGTTG AAGACAACCC TCGTGGTGTG AACGTCTTTG 2520 

TTAGCCGTAG TCATCCAGAA ATGATCAAAC GTTTAATGGA GCAAGAAATT CCAGAAGTTT 2580 

ATGATGGAAC TGTTGAAATC ATGAGCGTGG CTCGTGAAGC AGGTGACCGT ACGAAGGTTG 2 64 0 

CTGTTCGTAG CCACAATCCA AACGTGGATG CTATCGGTAC AATCGTTGGA CGTGGTGGTG 27 00 

CTAATATCAA GAAGATTACT AGCAAATTCC ACCCAGCTCG TTACGATGCT AAAAATGACC 27 60 

GCATGGTACC AATCGAAGAA AATATCG AT G TTATCGAGTG GGTAGCAGAT CCAGCTGAAT 2820 

TTATCTACAA TGCCATCGCT CCTGCTGAGG TTG AC CAAGT TATCTTTGAT GAAAACGACA 28 80 

GCAAACGTGC CTTGGTGGTT GTTCCAGATA ACAAGCTTTC TCTTGCCATT GGTCGTCGTG 2 940 

GACAAAACGT GCGCTTGGCG GCTCACTTGA CTGGTTACCG TATCGATATC AAGTCTGCTA 3000 

GCGAATTTGA AGCCATGGAA GACGCTGCTT CAGTAGAGTT GGAAGTAGAA AAC GAT ACTG 3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 3120 

AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTG ATTT GCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 330O 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTG ATCGC TTATGTGGAT CACAAAGTGA 3 360 

AAAGAAGAGA GTTGGGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 3420 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 3480 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3 540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3 600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT GTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 3780 

GGATGTGAAA AGCCACTCAT CAAGTGTGGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 384 0 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGT AAGTG C 3900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4020 

AGCTGT AG CC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4 080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 4 200 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCGTCG 4 2 60 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 4 380 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4 4 40 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA AC CT AAC AAA CGCAAGGAAC CAGAGGAAAT 4 500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA GTTCAAGCAG TGGTTGAAGT 4 560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4 620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGGTCCT AGAAAACAAC AAAAGAATCG 4 680 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4 800 

ATTCCATGAA TTGCCAACAG AATTTGAATA TACAGATGGT ATGACCGTTG CGGAAATCGC 4860 



WO 98/18931 



PCT7US97/19588 



1203 



AAAACGTATC 


AAACGTGAAC 


CAGCTGAAAT 


TGTTAAGAAA 


CTTTTCATGA 


TGGGTG TC AT 


4920 


GGCCACACAA 


AACCAATCCT 


TGGATGGGGA 


AACAATTGAA 


CTCCTCATGG 


TGGATTACGG 


4980 


TATCGAAGCC 


AAACAAAAGG 


TTGAAGTGGA 


TAATGCTGAC 


ATCGAACGTT 


TCTTTGTCGA 


5040 


AGATGGTTAT 


CTCAATGAAG 


ATGAATTGGT 


TGAGCGTCCA 


CCAGTTGTTA 


CTATCATGGG 


5100 


ACACGTTGAC 


CACGGTAAAA 


CAACCCTTTT 


GGATACTCTT 


CGTAACTCAC 


GTGTTGCGAC 


5160 


AGGTGAAGCA 


GGTGGTATTA 


CTCAGCATAT 


CGGTGCCTAC 


CAAATCGTGG 


AAAATGGTAA 


5220 


GAAGATTACC 


TTCCTTGATA 


CACCAGGACA 


CGCGGCCTTT 


ACATCAATGC 


GTGCGCGTGG 


5280 


TGCTTCTGTT 


ACCGATATTA 


CGATCTTGGT 


CGTAGCGGCA 


GATGACGGGG 


TTATGCCTCA 


5340 


GACTATTGAA 


GCCATCAACC 


ACTCAAAAGC 


AGCTAACGTT 


CCAATCATCG 


TAGCTATTAA 


5400 


CAAGATTGAT 


AAACCAGGTG 


CTAACCCAGA 


ACGCGTTATC 


GGTGAATTGG 


CAGAGCATGG 


5460 


TGTGATGTCA 


ACTGCTT GGG 


GTG GAG ATT C 


TGAATTTGTT 


GAAATTTCGG 


CTAAATTCAA 


5520 


CCAAAATATC 


GAAGAATTGT 


TGGAAACAGT 


CCTTCTTGTG 


GCTGAAATCC 


AAGAACTCAA 


5580 


AGCAGACCCA 


ACAGTTCGTG 


CGATCGGTAC 


GGTTATCGAA 


GCGCGCTTGG 


ATAAAGGAAA 


5640 


AGGTGCGGTC 


GCAACCCTTC 


TTGTACAACA 


AGGTACCTTG 


AATGTTCAAG 


ACCCAATCGT 


5700 


TGTCGGAAAT 


ACcTTCGGTC 


GTGTCCGTGC 


TATGACCAAC 


G AC CTTGGTC 


GTCGTGTTAA 


5760 


AGTTGCTGGA 


CCATCAACAC 


CAGTCTCTAT 


CACAGGTTTG 


AACGAAGCAC 


CGATGGCGGG 


5820 


TGACCACTTT 


GCCGTTTACG 


AGGATGAAAA 


ATCTGCGCGT 


GCAGCAGGTG 


AAGAGCGTGC 


S880 


CAAACGTGCC 


CTCATGAAAC 


AACGTCAAGC 


TACCCAACGT 


GTTAGCCTTG 


AAAACCTCTT 


5940 


TGATACCCTT 


AAAGCTGGGG 


AACTCAAATC 


TGTTAATGTT 


ATCATCAAGG 


CTGATGTACA 


6000 


AGGTTCTGTT 


GAAGCCCTTT 


CTG CCTC ACT 


TCAAAAGATT 


GACGTGGAAG 


GTGTCAAAGT 


6060 


GACTATCGTC 


CACTCAGCGG 


TCGGTGCTAT 


CAACGAATCA 


GACGTGACCC 


TTGCCGAAGC 


6120 


TTCAAATGCC 


TTTATCGTTG 


GTTTCAACGT 


ACGCCCTACA 


CCACAAGCTC 


GT CAACAAGC 


6180 


AGAAGCTGAC 


G ATGTGG AAA 


TCCGTCTTCA 


CAGCATTATC 


TACAAGGTTA 


TCGAAGAGAT 


6240 


GGAAGAAGCT 


ATGAAAGGGA 


TGCTTGATCC 


AGAATTTGAA 


GAAAAAGTTA 


TTGGTGAAGC 


6300 


GGTTATCCGT 


GAAACCTTCA 


AGGTGTCTAA 


AGTGGGAACT 


ATCGGTGGAT 


TTATGGTTAT 


6360 


CAACGGTAAG 


GTTGCCCGTG 


ACTCTAAAGT 


CCGTGTTATC 


CGTGATGGTG 


TCGTTATCTA 


6420 


TGATGCTGAA 


CTCGCAAGCT 


TGAAACACTA 


TAAAGACGAC 


GTGAAAGAAG 


TGACAAACGG 


6480 


TCGTGAAGGT 


GGATTGATGA 


TCGACGGCTA 


CAATGATATT 


AAGATGGATG 


ATGTGATTGA 


6540 


GGCGTATGTC 


ATGGAAGAAA 


TCAAGAGATA 


AGATTTTTTG 


CTCCTTTCTT 


AGGTGGTGAG 


6600 
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GGACGCAAGC 


AAACCGATGG 


TTTCATTGCT 


TATTTTTGAG 


CCTAGGGTCT 


CAAAAATCCC 


6660 


CTGTGATGGG 


ACTGATAAAT 


CAGTTCCATC 


ACTTTCACCA 


CGGCG AAAG A 


AGCAGATGAC 


6720 


TTCAAATTGA 


ACTTCGTTTC 


AATTTAAACT 


GAAAATCAAG 


AAGTTTAAAA 


TAGCT AGGTC 


6780 


TGCTGGCCTA 


GCTTTTGGTT 


CAAAGTAGAG 


AAAGGAATAT 


CATGG C AAAT 


CATTTCCGTA 


6840 


CAGATCGTGT 


GGGCATGGAA 


ATCAAGCGTG 


AAGTCAATGA 


GATTTTGCAA 


AAGAAAGTCC 


6900 


GTGATCCACG 


TGTCCAAGGT 


GTGACCATCA 


TAGATGTTCA 


GATGCTGGGT 


GACTTGTCTG 


6960 


TTGCCAAGGT 


TTATTACACC 


ATTTTGAGTA 


ACCTTGCTTC 


GGATAACCAA 


AAAGCCCAAA 


7020 


TCGGGCTTGA 


AAAAGCAACT 


GGTACCATCA 


AACGTGAACT 


TGGTCGCAAT 


TTGAAATTGT 


7080 


ACAAAATCCC 


AGATTTGACC 


TTCGTCAAAG 


ACGAGTCCAT 


CGAGTATGGA 


AACAAGATTG 


7140 


ACGAGATGCT 


ACGCAATCTG 


GATAAGAACT 


AAAGAAGAGG 


GGTTGCCCCT 


CTTTTTTGGT 


7200 


GGAGGAAAAT 


AGGTTGAATT 


TGAAATGGAA 


AAAT ATTCTT 


TTATAATAGA 


TTGAAACTAG 


7260 


AATAGTACGC 


CTCTACTTCT 


AAAATATTGT 


T AGAAATC G A 


TTTGACTGTC 


CTGATCGATT 


7320 


TGTCCTGTTC 


TTGTTTCATT 


TT AAT AT AAA 


AAAGGGATTC 


TGTATTTTTT 


AATGTTATCT 


7380 


AAT T AGAAAA 


TGCTTTTTTT 


GTAGGAAATA 


T AATATGAT A 


AGGTGCAAAA 


AAGAAATAAG 


7440 


GAGTTTGTAT 


ATGGCTGAAC 


AAGACTTAGC 


TATGCAAGTA 


TTGCAACAAG 


TGGTGAAACT 


7500 


ACCTGTTGTT 


AAGGTTGATC 


GTTCGAAATT 


TTTAGTGGAT 


AAGTTTTCCA 


AAGAATTGGA 


7560 


TCCAAAAGAT 


ATTCCTACCT 


TATTGGAACA 


AGGTC CAACG 


ACTCTTCTAT 


CTCAAGAAAT 


7620 


ATTAGATCGT 


GTAGCTAATG 


CTTGTATTCG 


GGACAATGTA 


TTATTAGCGA 


GTGGGACTTC 


7680 


TGTTTTGGCA 


GGATTACCTG 


GAGGGCTTGC 


TATGGCAATT 


ACCATTCCAG 


CTGATGTGGC 


7740 


TCAATTTTAT 


GCTTTCTCTC 


TGAAATTGGC 


TCAAGAATTA 


GGTTATATTT 


ATGGTTATGA 


7800 


GG AT CTTTGG 


GCTTCACGAG 


AGGAGTTGAG 


TGAAGATGCT 


CAAAATACCC 


TCTTGCTTTA 


7860 


TCTAGGCGTA 


ATGTTAGGCG 


TG AAT GG AAC 


CGCTGCTTTG 


CTACGTGTTG 


GTAGTATAAC 


7920 


AATTG C C AAA 


CAGGTAATGA 


AAATAGTGCC 


TAATAAAGCT 


TTAACAAAGA 


CGCTTTGGTA 


7980 


CCCTATTTTG 


AAAAAAGTCT 


TAAAAATATT 


T GGTGTG AAT 


CTTACCAAGG 


GAGGGTTGGC 


8040 


CAAAGGAATG 


GGGAAATTTA 


TTCCTATCTT 


GGGTGGTATC 


ATTTCAGGTG 


GTTTAACCTT 


8100 


TGCAACTATG 


AAACCAATGG 


GGGAAAGCTT 


GCAGAAAGAA 


TTATCCAAGC 


TAGTCAACTA 


8160 


TAGTGAAGTT 


CAATATCAAG 


AAGATGTTGA 


AACAATCCGA 


AAAGAGGCTG 


AAATCATCAA 


8220 


AGGAGAGTAA 


TATGAATCCT 


ATCAAAGCTT 


TTGCTAAAAT 


TTATGGTAAT 


TACTTTTTGA 


8230 


CCGTGCAAGG 


TGTAAAAGTG 


ATGAAAACGA 


TAAAGAAAGC 


TGACCATGTC 


GTTGTTGGTC 


8340 


TGGGGAAACT 


TTTTATTGCC 


GACAAGTTAA 


TGGATACGGC 


TCGGTGGCTC 


ATTAAGCCAG 


8400 
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AGGAGAGAGA 


ATGAAATTTT 


TTTGGTCTTC 


TTGCTATTCT 


TTTTATCAAA 


CCGATTATTG 


8460 


GGATTGTGAA 


ATTCTTTTGG 


ATGATCATCT 


CTTTTGCAGT 


CCAATTGCTG 


TTTTACAAGA 


8520 


TAGTGTTTAA 


GATATTGGAT 


T GGCTCTTTA 


AACTTATCTA 


GATGGTAATC 


CAAGTTGCAG 


8580 


AGAACTAGCA 


GGAACTCCAC 


TGCTAGTTTT 


TTATTCTCTT 


TCCATATGGT 


ATAATATAAG 


8640 


CAGTAAAATC 


ATTTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAGCT 


TCACCTTGCA 


B700 


GTATATATGT 


TACTGACTTC 


GTCAGTTCTA 


TCCACAACCT 


CAAAACGGTG 


TTTTGAGCTG 


8760 


ACTTCGTCAG 


TTCTATCTAC 


AACCTCAAAA 


CACTGTTTTG 


AGCAACCTGC 


GGCTAGCTTC 


8820 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAACATAC 


AATGGAGGTC 


GTCATGGACA 


8880 


ATATCATCGA 


TGTGTCAATT 


CCTGTTGCAG 


AAGTGGTGGA 


CAAGCATCCA 


GAAGTCTTGG 


6940 


AAATTCTAGT 


GGAGTTGGGT 


TTTAAACCCC 


TTGCCAATCC 


CTTAATGCGC 


AATACAGTTG 


9000 


GTCGTAAAGT 


ATCACTTAAA 


CAGGGTTCTA 


AGCTAGCAGG 


AACTCCTATG 


GACAAGATTG 


9060 


T AC GC AC ACT 


GGAAGCGAAT 


GGCTACGAAG 


TGATTGGATT 


AGACTAATGA 


C AG ATGAACG 


9120 


GATTCATATC 


CTACGGGATA 


TTTTGTTAGA 


ATTGCACAAT 


GGCGCCTCTC 


CTGAGTCGGT 


9180 


TCAAGATCGC 


TTTGATGCGA 


CCTTTACGGG 


CGTGTCAGCC 


ATCGAGATTT 


CCCTTATGGA 


9240 


GCACGAGCTG 


ATGAACTCGG 


ATTCGGGCGT 


CACTTTTGAA 


GATGTTATGG 


AACTCTGTGA 


9300 


TGTCCATGCC 


AATCTTTTTA 


AAAATGCTAT 


CAAAGGTGTC 


GAAGTTTCAG 


ATACTGAGCA 


9360 


TCCAGGTCAC 


CCAGTTCGTG 


TCTTCAAAGA 


AGAAAATCTG 


GCTCTCCGTG 


CGGCCTTGAT 


9420 


TCGCATTCGT 


AGATTGTTAG 


ATACCTATGA 


GTCTATGGAA 


GACGAGGAAA 


TGCTGGCGGA 


9480 


GATGC GTAAG 


GGTTTGGTGC 


GTCAGATGGG 


ACTTGTGGGT 


CAATTTGACA 


TCCATTACCA 


9540 


ACGTAAGGAA 


GAACTCTTCT 


TTCCTATCAT 


GGAGCGCTAT 


GGACACGATT 


CACCTCCCAA 


9600 


AGTTATGTGG 


GGAGTGGATG 


ATCAGATTAG 


GGAACTCTTT 


CAAACAGCTC 


TAACGACAGC 


9660 






CAAIT AGC AG 


TGTAAAGGAA 


GCTTTTGAAG 


CTTTTGCGAC 


9720 


AGAGTTTGAA 


AGTATGATTT 


TCAAGGAAGA 


GTCCATCCTC 


CTCATGATTC 


TCCTTGAGTC 


9780 


TTTTACTCAG 


GATGACTGGC 


TTCAGATTGC 


GGAGGAGAGC 


GATGCCTATG 


GCTATGCCAT 


9840 


CATCCGTCCG 


TCAGAGAAAT 


GGGTGCCAGA 


ACGACAGAGC 


TTTATTGAGG 


AAAAGATTGC 


9900 


AGAGGAGCCT 


GTACAGCTAG 


ATACGGCAGA 


AGGTCAAGTT 


CAACAAGTCA 


TAGATACGCC 


9960 


AGAAGGCCAT 


TTTACCATTA 


CCTTTACCCC 


TAAGGAAAAG 


GAAGCTGTGC 


TGGACCGCCA 


10020 


TAGTCAACAG 


GCTTTTGGTA 


ATGGCTATCT 


TTCAGTCGAG 


CAGGCCAATC 


TCATCCTCAA 


10080 


TCATCTCCCT 


ATGGAGATTA 


CCTTTGTCAA 


TAAAGAAGAT 


ATTTTCCAGT 


ATTACAATGA 


1014O 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACGCCG TCCCAAGTCG GGCGCAATGT 10200 

CCAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATCA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT ATGAAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 10320 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTCGACTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTTCGTG GATTAGAATA 10440 

AGGAGAAAAA ATGAGTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTGGT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAAGG 10690 

CTTGTTTGGT GAGCGAAATT ATT AAA CGAG AAAGATTCTT GATTT T TCAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 108 60 

TTTTCAAGTA CGGTGACTAA CGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CAGTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTGTAGAAG ATTTTGCAGC AAATTTGGGT 1104 0 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAAGTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAAGTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AG CCATT AGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4156 base pairs 
(3) TYPE: nucleic acid 
( C > STRANDEDNES S : doub 1 e 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 211: 

CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTG ACCTTT A TGCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 
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ACCTTTGGAG TGGTTCTCCA TTTAACCAAG CTAGTCGCGT GATGGGAGAA AATCCTGAGC 240 

GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT GATTCAAGCC TACACAGAGC 300 

GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCG CTGC TAATATCATT AAAGAAGCCA 360 

AACATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT CGAAGAAGTT GCAAGCGCTG 420 

CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC TGTTATCGGA GACGGTAAAT 480 

TGAAAATCAA TCTTGCCCGT CTTGACACAC GTCTACTTCA CGGTCAGGTT GCAACTGCTT 540 

GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC AGATAACGTG GCTAAAGACG 600 

ACCTTCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA TGTCAAGGCT AACGTGGTTC 660 

CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT TGGAGAAACA CATGCCCTTA 7 20 

TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA AGGCGGCGTG CCAATCAAGA 780 

CTCTTAATGT TGGTTCTATG GCTCACTCAA CAGGTAAAAC ATTGGTCAAT ACCGTTTTGT 840 

CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG T G AC TTGGGT GTTGAATTTG 900 

ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT AACAAAGCCA 960 

ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT GTCTATTATT TCTATGGTTT 1020 

TAGTAGTCGT TGTAG CCTTC TTTGCAGGTC TTGAAGGCAT CCTCGACCAG TTCCAATTTC 1080 

ACCAACCACT TGTAGCCTGT ACCCTTATTG GGCTTGTAAC AGGTCACTTG GAAGCAGGGA 1140 

TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGCTTG GTCAAATATC GGTGCTGCTA 1200 

TCGCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT TATCATGGTT CTTGGTGGTG 1260 

ACTTT AC CAA . GACTGGTATC GGTGTTGCCC AAGCGGTTGC TATCCCTCTT GCTGTAGCTG 1320 

GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG TTTGGTTCAT ACTGCAGATG 13 80 

CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC GCATTTCATC GCGCTACTTT 1440 

TC C AAGG ACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT TATGGTACCA ACTGAAACTG 1500 

TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA TGGTATGGCT ATCGGTGGTG 1560 

GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT GATGGCAACT CGTGAAGTAT .1620 

GGCCATTCTT CGCTCTTGGT TTCGTTCTCG CTGCTGTGTC AGATATTACT CTAATCGGAT 1680 

TCGGTGCTAT CGGCGTTGCT AT CGCTCTT A TCTACCTTCA CCTTTCTAAA ACTGGTGGAA 1740 

ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG CGATATCCTA GAAGACTACT 1800 

AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT CAATTAACTA AATCAGATCG 1860 

TAAAAAAGTT TGGTGGCGTT CAACCTTCTT ACAAGGGTCT TGGAACTTTG AACGGATGCA 1920 
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AAACTTGGCC 


TGGGCTT AT A 


CACTCATTCC 


AGCTATCAAA 


AAACTCT AT A 


CTAAAAAAGA 


1980 


AG AT CAAATC 


GCTGCTCTTG 


AGCGTCACCT 


TGAGTTCTTC 


AACACTCATC 


CATACGTAGC 


2040 


TGCTCCAGTC 


ATGGGGGTTA 


CT CTTGCGCT 


TGAAGAAGAA 


CGTGCTAACG 


GTGTGGAAAT 


2100 


CGATGACGCT 


GCTATCCAAG 


GGGTTAAAAT 


CGGTATGATG 


GGACCTCTTG 


CTGGTATCGG 


2160 


TGACCCAGTA 


TTCTGGTTTA 


CAGTACGCCC 


AATCCTTGGA 


TereTCGGTG 


CTTCACTTGC 


2220 


CCTTACTGGC 


AATATCTTGG 


GGCCACTCCT 


CTTCTTTGTT 


GCATGGAACT 


TGATTCGTAT 


2280 


GTCATTCTTG 


TGGTATGTTC 


AAGAGATTGG 


ATACAAGGCT 


GGATCAGAAA 


TCACTAAAGA 


2340 


TATGTCTGGT 


GGTATCCTTC 


AAGATATCAC 


TAAAGGAGCT 


TCT AT CCTTG 


GGATGTTCAT 


2400 


TCTTGCTGTC 


CTTGTTCAAC 


GCTGGGTAAA 


TATTAAATTT 


GCTTTCGATG 


TTTCTAAAGT 


2460 


TCAACTAGAT 


GAAAAGGCTT 


ATATCCATTG 


GGATAAATTG 


CCAGAAGGGT 


CTAAAGGTAT 


2520 


CCAAGAAGCA 


TTCGCACAAG 


TAGGACAAGG 


ATTGTCTCAA 


ACTCCTGAAA 


AAGTTACTAC 


2580 


TTTCCAACAA 


AACTTGGATA 


TGTTGATTCC 


TGGATTATCA 


GGACTACTCC 


TT AC TTT ACT 


2640 


TTGCATGTAC 


TTACTTAAGA 


AAAAAGTATC 


TCCAATCACT 


ATTATCCTTG 


CCCTCTTCGC 


2700 


AGTGGGTATT 


GTGGCACATG 


TTCTTCACAT 


CATGTAATCA 


AGCAACTAAA 


AAGGAACCAG 


2760 


GTTCTAAAAT 


CTGATTCCTT 


TTTTCTATGC 


TTTTATTCAG 


CCAAGGCTCC 


CATTGGATCC 


2820 


CATGGTGCAA 


GTACGATTGG 


TTCTGCTCCA 


TAGGCAGCTT 


GTTCTTCTGC 


TGTCAGCAAT 


2880 


TCCTTACGAA 


CAACGATTTG 


GTATGTGTAT 


TCGTCCATCC 


AAGCGTCTGA 


GGCAACAAAG 


2940 


TAACCATCTG 


TACCGACCTT 


GTCTCCCCAT 


GAGTTTTCAA 


CCTTCCACTT 


GGTTGATTTA 


3000 


CCATTTTCGT 


CCAAGTCAAC 


ACCTGTCAAG 


ACCATGGCGT 


GGGTCATCAA 


GCTTTCACTA 


3060 


TAGTCCAAAC 


GTCCAGCCTT 


GTCTTGAGTA 


AGTTTAATGT 


CCATGCTTGA 


TTCAAAGTCA 


3120 


TAAACATCTG 


TCGCAAGGAT 


GCCAGCTTAC 


GGTTGCTGAG 


CTGGCCGACA 


TCAGAACCAA 


3180 


ACCAAACAGT 


CTCACCTGCT 


TGCATTTGGG 


CAATCGCCAA 


TTCTTTCAAG 


CGCTCCATTG 


3240 


GAACGTTGAT 


GTAGCGAACT 


GCACGGCTAC 


CAACCACATT 


CCCCAACATC 


TCAACTGTGT 


3300 


AAGATTTTCC 


GTAAGGTTTA 


TCAGCAGTTG 


GAGCATTGAT 


AACAGAAACG 


TAGTCTTCTA 


3360 


AAGGAAGATT 


GACATATTTC 


TTGTAAAACT 


CTTGTGGTGT 


GATTCCTTTT 


TCACTTTTGT 


3420 


AGTTGTTATC 


TTTATCGCGA 


TAAGCAAAGT 


CAAACTTGCG 


TGGTGGAAGT 


CCTAATGACA 


3480 


TAGCAAGAAA 


GTTAAAGATT 


TCTTGCAAGA 


GGTCTTCTTT 


CTTAGCTTGA 


ACAGTCGCTT 


3540 


GATCTGCACC 


AGAAACAAGC 


AAGTCACGCA 


AGATTTGAGC 


ATCTTGACGA 


AG C AATTT AT 


3600 


TAAGGATCGC 


ATTTAGCTCA 


CGACTGCTGC 


TAGATGAAAC 


AGACTCAGGA 


TAAACTGACT 


3660 


TAGGCACGAC 


ACCGTATTTT 


TCAAAGAGGG 


AAACGACCAT 


, ATCCCATTGA 


CCGCCATCTT 


3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT GCGGCTAGTC AATTCTTGGT CTGAAGTCGC 3 7 80 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3 84 0 

TTGTGACAAC TCAAAGTTCT CCAAT TTGT A TTGCGAGATG AGTTTGTGGC GGAAGGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCTTTCTGG TTAGTGACCT TCTCCTTGGT 3 9 60 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4 020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTG CTTCAT A 4 080 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 4156 



(2) INFORMATION FOR SEQ ID NO: 212: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3902 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

AAAAACAACA AAATAAAACA AAAACAAAAA TATCGAGGTT TATTTTCAAA ACTTTCGATA 60 

TTTTTATTAA GTTATTATTT TGTTGTTTCT AGTTTACTTT TTGATGGTTA AGAGTGGTGG 120 

AGAATTATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAG CTAGCCGCAG GCTGTACTTG 180 

AGTACGGCAA GGCG AAGCTG ACGTGGTTTG AATTTGATTT TCGAAGAGTA TTAGTGCAAA 2 40 

CCGTAGTTGT AGTCATCATC TTGCATGGCT TCAACTTCGC CAAGAAGGTA ACCATTTCCG 300 

ACTTGAGAGA AGAAGTCATG GTTGGAAGTT CCTGTTGAAA TACCGTTCAT AACGATTGGG 3 60 

TTGACATCTT CAGCTGAATC TGGGAAAAGT GGATCTTGTC CCATGTTCAT GAGAGCTTTA 420 

TTGGCATTGT AGCGAAGGAA GGTTTTAACC TCTTCAGTCC AACCAACACC GTCATAAAGA 480 

CTCTCTGTGT AGCCTTCTTC ATTTTCATAA AGAGTATAGA GTAGGTCGTA CATCCATTCT 540 

TTGAGTTTTT CTTGCTCTTC TTCAGGTAAT TCATTGAAAC CAAGTTGGAA TTTGTAACCA 600 

ATGTAGGTTC CGTGAACAGA CTCGTCACGA ATAATCAATT TAATGATTTC TGCAACGTTG 660 

GCAAGTTTGT TGTTACCGAG ATAGTAGAGG GGAGTGAAGA AACCAGAGTA GAAGAGGAAG 7 20 

GTTTCGAGGA AGACGCTGGC AACTTTCTTT TCAAGTGGGC TGCCGTTTAG GTAGATTTCG 7 80 

TTGACAATCT CAGCCTTCTT TTGTAGGTAA GGATTGGTAT TGGTCCATTC GAAAATTTCT 84 0 

TCAATCTCAG CCTTAGTATT CAAGGTAGAA AAGATTGATG AGTAAGATTT AGCGTGGACA 900 
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GATTCCATAA ATTGGATGTT ATTGAAGACA GCTTCCTCAT GTGGTGTACG GATGTCTGCG 960 

CGAAGGGCTT GAACCCCAGT TTCAGATTGC ATAGTGTCAA GAAGGGTTAA ACCACCAAAA 1020 

ACTTTTCCGA CCAAGTCTTT CTCTTTGTTA GATAGCTTTC TCCAGTCATC CAAGTCGTTT 1080 

GATAAGGGAA TACGTGTATC GAGCCAAAAT TGCTCCGTCA GTTTTTCCCA AGTTGATTTG 1140 

TCGATGACAT CTTCGATGGC ATTCCAGTTA ATGGCTTTGT AGTAAGTTTC CATTTAAAAT 1200 

CTCTTTCTGT GTTTAGTATT GCGAACTCAC AATTATTTCT ACTTTACCAT AATTCTATAG 12 60 

GAGTATCGCA CAAAAAGTCG GAAGCCCGAC TTTTAAAATG TTACATAAAT TATGTTATGA 1320 

CATAGTACAT TTGATTTTAT CAGTGCTGCT TAGGGAAAAA TAGTGTTTCT ATGCTAGAAA 1380 

CTAAATCACA CAGCTTTCAC ATTGGTTGGC GCCGACTTCT CCACCGTCAT CTGTAAAGGT 144 0 

ACGGACGTAG TAGATAGACT TGATTCCCTT GTTAAAGGCA TAGTTACGAA GGATGGACAA 1500 

GTCACGTGTC GTTTGTTTAT TTTCCCTCTT CCATTCGTAA AGGCCTTTTG GAATGTCACT 1560 

GCGCATGAAG AGGGTGAGTG AAAGTCCTTG ATCCACGTGT TCAGTCGCAG CAGCGTAAAC 1620 

ATCGATGACT TTACGCATAT CCATATCGTA GGCAGAAGTG TAGTAAGGAA TGGTTTCTGT 1680 

AGACAAGCCA GC AG CAGGGT AATAGATTTT ACCAATTTTC TTCTCTTGGC GTTCTTCGAT 174 0 

ACGTTGCGTA ATCGGGTGGA T AGAAG CAG A AACGTCGTTG ATATAGCTGA TAGAACCATT 1800 

TGGCGCTACA GCAAGGCGAT TTTGGTGGTA AAGACCATCT TCTTGAACCT TGTCGCGAAG 1860 

TTCAGCCCAA TCAGCAACAC CAGGGATAAA GACATTTTTG AAGAGTTCTT TAACACGGTC 1920 

TGATGTTGGA ACAAATTCAC CAGTTACATA CTTGTCAAAG TAACTTCCGT TAGCATAGTC 19 80 

TGATTTTTCA AAGTTGTGGA AGGTAATACC ACGTTCACGT GCAATATTGT TTGACT CT AC 2040 

CAAGGTCCAG TAGTTCATAA GCATAAAGTA GATGCTTGTA AATTCAACAG ACTCAGGTGA 2100 

ACCATATTCA ATGAGTTGTT GGGCAAGGTA GCTGTGCAGT CCCATGGCAC CGAGACCAAA 2160 

GGTGTGGGCT TGGCTATTTC CATGGTCAAT CCTTGGTACA GCT ACG AT AT GTGAACTATC 2 22 0 

TGTAACGAAA GTAAGGGCAC GAACCATAGC ACGGATAGAA CGACCAAAAT CAGGTGAAGT 2280 

CAT CATGTTA ACCACGTTGG TTGAACCCAG GTTACATGAA ACATCTCTTC CCATTTGAAG 2340 

GAATTCTTGA GCATCCTTGA TCAAGCTTGG TTCTTGAACT TGAAGAATCT CAGAACACAA 2400 

GTTACTCATG ATAATCTTTC CATCAACAGG ATTTGCACGG TTAGCCGTAT CGATGTTGAC 2460 

TACATAAGGA TAGCCAGACT CTTGTTGCAA TTTAGAGATT TCAGTTTCCA AATCCCGCGC 252 0 

CTTGATTTTT GTCTTGCGAA TATTTGGATT TGCGACCAAT T CAT CGT AT T TTTCAGTAAT 2580 

GTCGATGTAA TTGAATGGCA CACCGTATTC TTTTTCTACA GAGTAAGGGC TGAAGAGGTA 264 0 

CATTTCTTCA TTTTTACGAG CCAATTCGTA GAATTTATCA GGTACTACAA CACCAAGTGA 2700 
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TAGAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 27 60 

GAT AT CTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2 82 0 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG C CTTTG AT AG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3 240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3 3 00 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 3 3 60 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3 4 20 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3 4 80 

A CGGTT AATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 3 54 0 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCC CTAA AAGCAGTTTC 3 600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AAC CGAGCTC TTTAACTTGA 3 720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3 84 0 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 390 0 

GG 3902 

(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 56 base pairs 
(8) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGAGGT TGTTTACATA AAACTCTAT A TCAGAGAAGC CTGATATAGA GTTTTTTCTT 180 

G CT AGTTTT A GCATTTTTTT GTAAAATAGA AAAAGTGAAG AGAGGTATGA AATGAGCAAG 24 0 

AAAGATAAAA AAATCGAAAT TCAAGTAGCG GATGCCAAAG TTAATGTTGG TAAAGACAGT 300 

TTTGAAGGTT ATACATTGAC TATCGGTAAA AAAGTTATCG GAG AAATTGC CGAATTAGAC 3 60 

GGACAATTTG CC ATT AT AAA GAATGGGAAT GTCGATAGTT TTTATAAAAA ATTGGAAAAA 4 20 

GCTGTGGAAA TTTTGATTGA AAATTATAAT TTAGCAAAAT AAGTCTTGTT TTTTTGAAAT 4 80 

TTTCATGATA TAATAGTCCA TGTTGATTGT AGGAGAGATA GCGAAGAGGC TAAACGCGGC 54 0 

GGACTGTAAA TCCGCCCCTT CGGGTTCGGG GGTTCGAATC CCTCTCTCTC CATTTCATTA 600 

ATGGGGTATA GCCAAGCGGT AAGGCAAGGG ACTTTGACTC CCTCATGCGT TGGTTCGAAT 6 60 

CCAGCTACCC CAGTTCTTAG GTAATAATCA AGATAGAAAG CAAAATATCT TAGGGTATTT 7 20 

TATTTTTATA ATTGAAAGAC G TGAATGAT A TGAACATGTC CTTCCGGGTG CTTAGGAAAA 7 80 

AAATTATAAG TATGTCAAGT TTAAGAAAAA CTTGATTGTT GCAGGATTTT TTAGATGAAC 84 0 

GAATTTGAAG ATTTGCTAAA TAGCGTTAGT CAAGTTGAGA CTGGTGATGT TGTTAGTGCT 900 

CAAGTATTGA CAGTTGATGC GACTCAAGCT AACGTTGCAA TCTCTGGAAC TGGTGTTGAA 9 60 

GGTGTCTTGA CTCTTCGCGA ATTGACAAAC GATCGTGATG CAGATATCAA TGACTTTGTT 102 0 

AAAGTAGGAG AAGTATTGGA TGTTCTTGTA CTTCGTCAAG TAGTTGGTAA AGATACTGAT 1080 

ACAGTTACAT ACCTTGTATC TAAAAAACGC CTTGAAGCTC GCAAAGCATG GGACAAACTT 1140 

GTTGGTCGCG AAGAAGAAGT TGTTACTGTT AAAGGAACGC GTGCCGTTAA AGGTGGACTT 1200 

TCAGTAGAAT TTGAAGGTGT TCGTGGATTT ATCCCAGCTT CAATGTTGGA TACTCGTTTC 12 60 

GTACGTAACG CTGAGCGTTT TGTAGGTCAA GAATTTGATA CTAAAATCAA AGAAGTTAAC 1320 

GCTAAAGAAA ACCGCTTCAT CCTTTCACGT CGTGAAGTTG TTGAAGCAGC TACTGCAGCA 1380 

GCTCGCCCTG AAGTATTCGG TAAATTGGCT GTTGGTGATG TTGTAACTGG TAAAGTTGCT 1440 

CGTATCACAA GCTTCGGCGC TTTCGTCGAC CTTGGTGGTC TTGACGGATT GGTTCACTTG 1500 

ACTGAATTGT CACATGAACG TAATGTATCA CCAAAATCAG TTGTAACTGT TGGTGAAGAA 1560 

ATTGAAGTGA AAATCCTTGA TCTTAACGAA GAAGAAGGAC GTGTATCACT TTCACTTAAA 1620 

GCAACAGTAC CAGGACCATG GGATGGCGTT GAGCAAAAAT TGGCTAAAGG TGATGTAGTA 168 0 

GAAGGAACAG TTAAACGTTT GACTGACTTC GGTGCATTTG TTGAAGTATT GCCAGGTATC 1740 

GATGGACTTG TTCACGTATC ACAAATTTCA CACAAACGGA TTGAAAATCC AAAAGAAGCT 1800 

CTTAAAGTTG GTCAAGAAGT TCAAGTTAAA GTTCTTGAAG TTAACGCAGA TGCAGAACGC 1860 

GTGTCACTTT CTATTAAAGG TCTTGAAGAA CGTCCAGCCC AAGAAGAAGG ACAAAAAGAA 1920 
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GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 

CATTGCGCTT G AT AAGTTT G ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 

GAAAAATAGG ATGAACCTGC TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT CATAGAGCTG ATAGTGGTGT TTCAGG 

(2) INFORMATION FOR SEC; ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10974 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGGAT 60 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AAC ACT AT AT 12 0 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 180 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTCAT AGAGAAATTT TTATGAATAA 240 

CTCAATAATT TTATAGTAAA TGATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 300 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 3 60 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 4 20 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCGA TGCTTATTTG 480 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 540 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAAGTAAAT 600 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTGACTC CTATTTATAT TTATGAGACA 660 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 7 20 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 7 80 

ACTGTTATAG AAGT ACT AAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 840 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2456 
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TAAAGATACG 


ATGACGAGTG 


ACTTTTTCGA 


AGCTTGCTTC 


CAAAAATTCT 


TACTACCTAC 


900 


TTTAGATACA 


CCATCCCTTA 


TCATTATGGA 


CAATGCAAGG 


TTTCACAGAA 


TGAACATGTG 


960 


TAAGGAGCAG 


GGCATAGACT 


GTTACCACTT 


CCTACCTATT 


CACCCGAGTA 


TAATCCCATT 


1020 


GAGAAAATAT 


GGGCTTACAT 


C AAAAAC AT C 


TCAGAATAAT 


ATTGTCAAAT 


TACGATGCTT 


1080 


TTCTTGAGGC 


ACTTTTGTCC 


TATTCTTGTT 


TCAGCCGACT 


ATACTCCGTT 


ATTGGGCAGC 


1140 


TACGGAACAG 


TCGATGGGAC 


GATGGGGGGA 


CATAAAAAAA 


TCCTCCAGTT 


TTGTTTTTTA 


1200 


TAACAGTATA 


CTGGAGAATT 


GACAATCTCG 


CTACATACCT 


CGTTATAGCG 


CGGTTACTTA 


1260 


TTAGGCAGTT 


ACAAAACAAC 


TGTGAACAGA 


AAACATTCCA 


GAGTCAGACA 


AG A CTTTGG A 


1320 


ATGTTTTGGC 


T CT AT AATTT 


CTGTAGTGGG 


TAATCCCACC 


CCAGGAATTA 


TAGGGTCGTT 


1380 


TCTTGTAGAA 


AAAAAGC CCC 


ATATGACCTA 


TAATGAAAAG 


CGTCTAACCA 


ACT C ATT AG A 


1440 


AAGGGTTCAT 


ATGGAACAAC 


TTAAGAATAC 


CACAGATTTG 


CTCGGATTGG 


AAGACAAAAA 


1500 


TATCAAAATC 


TTGTCTGTTC 


TGAAATACCA 


AAC CCATCT A 


GTCGTTCAGG 


CAAAGTTGGA 


1560 


TTCCCCCGCT 


CCTCCTTGTC 


CTCATTGTCA 


AGGGAAGATG 


ATCAAATACG 


ACTTCCAGAA 


1620 


AGCCTCTAAA 


ATTCCGCTTC 


TCGACTGTCA 


GGGTTTACCC 


ACGGTACTGC 


ATCTCAAAAA 


1680 


GC GCCGCTTT 


CAGTGCAAGA 


ATTGCCTTAA 


GGTGGTCGTT 


TCTCAAACAT 


CCATTGTCAA 


1740 


GAAAAATTGC 


CAGATTTCCA 


ACATGGTGAG 


ACAAAAAATC 


GCTCAGCTCC 


TCCTTGAAAA 


1800 


GCAGTCTATG 


ACTGAGATTG 


CCCACAGATT 


GGCGGTCTCA 


ACTTCCACCG 


TCATCCGAAA 


1860 


ACTGAGGGAA 


TTTAAGTTTG 


AAACCGATTG 


GACCAAGTTG 


CCAAAAGTTA 


TGAGTTGGGA 


1920 


TGAGTATAGC 


TTCAAAAAGA 


GCAAAATGAG 


CTTCATTGCC 


CAAGATTTTG 


AGTCCAAATC 


1980 


CATCCTCGCA 


ATTTTAGACG 


GGCGAACTCA 


TGCGGTGATT 


CGAAACCATT 


TCCAACGCTA 


2040 


TCAGAGAGAG 


GTTCGGGAGC 


TGGTCGAGGT 


CATCACCATG 


G ACATGT AC A 


GCCCTTATTA 


2100 


TCGGCTCCCT 


AAGCAACTCT 


TTCCAAAGGC 


GAAGATTGTT 


CTTGACCGCT 


TCCACATTGT 


2160 


CCAACATCTG 


AGCCGAGCTA 


TGAACCGAGT 


ACGAATCCAA 


ATCATGAACC 


AATTTGACCG 


2220 


AAAATCCTTG 


GAGTATCGGG 


CGCTCAAGCG 


CTTTTGGAAC 


CCTCGCTTTT 


TCGTTTCTAG 


2280 


GCTCGGGCTA 


AATCAGTCCA 


CTGGACTGAT 


TT ACT AC AC C 


AGTATAGCTT 


CAAGCTCTGT 


2340 


CAGAAACGAT 


TCTATCAGCC 


CACGTTTCGA 


ATGCACTTAA 


C C CATCGGG A 


AGTACGAGAT 


2400 


AAGCTGCTTT 


CTTACTCTGA 


GGGATTACAG 


GTTC ACT AC G 


AACTCT AT C A 


ACTCCTGCTC 


2460 


TTTCATTTTC 


AAGAGAAGAA 


TGCCGACCAT 


TTCTTTGGAT 


TGATTGAGCA 


AGAACTGCCA 


2520 


ACGGTTCATC 


CGCTTTTTCA 


AACGGTCTTT 


TGGACTTTTT 


TAAGGGATAG 


AGATAAGATT 


2580 


AT CAACGCAC 


TTAAGCTGCC 


TTATTCCAAC 


GCTAAACTTG 


AAGCGACCAA 


TAATTTGATT 


2640. 
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AAGATTATCA 
TTGATGACTT 
TTCGCCTACC 
ACTTGGTTTG 
GGCATAGGCA 
GACATCTGTG 
TGCCACAACT 
AATGTCCATA 
CTTTTGCTCG 
CTCCAGATTA 
AGCCAGCTCG 
ATCATAAGCC 
CTCCGAAACA 
CTCCAAAGGC 
C AGTGG AAGT 
TCCGACCAAG 
TGCTTCTGCC 
TACTTCGGGC 
GGACAAGAGC 
AAATGCTAAA 
CTGGGTCATG 
ATCCTTGCGC 
ATTGAGCAAA 
GCAGGACTTC 
CATCATAAGC 
GAGTCCAAGG 
CAGGCTGTTG 
CTCGGTATGG 
AAAATCATGC 



AGCGCAAAGC CTTTGGTTTC 
TGAACATCAA AAAAGAGAGT 
CACTACACTT GACAAAGAGC 
GCATTGAGGT CCCAGCCTGC 
ATCATACCTG CATTGTCTCC 
ATTTCGGCTG CTAGGCGTTC 
AGGATTTTAA CAGGATATTT 
ACTGCTGCTT GGAAGGAAGC 
GCATTGTGAT GAAGATTGAT 
TCTTCCTTAA TCATGGCACG 
TCAATCTCAC GACCTGCAGG 
TCACCAACCG CATCATCACG 
TAAACCAACT CTGTGTGTCC 
TCCACACTCT GAGCTGCCAT 
CCGTGAGCCC AAGCAAAGGC 
CCTGGTCCGT AGGTAACCGC 
AATGCCTCCT CGATACAGGC 
ACTACGCCAC CAAAACGTTT 
TCATCGTCGT TTTTCAAGAC 
ATATATCTAT CCTTCATCTA 
GTAGTAGGCC TTTCGCTCAG 
TCGTTGATTT GACTGTCTGA 
CAAGGcTGAC GCAATCCCCT 
TGCTTCAAAA AGATTCTCCT 
CAATGCATAC CAAGTCTGGT 
ACTGACTAGG TAAACAGCTG 
TTGAATTCGC TTGATTTCTA 
TTCTTGAGCC AGTTTTCCTC 
AAGGAGTCTG CTTCCTTGTC 



1215 

CGGAACTTTA ACAATTTTAA 
ACGAATTTCG TACTCTCCAG 
CACTCTTTAT TCCATGGTAT 
GAAGTTTTCT TTGTTCCACT 
GCAGAGTCGC AGAGGGGGGA 
TCTGAGACCT TTATTGGCTG 
CTCCAAAGCC TTCTTGGTTT 
ACACAAATCT TCTGTAGACA 
AAAGGC AG AT TTCAAACCTC 
GGGGAAATCA TAAATATCCT 
ATAGGTCAAG CCCATGACAC 
GGTTTCCCCA ACAATCTTAT 
GCCGCTGACC AAGAGGGCTA 
GAGCTGCCCA GCCATGTGAT 
CTTGGCAGCT GACAAACCAA 
AACAGCTGTC ACGTCCTCTT 
TGTAATGACC TCGACATGGT 
GTGACTCTCA ATTTGACTAG 
GGCG AC ACT G GTCTCATCAC 
TTTCTCTCTT CATGATAATG 
CGATAACTGT CATCTTTTCT 
CTTCGAGGAA AATTTCCTTG 
GACCCTGATA AGCTCCTTTG 
GCACAG CT AG AAATCCAATC 
CTTGGGACAG ATCTGCTTGG 
CCATAACAGC GTAGATGGCT 
TCATAGGCGT TTAATGTAAG 
AGCCTCGACT CGTTTGAGGT 
CCAGGCCAAA AGAGCTAGAT 



AAAACGGATT 2 700 

ATTGCAGCTT 27 60 

CAAAGGCAAG 2 82 0 

CGCTGACGCT 2 880 

TGATAACCTT 2 94 0 

CCACACCACC 3000 

TTGCCATGAG 3060 

GGCTTTCTCC 312 0 

AGAAGG AG AA 318 0 

GCCCCTGATG 324 0 

GGCCGACCTT 3 300 

AATCTCCTGC 3 3 60 

GCAAGGGAAA 3420 

TAACAGGAAT 34 80 

CTAGCAAGGC 3 54 0 

CGGTAATCCC 3 600 

GACGACTGGC 3660 

CAATGACATT 3 7 20 

AGGATGTCTC 3 7 80 

GCGTCCTCGA 3 840 

TTCTTGTAAA 3 900 

TCTGTCGGCA 3 9 60 

ACAGCG ATTT 4 020 

ACTTCTGCCC 4 080 

ATTTGCTCC A 414 0 

TGAGCTAGGT 4 200 

ACTCGCCAGA 4 2 60 

AATTCGGCAC 4 32 0 

TAGCTGCATT 4 3 80 
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GGCCAATCTT TCTTTGTAAT CAGTCCTTGG CAAGTGTTTT TGAATCTGCT CAACAAAGGG 4440 

GCCAACTTCT CCGACAAAGG TTACCTGACT AGTACCCTTG ACTTTTTCTA GCACCTCTTC 4 500 

AAAAGATAGG TGCGCTTCTG CCATGACAGG TTTCGCATTT TCATAAAATC CTGCATAAAC 4560 

ATTATTGCGA CGCGCATCCA TCAAGGGGAC AAACAAACCT TCTTGTTGAT GGGGCACCAG 4 620 

AGCCAAGAGA CTCGACATAC CAACCAACTC GATGTTCAGG GTGTGAGCTA AGGTCTT AGC 4 680 

AGTTGCTACC GCAATTCGCA AGC CTGT AT A GCTACCCGGC CCTTCAGCTA CCACGATTCG 4740 

GTCCAAATCC TTGGGTGTCC AATCCAAACT TGCCATCAAA AAATCGATGG CAGGCATAAG 4800 

AGTAATACTG TGATTTTTCT TAATATTAAT CGTCGTCTCG GCAAGAACCT GCTTATCCTC 4860 

TAAAATAGCC AGAGAAAGAG CCTTGCTGGA CGTATCAAAA GCTAATACTT TCATAACACA 4 92 0 

TTCCTATCTT TTTGTCTGCT TACT ATT ATA CTACAAAAGC TGGCACATGG GAATTTTCTT 4 980 

TGCCCCCAGA CAAGAGTGCC CTCACTTAAC TAAAAATAAT TTAAAAAAAT GCTCACTTTT 504 0 

CCTTTTCTTT TCCGAATATA AAAGTGAACA AGAAAAAAGG AGGAAAGTTC AATGACAAAT S100 

TTTGACATTC TTGACAATCA ATTTTT AT CC TTATCTGAAA ATGAATTATC AGATATTGAT 5160 

GGCGGTCTCG CTCCCTTGGT TATCTTTGGA GTAGCAGTAT CTTGGAAGGC TATTGCAGGT 5220 

GGAACAGCAC TTATAGGTTC TGGTTTGGCA GCTGGTTATT TTTTAGGAGG AGATTAATAT 52 80 

GATGAAAGAT TTGAACAATT ATCGTGAAAT TTCTAATAAG GAATTGCAAG AAATCAAGGG 5340 

TGGCTTTGGT GTCGGTGTTG GTATCGCTTT ATTTATGGCA GGTTATACCA TTGGAAAAGA 5400 

CCTTCGTAAA AAGTTTGGTA AGTCATG CT A GATAAGAAAC ACATTTTTAG AAGGATAAAT 54 60 

TTTATTGTCT TCATCT CTT A CA G TTTGCTC AGCATTCTCA ATGATTTGAA CATTACTACC 5520 

ATCCCTTTAC CATTCGATTT ATCTGTTTGT ATTGTTTTAT TTTTATGCTT CAACTCTATT 5580 

TTTGATCAGA ACAATGACTC CCATAAAAAT AATAAGCTTT G AAAATTC C A TTGTCATGTC 5640 

ATGT TAG AAA AATGCAAAGA CCACCTCATC TT G ATAGATG GGGTGGAATT TTCGTGTCGT 5700 

AAATCTACTA TCTCTACATT CCCAAACAAA AAACCCCAGC ATAAGCAGGG CATCTAAGCA 5760 

TTTAATTCAA AGTAAAATAC AAACCAAACG ACATAGGTCA CGAGGAGGAG AAAAAGCGAG 5820 

TAGAGAGTCA CAAAGGTCAT TTTCCACAAG AACTTGGTTT GTCGTCGTTC CAGTTTGG C A 5880 

AATAGAAGAT TCCCCGCATA AACGCAAGCA ACAAAAACAA TAAAAGCTAC CAAGCGAGCT 5940 

CCGATAGCAA AAGCAAATAA GTTATACATA GGGCAACCTC CTTGACTTAA AATCTATATG 6000 

GAATTATGAC AAGCAATAAA TTTCACTTCC GTTATCAACA TAATACATTT TCTTTATTTT 6060 

TGAAAACGCT TACCAAAGAA ATCGTCCCCT AACTTTCTCG TTTCCGTCTT TTACTAATTT 6120 

TTCATTTTGT GGTATAATTG AAATAATTGT AACGAATCAA GGTCAATCTA GACACAAAAT 6180 
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GGAATGAAAT CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG TAAATAGAAA 624 0 

GGAACTATAT G ATTT AC AAA GTTTTTTATC AAGAAACAAA AGAACGTAGC CCACGCCGTG 63 00 

AAACAACACG CACGCTTTAC CTAGACATCG ATGCCAGCTC AGAACTTGAG GGCCGTATCA 63 60 

CTCCTCGCCA ACTTGTCGAA GAAAATCGCC CAGAGTACAA TATCGAGTAT ATCGAACTCT 64 20 

TGTCTGACAA ATTGCTCGAT TACGAAAAAG AAACTGGCGC CTTCGAAATT ACGGAGTTCT 64 80 

AATATGGCCT ACACTCTTAA ACCTGAAGAA GTCGGCGTTT TTGCCATCGG TGGTCTAGGA 6540 

GAAATCGGGA AAAACACTTA CGGAATTGAA TACCAAGACG AGATTATCAT CGTCGATGCT 6600 

GGGATTAAAT TCCCAGAAGA TGACTTGCTT GGTATCGACT ATGTCATTCC TGACTACTCT 66 60 

TACATCGTGG ACAATATCGA CCGCGTCAAG GCTGTTTTAA TCACACACGG ACACGAGGAC 67 2 0 

CACATTGGTG GGATTCCGTT CCTACTCAAG CAAGCAAATG TCCCTATTTA TGCTGGACCG 6780 

CTTGCCTTGG CTTTGATCCG TGGGAAACTC GAAGAACACG GCCTCTTGCG CAACGCCAAA 684 0 

CTTTACGAAA TCAACCACAA CACCGAGTTG ACCTTTAAAA ATCTCAAGGC AACTTTCTTT 6 900 

AGAACGACTC ACTCTATTCC AGAGCCTTTG GGGATTGTCA TTCATACTCC T C AAGGG AAA 6960 

ATCGTCTGTA CGGGTGACTT TAAGTTCGAC TTTACTCCAG TTGGAGAACC TGCGGACTTG 7020 

CATCGTATGG CTGCGCTTGG TGAAGAAGGC GTGCTCTGTC TCCTGTCTGA CTCGACAAAT 7 080 

GCGGAAGTAC CAACCTTTAC CAACTCTGAA AAAGTCGTTG GTCAGTCCAT TATGAAGATT 7140 

ATCCAAGGTA TTGAAGGACG TATCATCTTT GCATCCTTTG CCTCAAATAT CTTCCGTCTC 7 2 00 

CAGCAGGCAA CAGAAGCTGC TGTTAAGACT GGACGCAAGA TTGCGGTCTT TGGTCGTTCT 7 2 60 

ATGGAAAAGG CCATTGTCAA CGGAATCGAT CTTGGCTACA TCAAAGCTCC TAAGGGAACC 7 3 20 

TTTATCGAGC CAAATGAAAT CAAAGATTAT CCTGCAGGAG AAGTTCTTAT CCTCTGTACA 7 380 

GGTAGTCAGG GTGAGCCTAT GGCAGCCCTC TCTCGTATCG CCAACGGAAC CCACCGTCAA 7 44 0 

GTACAATTAC AAC CAGGTG A TACCGTTATC TTCTCTTCTA GTCCCATCCC TG G AAAC ACT 7 5 00 

ACTAGTGTCA ACAAGCTGAT TAACATCATT TCTGAAGCTG GTGTCGAAGT TATCCACGGT 7 560 

AAAGTGAACA ATATCCATAC ATCTGGACAC GGTGGTCAGC AAGAGCAAAA ACTCATGCTC 7 62 0 

TGCTTGATTA AGC CAAAAT A CTTCATGCCT GTCCACGGTG AATACCCCAT GCAAAAAGTC 7 6 80 

CACGCTGGAC TAGCAGTGGA . TACTGGTGTT GAGAAGGACA ATATCTTTAT CATGAGCAAT 7740 

GGCGATGTGC TTG CCCTT AC TGCTGAC T C A GCTCGTATCG CAGGTCATTT CAACGCCCAA 7 800 

GATATCTATG TCGATGGAAA TCGTATCGGT GAAATTGGCG CAGCTGTCCT CAAAGATCGT 7 860 

CGCGATCTAT CTGAAGACGG TGTCGTTCTG GCAGTTGCAA CTGTTGACTT CAAATCGCAG 7 920 
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ATGATTCTAT 


CTGGTCCAGA 


CATCCTCAGC 


CGAGGCTTTG 


TCTACATGAG 


AG AGT CTGGC 


7980 


GACTTGATTC 


GCCAAAGCCA 


GCGTATCCTC 


TTCAATGCCA 


TTCGTATCGC 


ACTGAAAAAT 


8040 


AAGGATGCTA 


GCGTGCAATC 


TGTCAATGGT 


GCCATTGTCA 


ACGCTATTCG 


CCCCTTCCTC 


8100 


TATGAAAATA 


CCGAACGTGA 


ACCGATCATC 


ATCCCGATGA 


TCCTCACACC 


AGATGAAGAA 


8160 


TAAAGCAAGA 


AAACAGCCCC 


GTCCTCGGAG 


CTGTTTTTCT 


CTATGCTTTC 


TTTTGAGATT 


8220 


AAAACTCATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


GCTAGCCGTA 


GGTTGCTCAA 


8280 


AGCACTGCTT 


TGAGGTTGTA 


GATAGAACTG 


ACGAAGTCAG 


TAGCCATACC 


TACGGCAAGG 


8340 


CGACGTTGAC 


GCGGTTTGAA 


GAGATTTTCG 


AAGAGTATCA 


ATAAAAATCG 


AAAT CAG ACT 


8400 


AGAAGGCTAA 


G CG AAAG CAT 


AACTTGAGTT 


AGCTCCCATA 


GTTCGGGAAA 


CTATGGGAGG 


8460 


CTGGAGATGA 


ATCAAAGCCA 


AGCTTTGAAC 


TCATTCGTAA 


GAAGCCGACG 


ACGTATCATT 


8520 


TTGATTTTTG 


AAGAGTTTTA 


GAAATACTAC 


G ATTTTT ACC 


TTCCAGATAC 


ACCATCAAAA 


8S80 


TAGAAATATC 


TGCTGGGTTT 


ACTCCCGAAA 


TACGGCTGGC 


TTGGCCGATG 


GTTTCTGGAT 


8640 


TGATGAGTTT 


GAACTTCTGA 


CGGGCTTCGG 


TTGCGATAGA 


ATCAATGTCA 


TCCCAGTCGA 


8700 


T ATTGGC CGG 


AATGCGTTTT 


TCTTCCATGC 


GTTTCATCTT 


GGCAACCTGG 


TCCATGGCTT 


8760 


TGGAAATATA 


GCCTTCATAC 


TTGATTTCTG 


TTTCAATCAA 


TTCGATAATC 


TTGTCATCCA 


8820 


AGTCTTCTGC 


AGCTGGTCCG 


ATGAAGGCCA 


CCACATCTTG 


GTAAGAAACT 


TCTGGACGGC 


8880 


GAAGGAATTC 


CTTGGCTGTC 


ACTGCATCGG 


TCAAGGGTTT 


GAAGCCCATC 


TCCTCAACCT 


8940 


TGGCATTGGT 


TTCCTTGACT 


GGCTTGAGTT 


TGATACTGTC 


TAGGCGCTTC 


ATCTCATTAT 


9000 


CAAATTGATT 


TTTCTTGATT 


TCAAAACGAG 


CCCAGCGTTC 


ATCGTCCACA 


AGGCCAATCT 


9060 


CGCGTCCCAT 


CTCAGTCAAG 


CGCAT AT C AG 


CATTGTCATG 


ACGAAGAATG 


AGACGGTATT 


9120 


CAGCACGACT 


GGTCAAGAGA 


CGGTAGGGTT 


CAATGGTTCC 


CTTGGTCACC 


AAGTCGTCGA 


9180 


TCATCACCCC 


GAT ATAACC A 


TCACTGCGCT 


TCAAAATCAA 


TTCAGGCTTG 


CCTTGGATTT 


9240 


TCAGAGCCGC 


ATTGATACCC 


GCGATAATCC 


CTTGGCCTGC 


TGCCTCTTCG 


TAACCTGATG 


9300 


TTCCATTTGT 


CTGACCAGCA 


GTGAAGAGAC 


CTGAGATTTT 


CTTGGTTTCC 


AAAGTCGCAC 


9360 


GCAACTGATG 


AGGCAAGACC 


ATATCATACT 


CAATAGCATA 


ACCTGTCCGC 


ATCATCTCTG 


9420 


CATTTTCCAA 


ACCTTTGATG 


GAATGCACCA 


AGTCACGCTG 


GACATCCTCA 


GGCAGACTGG 


9480 


TTGAAAGTCC 


TTGCACATAG 


ACTTCCTCAG 


TATTGCGCCC 


TTCTGGCTCA 


AGGAAGAGTT 


9540 


GGTGACGTTC 


CTTGTCCGCA 


AAGCGCACAA 


TCTTGTCTTC 


AATCGACGGA 


C AGT AACGAG 


9600 


GCCCCACTCC 


CTTGACCACA 


CCTGTAAACA 


TAGGCGCACG 


GTGGAGGTTG 


TTTTGGATAA 


9660 


TCTCATGACT 


GGTACCATTG 


GTATAGGTCA 


ACCAGCATGG 


TACTTGGTCC 


TTGACATAAT 


9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 97 80 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 9840 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AAT AG AAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGACTACTTG AGGTCTC CG A TGATAATTTC CCCACGGAGA GCAGTCCCTG 99 60 

TCGTCACAAT AACAGCCTTA G C AGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 10020 

CCTTGCCATC TTC CACC AAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTG CGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCGCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 102 00 

CAATGGTTTT GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTGG 102 60 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CAT TTC AAT A TTGATGGTCG 10320 

CAAGCAGGAC CTTACAGCCC AT ACGGCT AG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTC CTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC T AGCTGG AT A TTCTGGAGCT TATCCAAGTC AATCCACTCA CAGGGCTGCC 10560 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10620 

ACTC GAT ATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAACC AC AAAAGCT A 10680 

GCTGCCCAGC TTG AG CTTTG AC AC CCAGTT CTTCCTTCAC TT CACGG ACT ACCGCGTCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAAC CGT AT 10860 

TGTCTACTTT TGTCCGAAAG TCTT G TTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAAC AGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 
(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCGCTAGgCA ATCGTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 2 40 

CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 3 00 

CAAGATAAAA ACATGTGTAA GCAAATCTGC T ACACTTT AC TGGAGGACGC CAAGAATAAG 3 60 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 420 

AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 4 80 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 54 0 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 6 60 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 7 80 

ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCT ATCTG A TGCGCTATTT CAACCAATTT 84 0 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 9 87 



(2) INFORMATION FOR SEQ ID NO: 216: 

(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 51 base pairs 
{B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 
CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 2 40 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 3 60 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 4 20 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 4 80 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 54 0 
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CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAG CAAATCCGTA 600 

TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 660 

ATC ACT CC AT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 

CAGACATAAA GTTTCTGATC CGTTGT CC AT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 780 

GAGTACTCCA TAGTATATCC TGTCGCTCTT GTTTTAGGAA AGACAGAATT GTAGGGCATG 840 

ATGAAATAAA CTGGTAGTTC GGC AT CAT AC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 

AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG C AT AACGGG C TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTCATA 1140 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 12 00 

TGCACGTCCG TCTCCACCAA GTCTGGTTTG AGTTGTGCTG TAGTTTCCAA GGACTCTACT 1260 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAACTTG AGGTAGATGA 1320 

ACCATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 1380 

ATGACACCCC ATCGCACGAT GTGATCTTTT TCTCTCCT AG GAAGCATATC CAGCTCCTTT 1440 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAAGTCA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AAT AC CAAGA AT CAGAG ACT CTCTCTGAGT GATATCATCT 1560 

ACCAAAGTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 162 0 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGAGATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAAGA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA G AAAAAGAT A 1920 

AACAAGGCTT TAGCGCAAGT TAGCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 1980 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 2 220 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT T ATCAAG ATC GAGGTAAATC 2 280 
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TCTTTAACAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 23 40 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CGTAAACACG CGCCGTCATC CGAGCATCCT 24 60 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA .2520 

GATTGCCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTCCTAA AATAGGTAAA TCACTTTTAG 2640 



CAGCATTGTA G 

(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2651 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA GCTATGGAGG 60 

AAAATATGGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 120 

ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 180 

AAGAACAAGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 24 0 

AATTGGCTTT ACATGCCTTT G AAGAAAC AG GACGTGGTCT ATTTGAAGAC AAAGCAACTA 3 00 

AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACAGTTGGCG 3 60 

TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 4 20 

GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 480 

TGAAGACACG TAACCCAATC GTCTTTGCCT TCCATCCATC AGCACAAGAA TC ATCTGCT C 54 0 

ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTGTGTGC 600 

AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT T ATGAAC C AC GAAGGTGTTG 660 

CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGGTAAAC 72 0 

CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 780 

GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGGTATG GTCTGTGCAT 840 

CTGAACAAGC AGTTATCATT GAT AAAG AAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 

CTTACCACAC TT ACTTTGT A AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTGCTTCG 960 

GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 1020 
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AACCAGCAAC TTGGATTGCA GAACAAGCAG GATTTACAGT TCCAGAAGGA ACAAACATTC 1080 

TTGCTGCAGA ATGTAAAGAA GTTGGCGAAA ATGAGCCATT GACTCGTGAA AAATTGTCAC 114 0 

CAGTTATTGC AGTTTTGAAA TCTGAAAGCC GTGAAGATGG TATTACTAAG GCTCGTCAAA 1200 

TGGTTGAATT TAACGGTCTT GGACACTCAG CAGCTATCCA CACAGCTGAC GAAGAATTGA 12 60 

CTAAAGAATT TGGTAAAGCT GTTAAAGCTA TTCGTGTTAT CTGTAACTCA CCTTCTACTT 1320 

TTGGTGGTAT CGGGGACGTT TACAATGCCT TCTTGCCATC ATTGACACTT GGATGTGGTT 1380 

CTTACGGACG CAACTCAGTT GGGGATAACG TT AGTGC CAT TAACCTCTTG AATATCAAAA 14 40 

AAGTCGGAAG ACGGAGAAAT AACATGCAAT GGATGAAACT TCCTTCAAAA ACATACTTTG 1500 

AACGTGATTC AATTCAATAC CTTCAAAAAT GTCGTGACGT TGAACGTGTC ATGATCGTTA 1560 

CTGACCATGC CATGGTAGAG CTTGGTTTCC TTGATCGTAT CATCGAACAA CTGGACCTTC 162 0 

GTCGCAATAA GGTTGTTTAC CAAATCTTTG CGGATGTAGA ACCGGATCCA GATATCACAA 1680 

CTGTAAACCG TGGTACTGAG ATTATGCGTG CCTTCAAACC AG AT AC CAT C ATCGCACTCG 174 0 

GTGGTGGGTC TCCAATGGAT GCTGCCAAAG TAATGTGGCT CTTCTACGAG CAACCAGAAG 1800 

TGGACTTCCG TGACCTTGTC CAAAAATTCA TGGATATCCG TAAACGTGCC TTCAAGTTCC 1860 

CATTGCTTGG TAAGAAGACT AAATTCATCG CGATTCCAAC TACATCTGGT ACAGGATCTG 1920 

AAGTAACACC ATTTGCCGTT ATCTCTGATA AAGCAAACAA CCGTAAATAC CCAATCGCTG 1980 

ACTACTCATT GACACCAACT GTGGCAATCG TAGATCCTGC TTTGGTATTG ACAGTTCCAG 204 0 

GATTTGTTGC TGCTGATACT GGTATGGACG TATTGACTCA CGCGACAGAA GCATACGTAT 2100 

C AC AAATGGC TAGTGACTAC ACTGATGGTT TAGCACTTCA AGC C ATT AAA TTGGTCTTTG 2160 

AAAATCTCGA AAGCTCAGTT AAGAATG C AG ACTTCCACTC ACGTGAGAAA ATGCATAACG 2220 

CTTCAACAAT CGCTGGTATG G CCTTTGCC A ATGCCTTCCT AGGTATTTCT CACTCAATGG 2280 

CC CAT AAGAT TGGTGCGCAA TTCCACACAA TCCACGGTCG TACAAATGCT ATCTTGCTTC 2340 

C AT ACGTT AT CCGTTACAAC GGTACACGTC CAGCTAAGAC AGCAACATGG CCTAAGTACA 2 400 

ACTACTACCG TGCAGATGAA AAATACCAAG ATATCGCACG CATGCTTGGA CTTCCAGCTT 2 4 60 

CTACTCCAGA AGAAGGGGTT GAATCTTACG CAAAAGCTGT CTACGAACTC GGTGAACGTA 2 520 

TTGGGATCCA AATGAATTTT AGAGACCAAG GAATTGACGA AAAAGAATGG AAAGAACATT 2 580 

CTCGTAAATT AGCCTTCCTG GCTTATGAAG AC CAATGTTC ACCAGCTAAC CCACGTCTTC 2 64 0 

CAATGGTAGA CCATATGCAA GAAATCATCG AAGATGCATA CTATGGCTAC AAAGAAAGAC 2 7 00 

CAGGAC GCCG TAAATAATTG TTTATCAGTC TAGAAGCAAG ACAAAAACTC AATTTGAGGG 2760 



WO 98/18931 



PCT/US97/19588 



1224 



AAAGATCCAG 


TAATTTTTCT 


ATGATAAAAG 


GCATCCTATC 


AAGGTTTTTG 


AACACCTGAT 


2820 


AGGATGCCTT 


TTTATGATAT 


TGAGGCCTTT 


TTGCCCTTTT 


TGAAAAACTA 


GAATAGAAAC 


2860 


AAAATATATA 


AT AG ATTG AA 


ACTAGAATAG 


TACATATCTG 


CTTCTAAAAC 


ATTGTTAGAA 


2940 


TTCGATTTGA 


CTGTCCTGAT 


CGATTTGTCC 


TGTTCTTATT 


TCATTTTGAT 


ATATAAAAAA 


3000 


T AT AGT AT AG 


TAGACTGAAT 


CTAAAATAGT 


ACGAAACAAT 


TGCTAAAACA 


TTTATAGAAA 


3060 


TTAATTTTAC 


TTTTC TG AT A 


GAGTTGTTCA 


CATCTTATTT 


CAATTCACTA 


TAGTTTAATT 


3120 


TAAGAGTAGT 


ATTTACTAAG 


GCCCAATTAA 


AATCAAAGAG 


CAAACTAGAA 


AACGAGTGCC 


3180 


ATTCAGCTCA 


AAACACTGAT 


TT GAG ATTG C 


AGATAAGACT 


AGCCCCCTCA 


TTAACAGATT 


3240 


TACGATAAAA 


CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATGATAGAT 


3300 


CTCTATAAAA 


TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTCGTTTT 


AAAAT ACT A T 


3360 


CTCAGATATT 


CTTATATCGA 


CAAGAAGTTT 


TTGAGTCATT 


CCCTCATCAT 


ACATATTAAA 


3420 


TAAATAGTGG 


CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


3480 


AGATATAAAC 


AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AG AAACT AC A 


3540 


GTGGACTATT 


CTAGATTCAA 


CAT ATT AT AA 


AAACT AG AGT 


AAAAGAAAAG 


GATTGGATCT 


3600 


TGTGTAATGC 


AGGATCCAAT 


CCTTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTCCTACAA 


3660 


TGTAGTCGTC 


ATTAATAAAG 


ACAGATGGGA 


ATGACAGTGT 


TCCTATTTAT 


TTTGATAGAG 


3720 


ATCGATGAAT 


T CTTT AG AT A 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGTCTTCTGC 


3780 


ATGCATAAAT 


AGCAAGGAGA 


ATCCTATTTT 


TTCTCCAGTA 


GCTTCTTTTT 


GTATGAGATT 


3840 


AGAGTGAATC 


TTGTGCGCTT 


CTACTAAGGA 


GTCTTCCGCT 


TCTTCAACTT 


TAATTTTCGC 


3900 


TTCTTTTAAA 


TTTCCTGCCT 


TAGCTAGTTG 


GATGGCTTCA 


ATAAAGGATG 


ATTTGGCTGC 


3960 


TCCACTATTG 


GCAATGAGCT 


GAAAACAGAT 


AT ATT CC ATT 


TCTTCTGTCA 


TCTTATTTCT 


4020 


CCTATCCATG 


CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


GTAATCCCGC 


4080 


ATATCAATGG 


TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG 


GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


T ATT CTTT TT 


AGCTTCTGTC 


4200 


ATTGATTTTG 


CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCGCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA 


TACTTGTCGA 


CATTCCCGCA 


TTACATACTA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC 


ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATGTATAG 


4380 


AAGAACTCCA 


AGTG CAAAGA 


TGATGAATTG 


AACTAGAACT 


GCTCTCACGT 


CCCCTGCTGT 


4440 


TGCTAACCAT 


GCATTTAAGA 


ATACTGGTGT 


AGTCCAAGGA 


ACTTGTATAA 


ATGCAGGACT 


4500 


CATGAATTCT 


GTAACTGTTG 


CTAAGTAGCT 


GATTAAAATA 


CCAAGGACTG 


GAACTGTGAT 


4560 
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AAATGGAATA GCTAATGAAA TGTTATAAAC GATTGGCTAA CCGAATAATA CTGGTTC AT T 4 620 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGACA CAGCATTGCG 4 680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4 740 

GAATGTTTGT ATTTGTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4800 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 48 60 

TGCTTGGTGA ATACCAAATA G CC AT AACAT ATTTCCTAAA GAGTAAATAA TAATGACCCC 4920 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 504 0 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCT ACT AAC A GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACGTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CGtACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 5220 

GAAATGTTTA CCGC AT CTTT TGCTCCGTCA GGAACTACAG AAACTG T ATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACG GGTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGGGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 563 8 



(2) INFORMATION FOR SEQ ID NO: 218: 

(iJ SEQUENCE CHARACTERISTICS: 

(A> LENGTH : 474S base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GT AT C AG AT A TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTG AAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTGCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA TACCCTTCTT 3 60 

TTGAAATCAC CAACAACACA CCCAACAGAT AAAATCATTG CTCCTGAATT GGCTGAATTG 4 20 

GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC CAACTTGGCT 480 

AGCAAATCTG CTGAAGAATT GATTGATATC GATGCTAAGA CTTTTGAACT CAACGGAAAT 540 

AATGTCCGTG TTGCCCAAGT GAACACAGTT GACATCGCTG AAGTTTTGGA ACGCCAAGCA 600 

GAAATTGAAG CTGCAATGCA AGCTGCCAAC GAATCAAACG GCTACTCTGA CTTTGTCTTG 660 

ATGATTACAG ATATCGTCAA CTCAAACTCA GAAATCTTGG CTCTTGGTGC CAATATGGAC 72 0 

AAGGTCGAAG CGGCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 780 

GTTTCACGTA AGAAACAAGT GGTACCTCAA TTGACTGAAA GCTTTAATGC GTAAGATTTT 840 

GGGTGTCAGC TCAAAATCGG AAAGTCTAGT TTGCCTTATA TCGCAAGGAG TTTCGGCTCC 900 

TTTTTTCTAG GAGTGAAGTA TGTTAGAAAA TGGCG AT TTG ATTTTTGTGA GAGATGGGTC 960 

AGACATGGGA CAGGCCATCC AGACTTCCAC AGGTAACTAT AGCCATGTTG CCATTTATTT 1020 

GGATGGGATG ATTTATCATG CTAGTGGACA GGCTGGTGTT GTCTGTCAAG AACCGGCAGA 1080 

CTTCTTTGAG TCCAATCATT TATACGACCT CTATGTTTAC CCAGAAATGG ATATCCAGTC 1140 

GGTGAAGGAA AGAGCTTGCA AACATCTTGG AGCACCCTAC AATGCTTCTT TCTATCCAGA 1200 

TGCAGCTGGT TTTTACTGCT CCCAGTATAT AGCAGAAATC CTACCTATTT TTGAAACTAT 1260 

TCCTATGAAA TTTGGAGwTG GGGAGCAGGA GATTAGTGAT TTTTGGAGGG ACT ATT ACAT 1320 

AGAACTAGGT CTGCCTGTTC CTCTGAACCA AGCTGGTACC AATCCTAGTC AGTTGGCAGC 1380 

ATCGCCTCTG TTACAATGTA AAGAAAGGAA TCTTC ATGAT TCAGATTTTT AATCCATCTC 1440 

GTTTGACGAG ACAGCCATTT TTGGAGAATT GATCCGCTAT CTGGATCAGT ATGAGGATGT 1500 

GATTCTACGG GAAATTAAGG CTCAATTTCC AGATGTTGCA GTTGATAAAC TCATGGAAGA 1560 

GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGCGC TATTACCTCA ATTTTCCTAC 1620 

GCTTGAATCA CTTGATAGTC TTGAACTGGA TCAAGAGATT TTTGTGAGAG AAGCTAGTCC 1680 

GGTCTATCAA GCCTTGTTGG AGCAGAGTTT TGAGACGGAA TTGCGCAATC AAATCAATGC 1740 

AGCTATTTTA GTTGAAAAGA CGGACTTTGC GCGCATTAAA ATGACCCTGT CCAATTATTT 1800 

TTACAAGGTC AAACAGCAGT ATCCTTTGAC AGAAAAACAG CAGGAGCTCT ATGACATTTT 1860 

AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 1920 

CAAAAAAGAC CAGCTTATGC AGAAATGCCG TGATATCTTT GTGGACAGTT AGGTTGTCTT 1980 

AGGCTATATT GTGCAAAATG AAGATGGAAA GTATGAGTTG GCTATCGATT TTGATAAGGA 2040 

GAGGTTAACT TTCTACTTAG CGTGATTTCT TGTTTCTGAG TACATTGTTT GACTTTCCTT 2100 
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AGTATTCGCT 


ATAAACTATA 


TGTAACCGGT 


AACACATATC 


GGAATAAACT 


AAAGGAGACA 


2160 


ATCATATGTC 


ACTTGAAAAC 


AAATTGGAAC 


AAGCAACAGG 


CCCTCTCAAA 


GAAGGTTTTG 


2220 


GTAAAGTTAC 


TGGAGACAGC 


AAGACAGAAC 


TTGAAGGAGC 


TGTTGAAAAA 


AC AGTTG CT A 


2280 


AGGCAAAAGA 


CGTTGTAGAA 


GACGCAAAAG 


GTGCTGTAGA 


AGGTGCCGTT 


GAAGGTTTGA 


2340 


AAAACGTTTT 


TACTAAAGAA 


TAGGAAAAAA 


TCAAGGGTTT 


CATTTTCCCT 


TGATTTTTTC 


2400 


TATTCTTATA 


AATAATTTTC 


TGCGACGGCT 


GTATCTCCTG 


GGTAGGATTC 


TTTCTTGCCC 


2460 


TGGATGATTT 


GGTAACAATC 


GGCTCCCTTA 


CCCGCAATAA 


TAACTGCATC 


TAATTCGTGA 


2520 


TTTGTGATAG 


CCATTGCCGC 


CTTGATGGCT 


TCTTGGCGAT 


CCGCAATCTT 


TTCAACAGGA 


2580 


TGATTGATGT 


AGCTACTAAT 


TTCATCTGCA 


ATGGCCATTG 


GGTCTTCATA 


GTTAGGGTCA 


2640 


TCAGC AGTC A 


GAAAGACTTG 


AATCTCAGGG 


TGTTGATTGA 


GGAGGAGGCC 


AAAGTCCTTA 


2700 


CGACGACTTT 


CTCCCTTGTT 


TCCTGTTGAT 


CCCAGAACCA 


GAGCAATCTT 


TC CGCTTTG A 


2760 


TGAGTTTCAA 


CCACATTGAT 


GAGTTTTTTC 


AGACTATCCC 


CATTGTGGGC 


ATAGTCGATG 


2920 


AAGACCTTGG 


CTCCATTTTT 


CTGAGTGAGG 


ACTTCCATAC 


GACCAGGAAC 


GCGGGTTGCA 


2880 


GCGATGCCTT 


TTTTGATGTC 


CTCAAGACTT 


GCTCCGAGAC 


GGAGACAAGC 


AAGTCCAGCA 


2940 


GCAACTGCAT 


TTTCTTGGTT 


GAAGTTGCCA 


ATGAGTTGAA 


TATCATAATC 


TCCAGCGAGT 


3000 


TT AC CCGT AG 


CTGAAAAGCT 


AAAGGCTTTG 


GAATTCTCGA 


TTTGGTTATC 


AAATTGGCTA 


3060 


CCATAGAAAT 


CATGGTCTTG 


ATCTTCAACC 


TGTTCTTTCA 


AGACTGAGAA 


GTGGTC CATG 


3120 


TCACTGTTAA 


TGATGACTGC 


TCGGCTCTTT 


TCCATCAAGA 


GACGCTTGTG 


GTAGAAATAG 


3180 


TCTTCAAAGC 


TAGGGTGTTC 


AATCGGGCCG 


ATATGGTCTG 


GGCTGATATT 


TAGGAAAACT 


3240 


CCCACATCAA 


AGGTTAGACC 


ATAGACACGT 


TTGACCAGAT 


AGGCTTGACT 


GGAGACTTCC 


3300 


ATGATGAGGT 


GGGTACGGTC 


ATTTTGCACA 


GCCTGATTCA 


TCATGTCAAA 


GAGGTCAATA 


3360 


CTCT CAGGGG 


TTGTCAACGC 


TGACTTAAAG 


AAAGTCTCCC 


CATCAAG ACT 






GTCGACAACA 


TAGCAGGTCT 


ATGCCCTTGA 


GATAAGATGT 


TATAGGCGAA 


ATAGGCTGCT 


3480 


GTTGTCTTAC 


CCTTAGTACC 


AGTAAAGGCA 


AGGAGTTTGA 


GTTTTTCCTG 


TGGATTACCA 


3540 


TAGAACTCCA 


TGGCAATCAA 


ACTCATGGCT 


TTCTTTATAT 


CGTT C ACAAT 


GATGACAGGG 


3600 


ATACCGACTT 


CGTAGTCCTT 


TTCAGCTACA 


TACCAAGCTA 


ATCCTTGTGT 


TATAGCAGAA 


3660 


AGAAGGTATT 


CTTTTTTAAA 


GGCAGCGCCT 


TTTGCGAAAA 


AAAGAGT GTC 


TTCTGTTACT 


3720 


TTTCGGCTGT 


CGTAGCTGAT 


GCTATCAAAA 


ATAACTTTGC 


TGTAGTTGTA 


GTGGTAATGA 


3780 


CCTTGGTCAA 


TAATTTCGCG 


AAAAAGGCCA 


TCTTTCTTTA 


AAATATCTAA 


TACGGTTTCA 


3840 
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ATCTTAATCA *p ACTTTCT AT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3 900 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3 960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4020 

GTAACTTTAT CAGT CGCCT A CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4 080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GGGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4 200 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GGCTTTATGA 4 2 60 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTC CTTGG CTAGCAGACT 4 3 20 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4 3 80 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 444 0 

ATGCCATGAG CCAAATTGCT G AGCAGGT C A TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4 500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4 560 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CACTCTTGAT TTATTTCCTT GCCCAAGAAG ,4 620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4 6 80 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4 740 

TTTTG 4745 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs . 
(B> TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG AT CGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 3 60 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCT G TTTGCC TTTAAATCGC 4 80 
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CATTGGAAGC GGAgCTTGTC ATAAAAGGGA AACTCGATAA ACAGGACTCC CAAGCCCACA 540 

CAGAGACTGG CAAGGACGTC TGATGGGTAA TGAACTCCCA GATAGACTCT TGATACCAGC 600 

ACACTGACTA GGTAGAGGCC AAGGACGATT TGTACGATTT TTCTCCAGAC CTGATCTTTA 660 

ATCCGCTCAC TAAGAATAAC AATCAAAGTC CCTACCATCA GCGTTACAGC TAGAGAATGC 72 0 

CCACTTGGGA AGGAAAATCC CTTCTCCTCC ACCAGATGTA AAATAGCTGG TCGTGGGCGC 7 80 

TGGTAGATAT TTTTAAAGGT CACGATTAAA AGACCTGCCA AAGCCAGATT TCCCAGCATG 840 

AAGAAACTTT CTATCTTCCA TCGCTTACGA TAAAAGACAA AAGCTGTAAT GACAACCCAA 900 

GTGATAATCA CTGGGATATC AATCAGACGT GTGAGGGCTC GAAAAAGAAT AGTCAAATAA 960 

TCTGGTAAGT CTCCTCGAAT GGCAGTCTGA ATCGATTGGT CAAAATTGAC C AAC ATT TC A 102 0 

GGGTAAAATT TGACCATGTA GCCAAGAATA ACGAAAAGTA AAAGGGCAAA ACTGCCCTTC 1080 

ATTAAAAATG TTTGTTTATC TCTCATAATG TTTTAAGGTT GGTTTCAAGA GAACATACAA 1140 

CAACCAGAAT GAAACGGAAA AG AT AAC AC C TTCAATCAAG TTAAAAGGTA ATACCATGGT 1200 

CATTAGGTAG TTGGAAAGTC CCAAAATTTT TCCAATATCA AAGTTAGCAA AC TT AGCGT A 1260 

CAAAGGAACA G CAT AAACAT AGTTGAGAAC CAACATGGCC AAGGTTAAAC CAATAGTTCC 1320 

AGCTAGAGAG CCTAGTAGGA AACGAAGGGT TGTCCGTTCC TTTTTCCAAA TCAAAGCAAA 1380 

TACGATGACA AAAACTCCCA AAGCTACGAT ATTCATCGGC AAACCAATGT AAGTATTCAC 1440 

TCCTTGGCTG TTAAGAAGCA ATTTCAAGAG TGAGCGAAGC AAGAGCACTC CTAGAGmCsC 1500 

AGGCAAATCC ATGACCACCA GACCCACAAG GACTGGCAAG AT AC T AAATT CG AT CTTG AG 1560 

GAAAGATGCC GCTGGTAAAA GCGGAAAGTC AAAGTACATC AGCACAAATG AGATGGCTGA 1620 

T AG AATTGC A ATGGTCGAAA GTCGACGTGT GTTTGTCATA ACAGGTTCCT CCAATTTTCT 1680 

ATAAAATCAG AAGAAGTTGG AAAGGATTCC TCTATCTATT CTCACTTTTT ATATCCCAAA 1740 

AGTTCCCTCT TACTCTATTA AAGAAAAACA AAGCAAGTGG TTACAATCCG GCTATAAATC 1800 

TATCAAAACA GACAAGGCTA TTCTTTCG TC TTCTCCCATC CAGACTATAC TGTCGGTTGT 1860 

GGAATCTCAC CACATCACGT TGCGCTCACG GACTTCTTTA 1900 



(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4692 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



GGTTTTCCAG 


CAGGAGCTTC 


TCCTTTATCA 


G AATG AC CAT 


CCCATCTGCT 


CACGATAGAT 


60 


GAATAATGAT 


ATTTTTTACC 


ATGATAGTAA 


TTTGAAAAAG 


CCTAACCACC 


TCCTGAACCT 


120 


TCTCCATATG 


TCCATACTCC 


TCCATCTGGA 


TATTATACAG 


CAGCTGATGC 


AGCTCCCAAT 


180 


AATGTAAAAC 


TTGAAATAAG 


AGCTAGAGCA 


AGTAATCTAT 


GTTTTTTCGT 


TTTCATTTTA 


240 


TTTTTTCTTT 


CAAAAAAAGC 


ACACCTTGAG 


CAACAATGCA 


ACAAAATAAA 


TCCTCCTCTC 


300 


TCTTTTATTG 


AAACCGCTTT 


CTTATGTGAT 


AAGAATAACT 


TTTTTATtAT 


TTGTTGTCAA 


360 


GGAAAAAATC 


GAATTTTTTA 


GATATTTTAC 


TATATTACCT 


CTGTGAATAA 


TATTATATAG 


420 


TAGTTTTATT 


TCAAAATAAT 


ATGCAACCAG 


T ACT AAC CAA 


ATAT AAAAT A 


GATGCCATTA 


430 


ACGAATTTTA 


TTCAAGTTTT 


TCCCATTCAT 


ACTATACAAG 


TAAAAGAGAT 


GGTGTTAACT 


540 


AAAAAGCAAT 


TCAAACTATT 


GTAAAATTCC 


TAGCAAAAAG 


AGAGCCGAAA 


CTCTCTTTTT 


600 


TATCTTCTTT 


TACTTTTTTT 


GACTGGCATG 


AGTGTG AT GT 


CTCTAACACT 


AAAGTAAGCT 


660 


AGGATCAACA 


TGGCTATTGC 


TAGGAATATT 


TCTGTTGGTA 


ATTGAAAAAT 


TTTCAGAAAA 


720 


GATAGAACCA 


ATAAAATCAA 


GAGTGCCACT 


AAAAT ACAT A 


CCATAGCGAC 


GATATTGACA 


780 


GTCCCTTTAA 


TGC TTTCTGG 


TGTCGCAAAT 


ACAT AG AGT A 


GGAGCAGTAA 


AATTCCTAGG 


840 


ACTAAATAGA 


CCATCTTTCT 


CTCTTTCTAG 


CTCTTATTCA 


GCTGATTTTT 


TCTTCTTGTT 


900 


AGCTTTCTCA 


CGCTCTGCTT 


TGTTAAGGAT 


TTGTTTACGC 


AAACGGATAG 


ACTCAGGCGT 


960 


TACTTCCATG 


TACTCATCGT 


CGTTCAAGAA 


CTCAAGAGAC 


TCTTCAAGTG 


TCAAGATACG 


1020 


AGGCGTCTTG 


ATAACAGCTG 


TTTGGTCCTT 


AGTAGCTGAA 


CGAACGTTGG 


TCATTTGTTT 


1080 


TGC CTTCGTG 


ATGTTAACTG 


TCAAGTCATT 


TTCACGAGAG 


TTTTCACCGA 


TGATCATTCC 


1140 


TTCATAAACC 


TCAGTACCTG 


GGTTGACAAA 


G ATCGT AC C A 


CGTTCTTCGA 


TAGACATGAT 


1200 


TGAGTAAGTT 


GTAGCCTTAC 


CAGCATCGAT 


AGAAACAAGG 


GCACCACGGT 


GACGTCCACC 


1260 


AATTTCCCCT 


GGAATCAATG 


GCAAGTATTG 


GTCGAAGGTA 


TGGTTCATGA 


TACCGTAACC 


1320 


ACGAGTCATT 


GATAAGAACT 


CAGTTGAGTA 


TCCAATCAAA 


CCACGCGCTG 


GAACAAGGAA 


1380 


GACCAAACGA 


GTTTGACCAT 


TACCAGTTGA 


AATCATATCC 


AACATTTCAC 


CTTTACGTTC 


1440 


AGAAAGGCTT 


TGGATAACAG 


ACCCTTGGTA 


TTCTTCTGGA 


GTGTCGATTT 


GTACACGTTC 


1500 


AAATGGTTCA 


CATTTAATAC 


CGTCGATTTC 


TTTTACGATA 


ACTTCTGGAC 


GAGATACTTG 


1560 


AAGTTCATAG 


CCCTCACGAC 


GCATTGTTTC 


GATAAGGATT 


GACAAGTGCA 


ATTCTCCACG 


1620 


TCCTGAAACA 


GTCCATTTAT 


CTGGTGAATC 


AGTTGGGTCA 


ACACGAAGGG 


AAACGTCTGT 


1680 


TTGCAATTCT 


GCCTGCAAGC 


GTTCTTCCAC 


CTTACGAGAA 


GTTACCCATT 


TACCTTCTTT 


- 1740 
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ACCAGCAAAT GGTGAGTTGT TGACCAAGAA AGTC ATTTG A AGAGTTGGCT CATCGATCTC 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG GTTT CACCG A CAAAGATGTC 1860 

TTCCATACCT GAAACGGCAA TCAAGTCACC CGCTTTGGCT TCTTGGATTT CACGACCTTC 192 0 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACCGAAGTTT TTAGTTGTAC CGTCAAGTTT 1980 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 2040 

ACGTCCAACG AAGT C ATTGT AGTCCAAAAG TG AC A CTTGG AACTGCAAAG GCTCATCTGA 210 0 

GTTATCTACT GGAGCTGGGA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGT C A GCTGGATCAT CTGACAATGA AGAAGTTCCG TTGATCGCTG AAGCATAAAC 2 2 20 

CACTGGGAAA TCAAG CTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 2 2 80 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 23 4 0 

GACAAGGTCT TGTTCCAAGG CTTTTTTCAA TACGAAACGA GTTTGTGGCA TGGTT CCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTCTCC 24 60 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTG AT A CGAGTTCCGT TGTAAGCAAC 2520 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCGATATCGT TTGAGTCCAT 2 580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTC T GATTGTTTCA ATAATTCCTC 2640 

AAC C AGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC GGATATCTTC 2700 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 2760 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTGCTT 2820 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2880 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCCATGAAG CAAGCACTCC 294 0 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AAT ATCG AT A 3000 

CAGGACTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATCAA CACACCTATC 3 060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AG AC AAGGAA CTTCCGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 3180 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTGGGC TTTCAGCTCC 3 2 40 

TCCATCCCCA AT ATC ATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3 300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAC 3 3 60 

C CGC AAAACT AGAGATTCTA TCTGCAAGTC sCTCCCTCAG TCAAGCCTCT ATCACCATTT 3 420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GCTTGGTGTT AAGGTGACTA 34 80 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAAGGTAACT 3 540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG ATGAGTCGAT 3 600 

AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3 660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AG AT ATT AAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG GCAAGACCAA TCCTATGATA 3780 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3840 

AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3900 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3960 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 4 020 

AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC GTTGGTTTTC 4080 

GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA ACTGACTTGA 414 0 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAATC 4200 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 4 260 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 4 3 20 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 4 380 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4440 

GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4500 

ATGTAAACAA TAATACCAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA TGTTTTCATA 4S60 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA ATGCTCCTTA 4 620 

TTTTC CTATC TTCTTTAGCG ATTCTAAGGC AAGTATCGTA CAATAAAAAC ATGGGGATTC 4 680 

AACAATTACA TT 4692 
[2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 60 

GCCG CCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 120 

AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 180 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC AT ACTT CCT A AAAAG AATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA G AGGT C ACT A GAAAACTACC AATAAACATG 3 60 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 4 20 

CCATTTGATT TAG CTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 4 80 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTG AG ATT C A 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC AT AGT CTTGG CG AACCAAGA AAGTAAAAGG C ATGGTC AC C 660 

AAGACCATCA AAACCACAGG TCGCACAATC TCGAT AAAAG CGTCTT 706 

(2) INFORMATION FOR SEQ ID NO: 222: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
CAGCTGATGG GCAATATCAG TCATAGAAAT TTTTTCAATT AACTTTTGAG CAATTTTTTG 60 

GTTGATGATA CGAGGGATTT GGTGATTTTT CTTTACCAGG GGAGTCTCAG CAACCATCAT 120 

TTTTGAACAG TGATAGCACT TGAAACGGCG TTTTCTAAGG AG AA TT CT AG AAGGCATACC 180 

AGTTGTTTCG AGGTAAGGGA TCTTAGACGG TTTTTGAAAG TCATATTTCT TCATTAGACT 240 

TCCACAATCA GGGCAAGATG GAGCCTCATA ATCCAGCTTA GCGATAATTT CTTTGTGGGT 300 

ATCCATATTG ATGATATCTA GAATCTTGAT GTTTGGGTCT TTAATATCGA GCAGTTTTGT 3 60 

GATAAAATGT AATTGTTCCA TATGATTCTT TCTAATGAGT TGTTTTGTCG CTTTTCATTA 4 20 

TAGGTCATAT GGGACTTTTT TTCTACACAA AAATAAGCTC CATAATATCC ATAGGGGATT 480 

TACCCACTAC AAATATTATA GAGCCCGAAA ATATGGGAAA ACTGATCCTT GTTTCTGCTT 54 0 

TTGTCTATAG AAGAATAATA AAGATTATCT TCTTCAAATT CTCCGATATT CTCTAAAGTT 600 

TTGTGCAAGT TGCACAGAAC TTGTTTATTT TTTTGGTCAT CTTGCCATAG AAATATAAAG 660 

CGTTTTCATA TATAATATAA TTATCAAAAG ACAAAAGGAG TTCACCTCAT GGTAGAATTG 720 

AATCTTAAAA ATATTTACAA AAAATATCCA AACAGCGAAC ACTATTCAGT TGAAGATTTC 780 

AACTTGAACA TCAAAGATAA AGAATTT AT C GTTTTCGTAG GACCTTCAGG ATGTGGTAAA 840 
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TC AAC T AC AC TCCGTATGAT TGCTGGTCTT GAAGACATTA CAGAAGGTAC TGCATCTATC 900 

GATGGCGTAG TTGTCAACGA CGT AGCTC C A AAAGACCGTG ATATCGCCAT GGTATTCCAA 9 60 

AACTACGCTC TTTACCCACA CATGACTGTT TATGACAACA TGGCTTTCGG TTTGAAATTG 1020 

CGTAAATACA GCAAAGAAGA CATTAACAAA CGTGTTCAAG AAGCAGCTGA AATACTTGGA 1080 

TTGAAAGAAT TCTTGGAACG TAAACCAGCT GACCTTTCAG GTGGTCAACG TCAACCTCTT 1140 

GCCATGGGGC GTGCGATTGT CCGTGATGCG AAAGTATTCT TGATGGACGA ACCTTTGTCA 1200 

AACTTGGATG CCAAACTTCG TGTATCAATG CGTGCTGAAA TCGCTAAAAT TCACCGTCGT 1260 

AT CGG AGCT A CAACTATCTA TGTAACTCAC GACCAAACAG AAGCGATGAC ACTTGCAGAC 13 20 

CGTATCGTTA TTATGTCAGC TACTAAGAAC CCTGCTGGTA C AGGT ACT AT CGGACGTGTA 13 80 

GAACAAATCG GTACTCCTCA AGAAGTTTAC AAAAATCCAG TTAACAAATT CGTTGCAGGA 14 40 

TTCATCGGAA GCCCAGCTAT GAACTTCATC ACCGTGAAAT TGGTTGGTAG CGAAATTGTT 1500 

TCTGACGGTT TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG TGAAAAAGGC 15 60 

TACGAAGGAA AAGAATTGAT CTTTGGTATC CGTCCAGAAG ACGTGAATGC AG AACCTG CT 16 20 

TTCCTTGAAA CATTCCCAGA CTGTGTTGTA AAAGCGACTA TCTCTGTATC AGAACTGCTT 1680 

GGTTCAGAAT CTCACCTTTA CTGTCAAGTT GGTAAAGACG AGTTTGTTGC AAAAGTTGAT 17 40 

GCTCGTGACT ACTTGCAAAC AGGTGCAACA GTTGAGCTTG GATTTGACTT GAACAAAGCA 1800 

CACTTCTTCG ATGTAGAAAC TGAAAAAACA ATCTACTAAA ATAAATAAAA TTCAAAGCAC 1860 

TACAAGAAAA GATATCTCTT TATCAATTGT AGTGGAGAGA TATCAGTTAA T CT AGGG AG A 1920 

GAAACAAAAT GCTTCTCTCC TTTTTGCTAG AGAAGTCATA TTATGCATCT ATATTGTGAT 1980 

GCTCTTTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT GCCGTACCTA 2040 

TGATTACTGA TTTCGTCAGT TTTATCTGCA ACCTCAAAGA TGTACTTTGA GCAGCTTACG 2100 

GCTAGTTTCC TAGTTTGCTC TTTGATTTCC ATTGAGTATT ATTTGTGGGT ACCATCTACA 2160 

AGTGAAGCTA TATGCGTAAA CTACCTGAGC AATTGAATTC GAACTAGAGA GGTAATAATA 2 220 

AATTTATCCT ATAGTTATGG TGACTTGTAT GCTTTTGATT CTAGTTTATC AAATAATAGA 2 2 80 

TTAGAATTGT CAGATAATAT CATTTTGTGT TATAATGAAG AAAAAACAGA GGTGTTCAAA 2 340 

TGTCAGAAGC AGGTCATAAG TTTTTAGCAA AATTGGGGAA AAAACGCTTA CGTCCAGGTG 2 400 

GAAAGCGTGC CACAGATTGG TTAATTGCAG AAGGAGGATT TTCAAAAGAA AAGAGAATAC 24 60 

TAGAGGTTGC GTGTAATAGG GGAACTACAG CAATTGAGTT GGCACAGCGT TTTGGTTGCA 2520 

AGATAACTGC TGTTGATATG GATGCTCAAG CTTTAGAAGT GGCTAAAAAA TCTGCTGGAA 2580 

CGGCAGGTGT TGCTCATTTA ATCAGTTTTG AAAGAGCAAA T G CAATG AAA CTTCCTTATC 2640. 
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AAGATGCTAG TTTTCATATT GTTATAAATG AAGCTATGCT GACTATGCAA GCCGATCAAG 27 00 

CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA ACCTGGAGGT CTTCTCTTGA 27 60 

CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG ACAGGAATTA TCACAAGCAA 2 820 

TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGCGA ACAGGTGATG ATAGAATCAG 2 880 

GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT AATGAAATTA TCGGGTATGA 2 940 

TTTATGACGA AGGTTTGCTA GGAACTTTGA AAATTTCTGT AAATGCTTGT AAAAAGGAGA 3000 

ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA GAATAAACAG AAATTGGGCT 3 060 

TTATTCCGAT GGCTAGTTAT AAATCGTCAA AACGTTAGAT AATTATTGAA GTTAACTTTT 3120 

CCTTTTTTCT TTCTTAAAAA AT ATGCTAT A ATAGAGAGTA AAAAACTTTG AAAGAAAGAA 3180 

AAAGATGAAT TTAAAAGATT AC A TTGC AAC AATTGAAAAT TATCCAAAGG GTACCG 32 3 6 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2885 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
CCTGACTTTT CAAATTGGTT AGTT TGCCAC ACTTGGTTTA TATGGTCGTG GAAAGCATGG 60 

CTATTACTTC TCAAAGGGCG ATTTCTCACC CCATGAAAAG TGTCTATTTT TGTTTAGGTT 120 

TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA CCGCTTATAT 180 

TTGATGCCAA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG GCTATAATTC 240 

CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA AATGGTATTT 300 

TCACGATTGT TTAGCACGTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT AGGCTTAATA 3 60 

ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT CATGAGTTCA 420 

GTAGCTTCAA ACTCTGTAAG TCTTGCACCT TTACCATAAT TCCACCCCTC TTTAATTTGT 4 80 

TTGATCATTT CGGTTAGGCT TACAAAAAGC ACACTCTTAG GTTCTCCTTT TGTCTTATAC 54 0 

CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT CCCTGTACCT 600 

CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGT AC CTG ACCTTTTGCA 660 

AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AATCATCAAA AGTTTTAGTT 720 

TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT TTGCATATAT 780 
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AGTTTAGCAT TCAAATTATC AGCAATCGCA T CTTCTTC AT CTTGCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATAGGG TGGGATACAG CGAACTTCTT TTATTGCCTC T CCGTTCT C A 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTCTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTACTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCGT CAATAAATTG TCCTGTTTCC AACGGTTAAG GATTACCTTG ATGTATGCAA 1200 

AGTTTG CTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 12 60 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTCCACACT GG TTACTTTT TTATTTACAA 13 80 

AATCAGAAAT CAGT TGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAAT AT C 1560 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTG 162 0 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GGGCGAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 1740 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTCATC 1800 

TTTGATGGAT TT TGTCTG AC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT IB 60 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 1920 

TTCTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 1980 

TTCT AAAC AT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 204 0 

CCATATCCTC AAATGCCTGG T ACTGCTCC A ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTT CT CTTAACTGCT TAG CGTTC AT GTC TGTT ACT GCCTTTAGTA 216C 

GCAAGTTGTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATGTT TCTATACTGT 2220 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG CAGACGAATT TCAGAAAGTT 2280 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCT G TTG CTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 2460 

AGT CAGCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 252 0 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 2640 

TCTTTTTTAC CCTCTAAAAC GG CTTTGGCT AGTGGTTAAT ATT ATTT AC C ACTTGTCTCT 2700 

ATAAACGTGT TAGAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 2760 

TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 2820 

AAATTACATC TTCTTTATCC TTGGCATCTG CTTTCTCTGA G AC AAATTAG AATGTCAATA 2880 
CTTGG 

2885 

(2) INFORMATIOK FOR SEQ ID NO: 224: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3144 base pairs 
<B> TYPE: nucleic acid 
(C> STRANDEDNESS : double 
( D ) TOPOLOGY : 1 inear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

TATCAATCCT TTCC CATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 60 

ATGTATTTTT ACGAGT C AGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTTTTGAC 12 0 

TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAG 180 

ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTA CTATAAAATG 24 0 

AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA AAAACATTAA 300 

AACGTCCTGA GGTTTTATCA CCTGCAGGGA CTTTAGAGAA GCTAAAGGTA GCTGTTCAGT 3 60 

ATGGAGCAGA TGCTGTCTTT ATCGGTGGTC AGGCCTATGG TCTTCGTAGC CGTGCGGGAA 42 0 

ACTTTACTTT CGAACAGATG GAAGAAGGCG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 4 80 

TCTATGTACC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 54 0 

TCCGTAAACT GCGTGATATC CGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 600 

TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACCCAA GCCAGTGCCA 6 60 

CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 720 

GTGAGGTTTC AATGGAAGAA TTAGCTGAGA TCCGCAAACG TACAGATGTT GAAATTGAAG 780 

CCTTTGT CC A TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 840 

TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACCACC 900 

TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 960 

TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG ATTGAAAATG 1020 
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GTGTGG AC AG TCTAAAAATC GAAGGACGTA TGrAGTCTAT TCACTAyGTA TCAACAGTAA 1080 

CCAACTGCTA CAAGGCGGCT GTGGATGCCT ATCTTGAAAG TCCTGAAAAG TTTGAAGCTA 114 0 

TCAAACAAGA CTTGGTGGAC GAGATGTGGA AGGTTGCCCA ACGTGAACTG GCTAC AG GAT 1200 

TTTACTATGG TACACCATCT GAAAATGAGC AGTTGTTTGG TGCTCGTCGT AAAATCCCTG 1260 

AGTACAAGTT TGTCGCTGAA GTGGTTTCTT ATGATGATGC GGCACAAACA GCAACTATTC 13 20 

GTCAACGAAA CGTCATTAAC GAAGGGGACC AAGTTGAGTT TTATGGTCCA GGTTTCCCTC 1380 

ATTTTGAAAC CTATATTGAA GATTTGCATG ATGCTAAAGG CAATAAAATC GACCGCGCTC 14 40 

CAAATCCAAT GGAACTATTG ACTATTAAAG TCCCACAACC TGTTCAATCA GGAGACATGG 1500 

TTCGAGCTCT TAAAGAGGGG CTTATCAATC TTTATAAGGA AGATGGAACC AGCGTCACAG 1560 

TTCGTGCTTA ATGTAGTTGT TTAGTTTTAA AAAACTATGC AAAGCTC C AT ATACAACACT 1620 

TAAACGAGAT TAAAGAATGG CGAAATCCCT TGATGCGCAA GAGATTAGCT GTCTTT TTT A 1680 

TTTTTTAAGT GATAAAGTCG GAGTTTAGGC ATCAAAGCCT ATCAAATTAA ACAAAGAAGC 174 0 

GATGTCTTAG AT ATTTTG AA AAAAATTAAT AAGCAGAAAA CTCTCTATTA TTTTGTTGTA 1800 

GAGAGTTTTT TGTTAATAAA ATTTCACAAA ATGACATTTA TATATTGCAT TAAGTTAGAT 1860 

ATATGATATA ATATTGTTAA AAAGAGGCGC AACTTTTTAA AATTAATGAG AATCAAAGAG 1920 

AAAACCAATA ATATTAATGG AGGAATAAAA AATGTAAGTA AGCATTATGG TCATTCAATC 1980 

ATTCTCAAAG ATATAAATTT TGCACTTAAC AAGGGTGAAA TTGTTGGTCT AGCAGGGAGA 2040 

AATGGAGTTG GTAAGAGTAC GTTGATGAAA ATTCTTGTTC AGAATAATCA ACCGACTTCA 2100 

GGTAATATTA TAAGCAGTGA TAATGTTGGG TATTTAATCG AAGAACCAAA ATTATTTTTA 2160 

TCTAAAACAG GTTTAGAGAA TTTAAAATAT TTGTCAAATT TATATGGTGT TGACTACAAT 2 220 

CAAGAAAGAT TTAGATGTTT GATCCAAGAG TTAGATTTGA CTCAGTCTAT TAATAAAAAA 22 80 

GTAAAGACCT ATTCTTTGGG TACAAAACAA AAATTAGCTT TGCTTCTAAC TCTCGTTACG 2340 

GAACCTGATA TATTGATTTT AGATGAACCG ACTAATGGTT TAGATATTQA ATCATCACAA 2400 

ATAGTTTTAG CGGTTCTAAA AAAATTAGCT TTACATGAAA ATGTGGGAAT TTTAATATCG 24 60 

AGTCATAAAT TAGAAGACAT TGAAGAAATT TGTGAGAGAG TTCTTTTCTT GGAGAACGGG 2520 

CTTTTGACAT TTCAAAAAGT AGGAAAAGAT AGTCATAATT TCTTGTTTGA GATAGCTTTT 2580 

TCATCAGCTA CAGATAGAGA CATTTTCATT ACCAAACAAG AATTTTGGGA TATTGTTTAG 2 640 

GAAGAGGGAT TGAGAATTAC TATGTCTGGG AATATTCAAA ATAGTGAGCT TTTTAAATTT 2700 

TTTAACGAAA ACT CT ATT AA AGTAGTTGAT TTTGAAACTA AAAAAGAGAC GCTTAAAGAT 2760 

ATTTACCTAA ATCGTTCAAA ATAAAGGAAG GTTATAATCA TGAAATTAAA TAAACAGAAG 282 0 
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AATCGGATCA TTTACGTCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 

GCTTTGAATG GCATTGTGTT ACT AG T CAT A GTAAGTAAAT TGGGTATTCC AGGTGATTTA 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 

CTATTATCTT ACATTTTCTA TTT AT AC AAA TTGAAAAGTG GCTTGGTATw TGGT ATTTT A 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATACGA TGATGATGAA TACTAGTAAT- 

GATTTGTTTA TCAAAGCAAT TGAA 

{2) INFORMATION FOR SEQ ID NO: 22 5: 

{i> SEQUENCE CHARACTERISTICS: 

(A> LENGTH; 3766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 



TACGGTATTA 


TTTTTAAGGA 


GAAAGAATCA 


TGAAAATCAA 


AAAATGGCTT 


GGTCTAGCAG 


60 


CCCTTGCTAC 


AGTCGCAGGT 


TTGGCTCTTG 


CAGCTTGCGG 


AAACTCAGAA 


AAGAAAGCAG 


120 


ACAATGCAAC 


AACTATCAAA 


ATCGCAACTG 


TTAACCGTAG 


CGGTTCTGAA 


GAAAAACGTT 


180 


GGGACAAAAT 


CCAAGAATTG 


GTTAAAAAAG 


ACGGAATTAC 


CTTGGAATTT 


ACAGAGTTCA 


240 


CAGACTACTC 


ACAACCAAAC 


AAAGCAACTG 


CTGATGGCGA 


AGTAGATTTG 


AACGCTTTCC 


300 


AACACTATAA 


CTTCTTGAAC 


AACTGGAACA 


AAGAAAACGG 


AAAAGACCTT 


GTAGCGATTG 


360 


CAGATACTTA 


CATCTCTCCA 


ATCCGCCTTT 


ACTCAGGTTT 


GAATGGAAGT 


GCCAACAAGT 


420 


ACACTAAAGT 


AGAAGACATC 


CCAGCAAACG 


GAGAAATCGC 


TGTACCGAAT 


GACGCTACAA 


480 


ACGAAAGCCG 


TGCGC TTT AT 


TTGCTTCAAT 


CAGCTGGCTT 


GATTAAATTG 


GATGTTTCTG 


540 


GAACTGCTCT 


TGCAACAGTT 


GCCAAC AT CA 


AAGAAAATCC 


AAAGAACTTG 


AAAATCACTG 


600 


AATTGGACGC 


T AGC CAAACA 


GCTCGTT CAT 


TGTCATCAGT 


TGACGCTGCC 


GTTGT AAAC A 


660 


ATACCTTCGT 


TACAGAAGCA 


AAATTGGACT 


ACAAGAAATC 


ACTTTTCAAA 


GAACAAGCTG 


720 


ATGAAAACTC 


AAAACAATGG 


TACAACATCA 


TTGTTGCAAA 


AAAAGATTGG 


GAAACATCAC 


780 


CTAAGGCTGA 


TGCTATCAAG 


AAAGTAATCG 


CAGCTTACCA 


CACAGATGAC 


GTGAAAAAAG 


340 


TTATCGAAGA 


ATCATCAGAT 


GGTTTGGATC 


AACCAGTTTG 


GTAATAAGAA 


ACAGCGAGCT 


900 


GGGAGAGAAA 


ATTCCACCTC 


TTGCTTTTGT 


ATAGAGTATA 


GATTGTAAAG 


AAGACTATTC 


960 


GTTCATAGAA 


AGGTAGAGAG 


AATATGGTTT 


TTCCTAGCGA 


ACAAGAACAG 


ATTGAAAAAT 


1020 



2880 
2940 
3000 
3060 
3120 
3144 
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TTGAAAAGGA TCATGTAGCC CAGCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 1080 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAACCGTGT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCCCCC TTTGTCATGG 1200 

CACATTTCAA GAGTTCGCGT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 1260 

CTGTGCCAGC GGATGGGGAT CAGGTCTGGA CAGAGGATCC JcTTTACGCTT TCGGTCCGCA 1320 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 1380 

GTGCTTTGAG AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AAGCATGCAG 1500 

AC AAACTC CG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG G AC C AAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 1620 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 1680 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGGTTGAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 1860 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 1920 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 204 0 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTT G ATG 2220 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TGTGCGAATC GCTGATTATT ACACCCATAT CGAATTAGTA GAGGAGCTGA 2 34 0 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 24 00 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 2460 

TACGGAATCG TT GG AT ATT C TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 2520 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTCACCACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG 264 0 

CATTTTAACC TGATGAGCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGAAGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 27 60 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAG CGTGTG 2820 
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GCAATTGCGC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCAGACGA GTCAACTTCT 2880 

GCCCTTCATC CGAAGACAAC C AAGC AG ATT TTCGCCTTGT TGCAAGATTT GAACCAAAAA 2940 

TTAGGCTTGA CTGTTGTCTT G ATT ACGCAT GAAATGCAGA TTGTCAAAGA CATTGCCAAC 3000 

CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGG GTAGTGTGCT TGAAATCTTC 3060 

TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 3120 

GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGGAACACT TGTCTGAAAA CAGTCTCTTG 3 180 

GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAG 32 40 

CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 3 300 

GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 3 360 

GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGG GAGTACAGTA AGATGGAATC 3 420 

ATTGATTCAA ACCTATTTAC CAAATGTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 3 480 

GGGAACGGCT ATCTACTTAA CT CTTT AT AT GACAGTTCTT TCCTTCATTA TCGGAGGCTT 3 54 0 

CTTGGGGCT A GTGGCAGGTC TCTTTCTCGT CTTGACACCG CCAGGTGGTG TCTTGGAGAA 3 600 

TAAACTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 3 660 

CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 3720 

AAATGCAGCC CTTGTCCCAC TTTCTTTTGC AGTCTTTGCC TTCTGG 3766 



<2> INFORMATION FOR SEQ ID NO: 2 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
C O I TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC GGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATGGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 12 0 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTC A TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC GGTGTCACTG TCTTTGGTGG 3 60 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATCA CTGTGACTTC 420 
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AGAAAGCAAG 


GGCAATGGCG 


CGATTGCTGG 


TTTGGTAATC 


GGTTTGTCAT 


TCATGGCGAT 


480 


GATTCTTGTC 


GGATTGAAGA 


TTACTGGACT 


TTCAGTAAAC 


CCAGCTCGTA 


GCTTGGCACC 


540 


AGCTGTCTTG 


GTAGGCGGCG 


CAsCCTTCAA 


CAAGTTTGGA 


TTTTCATCCT 


TGCACCAATC 


600 


GCTGGTGGAG 


TTCTTGCAGC 


CCTTGTTGCA 


AAAAATTTCC 


TTGGAACAGA 


AGAATAATTG 


660 


AAACTCAAAA 


AGCCTTGCTC 


CTCATCTTGA 


GGAACAGGGC 


TTTTTCGTAT 


GAT ACT CTTC 


720 


GAAAATCTCT 


TCAAACCACG 


TCAGCTTCAT 


CTTGCCGTAG 


TATGGTTACT 


GACTTCGTCA 


780 


GTTCTATCCA 


CAACCTCAAA 


ACAGTGTTTT 


GATCTGACTT 


CGTCAGTTCT 


ATCTGCAACC 


840 


TCAAAACAGT 


GTTTTAAGCT 


GACTTCGTCA GTTCTATCTG 


CAACCTCAAA 


ACAGTGTTTT 


900 


AACCTGACTT 


CGTCAGTTCT 


ATCTGCAACC 


TCAAAACAGT 


GTTTTAAGCT 


GACTTCGTCA 


960 


GTTCTATCTG 


CAACCTCAAA 


ACAGTGTTTT 


AAGCTGACTT 


CGTCAGTTCT 


ATCCACAACC 


1020 


TCAAAACAGT 


GTTTTGATCT 


GACTTCGTCA 


GTTCTATCCA 


CAACCTCAAA 


ACAGTGTTTT 


1080 


GATCTGACTT 


CGTCAGTTCT 


ATCCACAACC 


TCAAAACAGT 


GCTTTG AG C A 


ACcTGCCGCT 


1140 


AACTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATGACT 


TTAGCGGTTG 


TCAATTTTCT 


1200 


CTGGATAAAG 


GTCGTGTTGG 


AAGAGGCGTT 


GTTCTGCCAA 


GCCCTCATAC 


TTAGTTCCTT 


1260 


GCTTACCGTA 


GTTGT AG TAG 


GGGTCGATTG 


AAATGCCACC 


GCGCGGAGTG 


AATTTTCCCC 


1320 


AGACTTCTAA 


ATAGCGAGGG 


TCTAGCAAGT 


TGACCAAGTC 


TTTCCCGATG 


GTGTTGATAC 


1380 


AGTTTTCGTG 


GAAAT CTCCG 


TGGTTTCGGT 


AGCTAAATAG 


ATATAGTTTG 


AGGGATTTTG 


1440 


ACTCGACACA 


GAGCTTGTCA 


GGAATGTAGG 


AAATATGAAT 


CGTCGCAAAG 


TCTGGCTGAG 


1500 


CAGTGATTTG 


TCCCAGCAGA 


GACATATCGA 


GGATATGGTG 


ACGAATGCCC 


TGTTCCTTAG 


1560 


CGATTTCTCT 


AGTAATTTGA 


ATTTCGAGGT 


GATGACGTTG 


GCCGTAGGCA 


AAGGTGACAG 


1620 


CTTCGACTGT 


TTCATAGTGT 


TGCATGACCC 


AGAAAAGGCA 


GGTTGTTGAA 


TCTTGACCAC 


1680 


CACTAAAGAC 


GACCAAGGCT 


AATTGACGTT 


TCATAGTACT 


CCTTCCAAAA 


TGGGAAATGT 


1740 


TCAGAGCACG 


CAAAAAGCTC 


CCATTAGGGA 


GCTAAAAAAT 


ACCAAATCGA 


GGTTTTTTTA 


1800 


GCGATGGCAT 


ATCCCAAACA 


TCGTAATATT 


CTACTTATAT 


AGTAAAATGA 


AATAAGAACA 


1860 


GGACAAATCG 


ATCAGGACAG 


TCAAATC GAT 


TTCTAACAAT 


GTTTTAGAAG 


TAGAGGTGTA 


1920 


CTATTCTAGT 


TTCAATCTAC 


TATAGTCTAG 


CATATTTTTT 


GAAAAATGGC 


AAAGGGCAAG 


1980 


AAAAAAGAGA 


CCAAAGAAAG 


TACTTGGTCT 


CTCGTTTGAT 


TAGCTCAATT 


CAGCAATGAT 


2040 


GGCCTTGATT 


TGTTCTGCTG 


TGTGAACACC 


TGCAACTTGT 


TTGACAACTT 


GGCCGTCTTT 


2100 


TTTGAAGAGA 


AGAGTTGGAA 


TAGACATGAT 


TCCAAAAGCA 


CGAGCTGTGT 


TTGGATTTTC 


2160 


AT CAACGTCC 


ATTTTAACGA 


TTTTCAAGAC 


ATCTTCTGAA 


AGTT CTTC AG 


ACAATTTGTC 


2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 22 80 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATT GCTTTTG CCATTGTATT 234 0 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 24 00 

CATAAGTCCC ATCGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACACGGTAGT 2460 

AkkCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 2520 



t2) INFORMATION FOR SEQ ID NO: 227: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5278 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



ACTCAGTTAG 


ATTTTGTTTT 


CAAAAACAAC 


GAAGAAAAAG 


ACCATGTTGC 


TCTACTTGGA 


60 


AGAATTGGCT 


CCGAACGTGT 


TTATCGATAT 


ATTAATAAAA 


AATATTTAGA 


TTTACCGGAA 


120 


ACATTCGAAA 


ATTATAATGT 


TTTTGTACCA 


GAAGCTAATG 


GAAGTGGTGC 


CTTAGGTGAA 


180 


GTCTTATCAA 


CACCCCTAAT 


CGGGGAACCC 


CTAATCGGGC 


ATACAGATAC 


TTTTTTATCT 


240 


ATTGGTAATT 


TTAAAACAAA 


ATTTGAAGCC 


GATGCTTGTA 


TTAAATTTAT 


TAAAACTAAA 


300 


TTCGCTAGAG 


TATTATTAGG 


TGTTTTGAAA 


GTTACTCAGC 


ATAATTCACG 


CAAAACTTGG 


3 SO 


TATTACGTCC 


CCCTCCAAGA 


CTTTACGGTC 


AATTCGGACA 


TTGATTGGAC 


ACAATCAGTG 


420 


ACTGATATTG 


ACCGCCAGCT 


TGATCAAAAA 


TATGACTTTT 


CCCCTGAAGA 


AATTGCCTTT 


430 


ATTGAGAATC 


ATGTAAGGGA 


G ATGGAT TAG 


AAAACTATTT 


TTATTTGACA 


AATAGTGCTC 


540 


AATGATCTAA 


AATGACTATA 


TAGGATTAGG 


TCAGGAAGCA 


TACGATGCCC 


TGACCCTTTT 


600 


TGTACTTATG 


AGATGAGAAA 


GTCATTTGTT 


AGATAAATTG 


ACTCGTT AG C 


AAACGTTCAA 


660 


AAAAGGAAAA 


CTTATGCCAG 


TAGAAATTAA 


AACCACTAAA 


GAAATTCATC 


CTAAAATCTA 


720 


TCCCTACACC 


ACACCGACAG 


TAACCAGTAA 


TGAAGGCTGG 


ATTAAGATTG 


CGTATACAGA 


780 


ACGTGATGTC 


ACACAACGTA 


TCAAGGAGCA 


AACGCATACA 


GCTCATATAG 


CTACAGATGT 


840 


CTTATGGACT 


GGTGATGCAG 


CTTATACAGA 


AGAGCCTGAT 


AAGGGGAAAA 


CTTTCAAG G A 


900 


CCATGATTTC 


CACCATTTCC 


TTTCTTTCCA 


TGATGTAGAA 


CGTCGTCCCA 


AGACGGAATG 


960 


GTTCTATTTT 


AATGGAACTC 


CTGAAAAATC 


AAAAAATCTT 


TTTGATAAGT 


TTGTTCAGCA 


1020 


TGATTTGTCT 


GGTTATCAGC 


CTGGAAAAGG 


ACAGGACTAT 


ACTCTGCGAC 


AAGAGCAAGA 


1080 
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AGAAGCAGTT 


GCTAAGACAT 


TAGCTTATTT 


CCAAGAACAT 


GCTGGAGGCA 


AGTTTCTCTG 


1140 


GAATGCCAAG 


CCACGCTTTG 


GTAAAACCTT 


GTCTACCTAT 


GACCTAGCTC 


GACGGATGGA 


1200 


AGCTGTCAAT 


GTCCTAATTG 


TAACAAACCG 


CCCTGCCATT 


GCTAACTCAT 


GGTATGATGA 


1260 


TTTTGAAACA 


TTCAT AG C AG 


GTCAAACGAC 


TTACAAGTTT 


GTTTCTGAAT 


CAGATAGCCT 


1320 


TAAGAGTCGT 


CCAATCTTGT 


CACGACAAGA 


ATTTCTTGGT 


ATTTTAGCTG 


ACGATGTAAG 


1380 


ACAACTTGCT 


TTTATCAGTC 


TCCAAGACTT 


GAAAGGATCT 


GTTTATTTAG 


GTGGAGAGCA 


1440 


C G AT AAACTC 


AAATGGGTAA 


CTGATCTGCA 


TTGGGACTTG 


TTGGTTATTG 


ACGAGG CTC A 


1500 


TGAAGGAGTT 


G AT AC CTTCA 


AGACTGACCA 


AGCCTTTAAT 


AAGATTCGAC 


GAAATTTTAC 


1560 


TCTGCATTTG 


TCAGGTACAT 


C ATTT AAAG C 


ATTGGCTAAA 


GGAGATTTTA 


C AG A GGAAC A 


1620 


AATCTACAAC 


TGGTCTTATG 


CTGATGAGCA 


GGCTGCTAAG 


TATTCGTGGT 


CTCTTGAGCA 


1680 


AGAAGAGGAA 


AATCCTTATG 


AAAGCTTGCC 


TCAGTTGAAT 


CTCTTTACCT 


ATC AAATGTC 


1740 


TCAGATGATT 


GGCGAAAAGT 


TAGAAAAAGG 


CGCTCAGATC 


GATGGTGAAA 


ATATTGACTA 


1800 


TGTTTTTGAC 


TTAAGTGAAT 


TTTTCGCTAC 


AG ATG AT AAA 


GGGAAATTTA 


TTCATGAGCA 


1860 


TGATGTCAGA 


AATTGGTTAG 


ATACTCTATC 


AAGCAATGAA 


AAATATCCAT 


TTTCAACCAA 


1920 


AGAACTCCGT 


AATGAACTCA 


AGCATACTTT 


TTGGCTTTTA 


GAACGTGTCG 


CTTCGGCCAA 


1980 


AGCATTAAAA 


GCC CT ACT AG 


AAGAACACCC 


AATCTATGAA 


AACTATGAGA 


TCGTTCTAGC 


2040 


TGCTGGTGAC 


GGACGTATGT 


CCGAAGAAGA 


CGATAAAGTC 


AAACTC AAAT 


CCTTGGACTT 


2100 


GGTTAGAAAA 


GCGATAGCAG 


AGAATGACAA 


AACCATTACC 


CTATCCGTTG 


GTCAGCTGAC 


2160 


GACAGGTGTC 


ACTATCCCTG 


AATGGACAGG 


TGTATTGATG 


TTATCAAATT 


TG AAATC AC C 


2220 


AGCTCTTTAT 


ATGCAGGCCG 


CCTTCCGTGC 


TCAAAATCCT 


TACTCATGGA 


GCGATAACAA 


2280 


AGG AAATC AC 


TTTCGCAAAG 


AAAG AG CCTA 


TGTATTTGAC 


TTTGCGCCGG 


AAAGAACCTT 


2340 


GATTCTCTTT 


GATGAGTTTG 


CCAACAACTT 


ATTGCTTGTA 


ACTGCAGCTG 


GTAGAGGAAC 


2400 


TTCAGCTACA 


CGCGAAGAAA 


ATATTAGAGA 


ATT ATT AAA C 


TTCTTTCCAA 


TTATTGCCCA 


2460 


AGACCGTGCT 


GGTAAGATGG 


TTGAAATTGA 


TGCAAAGGCA 


GTTCTAACCA 


CTCCTCGCCA 


2520 


GATAAAAGCT 


AG AG AAGTT C 


TTAAACGAGG 


TTTTATGTCC 


AATCTCTTAT 


TTG AT AAT AT 


2580 


TAGTGGTATT 


TTCCAAGCAA 


GTCAAACAGT 


TTTAGATATT 


TTAAATGAGC 


TGCCAGTTGA 


2640 


AAAGGAAGGG 


AAGGTACAAG 


AT AGTTC TGA 


TTTATTAGAT 


TTTTCAGATG 


TTACAGTCGA 


2700 


TGATGAGGGA 


AATGCAGTAG 


TAGACCATGA 


AATTGTAGTT 


AATCAGCAAA 


TGCGACTTTT 


2760 


TGGTGAAAAA 


GTTTATGGAC 


TTGGTGAATC 


TGTTGCTGAG 


TTAGTCACAA 


AAGATGAGGA 


2820 


ACGAACTCAA 


AAACAGCTGG 


TCAATGACTT 


GAGTAAGACC 


GTT TCTTC AG 


TGATTGTAGA 


2880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGGGAAACT GAGCAAATTA AG AAACAAAT 294 0 

TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 3 000 

TCATATCAAG CAACACTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 3060 
TAAGATTCAA GAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 3120 
ACTAGAACAA AC ACT C AAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 3180 
TGAGATAAAA CGTGTGGAAC AGTTGAAACA ATCAGCTCAA GATGAAATTC GTG AC CATTT 324 0 

ACGAGGGTTT GCAAGAACAA TT CCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 33 00 

ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 3 360 

GATTGATCAG TTT AG AT AT T TGCGAGATGG TGGGCAGGAT TTTG C AGGGC ATCTCTTTGA 34 20 

TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 3 4 80 

TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 3 540 

AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 3 600 

AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 3 660 

TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 3 7 20 

TCCTGAAGAA CGCTTAAAAC AT ATTTTGG A AAAGCAAGTT TATGGATTTG CTCCGTCTGA 3 780 

GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAAT CTTTCTAAAG ATATCAGTAG 3 840 

GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 3 9 00 

GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 3 9 60 

GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 4 020 

CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 4080 

GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 4140 

AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 4 200 

AACTCCCTGA TAATTCAAGG AGTTGTCTAT ACTTAAATTA GTTTTTAGAA GCTTCTTGGA 42 60 

ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTT AG C A GATGCTTGCA 4 320 

TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG AC G AACG AT A CCATGTGCAC 43 80 

CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 4440 

CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGG 4500 

CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 4 560 

GAACACCTTC GAACAATTCA AT CAATTC AG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 4 620 
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ATTCTTCAAG GTTTAC AC C A GCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4 680 

CGTGTTCACC CATGATGTAG GCGTGCA CTG AACGAGCATC CACATCCAAT TTTTCAGCAA 474 0 

GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4 600 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 4860 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GATTGATTTG TTGATAGCAA 4920 

GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 4980 

TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTCAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT 5100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 5160 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTAGTTGAAG 5220 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 5278 
(2) INFORMATION FOR SEQ ID NO: 228: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANPEDNESS : double 

(D) TOPOLOGY: linear 



Txi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 



ATAAGGAATC 


TCTAAAAAAT 


TTTAAGGAGA 


ATCTAGCAAA 


TGGATTTCAC 


ATGGGCACTG 


60 


AAGTATG CCA 


CTGAATTTTT 


GGGAACTGCC 


ATTTTGATCA 


TTCTTGGGAA 


TGGTGCAGTT 


120 


GCCAACGTTG 


AACTTAAAGG 


TACGAAAGGT 


CACCAAAGTG 


GCTGGATCGT 


CATCGCTGTT 


180 


GGTTATGGTA 


TGGGGGTTAT 


GATCCCAGCC 


TTGATGTTTG 


GTAACGTATC 


TGGGAATCAC 


240 


ATCAACCCTG 


CTTTCACTCT 


AGGGCTTGCA 


GTTAGCGGTC 


TTTTCCCTTG 


GGCACAAGTG 


300 


GT ACCTT ACA 


TTATCGCGCA 


AGTCTTGGGG 


GCTATCTTTG 


GCCAAGCCTT 


AGTTG TGGCA 


360 


ACATACCGTC 


CATTCTACTT 


GAAAACTGAA 


AACCCAAATA 


ACATCTTGGG 


AACTTTCTCA 


420 


ACTATTTCAA 


GTATTGACCA 


TGGTACAAAA 


GAAAGTCGCT 


ATGCAGCAAC 


TGTCAATGGT 


480 


TTGATTAATG 


AGTTTGTTGG 


TTCATTTGTT 


TTGTTCTTTG 


CAGCTCTTGG 


TTTGACTAAA 


540 


AACTTCTTTG 


GTGCTGAAGT 


GCTTCAATTC 


ATGAAACAAA 


AGGCAACAGA 


AGCAGGACAA 


600 


ACAGTTGATT 


TTTCTGACTT 


GGCTATTAAA 


GCACAGGTGG 


CTCCACACAC 


TGCTTCAGGA 


660 


CTTTCTGTGG 


CTCACTTGGC 


ACTTGGATTC 


CTCGTTATGG 


CTTTGGTAAC 


ATCACTTGGA 


720 


GGACCTACAG 


GACCTGCCTT 


GAACCCAGCC 


CGTGACTTGG 


GACCACGTCT 


CCTTCATGCT 


780 



WO 98/18931 



PCT/US97/19588 



1247 

TTCCTTCCCA AATCACTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTGG 840 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC C ATCT AT AAG TAAGAGAGGA 960 

TCAGACTGGk TCTCTCTTTT kCATTTTTaG GGAAATGAAA GAAcTCTAAA CAAACTCCTC 1020 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT TCAATGGACT ATAGTAGGTT 1080 

GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA GAAATTGGTT TCAATTCTCC 114 0 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCCG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGG T TG AAGTTGG AATAGTCCTC CCTTCTTTCT 1260 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT GTTCAGTTTC TATTTCATTT 1320 

TACTATAAAA TAAGCGATTA GGGGGGCTAT T CTTCG ACCT ACATTGACTC TCCTGAGTCC 13 80 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAACTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAACGATT 1560 

TGCCCT ATT C TTGTTTCATT TTACTATATA AAC C AG AG AC TGTTTACATT TTCAGCAAGT 1620 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT CTATTTAGTA CTTTCT ATT A 1680 

ATTAGTTAAA TTTTTACCAA G AAT AATTC A CAAAAACCTT GTAAAACACT TGCAATTTAG 174 0 

CTGAAATTTG ATAAAAT AG T AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGGT AAAACT AC C C TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G 1941 
< 2 > INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 29: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTACCCTGT TGTTAAAAAG ATAGGGAGTC 120 

AT AATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 240 

CGTTGTTCAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 300 

ATTTAAATTT TCCG T ATT AG TCTTCCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 3 60 

TGATTCCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 4 20 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCACTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 54 0 

AT GG TT AG AG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTCAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTCTCATTAT ATTTGCAACG 72 0 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 7 55 



(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



CCAGAAAAAC 


CGTAGTGGAG 


CTCGTGGAAC 


AGTGGAATTG 


ATTTTCCAAA 


AAGAATACAA 


60 


TAAATTTTCA 


AGTATCTCAA 


AGAGGGAGGC 


ATAAGATGTC 


AGATGCATTT 


ACAGATGTAG 


120 


CCAAGATGAA 


AAAAATCAAA 


GAAGAAATCA 


AGGCACATGA 


GGGACAAGTC 


GTAGAAATGA 


180 


CTTTGGAGAA 


TGGTCGTAAG 


CGCCAAAAAA 


ATAGATTGGG 


TAAGCTAATT 


GAAGTTTATC 


240 


CATCTCTATT 


TATTGTGGAG 


TTTGGGGATG 


TGGAAGGAGA 


TAAACAAGTT 


AATGTTTACG 


300 


TTGAATCCTT 


TACTTACTCA 


GATATTCTTA 


CAGAAAAGAA 


TTTGATTCAT 


TATCTTGACT 


360 


AAAGTGAGAA 


ATTTTCTCAC 


TTTTTCTTTT 


TTCTCCGAAT 


AATTTAGGTG 


AAGGCAATCA 


420 


TCGCTTTATA 


TTATTTTTCA 


AGGAGGAAGA 


ATGAAAATTT 


TACCGTTTAT 


AGCAAGAGGA 


4B0 


ACAAGTTATT 


ACTTGAAGAT 


GTCAGTTAAA 


AAGCTTGTTC 


CTTTTTTAGT 


AGTAGGATTG 


540 


ATGCTAGCAG 


CTGGTGATAG 


TGTCTATGCC 


TATTCCAGAG 


GAAATGGATC 


GATTGCGCGT 


600 


GGGGATGATT 


ATCCTGCTTA 


TTATAAAAAT 


GGGAGCCAGG 


AGATTGATCA 


GTGGCGCATG 


660 


TATTCTCGTC 


AGTGTACTTC 


TTTTGTAGCC 


TTTCGTTTGA 


GTAATGTCAA 


TGGTTTTGAA 


720 


ATTCCGGCAG 


CTTATGGAAA 


TGCGAATGAA 


TGGGGACATC 


GTGCTCGTCG 


GGAAGGTTAT 


780 


CGTGTAGATA 


ATACACCGAC 


GATTGGTTCC 


ATTAC TTGGT 


CTACTGCAGG 


AACTTATGGT 


840 
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CATCTTCCCT GCGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 
GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AG G ATTT ATT 
CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 1020 
GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCGGAACT 1080 
GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT TCAAAAACAC 
GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG 
GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 
TTTAAGACCA AGTCTGCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 
TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 
TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 
TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 
(2) INFORMATION FOR SEQ ID NO : 231: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
CCCGGAAAAC AAGTTAAAGT TGAAGTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 
GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 
TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 
AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 
ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CGCTTAATTT TAAGGAGAAC 3 00 

ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 3 60 

CTGGACACGC CGAGAGTGGC GAATACGGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 4 20 

TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 4 80 

TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 54 0 

AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTT ATC GG 600 
AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GG AGGAAAAC 6 60 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 720 



60 
120 
180 
240 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAACC AGCTGACGGA 780 

CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 840 

GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 

TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 

CGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 1020 



TTAAAAT 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1990 base pairs 
(B) TYPE: nucleic acid 
(Ci STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



CGGTTCAAAT 


GGTGCAGGTA 


AATCTACGTT 


AATTAATTCT 


ATTGTAGGTT 


TTCAAGAGAT 


60 


TTATTTAGGA 


GAAATAGAGT 


ATTGTGATAA 


AGATTTGATA 


GTTAGTTCTC 


AACCTTTTGC 


120 


TCATTTAGGC 


TTTACTCCTC 


AAACCACAGT 


AATTGATTTT 


TAT ACT ACTG 


TGAAGGACAA 


180 


TGTAATATTG 


GGGCTGAACC 


TTGCTGGAAA 


GTTTGGGAAA 


AATGCTGAGA 


AGTTGTGTCA 


240 


AATAGCCTTA 


GAAATTGTTG 


GCTTAGCTGA 


TAAAAAAAAT 


AATTTGGT AG 


AAACATTGTC 


300 


AGGTGGACAA 


CTGCAACGCG 


TCCAGATTGC 


TAGAGCAATA 


GCTCATAATC 


CAGATTTTTA 


360 


TATTTTAGAT 


GAACCTACCG 


TTGGTTTAGA 


TACTGAATCT 


GCCGAAAAAT 


TTTTAATGTA 


420 


TTTAAAAGAT 


AAGAGTTTGG 


AAGGAAAAAC 


TATTATCATA 


TCTTCACATG 


ACATAAAT CT 


480 


ACTCGAAAAG 


TTTTGTAAAA 


AAATACTTTT 


TTTACAAAAT 


GGCTC CATAT 


CATTTTTTGG 


540 


TGATATGCGT 


GACTTTGTAG 


ATAATTCAAC 


TATCAAATTA 


AATTTTTCAA 


TGCAGAATAG 


600 


AATTTCTAGA 


TATCAAATTG 


AATTTTTAGA 


AAATTTTAGA 


TTTAAAGTTC 


ACATCGAAGA 


660 


TAATGATAGT 


TTTACAATAG 


AAGTCCCTAT 


AGAAGAAAAG 


ATCTTAGATG 


TT ATCAATG A 


720 


GGTAGGAAAA 


GCATGTGAAA 


TTAAAAACTT 


TTCAACAAGT 


AAATTAACCT 


TACAAGAAAG 


780 


TTATTTGCAA 


AGAATAGGAG 


GAGAAAAATG 


AAGGCTG AT C 


AATTAAGGCA 


CAAATCGGAC 


840 


TTAGGTTTAA 


GAGGTCTAGC 


GATTATTGCT 


AAAAATGAGA 


TTATTGCTTT 


TTTTAGAAGT 


900 


AAAGGTTTAA 


TTATTTCTCA 


GTTTCTACAA 


CCAATCTTAT 


ATGTTGTTTT 


TATAATAATA 


960 


GGATTAAATT 


CTTCGATAAA 


GAACATTCAG 


TTTAATGATA 


TAAAAACCTC 


TTATGCAGAA 


1020 


TATACAATCA 




AGCTTTATTG 


ATAATCGGGC 


AGATGACTCA 


AGT TATTTAT 


' 1080 
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AGGGTGACAA TAGATAAAAA ATATGCGCTA CTTGCTCTTA AGTTATGCAG TGG AGTT CGT 114 0 

CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TACGCTTCAT AGTTCAAGAA 12 00 

ATTATTATAT ATATAATTAC CTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 12 60 

TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGG ACTCCCTTCC AATTTTACTT 1320 

ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 13 80 

CTTGGTTTTA CACCTCCTCT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 144 0 

ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 1500 

AATTCAACAA CTTTGGAATT AGTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 1560 

GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTT GGGTC AT 162 0 

CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 168 0 

AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 174 0 

CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAACTTTAT CACATACCAC 1800 

ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCG TC AAA TCATTTATGG 1860 

CACATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 192 0 

AATC C ATGAA CCGAATCCTT GTTCAG AC C A TATTCACATG CTTATCAGTA TTCCTCCGAA 1980 

ACTTAGTGTT 1990 
(2) INFORMATION FOR SEQ ID NO: 23 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCGCCATAT TTAGGTCCTG GACGTGGAAT 12 0 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 18 0 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 24 0 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGT AAAG AC GC AGGT ACT A CTTGGAGTGC 360 

AC CGG AAT AT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 420 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TGTCATTTTA ACTACTCCAA ACTCCACAAA 4 80 

TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT GATGATAATA CAATTGATTC 540 

GCGTTATCAT CACGACGTTG ATTATAGTAA CTATGGATAC TCATATTCAA CATTGACAGA 600 

GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT GATTCATGGT CTCGT AATG A 660 

ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG ATTGAAGATC TGAAAAAGAA 720 

TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA ATTATGGTAA ATTACGGTAT 780 

TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA ATGAACAAAA TTGAAGATGC 840 

AAAAG TGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA GTTGCTCAAG AGTTGGGATC 900 

AGATCTTTGT GCAAGTTTAG AT G AACTTGT AGCACGTGAA GATATTGATT GTCTGATCGT 960 

AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA GCTGCTCAAC ATGGCAAACA 1020 

CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT TGTAAAGCCA TGGTTGACGC 1080 

ATGTAAAGAA AAT AATGTC A TCTTTATGGC TGGTCACATC ATGAACTTCT TTAACGGTGT 1140 

ACACCATGCT AAAGAATTGA TTACTCAAGG TAAAATCGGT AAAGTTCTTT ATTGCCATGC 1200 

TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTAT C A TGGAAG AAAC TTCGTTCTCA 12 60 

ATCTGG AG G A C ATTTGT AC C AC CAT ATTC A TGAATTAGAT TGCATTCAGT TTATCATGGG 1320 

AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA TATCATAAAG GTGAAAACTT 1380 

TGGTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC TCTGATGATC GTTATGCTGT 14 40 

TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC GTCTTGATTC AAGGAACTGA 1500 

AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT CTTCGT GTT A AAGGTGAAGG 1560 

AGAATCACAC TTCTTAGTTC ATGAAACTCA AG AGGAAG AT GATGATCGTA CAGCTATCTA 1620 

TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA CCAGGAGTAC GTTGCCCATT 1680 

ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA CATGACATCA TTAAAGGTGG 17 40 

AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTG T A GCTGCTTTAG AATCAATCCC 1800 

TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA AAAGTAAGTC TTTCAGAAAT 1860 

CACAAATGCT TAACTTTTGT AAAACAGAAT AGTAAATTCT TGTCATTATA TAATTTCTAA 1920 

AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG GGACTGGGAT AATGCCCAGT 1980 

CCCATTTTTT ATCAAAAAGT AATGAG ATCA AAAATGTGGG AGTGTTGAAA TGAAGATTAT 2 040 

AGGT ATCG AT ATTGGCGGAA CAACAATTAA GGCAGATTTA TACGATGAGT TTGGAACGAG 2100 

TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT GATTTGGGAA CGAATCAGAT 2160 

ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA AATCATTCAA TTGATGGTGT 2220 
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TGCGATTTCC 


ACTGCTGGAG 


TTGTTAATGC 


TAATACTGGA 


GAAATCATCT 


ATGCAGGCTA 


2280 


TACAATACCA 


GGGTATATCG 


GAGTAAACTT 


TACTGCCGAA 


ATAGAAAAAC 


GTTTTGGGTT 


2340 


GTATACTTTT 


GTTGAAAATG 


ATGTTAATTG 


TGCTGCATTA 


GGTGAATTGT 


GGAAGGGACA 


2400 


AGCCAAAGAT 


AAGAAAAATG 


TAGTAATGGT 


TACTATTGGA 


ACAGGTATAG 


GAGGCAGTAT 


2460 


TATTGTCAAC 


GGACAAATTG 


TTAACGGATT 


TAACTATACT 


CCTGGTGAAG 


TAGGTTATAT 


2520 


TCCTGTAGGT 


AATTCGGATT 


GGCAAAGTAA 


AGCCTCAACA 


AC CGC ATTG A 


TTCATTTATA 


2580 


TCAAAAAAAG 


AGCTTGAAAA 


CTAATCAAAC 


TGGACGTACT 


TTCTTCACTG 


ATTTAAGATC 


2640 


TGGAGATAAA 


GTTGCTGAAG 


AAACTTTTGA 


AATTTTTGTA 


GAAAATCTAA 


CAAAAGGTTT 


2700 


ATTAACGATT 


TCTTATCTAC 


TTAATCCAGA 


AATTCTCATA 


TTAGGAGGTG 


GGATTCTGGA 


2760 


TAGTAAGGAT 


ATTTTGTTAC 


CTGAAATTCA 


AAGTTCTTTA 


GCTAAAAATG 


CAATGGATAA 


2620 


TAG GTTTTT A 


CCTAAAAATC 


TTGTGGCAGC 


TACATTAGGA 


AATGAAGCTG 


GTCGTATAGG 


2880 


AGCTGTAAAA 


AATTTCTTAG 


ATAGAATTTC 


TAATAAATAG 


TATGTAAGAT 


AAGGAGGTGT 


2940 


CACAATGACT 


AACTCTGTAT 


TTTCGACAAT 


GCAAGATATT 


GAGAATGTTG 


CAACCGATAT 


3000 


TATAAAATCA 


TATGATAATG 


AGATTTATAC 


TTATAAAGCT 


GTTTCCCAAG 


AAGAATTGGA 


3060 


AAAACTAGAA 


AAAAGTTATG 


ATGAAAAAAG 


TCACGAAGAA 


TTAGTTTCAA 


TAGAAAGCAA 


3120 


TTTAGAAATG 


AAACAACAGA 


ACCTTATTGA 


TGAGGTTAAT 


AAAACAATCA 


AGGAAAATGA 


3180 


TGCAAATATT 


CAGTATATTT 


CATCAAGTAG 


GAGACGAGAA 


TTTGTAGAAA 


AAATTATTGC 


3240 


TAGGGTGGTA 


GAAAAATATG 


GCCATTAGTC 


AGATGAAAAG 


AATCTCTCTA 


CTATTTTCTA 


3300 


AAAGTAGTCT 


TGATGATGTT 


TTAAAAACTA 


TTCAAGAACT 


AGAGTCAGTG 


CAGTTCCGTG 


3360 


ATTTAAAGGT 


TCAGGATAAC 


TGGTCAGAAG 


CTCTAGAAAA 


AGATGAAGTT 


GTATTTCCAA 


3420 


CTATTCAAAT 


TTTTCATACT 


TCTAATTCCA 


AT C ATGGGGT 


TATTGAGGGA 


AATGATGCCT 


3480 


I oAU 1 ini 1 1 


IjA i tjAATCAA 


CAACAACATT 


TAGAAGCAAC 


TGTAGAGAAA 


TTACAAGAAT 


3540 


ACCTACCGAA 


AGAAAACACG 


TTTAAATTAT 


TGCAGCAACC 


TCCGATAACT 


ACCTCTTATG 


3600 


AAGAATTAGA 


GAAATTTGGT 


AAAGCTAATG 


TTGCTGAGGG 


TGTTCTTAAA 


AAAGTGAATC 


3660 


ATCAAATTAA 


CAGAGTTCAT 


GAATTAGAAA 


G AC AC ATT C A 


AAGTAATAAT 


GAGGAAATAG 


3720 


AGCGATTAAT 


AAAGTGGGAA 


AAATTAGAAA 


TTGTTCCTGC 


GAATTTAGAA 


CAATTTTCTT 


3780 


TCTGTAAAGG 


AAAAGTCGGA 


ACAATTCCAA 


GGACTGAAGA 


TAATCGCTTA 


TACAATAGTC 


3840 


TTTTAGAAAA 


CAATATTGAA 


GTTCAAGAAA 


TATTTTCTAA 


TGATAGAGAG 


TACGGTGTTG 


3900 


TTGTTTTCTA 


TCAGTCTAGT 


TACTCTATAG 


ATTTTGATGA 


ATACTTATTT 


GAACCATTTG 


3960 



WO 98/18931 



PCT/US97/19588 



1254 

ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATCC 4 020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCGTTGCA AGATTCAAAG AAATATTTGA 4080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 4200 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 4260 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 43 80 

CTACACCTGT ATTAGCACCA TTTTACTTTA CA T T TTT TGG AATGATGGTT GCTGATTTAG 444 0 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4S0O 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 4 560 

CTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4620 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTCTGTTTGG GTTTATTACA GTATTTGCAG 4680 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 47 40 

CAGGATTTGC GTGGTGTGTT ATTCTG 47 66 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 34: 

CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 120 

AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 300 

TCCAAATGAT G ATGT AT ACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 3 60 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 420 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 4 80 

GGACTCTTCG CCTGGCTGCT T ATTT GAG AG AACATAACTT TGACTTTGCG GCTCTCACTC 540 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAACTGGA 


TGAAAATGGT 


GCTGCTCGCG 


TTATCCTGAG 


TCAGAAAATC 


TTGAAACAAT 


660 


ACAATATAAC 


CGATGCTGAA 


ACTGCGGCCA 


TTGTAGGTGC 


ACCTGGACGC 


ATTGACAGAG 


720 


TGAGTCTCTG 


GGGAATTTTT 


GTCGAACAGG 


CTGATGGCCA 


CTACCGAGTT 


CGCTTACGCA 


780 


GTAAAGTCCA 


TCCTATCAAT 


GAAATTGCCA 


AGGAGCATGA 


TGGTGGAGGC 


CACCCTCTAG 


840 


CAAGTGGTGC 


TAATTCCTAT 


AGCCTAGAAG 


AAAACGAAAT 


CATCTACCAA 


AAGTTAGAAG 


900 


ACTTGCTTAA 


AAACTGATAA 


AATACTTGCC 


AAACTTTTCA 


GAATCTGATA 


GACTAGTATA 


960 


GTAACAATCT 


ATGGCTCGCA 


AAGAGACCAT 


GGCAGAAAGG 


AAATATTGCA 


AAATGAAAAr 


1020 


AGATATCCAT 


CCAGAATATC 


GCCCAGTTGT 


CTTCATGGAC 


ACAACTACTG 


GTTACCArTT 


1080 


CCTTAGCGGT 


TCAACAAAAC 


GCTCTAACGA 


AACAGTTGAG 


TTCGAAGGCG 


AAACTTACCC 


1140 


ATTGATCCGT 


GTGGAAATTT 


CATCAGACTC 


ACACCCATTC 


TACACTGGAC 


GTCAAAAGTT 


1200 


CACTCAAGCA 


GATGGACGCG 


TGGATCGTTT 


CAACAAAAAA 


TACGGTCTCA 


AATAATGATA 


1260 


AGAGAACAGT 


TTTGGCTGTT 


CTTTTTTGTT 


TCTTGAAATC 


AACTGCTGTT 


TTCATGTTCC 


1320 


AGACTCATCT 


GTAGGTTCGA 


TTTCCATGCT 


ACTAGGCAGG 


AAGGAAATAG 


CTGTTTCAAC 


1380 


ACGTCCATAA 


TGAGCTATAC 


TATTGTCACG 


AACCACACTT 


TCATTGATGG 


TCCAAGTGGA 


1440 


ATTCATTTTC 


TTAAAAGCTT 


CTCGGACTTT 


TTCCAAATCT 


TTCGAGGCAA 


TGGCCTGCTC 


1500 


TAAGGTTTCA 


AAACGAGGAC 


TT AT ACT CAT 


CTGCTTTCAA 


AAAGCATTCT 


AGTCCATCTC 


1560 


CGATTACCGA 


TGGACTTTAT 


CACCTCCTTC 


TCCAGTCCTT 


GTATGACATC 


TTGAAGTTGA 


1620 


TTCATGACAT 


CTTCCAAAGT 


TCgAAAGGCT 


TTATTCTTAA 


AT CCACGTTT 


ACGAAT CTCT 


1680 


TTCCACACTT 


GTTCAATGGG 


TTCATCTCTG 


GTGTGTATGG 


AGGAATAAAG 


GTAAAATCAA 


1740 


TATTAGTCGG 


AATATTTAAG 


GTACTTGATT 


TATGCCATAT 


AGCATTGTCC 


ATAACGAGTA 


1800 


AAAGGATAAG 


CTTGTGAAAG 


CTCTTCTAAA 


AAGGCGTTCA 


TCCACACTCC 


TTTTTATAAA 


1860 


CCTGAAATAA 


GGCATCAATT 


GTAACAAATT 










CAAGAAAGGC 


TTTCTCTTCC 


TCAACTGTCA 


T AT ATGC AT G 


GTTACGACCA 


CCACGTGTTT 


1980 


CTTGAAGGAG 


AGAGTC GAGT 


CCGAACTCCT 


CATATTTTTT 


TACGTTTCGC 


CAAATCGTTG 


2040 


TTTGATTACA 


GTCTAAAAGC 


TCTATAATCT 


CTTTATAAGA 


TTTGCCCATC 


AGACGAAATA 


2100 


TAGTAGATTG 


AAACTAGAAT 


AGTACACCTC 


TACTTCTAAA 


ACATTGTTAG 


AAATCGATTT 


2160 


GTCCTGTTCT 


TGTTTCATTT 


TACTATAGAA 


CGATTTGAAG 


GCGTTTATAA 


TATTTAGCTG 


2220 


TACGAGAGTC 


TTTTAAAAGT 


GTTTTGATGG 


TTTGGATTTC 


TTCTTTAGTT 


GATTTCATAT 


2280 


TACTATTATA 


TAATGCTTTT 


TGATTTTAGT 


CTG GT AT AAA 


TATTGCTTTC 


CTCCAAAATG 


2340 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 2400 

TGCTAAACAT TTTCTTTTCT TAT AG CAT AC TTTAAGATTT TGTCTTTGAG AAAGATATTT 24 60 

CCAAGAAAAA CGTTCGTTTT TTGG 24 84 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 120 

G TTCGTG AAA TGGCAAATGA TAATAAGAGC CATTATTTGA TATACAGAGT TTTAGGTATT 180 

TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTC GTTT TTTATACAAA 2 40 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 3 00 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAG T A CAAAACCTAA GACTTTTGAA 360 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 420 

GATGGAGACC AT AT AACT AA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 4 80 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 54 0 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTCG 600 

GAACATTTAA GAGCAGTGAC CGG TATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 720 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA 780 

AAACCCAAAA ATT AT CTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGACiT 84 0 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 1020 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAAC G A AAACGAGATG GAAACTCTAA AACT AT AT AT AAGGTTGATA 1200 

ATAATGGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 1260 
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ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGAGTAGG TTATCCTACA CAAAAACCTA 13 20 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 1380 

CGTTCTGTGG AAGTGGAACT ACTTTACTAG CCTCCAAGAT TTTG AAT AG A AATTATATGG 14 4 0 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGCAA AATGTTATAA 1500 

AAACAAGTTC AAATTT ATT G AATAAAGGAA TCGAAGCATA TAGAACCAAA ACTGAGGAAG 1560 

AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 162 0 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 1680 

AATGTCTGAA TGAGAGTATC TCTTTAT T AC AGAATGCTAT AAACTCCAAA AAACTTGATT 174 0 

TTGGAGTAGT TATAAAAACT CATTCG 1766 

(2) INFORMATION FOR SEQ ID NO: 23 6: 

fi> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AAT AAT ATT A 120 

GCGCGG AG AA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC 180 

T CATCTAAT A CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 24 0 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTCG 3 00 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 3 60 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 4 80 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTCTTCATCA TGTAGTAGAC AAACTCTTCA 6 60 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 74 8 
(2) INFORMATION FOR SEQ ID NO: "237: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 



AAAAGATTAC 


ATTGCAACAA 


TTGAAAATTA 


TCCAAAGGAA 


GGCATTACCT 


TCCGTGATAT 


€0 


TAGTCCTTTG 


ATGGCTGATG 


GAAATGCTTA 


TAGCTACGCT 


GTTCGTGAAA 


TCGTTCAGTA 


120 


TGCTACTGAC 


AAGAAAGTCG 


ACATGATCCT 


GGGACCTGAA 


GCTCGTGGAT 


TTATCGTGGG 


180 


TTGTCCAGTT 


GCCTTTGAGT 


TGGGAATTGG 


TTTTGCGCCT 


GTTCGTAAGC 


CAGGTAAATT 


240 


GCCACGCGAA 


GTTATTTCTG 


CTGACTATGA 


AAAAGAGTAC 


GGTGTCGATA 


CCTTGACTAT 


300 


GCACGCGGAT 


GCCATTAAGC 


CAGGTCAACG 


TGTTCTTATT 


GTAGATGACC 


TTTTGGCGAC 


360 


AGGTGGAACT 


GTTAAGGCAA 


CTATCGAGAT 


GATTGAAAAA 


CTTGGTGGTG 


TTATGGCAGG 


420 


TTGTGCCTTC 


CTTGTTGAAT 


TGGATGAATT 


GAACGGCCGT 


GAAAAAATTG 


GT G ACT ACG A 


430 


CTACAAAGTT 


CTTATGCATT 


ATTAATGAAA 


ACAGTCCCTA 


GGGCTGTTTT 


CTCTACACTA 


540 


GGATATAAAA 


ATAGACTATA 


ACTAGTTAGA 


GAAAAACTAT 


AATTGAAAAC 


TATATCTTCT 


600 


TGCAGTATAA 


TAAAAGGACT 


AAGTGTTTGA 


GATTTGTCTT 


CAAACATATG 


CAATTATTCC 


660 


TGAAAGAGTA 


CAGTTAGGAG 


AGGGTTATGC 


CCATTCGAAT 


TGATAAAAAA 


TTGCCAGCTG 


720 


TTGAGATTTT 


ACGGACAGAG 


AATATCTTTG 


TCATGGATGA 


TCAACGTGCT 


GCCCACCAAG 


780 


ATATCCGTCC 


TTTGAAGATT 


TTAATTTTAA 


ATCTCATGCC 


ACAGAAAATG 


GTCACAGAGA 


fl40 


CCCAGTTGTT 


GCGCCACTTG 


GCTAATACAC 


CCCTACAACT 


GGATATTGAT 


TTTCTCTATA 


900 


TGGAGAGCCA 


CCGTTCTAAA 


ACAACTCGTT 


CAGAGCACAT 


GGAGACCTTC 


TATAAAACTT 


960 


TTCCTGAAGT 


CAAGGATGAG 


TATTTTGATG 


GGATGATCAT 


CACGGGTGCT 


CCAGTTGAGC 


1020 


ATTTACCATT 


TGAGGAAGTG 


GACTATTGGG 


AGGAATTTAG 


ACAGATGCTT 


GAGTGGTCTA 


1080 


AGACTCATGT 


CTATTCGACC 


CTTCATATCT 


GTTGGGGGGC 


TCAGGCTGGG 


CTTTATCTGC 


1140 


GCTATGGTGT 


AGAAAAATAC 


CAGATGGACA 


GTAAGCTATC 


AGGTATTTAT 


CCTCAGGACA 


1200 


CCCTAAAAGA 


GGGTCACCTT 


CTATTTAGAG 


GCTTTGATGA 


TAGCT ATG TA 


TCCCCTCATT 


1260 


CACGGCACAC 


GGAGATTTCT 


AAGGAAGAGG 


TCTTAAACAA 


GACCAATCTC 


GAGATTTTAT 


1320 


CAGAAGGACC 


TCAGGTTGGG 


GTTTCTATTw 


TGGCCAGTCG 


TGATTTACGA 


G AAATTT ATA 


1380 


GTTTTGGTCA 


TTTGGAGTAT 


GACCGTGATA 


CTTTGGCAAA 


AGAGTATTTT 


CGAGATCGTG 


1440 



ATGCAGGTT 144 9 
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(2) INFORMATION FOR SEQ ID NO : 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 .base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 238: 
TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTACT GCTAATCTAT AGTCTACCAT 120 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 240 

AACAGCCAAA ACAAAGTT G A AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 42 0 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 4 80 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 54 0 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TGTCTTTTTC CGTG T ATTC A CAAGAATCTC 600 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 » 

CTTGGCTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 7 20 |€ 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 84 0 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 



(2) INFORMATION FOR SEQ ID NO : 2 39: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 39: 
CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 



60 
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CGTAACATTC AGTTTGGTGT TCGTGAATTT GCAATGGGAA CAATCTTGAA CGGGATGGCC 120 

CTTCATGGTG GACtTCCTGT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 180 

GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 240 

TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 300 

CGTGCTATGC CAAATCTAAA TGTTTTCCGT CCAGCAGATG CGCGTGAAAC GCAAGCAGCT 3 60 

TGGTACCTTG CAGTGACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 4 20 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AAGGTTG CT A AAGGTGCTTA TGTTGTATAT 480 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCCTACT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA G CTT ACAAGG AAGAAATTCT TCCAAATGCA 660 

GTCCGCCGTC CTCTTGCACT CGAAATGGGT GCAAGTCAAA ACTCCTACAA ATATGTTGGT 720 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CCAAAAGTAT 780 

TGGCAGAATA TGGCTTTACT GTAGAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 84 0 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAAGGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 946 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 27 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 40: 

CGGGGCTCCc TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAAC AC 12 0 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGT CGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCT CTTG AAAAATCGTG 42 0 

GTC TCAAAGT AAC CATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 
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TGGGT C ACT A TGAACGTTTC ATCGATATCA ATCTCAACAA ATATTCCAAC CTGACAACTG 600 

GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCCTGG AGAATACCTT GGGGCAACTG 660 

TTCAAGTCAT TCCTCATATC ACAGATGCTT TGAAAGAAAA AATCAAGCGT GCCGCTCTAA 720 

CGACCGACTC TGATGTCATT ATCACAGAGG TTGGTGGAAC AGTAGGAGAT ATCGAGTCCT 780 

TGCCATTCCT AGAGGCTCTT CGTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 840 

ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 900 

CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 960 

GTACAGAAGA GCCAGCTGGT CAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 1020 

CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TT AC C AAATT CCACTGAACT 1080 

TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 1140 

CGGATATGAC AGAATGGTCA GCCATGGTGG AC AAGGTC AT GAACCTCAAG AAACAAGTTA 1200 

AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 1260 

CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTGG GTCAATGCCA 13 2 0 

ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TGCGGACGGG ATCATCGTAC 13 80 

C AGGT GGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCCC TATGCGCGTG 1440 

AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATGCA GTTGACATGT ATCGAGTTTG 1500 

CTCGTCACGT TTTAGGTCTT GAAGGTG CCA ATTCTGCAGA GCTTGC AC C A GAAACAAAAT 15 60 

ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 1620 

GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA GGCTGCTGCT GCTTATCACA 1680 

ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 17 40 

AG CAGTTTG A GGCAGCAGGT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAC 1800 

AAATCGTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 18 60 

CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 1920 

ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 19 80 

ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 204 0 

AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 2100 

AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 2160 

TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 22 20 

AGAAATATTT TAGCAGTCAA GATGTACTGT TATAGATTCA ATACATTATA CTTTTTTAAT 22 80 
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TTAATCCACT ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 2 340 

ATTTCTAACA ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 2 4 00 

ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAG CCACT 24 60 
TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 
AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 
TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG 

ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 2700 

GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACCTG 2760 
AAGA 

2764 

(2) INFORMATION FOR SEQ ID NO : 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 
<D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEO ID NO: 241: 

CCGTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 60 

CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 12 0 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGGTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 24 0 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 3 00 

AACT CTT ACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 42 0 

AATGAAG AT C ATAGACTTCT ACTAAGCGTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT ACT ATCT AAT AACTT AT C AT 540 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGGTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

T.TTTAGCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AAT ATCC AT A CCTCTGAGTG 720 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 780 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 

GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 900 
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CTTCTTGAAT 


ACTCTCTTGC 


CCCCAGATAA 


TCTCTGTCTT 


TTCACCATAT 


ATCTCACTGG 


960 


TTTTAGCTGT 


ATTTGTATTA 


ACAGCTACTG 


TCACAACTTC 


TGGGAAATCT 


TTAACCAACT 


1020 


CTTTTACCAA 


TTGAGTTAAA 


TTAAGCTGGC 


GGTTTGTAAC 


AATAATAATC 


TGAACCTGTC 


1080 


CGGTCTTTCT 


CGCGCGTCGG 


ACCATAATAG 


TACGGACACC 


TAGAACTTTT 


CTCTCATCCG 


1140 


TGATTGGAAT 


CTGGTGATAA 


GTAAGTAATT 


CTCCTAAGCG 


ATTAGCAATC 


ACTTGGGTTT 


1200 


CCTTATCTTG 


TACCAGGCAG 


TCTTTCAACT 


CTACTAAATA 


GTGAGAGTTT 


TGTGCATATA 


1260 


AGCCCGCCTT 


G AC CTG ATTT 


TTAAATTTTC 


GAGTCTGAAA 


L IVI AALTi A 


GCTCTGTAAT 


L J £. \i 


ATTTTGGTTC 


CTGCATTCCA 


ATAGTTGGAC 


GAATTTCATA 


ATTTTCATAT 


CCTGCAGGAG 


1380 


CAAATTTTTT 


CAGCGCTTGA 


TGAAGTAAGT 


CCCTCTTGAA 


CTCCAGCTGC 


TTATCATAAT 


1440 


GCAGGTGCAT 


GATTTGGCAG 


CCTCCGCATT 


CATTATAAAT 


AGTACAAGAT 


GGCACAATTC 


1500 


GAAATTTAGA 


CTTCTTGTTG 


ACCTTCAGTA 


ATTTTGCTTC 


AACAAAGTTG 


CGTCTAATAG 


1560 


AAGTAATCTG 


ACAATAGATA 


TCTTCGCCTT 


TGAGAGCTCC 


TGGTACAAAG 


ACTAATGTTT 


1620 


TTTGGTAAAA 


GCCGATTCCC 


TCACCGTTAA 


TTCCCATGCG 


CTTGATTTTT 


AATGGTATTT 


1680 


TT 












1682 



(2) INFORMATION FOR SEQ ID NO: 24 2: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 



TTAACTTTGG 


TCAATTCTTT 


AAAGTCATCC 


TCTGTAAGCA 


TGTCTAACCA 


TTGATGTTTC 


60 


CCTTTATTGC 


TAAAATCACC 


AATTCCGACT 


ACAGCTATAT 


CTAAATCTTT 


CCAACTATTT 


120 


TTCAAATTTT 


CAAAATATCT 


TGATTGCAAA 


AT AC CAT CTG 


CTAACAATTT 


ATTTTCTTGC 


180 


ACAATCGTTG 


CATTCATAAA 


TGTACACTCT 


CCATGAAATT 


TTCTAGACAT 


TTCATAAATC 


240 


AGTGTATTCA 


CATGGTATTT 


AG CGTGT ATG 


TGACTAGGAC 


CACCTGCTAG 


AGGATAGAAG 


300 


TGAACATTTC 


GGACACTTTT 


ACTGTGAATT 


AAATCTACTA 


AATTACTTAA 


ACTTTTCCCC 


360 


CAAGAAAAGC 


CAATTTTCAT 


ATTATCATCA 


ATTAGATTCC 


TAAGGACGCC 


TGCTGCAACT 


420 


TGAGAAATTC 


TTTCAGATAA 


AATTGTTGGA 


GTATCATCAA 


ATTCATTTGG 


AATAATTTCT 


430 


AAACTTTCCA 


AACTGTATTT 


TTCTTTTACA 


TAATTTTCCA 


ACTTAAACAT 


ATTGGTATCA 


540 
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AAATTCTCTA TTTCAATTTT AACAATTCCT ACATTCCTTG CTTCTGTTAA CATTCTACTA 600 

AT AG AGGTT C TATAAATTCC TAATTTTGCT GCTATTTGTG ACTGATTTAA GTTTTCAATA 660 

TAATACAGAT AAGCAATTTT AGAAAGCAGT TTATTCCTAT CTTGATTCAT AC ACTT AACC *?20 

TCTTACGAAA CTACCTTAAC CATTATCCCA GCATTTTCTA ATGTAGCTAT ATTTTGTTTA 7 80 

GAAAGTTTTT CGTCTGTTAT TACTTCATAG ACTTGACTTA AAGCAAATCT TCTTACTGTA 84 0 

CCTCTTTTAT CAAATTTACT TGAGTCAGTT AGGACAATGA CTTTATCCGA CACTGCTGAA 900 

ATATATTGAA CTACCTCACT GCGCATTAAA TCTTTTCCGG TAAAGCCCAT CTCTTTATCG 9 60 

TAACCATCTG TCCCAACAAA AGCTTGACAC ACATGAAAAG TCTGTATCAT TTCTTTTAAT 1020 

AAAGGTCCTA CAGTCACCTG TGAATCTTTC TGAAACTCAC CACCAAGAAC AATAACACGA 10 90 

CATGAATCAT AAGCTCTCAC AAAATTTGCT ATAAAAAACG AATTTGTTAC AATCGTAACA 1140 

TTTCTTTTTT GCTTGCAAAT TTCCTCAGCA AGTAAAGCAC AGGTCGATCC AGATTCTATC 12 00 

ATT ATTGT TT CATTATCTGA CACCAATTTT ACTGCTTCCT GAACAATTTT TCTCTTAGTT 12 60 

TCATAATTAA TTGACAAACG TACATTTAAG TCATCTCCAC TATTTAATAC AGCATATCCA 13 20 

TGCTCTCTGT GTAATAAACC TTTTGACTCT AATTTATCTA AATCTTTTCT AATCGTTACT 13 80 

TTCGATACAT TTAATTTTTC CGATAATGTA TTAACGTCGA TCTTTTCATA TTCTGATACT 14 40 

AATTTAATAA TTTGTTCCAA TCTTTTCATT TTACACCTCC GTTTTATTCT ACCAAAATAA 1500 

AAAGCAAAAA ACAACAAATT AACCTTTCGT TCGTAATTGT TTTTCTTTCG TTTTTGTGAT 15 60 

AGGATAGACT TATGAAGAGG AGGAACTCTT ATGGAAATAT CTAAAGGAAT TATTTTTAAT 1620 

ATTCAACACT TTTCAATTCA TGACGGTCCG GGTATTCGT A CAACTGTTTT TTTAAAAGGA 1680 

TGTCCTCTGC GCTGTC CATG GTGTTCTAAT CCTGAATCTC AAAGAATGAA ACCTGAAAAA 1740 

ATGAAAGATG CTCAACGAGA GAAATTCACC TTAGTCGGTG AAGAAAAGAC TGTAGAAGAA 1B00 

ATTATTACAG AGGTATTAAA AGACAAAGAA TTTTACGAAG AATCCGGTGG AGGTTTAACT 18 60 

TTATCAGGAG GTGAAATATT TGCTCAGTTT GAATTTGCTA AAGC CATCTT AAAATCAGCT 1920 

AAAGAACATC ACATACACAC TGCCATTGAA ACTACTGCCT TTGTTGAT C A TGAAAAATTT 1980 

ATTGATTTAA TTCAATATGT GGATTTTATC TACACAGACC TAAAACATTA TAATTCTATA 2040 

AAACATAAAA AAGTGACTGG GGTTTTTAAT CAAATGATTA TTAAAAACAT TCATTATGCT 2100 

TTTTCACAAA ATAAAACTAT CGTTTTAAGA ATCCCAGTTA TTCCTAATTT T AACAAT ACT 2160 

TTAGAGGATG CAGAAAAATT CGCTACTCTA TTTAACTCAT TAAATATCGA CCAAGTTCAA 2 220 

CTACTCCCTT TTCATCAATT TGGTGAAAAC AAATATCGTT TATTAAATCG GAAATATGAA 2280 

ATGGATGGAA TCAACGCACT TCAT CCwGAA GATCTTATTG ATTATCAAAA GGTATTTCTG 2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG 
ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 
CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 



(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCAGACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGGTGACTTG AATGATGTTC 120 

GGATGATTCA GTCAAGTAGT GGTATTCACT CCTTTGAACC TTCGGCAAAT GATATCGCAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 240 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGGTCTT AGAGGCTTCT GAGGGAATGA 300 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 3 60 

CGCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 42 0 

TCGCTGATAA ACTTTCAGAG CTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 4 80 

AAGCCTTT AT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCAGGTATCT CTCCTGAACA AG AACC AAG T CCACGACAAC 6 60 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 720 

ACGCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAACTC 780 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

AT AT G AGT AT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 900 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGAC GTT AC 960 

CAAGCTGGTC AGGATAAGAA AGAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGCT 1020 

GGTCAAAAGG CAGAAAACTT GACACCAGAT GAAGTCAGTA AGAGGGAGGG GATCAACGCC 1080 

GAACAAATTG T T AT CAAG AT TACGGATCAA GGTTATGTGA CCTCTCATGG AGACCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTGAAGAGCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG AAATCAAGGG TGGTTATGTC 12 60 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 13 20 

CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 1380 

GCAGATAATG CTGTTGCTGC AG CCAG AG CC CAAGGACGTT AT AC AACGG A TGATGGGTAT 144 0 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGGC 1500 

GACCATT ACC ATTACATTCC TAAGAATGAG TTATCAG CT A GCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 1620 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT ATGCTAAACC CTTATCAGAA 174 0 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT 1860 

GAATTGGAAA AACGAATT GC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 192 0 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGAACCTAG TCCAAGTCCG 19 80 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 204 0 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT AT ATCCC AGC CAAGGATCTT 2100 

TCAGCAGAAA CAGCAGCAGG C ATTG AT AGC AAACTGGCCA AGCAGGAAAG TTTATCTCAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2280 

TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAGGATG TCTCAAGTGA TAAAGTCAAG 2 34 0 

TTAGTGGAAG ATATTCTTG 23 59 
(2) INFORMATION FOR SEQ ID NO: 244: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCT TTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AG T T G TTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 12 0 

AG CG ATT CAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA CGGATCCCCT GATTAAATAC TACCGCGACT ATGTTGCGGG TAACGATATT 240 

AGCAAGAACC AAAGCTGGAT TATCGACCGC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 300 

AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTGGCCAC TCTTCCTATT 3 60 

GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG CTGAAAATAA CAATGCCAAA 420 

TTGATTGCTG CCTACTTTAA CCAAAATGGT ATCCATCCAC GCTATATGCA CCCTAGAGAA 4 80 

GCTGGGATTG TGGTCACAAC TGAACCTGGT CACGCTCGCA TCATTCCATC AAGTTATGAC 540 

AAGATTCAAC AATTGACAAA CACCAATGAA GTCCTTGTCA TTC CTGGTTT CTTTGGTGTC 600 

ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 660 

ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 7 20 

GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 7 80 

ATGCGCGAGT TGGCCTATGC AGGCTT CTC A GTCCTTCATG ACGAGGCTCT TCTTCCTGCC 84 0 

TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 90 0 

CG T ATCGTTC TAAAACACAG TAATGATGAA TTTCCAGTTG TGGGAATTGC TGGTGACTCA 9 60 

GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGC ATTTGGCCGC 1020 

AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 



1052 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 45: 
CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 12 0 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 24 0 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATT AGTGC CG CAAAATCCCA 36 0 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 4 20 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GAT G ATGGG A 480 
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AACATTATGC GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 



540 



CTCATACTTC GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGCG CAAAATAAGC 



600 



CAGAGAAATC CTTTAGGTAT AT AG AT ACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 



660 



ACTACAAAAA TCTACAATAT C TTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 



720 



TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 



780 



TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 



840 



CTTAACGAAA CATAT 



855 



(2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

TTTAGGAAGG CTATCCGTAA TTTTACAAAG GATTTAGATA TTACAGAGCA ACATTTAGAT 6 0 

ATTATCAAAA GAGAGATGTT TGGCGAATTT TTCAGTAGCA TGAACTCTCT TGAATTTATT 120 

GCAACGCAAT ATGATGCTTT TGAAAATGGT GAGATAATTT TTGATTTGCC GAAAATTTTA 180 

CAGGAAATTA CTTTAGAGGA TGTCCTTGAT GCTGGACATC ATTTAATAGA TGATGGTGAC 240 

ATAGTTGATT TTACAATATT CCCATCGTAG TAACCTATTA TAATAGACAC TAGAAAGAAG 300 

GGATGACAAG TATGAGAAAA AAAACAATTG GAGAGGTTTT ACGATTAGCT AGAATCAATC 3 60 

AGGGATTGAG TTTAGATGAA TTGCAGAAAA AGACAGAAAT C CAGTT AG AT ATGTTGGAAG 4 20 

CAATGGAAGC AGACGATTTC GATCAACTTC CAAGTCCTTT TTACACGCGT TCTTTCTTGA 480 

AAAAATATGC ATGGGCTGTT GAGTTAGATG ACCAAATTGT TTTGGATGCT TATGATTCTG 54 0 

GGAGTATGAT TACTTATGAG GAAGTAGATG TTGATGAAGA TGAGTTGACA GGTCGTAGA- 600 

GTTCAAGTAA GAAAAAGAAG AAAAAAACAT CATTTTTACC TTTATTTTAT TTTATCCTGG 660 
(2) INFORMATION FOR SEQ ID NO: 247: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCCGTTGCAC 


AGGATCGTGC 


ATAGTCAACT 


CTTCAAGTAT 


AGCATATCTC 


CTATTTTCTT 


60 


ACAAGTAATA 


ACACCTAAAA 


TGAAGCTTTT 


TCTTTTACTT 


TTTTCTGCCA 


AGAGGCAAAA 


120 


AGCATGCTGA 


GGTAAAAAAC 


GCTCATCATA 


ATAGGAACAC 


CAAGAATGGT 


CTTTTCATGA 


180 


TAGAAAATCG 


TCAAATAGGC 


TGAAAAGACA 


ACGCCAAGGA 


CAAAACTACT 


AAGCAGGCTA 


240 


ACAAATATGA 


ATCCTTCACG 


CAAAAAAGGA 


GTGTGCTTGG 


TTCGGAAATA 


ATCTCCAAAA 


300 


GCCAGCATGG 


TCCGTTTGAT 


ATTCCCTGTC 


ATAAAAGCGT 


TATTATAGGC 


AATACCCGAC 


360 


ACTTCTCCAA 


AAGCAGTTGT 


CACCAGTCCC 


ATACAGAAGG 


CCAAGGGCGG 


CACTAGATAG 


420 


ATATTATCCA 


CAGTTTGCGG 


CACAAAAGCA 


ATAATGATTG 


ATAAGATTGC 


CAAGGGAATC 


480 


AAGGACAGAA 


TAGGTTTTTT 


CACAATTCTC 


AATTTTTCCT 


TATAAATCCT 


TAATAAAAAG 


540 


ACTCCCATCA 


TAAACGCTAG 


CAAGGTGAGA 


ACCTTGTCCC 


TAACATCCGA 


AACATTATTT 


600 


TTAATTAATT 


CTACTGAAAG 


AAAGACAACA 


TTTCCAGTTT 


GTCCAGCTAC 


AAGGGTATTC 


660 


CCGCGAACAA 


TAAAAGTGTA 


AG C ATCCAC A 


TATCCAGCAC 


AAAACGTCAA 


AAAAAGTGCT 


720 


AACCTTTTAG 


ACTGACGTGA 


TATTTTTCTT 


ATAGGTAATA 


ACCTCATTTT 


ACCTCCCATT 


780 


GTATTTTCTC 


TTAGAAATAT 


TGTAC CATTT 


TCTTTCTAAA 


AAATCGTAGG 


CTACCATTTA 


840 


GATTTTACTA 


TT AG CAT AAA 


AATAATAATA 


GACAACTATT 


TATCCAAAAA 


TAGATAGATG 


900 


TAACATGTTT 


GCAAACAAAG 


CATACGAACC 


TTTAGTAAAA 


TCATTTCCAT 


GAAACTAGAA 


960 


TAGAGCCCTC 


TTAGCAAAAA 


TCATTATTTT 


AATTTATTTC 


TAATCACTCC 


TTGACATAAA 


1020 


TAACTCTCAC 


CAATAAAAGA 


CTATGTCTTA 


AAAAAATGGT 


ATAATAAAAT 


CAATACTTGG 


1080 


GCTTGATGGC 


TATGCTACTA 


ATAACAATTA 


GGAGAGAAAA 


TCAGGCACTT 


GTTAACAACA 


1140 


AGGATTATCC 


CCTTGAGATG 


AAAGGAACTT 


TAGAAATCTT 


ATGATGAACA 


TGCAAAACAT 


1200 


GATGCGTCAA 


GCACAAAAAC 


TTCAAAAACA 


AATGGAACAA 


AGCCAAGCTG 


AACTTGCTGC 


1260 


TATGCAATTT 


GTTG G CAAAT 








V- J. ^KjCCjA TAA 


13 20 


GAAAGTTGTC 


AGCATTGATT 


TCAATCCAGC 


TGTCGTTGAC 


CCAGAGGACC 


TTGAGACTCT 


1380 


TTCTGATATG 


ACCGTTCAAG 


CCATCAACTC 


TGCTCTTGAA 


CAAATCGATG 


AAACTACCAA 


1440 


GAAAAAACTG 


GGTGCTTTCG 


CTGGGAAATT 


ACCTTTCTAA 


AAACAAGGAG 


CTAGAACAAT 


1500 


GCTTGTCGAT 


AACAAAGGCT 


AAGAAAGGTG 


CAAAAATGAC 


TCTATAATAT 


TTGTAGTGGG 


1560 


TAAATCCCCT 


ATGGATATTA 


TGGAGCCTAT 


TTTTGTGTAG 


AAAAAAGTCC 


CATATGACCT 


1620 


ATAATGAAAA 


GCGACAAAAC 


AACTCATTAG 


AAAGAATCAT 


ATGGAACAAT 


TACATTTTAT 


1680 


CACAAAATTA 


CT AG ACATT A 


AAGACCCTAA 


TATCCAGATT 


TT AG ACAT CG 


TCAATAAGGA 


1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 1800 



1805 



AAGTC 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2516 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 8: 

CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA G ATTGG AG AT TATGATTTAA 60 

CGTCGCCGCG TTGGG GTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATAGTGG GTCAGATATT AC CCCGGT AT ATTCA TTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 300 

CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 360 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 4 20 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 4 80 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTG A CCC TTCTTTGCAT 54 0 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 7 20 

ATTG TGTCG A ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 84 0 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

C AATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 9 60 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 1020 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 114 0 

TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 12 00 

ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 12 60 

GCAGT ATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 13 20 
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TTGCGCtGTC GCCAATATTC AATCCATCCA AATGT ATT AG AATGGATCTT AGTTTTACTT 1380 

CAAGATATGA CGACTGGAGT ATATTGCTTT CCCTTCACAT ATATATTGTT CTTTTTTTAT 14 40 

TTGATGAATA ACTATTTTAA TAGGTTCGAG TGTCGCATTC GTCTGAAATC AATTAAGCAC 15 00 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGCAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 1620 

GAGGTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TAGTTTCTTT 1680 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACT ATT AGC 174 0 

AGTTTTTCTT GGTTTAAAAA ATCAAACATG AGCTTAGTAT TTCTGTTTAC TTTTTTATTT 1800 

GTAGAATCCT TATTCTGGAT TTATCAGTTG GACAATGGGA TAATTGGATT ATTGCCAATT 1860 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATT ACT AT CT 1920 

ATC AT AATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 1980 

GAAATGGGAA AGTTAAGTAG TCACATGTGG AGGTTGAATC AGATAATCTA TACCAAGTAC 2040 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTGTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGACTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATCGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2220 

GTATATCGTA GAAGAGCATC CGATTT C TTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 2 280 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TTTCTTTATG GACTCATTTT AATCATGGTA 23 40 

AAAGTT CT AG TGATTCAATT TATGTTACAG TTACCAAACT GGGATATAGG AGTTTTGTTT 24 00 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA CTCTT G TTTT GCTTTATGTT ATACGCACTA 24 60 

GGAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2516 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 

CGGTGTTTTT TTGTAAATTT T CT AGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA AT AC C AACAA TTATTTAAGC 120 

AAATCCAAGA AAC CATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 
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ACGACCTTAT 


GGAGCAATAT 


CAAGTCAGTC 


GTGATACCGT 


CCGAAAGcCC 


TCTCTCTCCT 


240 


CCAAGAGGAA 


GGATTGATCA 


AAAAGATAAG 


AGGGCAAGGT 


TCTCAAGTCG 


TCAAAGAAGA 


300 


AACCGTCAAT 


TTCCCTGT AT 


CCAACCTAAC 


CAGCTACCAA 


GAACTAGTTA 


AAGAACTTGG 


360 


ACTGCGCTCT 


AAAACCAACG 


TGGTCAGTCT 


GGACAAGATT 


ATTATTGATA 


AAAAATCCTC 


420 


ACTGATAACC 


GGTTTC CC AG 


AGTTTCGGAT 


GGTTTGGAAG 


GTGGTCCGCC 


AGCGTGTGGT 


480 


GGATGATCTG 


GTATCCGTTC 


TGGATACGGA 


CTATCTGGAT 


ATGGAACT C A 


TCCCAAATCT 


540 


CACTCGCCAA 


ATTGCTGAGC 


AGTCTATCTA 


TTCTTATATA 


GAAAATGGCC 


TCAAACTCCT 


600 


TATTGATTAT 


CCTCACAAGG 


AAATCACCAT 


TGACCACTCA 


AGCGACCGAG 


ACAAGATTCT 


660 


CATGGACATT 


GGCAAAGACC 


CTTATGTCGT 


TTCGATTAAA 


TCAAAAGTCT 


ATCTCCAAGA 


720 


CGGACGCCAA 


TTTCAGTTTA 


CCGAAAGTCG 


C C ATAAGTT A 


GAGAAATTTA 


GATTTGTAGA 


780 


TTTTGCAAAA 


CGCAAGAAAT 


AAAAGACTGA 


GACACCAGAT 


CTCAGCCTTT 


TTCGG CTCT A 


840 


TAATATTTGT 


AGTGGGTAAC 


CCCCCTATGG 


ATATTATGGA 


GCCTATTTTG 


TGTAGAAAAA 


900 


AAGTCCCATA 


TGACCTATAA 


TGAAAAG CG A 


CAAAACAACT 


CATTAGAAAG 


ATTCATATGG 


960 


AACAATTACA 


TTTTATCACA 


AAACTGCTCG 


ATATTAAAGA 


CCCAAACATC 


AAGATTCTAG 


1020 


ACATCATCAA 


TATGGATACC 


CACAAAGAAA 


TTATCGCTAA 


GCTGGATTAT 


GAGGCTCCAT 


1080 


CTTGCCCTGA 


TTGTGGAAGT 


CTAATGAAGA 


AATATGACTT 


TCAAAAACCG 


TCTAAGATCC 


1140 


C TTACCTCG A 


AACAACTGGT 


ATGCCTACTA 


GAATTCTCCT 


TAGAAAGCGT 


CGTTTCAAGT 


1200 


GCTATCATTG 


TTCTAAAATG 


ATGG TCGCTG 


AAACTTCTAT 


CGTCAAGAAG 


AATCATCAAA 


1260 


TTCCTCGTAT 


TATCAACCAA 


AAAATTGCGC 


AAAAGTTGAT 


TGACAAGATT 


TCTATGACCG 


1320 


ATATTGCTCA 


TCAGCTGGCC 


ATTTCAACTT 


CAACTGTCAT 


TCGG 




1364 


(2) INFORMATION FOR SEQ ID NO: 2 50: 









(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1227 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 
GACAATCGGT GGACGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 
CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 
TTTGTAGAGA ATCGAACCGC AAGGTTCGGT. TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 
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TAATAAACAG TATGAAAATC GTATCAGGAA TCT ATGGGG G ACGTCCCCTC AAG AC ACT AG 300 

AAGGCAAGAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 3 60 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG AC CTTT ATGC AGGTAGTGGT GCTTTATCTA 42 0 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGG T GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 540 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT G CC AAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 6 60 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 720 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 7 80 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTCG 84 0 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 900 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG CGGGTTAGAA AAGGCTGTGA 9 60 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTC GATGTCGCAA 1020 

AAAGACTGGG GGCTACTTCC CTACTGCGAG GTTTGAGAAA TGCGTCGGAT TTGCAATATG 1080 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAC 114 0 

AT AGTCG AC C TGAACATCTC TAT AT CAGTT CATCAGGCGT TAGAGAGCTT TTGAAGTTTG 1200 



GTCAGGATAT TGCCTGCTAT GTTCCCG 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3652 base pairs 
(B> TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 180 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 2 40 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 3 00 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA C AAGC CAGTT 3 60 
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ACACGGCTAA 


TTCGTCCACC 


TTTGACAAGA 


TTTTCCAAAG 


TTGAAACACC 


AATATAAAGC 


420 


TCTGTATGGT 


TTTTAACCTC 


TTCTACATGA 


GATAAAATTG 


CCTCCATATC 


TTTACCTTCT 


480 


TGAGCTAACT 


TCGCAGCCTC 


AACAACTTGG 


AATTTCAGGG 


CTTGGTCACT 


GAAGGAACTA 


540 


TCAACAACAG 


TCACATCTGC 


AGTAGATAGG 


CTAGCACCTT 


GGCGTGCTGC 


TTCTACCGTA 


600 


CCCGAAAGAG 


CATGGGACAT 


ATGAATAGCA 


AG AATCTGG C 


CACCATCTTT 


GCATAGGTCT 


660 


TCAAAAATCT 


CAGCAAAGAC 


ACCTACAGGT 


GGCTGACTTG 


TTTTCGGAAG 


ATTCTTACTT 


720 


TCTTGCATCA 


ACTGAAGAAA 


TTTACCTTCT 


TCTTTCAAAT 


CCGCATCAGA 


ATAAACAACA 


780 


TTATCAATCA 


TTACAGATAA 


TGGAACAATT 


GTAATATCTA 


ATTGCTTTAC 


TAGTTCAGGT 


840 


TCAATAGTAA 


CAGATGAATC 


GGTT ACAATC 


TTAATTTTTG 


TCATAGTATC 


AATCTTTCTA 


900 


TTTTAGGATT 


CAGATTGGTT 


TCCTT ACTTC 


TAATTATATC 


AAAAAAAAGA 


TTAAAAATCC 


960 


TAATGGAGTC 


AATCAAATTT 


TCCGTAAAAT 


TTGATATAAT 


CAACTTATAA 


GAAAAGAGGT 


1020 


GTCCTATGAT 


TAAAAAAATT 


TACCCCATTT 


TTACCATTTT 


ACTAGGTGCT 


GCTATTTATG 


1080 


CTTTTGGACT 


GACTTATTTT 


GTAGTTCCCC 


ATCATCTCTT 


TGAAGGAGGG 


GCGACAGGCA 


1140 


TTACCCTCAT 


CACCTTTTAT 


CTTTTTAAAA 


TCCCTGTTTC 


CCTCATGAAC 


CTGCTGATTA 


1200 


ATATTCCCCT 


TTTCATCCTA 


GCTTGGAAGA 


TTTTTGGAGC 


CAAATCCCTC 


TATTCTAGTT 


1260 


TACTAGGAAC 


CTTAGCTTTG 


TCCGGCTGGT 


TAGCTTTTTT 


TGAGCATATT 


CCCCTTCATA 


1320 


TTGATCTTCA 


AGCTGATTTA 


CTAATCACAG 


CCCTTATAGC 


GGGAATCCTA 


TTGGGAATTG 


1380 


GCCTTGGAAT 


TATTTTTAAT 


GCTGGAGGTA 


CAACTGGCGG 


AACTGATATT 


CTAGCTCGTA 


1440 


TTCTCAACAA 


ATACACTCAT 


ATATCCATAG 


GAAAACTGCT 


CTTT ATCTTA 


GATTTTTGTA 


1500 


TTCTCATGTT 


GATTCTCCTA 


ATCTTCAAGG 


AT TTGAGATT 


GGTTTCCTAC 


ACGCTTTTGT 


1560 


TTGATTTTAT 


TGTTTCTCGT 


GTTATTGATT 


TGATTGGTGA 


AGGAGGATAT 


GCCGGCAAAG 


1620 


GCTTTATGAT 


TATCACAAAA 


CGTCCTGACC 


AACTTGCTAA 


GGCGATTAAT 


GATGACCTCG 


1680 


GAAGAGGTGT 


TACTTTTATT 


TCTGGTCAAG 


GCTACTATAG 


TAAAGAAAAT 


TTGAAAATCA 


1740 


TCTACTGTAT 


TGTCGGAAGA 


AATGAAATTG 


TGAAAACGAA 


GGAAATGATT 


CATCGAATCG 


1800 


ATCCTCAAGC 


CTTTATAACT 


ATTACAGAAG 


CCCATGAAAT 


CCTAGGAGAA 


GGCTTCACCT 


1860 


TTGAAAAAGA 


ATAAAAAGAG 


GTAATGTCGT 


GACCTCAAAA 


GTTAGACTAA 


ATCATCTATC 


1920 


TTTTGGGTTA 


CAGACAACCT 


CTTTTTTATT 


TTATTTACTC 


AAGCTCTTAA 


GACCAATTCC 


1980 


GAG TT ACTTC 


TTCATCAGCC 


TTTAACTGAT 


CCACTAATTG 


GTCAACTGAG 


TCAAATTTGC 


2040 


TCATATCTCG 


AATGCGATCA 


AGCCAATAAA 


CCATGACGGT 


TTCCCCATAA 


ATATCTTGAT 


2100 


TAAAATCAAA 


AATATTGACT 


TCAAAACGTG 


CTTCTTCTCC 


ATCAAAGGTC 


ACATTTTTCC 


. 2160 
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CGACACTAGC CATAGCACGA TACTTCTGTC 


TTTGAATCTC 


AACATCAACA 


> AC AT AAACG C 


2220 


CATCTGCTGG 


i CATATAAGTA 


t CGGTCTAAAA 


GCACTAAATT 


CGCTGTCGGA 


TAACCAATTG 


2280 


TACGACCACG 


AGCATTACCA 


TGAACCACCA 


TACCTCTTGA 


TGGAAGCGGT 


GCCCCCAAAA 


2340 


GTTTTC CTGC 


TTCTTTCACA 


TTTCCATCTA 


AAATAGCTTG 


ACGGATACGA 


GTTGAACTAA 


2400 


TCTTTCCTTT 


CTCATCTTCT 


ACAGGTGGAA 


CAATGATAAC 


TTCTCCATCA 


AAGTAATTCT 


2460 


TTAAATCTTC 


TGCTGTTTTT 


TTGTCAGAAC 


CAAATGTATA 


ATCAAAACCT 


GCAACAATAA 


2520 


TTTTGGCATT 


CATAGCCTTG 


ATATAAGTTG 


CAAAGAATTC 


TTGTGCAGTG 


AG ACT AG CG A 


2580 


ATTGACTACT 


AAAATCAAGG 


AGATATAATT 


CTTCTACACC 


TTCGCGCTTT 


AATTTTCTTT 


2640 


CACGTTCACC 


AGGGTTCAAA 


ATATGCAAAA 


ACAAATCTGG 


ATGATAAGGC 


TCTAAAGCGA 


2700 


TCTTTGGAGA 


TTCATTAAAG 


GTCATAACGA 


CGATAGGCAA 


CAAATCCTTT 


CTCGCAGCCT 


2760 


TGTTGGCAAC 


ACGAAATAAT 


TCTTGATGCC 


CCTTATGTAT 


GCCATCAAAA 


TAGCCGAGAA 


2820 


CAACGACTGA 


ATCAGATGGT 


GTGCCAATAT 


CTTTTTGGTT 


TTTTATAGGA 


ATAGTAATAA 


2880 


TCATAAAATA 


ATTATATCAT 


AGCGATAGCT 


ATTTCTGGAA 


CAGAAAATCT 


GAAATGTTGT 


2940 


TTTTTTCACA 


TGAAGTGTAC CTGTTTTCAA 


AAAGCACTTT 


A TTCT ATCGT 


TGCTTAACTA 


3000 


TGAACTTTGC 


AATATTCTTC 


TCAAAAACTT 


GTAGGACATC 


TTCAAAATTT 


TGCAAGGAGT 


3060 


GATTAGACTT 


GTTCGGTAAC 


CATAAAGTGT 


CATACTATGC 


TTATGTATGA 


AAAAGCAATG 


3120 


CAACTAACTC 


CTGAGAACTT 


TAAATTACTA 


ATTGGTGCCG 


AAAAGGTAGA 


ATTTAGAATC 


3180 


GAGGTACACC 


TATGGCTGTA 


AAATTTACAA 


AATGAGACAA 


CTTGGGCAAG 


ATGTTTGAAG 


3240 


AATTTCCTAA 


ACTCCCTGAT 


TTGAAGCAAG 


TCACTTTCCC 


TAATGACAAA 


GAAAAAAGCC 


3300 


AAAACAGCAA 


AGAAAAACTA 


GATGACTGCT 


TTCCAACAAC 


TCCCATCTAG 


TGTGCTTCAG 


3360 


ACT GGG CT AT 


TTTTCTCTC C 


ATCTGTTAGC 


TTGGATTCTC 


AGACCGTTTC 


AGCTAAAGAA 


3420 


TATCTTTTCC 


CTTATCAGAA 


GGAACGGCTC 


AAGCCATTCA 


GACAAGTGAA 


GGGACGACAA 


3480 


GCCAATATTT 


GAAACCAGAT 


AGCAGTTCTT 


ATAGTCAATT 


GAAATAAAAT 


CTGAAGAAAT 


3540 


CGAGTAGGAA 


ACTCATATCA 


ATGTTTAACA 


GTGTTCTATT 


CCAGATTCAT 


ACTCAATGAw 


3600 


AATTAAAGTG 


CAAACTAGGA 


AGTTAGCCGC 


AGGTGATACT 


TTGGGTACGG 


CA 


3652 


<2> INFORMATION FOR SEQ ID NO: 252: 









(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 743 base pairs 
(B) TYPE: nucleic acid 
(C> STRANOEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

GTACCGTGGT GCCAAAGTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 60 

TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 120 

TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTT TAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 240 

TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATGGATTTG 300 

CGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 3 60 

CTACGATTCT CATAGTCAAG CTTGCTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 4 20 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGCA AAATAGCTGA 480 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGC T A 54 0 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATCCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 660 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATCGCTAGA 720 



TAAGGATAGA AAAAGTGATG ACA 

<2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 53: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAACT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA CCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

AT AC CAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT 180 

TGGTTTCCGC ACTCAGGGCA AG ATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 24 0 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGTTGTTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 4 80 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 
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TTAGTAAAGT AAAACTATTG GAGGATATTT TAATGCCACA ACCT AT TGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CGAAATGCTA GAACAGCTTT 720 

TGGATAAAGT GTTGCTCTAT G ACAATT CAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 780 

TGTGGGAAAA CGGATATGAC GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAG 840 

TTTGAATTAG ATCCTTTCTC CGGTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAAGCCC TTTACTGGGA TG GTCAAGCA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 102 0 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATCAGA AACTCGTGAT 1080 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 114 0 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGAGTG CTACAATTGA TAGTCTCACC 1200 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 1260 

TCCTCTGAGA AAAGTGTTTG CCCATCTGGA CAACTCAGTC TTTTTGAAGA GGAACAAAAT 13 20 

ATGGAAGAAC ACTCTGACTT AC CC AGTTG A AAGAGAAGAA ATCACCTATA AACGTAAGAA 13 80 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 1440 

AGTAGAAGAG AGCATTTGCC CTGATTGTCA GGGAGATCTA AAAGAGATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 1620 

T ATTCCT AAA GCCCCTTTGG CGCATAGCCT TGGCTCAGCT TCTATTATCG CTCACACCAT 1680 

CCATCAGAAG TTTAATCTGA AG GT AC C C AA TTATCGCCAA GAAGAAGATT GCGCTAAGAT 174 0 

GGGTTTACCA ATCACACGTA AGGAAATTGC T AATTGG CAT ATCAAGGCGA GTCAATACTA 1800 

TTTGGAGCCC CT TT AT AATC TTTTACGAGA AAAGTTGTTA GAACAAGCTC TTCTTCATGC 1860 

GGATGAAACC TCTTATCGGG TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTGTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TAGTCCTCTA GTTCTGCCTA TGCGATAGCA GTCCAAGGTT 2100 

TAGGAGTAAG GCGACGCTAA GCTTGGTAAA CTGCGAACAG CTAGAACCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGCC 2220 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2 280 
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ATTTTCCTTG 


! GAAAGAGACT 


GGGAGGCTTT 


GCCAGCTGAT 


GAACGACTAC 


AGAAACGTCA 


2340 


AGAACATCTC 


CAGCCCCTAA 


TGGAAGACTT 


CTTTGCTTGG 


TGCCGCCGTC 


AGTCAGTTTT 


2400 


AGCAGGTTCA 


AAACTAGGAA 


GGGCAATTGA 


ATACAGCCTC 


AAGTATGAAG 


AAACCTTTAA 


2460 


GACTATTTTG 


AAAGACGGAC 


ATCTGGTCCT 


TTCCAATAAT 


CTAGCTGAAC 


GCGCCATTAA 


2520 


ATCATTGGTT 


ATGGGACGGA 


GTAAAAGAGT 


CCAGTGGACT 


CTTTTAGCCT 


GAGCTCAGTT 


2580 


TAAAAAAGCG 


AGGGTGGTTA 


TTTTCTCAAA 


GTTTTGAAGG 


AGCTAAAGCA 


AGAGCTATTG 


2640 


TTATGAGCTT 


GTTGGAAACA 


GCTAAACGTC 


ATCAATTATA 


GTGCGTTGAA 


TCTATAACAG 


2700 


TACGCATCGA 


CTGCTAAAAC 


ATTTCTATAA 


AT C AATTTTC 


CTTTCCTAAT 


CGATTTGTTC 


2760 


ATATCTTATT 


TCAATCCATT 


ATAAATAGCG 


AGAAATATCT 


ATCCTATCTT 


CTAGAATGTC 


2820 


TTCCAAACGA 


GGAAACTCTC 


GTAAACAAAG 


AGGTTTTAGA 


GGCCTATTTA 


CCGTGGACTA 


2880 


AAGTTGTACA 


AGAAAAGTGC 


AAATAAGAAA 


TCTCCAGATT 


AGGAACTATC 


CGTGAGTTCT 


2940 


CTAGTCTGGA 


GATTTTTCAA 


TAGACTTCGT 


TATTGGACGG 


TTACAATTTA 


TTATATGAAA 


3000 


ATCCCATATT 


ATTCTCCAAT 


TCTATATTTT 


ACCTTTCTAA 


ATGTATAGAT 


TAACT AC CT A 


3060 


ATTATAGCAT 


ATAACGCAGA 


TTCCTTTCAA 


TCGTATGATT 


TACTGCATTA 


AATTAAGTAA 


3120 


AAAAATAAAG 


GCAGTCCGAA 


GACTGCCGAT 


ATTTATCTCT 


CATCTCTTTA 


ATTATGGTAA 


3180 


GTAAATAAAT 


AATTTCCCTA 


AAGATATGGA 


AATTATTAAT 


ACTATAAATA 


CATATTATAA 


3240 


AGTTTATAAA 


TACTGTAAAA 


ATCCTGAAGT 


TAATTTTCTA 


ATAAATATCA 


ATATGTGTTA 


3300 


GTATCTTTTA 


AATTTTTAGA 


CAATTTACTA 


GTTCTATAGA 


CATGTTTAAC 


AGACTCTATT 


3360 


TTACAATTCA 


AAAATTTCAT 


CTGCCACTTC 


ATTTAAAAAT 


TCTATATCAT 


GGGAAACAAT 


3420 


AAAAATTATT 


TTATCCATGG 


TTTTATACTT 


ATTAATCAGT 


TCAGATATTT 


TTATCATATT 


3480 


GGAATAATCC 


ATACCACTTG 


AAGGTTCGTC 


AAAAAAGACA 


AATGGAGAAT 


TCTTGCACAT 


3540 


AACAGATGCT 


ATTGCAAGCC 


TTTGCTTTTG 


CCCTCCTGAT 


AAACTCATCG 


GATGCCTTTC 


3600 


AATAAATTCG 


TCCAGGCATA 


AATCTTTTAA 


CCCAAATCAT 


TCATACCTCT 


CTCAACTAGA 


3660 


TGTAACTTAC 


AAAACCCCTG 


ACCTCATGAG 


CCACTTTCTT 


CCTCCTCATG 


AGGTCAGTTT 


3720 


TACTTTCTGC 


TGTTCCAGTA 


TCGTTTTTCC 


TCGCTAGATT 


TCCTCAAAAG 


GGCAGACTCC 


3780 


TCCCTTGGTT 


CGTCACACGA 


TTTTTTCATC 


TCGACTGTTC 


TTTAATGCAT 


CATTAACGAC 


3840 


GCTTTTCTTC 


TAGGTGGTTC 


ATAAGGAACA 


GGAAGATTCA 


GGTTGACTTT 


TCTAATCCTA 


3900 


GAATAAAGTG 


CTGAAAACAA 


TTCGGAATAG 


GCATAGAGAC 


TAGACAATTT 


GAGGAGCTGC 


3960 


TTGCGTCCTG 


TTCGAACACA 


TTTTCCCACC 


ACGTGAAGAA 


AAAGATGGCG 




4010 


(2) INFORMATION FOR SEQ ID NO: 254: 
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{i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


254 : 






ATGCATCCGT 


' TTGTCAAGCC 


' TAAATTGTAA 


. TTTTTTTCAA TTTAAAACAG AAAAACCCAG 


60 


GAAAATGACA 


TAAAAATATC 


ATTCCTAGGC 


CTATTTATGC 


TATTTCTCTC 


TGAAAAATAT 


120 


GAGTATTCAG 


TCGGTCAAAT 


GAAGCTGAAC 


G AACTC AT TT 


TCCCTCGCCT 


AATTCAATGA 


180 


TTCGATGACA 


TTGTTGGGCT 


ACATAAGCAT 


CGTCGGTCAC 


GATAATGACT 


GTTTTCCCCT 


240 


CTCGATTCAT 


CTCTAAGAGA 


AACTTCAAGA 


CCAAATCTCT 


ATTTTCAGGA 


TCCAGAGAAC 


3 00 


CTGTCGGTTC 


ATCGGCTAAA 


ATCAGCTGGC 


TGGGTTTTAA 


GATGGCTCTA 


GCAACTGCAA 


360 


TTCGTTGTTG 


TTCGCCCCCA 


GACAACTCGG 


AGACCCTTTG 


ATGCAAAGTA 


GCTGATAAAC 


420 


CTACTCTCTC 


TAAAATCTCT 


TCCACCTTTT 


TGAGCTTGTC 


TTTCTTAGGC 


AATTTCACAT 


480 


ATTTCAGCGC 


CACATGAGAT 


TGTACTCGAC 


CGTTTCATCA 


TCAATCAGGG 


CAAAATTTTG 


540 


AAACAGATAA 


GAGATATGTT 


CACGGATTAT 


TGTTTGCGAC 


TTAGCAGAAT 


TAACCGCTAG 


600 


ATTTGTCTGA 


CCAAAAATCT 


CATACCGTCC 


GCTATAATCA 


CCATCTATCA 


AACCCAATAA 


660 


ATTTAACAAG 


GTCGACTTCC 


CACTACCACT 


CTTACCAACA 


ATAGCTACCA 


AATCCCCCTG 


720 


ATCAATCCTG 


AGAGATAAGT 


TATCCAAAAT 


CACTTTTCCC 


CCAATGGTTT 


TGGTAATATT 


7 80 


TTTCAACTCA 


AT CAT AAG AT 


GCCCCCTTTC 


AATAACTCTA 


CTAGACTTCT 


TTTCTCCATC 


840 


CTAGAAGCTA 


AGCCTAGCAC 


AAATAGTATA 


TCCAGACATG 


TAAAACCTGC 


AAACAGTAGA 


900 


AGTGGTAAGA 


ACGCATGGGC 


AAAGAAAATC 


AAGACTAGAA 


GAGGGAAACT 


ATAGCCCAGC 


960 


AAGAGCAGAA 


CGAGGAGAGG 


ACGGTAGCGA 


TCGACCACTT 


TCCACCCCAT 


AAACTTCTTG 


1020 


GTAATGATAT 


CCCTGCGCTT 


CAATAAGAAA 


GTTGTTACTA 


GTAAGAAGTA 


GGAAATCATC 


1080 


ATGCTAAGGA 


GACCAAACAA 


AGCAAAGAGT 


AGGTTAAAAT 


TCCGAACAGC 


ATCTCGATAA 


1140 


GAATCCACTT 


TCTCTTGTTG 


AATGGCTTGA 


ATAGATGAAA 


ATTTTAAATA 


ATTTCCATCT 


1200 


GACAATTTCT 


CAACTAACTC 


TGTAATCTCT 


TTTTGATGTT 


GAACCGTATT 


TTCAATTTTA 


1260 


ATCGGATTAT 


TTAAGCCAGT 


TGTTGACAGG 


GAGGCTTTCT 


CATCCCACAT 


CATATCAGAA 


1320 


TCATTGACCA 


AG CT AATAAT 


TGGATTGGAG 


AGATTTTCCT 


TTCGCTTATC 


ACTATATGGG 


1380 


AAAAATG AC C 


AATCTCCTTC 


ATAATAGGCA 


ATCTCGACAT 


CCATCTCCTC 


TATCGTTCCT 


1440 
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TTTTCCTCCT CTTCATACTT CAT CG AATG A AAGGCAATTA ACTTCCCCAA GAGCTGATTT 1500 

TTATCTTCTT CACCTTTCCT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTGCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1800 

AAGGTTTCTA AGGTCAGGTA ATTACCTTCA CTTACCCACT GTTGCTGATA AGCAAGTTCT 18 60 

TTGTTTTCTT GTTCTAAACT TCTGCCCACC CCAATCAGTA ACGCCGTCAG TAAAATAGTT 1920 

GTCCCTATTT T CATC AC AT A ATTGAAGATA AGACCAAATT TGAAAGATGA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 204 0 

TAAAGCTGCA ACAGCAAAAA ATGAGACAAC CACAGCATAG GAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTC ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AG CT AAAAC A G CCCT ATCTT GAAAACCAAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2 2 80 

ATAGCCAAGA CAAAAACAAT GGCAGATAAA AGCAAATCTC GATTTATGAC TTCCACTGCA 234 0 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTC CATTTC 24 00 

TGTACAATCC TATCCTTGTC CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 2 4 60 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2 5 20 

AG AAAGGTCG GAATCGTACC AAGTTTATTG G AAATTT CTT TATTACTATA GACTCCTTCA 2580 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 264 0 

TCAAAAACAC CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTG AT GAT GGAGACTTTA 27 00 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 2 7 60 

TTCAAATACA AATCC AAAG A ATCTACAGG 2 7 89 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
CTGCGAATTT TATTAAAGAT AATGTCTTAA TTACAGCGGC TCACAACTAC T A C AG ACATG 



60 
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ACTATGGGAA AGAAGCGGAT GATATTTATG TTCTTCCGGC TGTTAGTCCA AGTCAAGAAC 12 0 

CATTTGGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTACTC AAAAAAATTT GACAGGAATA ACTCTGACTA 300 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 360 

TTTTAAGTGA TGATGGCATG TTCTTGGATT ACCAAGTTGA TACTTTAGAG GGGTCTAGTG 420 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GGAGATGGAG 4 80 

CTAATCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 540 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTGG TACCATTATA 600 

GACAACATGA TAAACAAACG G GTTGGC AGG AGATAAATGA TACCTGGTAT TATTTAGACA 660 

GTTCCGGTAA GATGCTTACA GATTCGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 720 

CAAATGGAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 7 80 

CATCTGGTGA GTCGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 84 0 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATGATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGGTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAAAGA AACCTACTGG AGAGTGATAG ATGGGAAGTA CTATTATTTT GATCCTTTAT 1020 

CCGGAGAGAT GGTTGTCGGC TGGCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCT CTT A GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACGAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATG AT AG CC AAGCAGAGAA ACGAGTCTAT TATTTTGAAG 1260 

ATCAGCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 1320 

ATTTACAGAA GGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GGAGAGCTAG 1380 

CACGTGGTTG GGTTAAGGAT T ACC CTCTT A CGTATGATGA AGAGAAQCTA AAAGCAGCTC 144 0 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 15 0O 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 15 60 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAG AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAG GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATGG AGAT AT GAAAACAGGC TGGTTCCAAG 17 40 

TCAATGGTAA CTGGTACTAT CCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 
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GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAACT CTTCACAATC AAAAAACGCA 1920 

TAG T ATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 204 0 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTG C " TTAAG CG AC T GAAACGACTT 2100 

CTCATAACCG TAAAACATTT C CGATTTC AG AATCCCAAAG AAGGACTCCA TCATACTATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2 2 80 

GTGCT TCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2 340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2 4 60 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2 4 95 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 870 base pairs 

(B) TYPE: nucleic acid 
<C) STRAN0EDNESS : double 
<D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 256: 
TACCACCGTA TTCATCCAGC AAGATTGCCA TTTGTCTTTG GGTATTTCGC AGTTCTTTTA 60 

GCAAGTCATC CACAAAAATA GTTTCAGGTA CAAAAAGTGG ATCTTGTAAA ATTCTCTTCC 120 

AAACAATATT GTCAAAACCG TCCACAAAGC CTGCCTTAAG GAGACTCTTG GTGTGAATGA 180 

TTCCAATTAC ATTGTCCTTA TCCCCATCAT AAACCGGGAT ACGAGAATAA TTTTGTTTTA 240 

AAATACTTTG GATAATGGCT TGACTATCAT CCTGAATATC CACCATAAAG GCATCCGTTC 300 

GAGGAACCAT AACCTCTCGT GCCATCAGTT CATCGAGCGA AAAGACACCT TGTAGCATCT 3 60 

CAATCTCATC AGCATCCAAT GTTTCTTCAC TATTTGTCAG CATATAGGCA ATTTCATCAC 420 

GGGTCATCTT TTCATCCGCA TCATCGAATG ACATAGGAGT CAAATGGCTC AAGAAATTGG 4 80 

TCGAAGCAGC TAAAAGCCAA ACAAAAGGAC TGACTAGTTT TCCGATCCCA ATGATAATCG 540 

GCGCTGTACG AATTGCCAAG GCATCCTTTA GATTAAGAGC GATTCTCTTA GGATATAATT 600 

CCCCAAAAAC GATGGAAATA TAGGTCAAAA ATGCCAAGGA TAGAAAAGTT GCCACGGCTT 6 60 

GTGCTGTTTC GCCATTCCCA AGCCAAGAGG CAATCACACG TCCTAGAGTA TCAGTTAAAC 720 
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TCGCCCCTGA TAAGATTG T A ATCAGGGTGA TTCCTACCTG GAT GGTTG AT AAAAAGTGGT 780 
TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCCGCCTTTT 840 



GTTCAACTCG GGCACGATTA AGAGAAACGG 
(2) INFORMATION FOR SEQ ID NO: 25 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 7: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA SO 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 120 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGC AC T AC AG TGGAAGCAAG GAACTGGGGA 180 

ATGCCCAAGA CT AT CT CT AT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 24 0 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 3 60 

AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 420 

CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 480 

AACAAAAGTC TTCTAACAGC AT GT AGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 54 0 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 600 

CAGCTTTTTT CTTTGCCTCC TTACCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 6 60 

TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 72 0 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 7 80 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 84 0 

TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 960 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 1020 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 1140 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 1200 
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(2) INFORMATION FOR SEQ ID NO: 258: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
ATGCCTATGT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT AGTTTGTCTG 
AAGCTGAGAG AGCGGCAcCC AGGCTTATGC TAAAGAGAAA GGTTTGACCC CTCCTTCGAC 
AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA GAAGCTATCT ACAACCGCGT 
GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC AATCTTCAAT ATACTGTAGA 
AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT TACCATAACA TCAAATTTGA 
GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT ACTCTTGAGC ATCTTTTGGC 
GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG CATTCAGATA ATGGTTTTGG 
TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA GCTGATACCA ATCAAACGGA 
AAAACCAAGC CAGGAGAAAC CTCAGACAGA AAAACCTGAG GAAGAAACCC CTCGAGAAGA 
GAAACCGCAA AGCGAGAAAC CAGAGTCTCC AAAACCAACA GAGGAACCAG AAGAATCACC 
AGAGGAATCA GAAGAACCTC AGGT CGAG AC TGAAAAGGTT GAAGAAAAAC TGAGAGAGGC 
TGAAGATTTA CTTGGAAAAA TCCAGGATCC AATTATCAAG TCCAATGCCA AAGAGACTCT 
CACAGGATTA AAAAATAATT TACTATTTGG CACCCAGGAC AACAATACTA TTATGGCAGA 
AGCTGAAAAA CTATTGGCTT TATTAAAGGA GAGTAAGTAA AGGTAGCAGC ATTTTCTAAC 
TCCTAAAAAC AGGATAGGAG AACGGGAAAA CGAAAAATGA GAGCAGAATG TGAGTTCTAG 
TTCTCATTTT TTTCATGAAA ATGTGCAAAA TATAGTAGAT TGAAACTAGA AT ACT AT AC C 
TCTACTTCTA AAACATTGTT AGAAATCGAT TTGACTGTCC TGTTCTTATT TCATTTTACT 
ATATCTTAAC AGATAGTGTA AATAAAGATA AACTATTTAC TGGCTAATTA ATCAGTTAAA 
CACTAGTTAA GGAGTAATGA TGAAAAAAAG AACAATACTA TTATTGATGG CCAGTCTGTT 
AGCTCTTGTC TTAGGAGCAT GTGGTTTCTT GGACATATTG ATCCTGGATC ATTCTCATCA 
GGATTACTCT TTACTGCTAT TTTAGAAACT GGGGTGGTTT GAT GGAAAGT ATTGGTCTTG 
TTATCGTTTC ACATTCCAAA CACATTGCAG AAGGTGTTGT TGAACTGATT AGTAAAGTAG 
CTAAAGATGT TCCGATTACT TATGTAAGAG GAACCGAGGG CGGAGGAATT GGAACGAGTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
.1020 
1080 
1140 
1200 
1260 
1320 
1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 1440 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGG AgC t TGAAATCAAT AAATAAGGAA 1620 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGGAGTGGAG AnATATGAAG ACACAAATTT TCACATTATT GAAAATCGTT GCTGAGATTA 60 

TTATTATTTT GCC ATTTCT A ACTAATCTAT AAGTTCTTTA TATTGCTGAA AACGCAATTC 120 

AAAAAGGGCT ATTAATTGTG GATTTTCTAA TACCTGCAGA GATTGGATAA AGCGTTCAAT 180 

CTCTTTTTGA TTGCTTCCCT TTGTTTGAAG AAAGACACTC ATCTTCTTTA AAAATTGCCA 240 

CGATACTTTT TCAAAAACAT CATACGGTCG TAACATCCTC TCCAACTCGG CTTCGAAGAT 3 00 

TGGGATGTAG GAGAAAAGTT TTCGCTCCAT GAGTTCTGAT AAGATATTTA AGAGTCCTTG 3 60 

CTTCATATAC AATCGATTGT GTACTAACTC TTTAAATTCT TTGGATTTTT CGAGTAAGGA 4 20 

GGTTGATAAA AAAATCAGAT CTTGATTGCT CAAGAAGGGC ATGGTATTGC AAAAGAGATA 4 80 

GAGTTCAAAC CAGGTCCAAG ACT CGAT AGC ATAGAGATAG GTGGTCAAAA ACTCGCTATC 540 

CTCCTCTGCT AGTGGGTAGC TTTTATTTAG TGAATGGATG GCATCTTTAA TCACGATGGC 600 

ATTCAAACGA CGATAGGTCT GCGCCATCTG TTCTTGATCG ACTTCCTCCA ATAGCTGCTC 660 

TAAAGCAGCT ATATCCTGAT GGGCAAAGCG ATTCACAACC TTTCGACCGA TTCGCATATG 720 

TGGAGATTCT TGATAGTTGT TGAGCTTGTG CCCAAACTCA TCAAAGGTCA CATTTATACC 7 80 

TTGGATAGCT AGAATCAACT TATCCGCAGA CAGCATAGAC TGCCCTAGTT CAAACTTGGA 840 

CAACTGAGAA GCTGTTAGAC CCTCACAAGC CACAT CTG AC TGCTTGAGCT TTCTCGCCAA 900 

ACGTAATTCC TTGTAAAATT CCCCCAGTTC CATTCTCTCA ATCATCTGAC CACCTCCTAG 9 60 

CTTTTGCAGG 970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



txi) 


SEQUENCE DESCRIPTION : 


SEQ ID NO: 


260: 






GTTGACCACG 


GGTAAAACTA 


CCCTAACTGC 


AGCTATCACA 


ACTGTTTTGG 


CACGTCGCTT 


60 


GCCTTCATCA 


GTTAACCAAC 


CTAAAGACTA 


TGCGTCTATC 


GATGCTGCTC 


CAGAAGAACG 


120 


CGAACGCGGT 


ATCACTATCA 


ACAC TGCGC A 


CGTTGAGTAC 


GAAACTGAAA 


AACCTCACTA 


180 


CGCTCACATC 


GACGCTCCAG 


GACACGCGGA 


CTACGTTAAA 


AACATGATCA 


CTGGTGCTGC 


240 


TCAAATGGAC 


GGAGCTATCC 


TTGTAGTAGC 


TTCAACTGAC 


GGACCAATGC 


CACAAACTCG 


300 


TGAGCACATC 


CTTCTTTCAC 


GTCAGGTTGG 


TGTTAAACAC 


CTTATCGTCT 


TCATGAACAA 


360 


AGTTGACTTG 


GTTGACGACG 


AAGAATTGCT 


TGAATTGGTT 


GAAATGGAAA 


TCCGTGACCT 


420 


ATTGTCAGAA 


TACGACTTCC 


CAGGTGACGA 


TCTTCCAGTT 


AT CC AAGGTT 


CAGCACTTAA 


480 


AGCTCTTGAA 


GGTGACTCTA 


AATACGAAGA 


CATCGTTATG 


GAATTGATGA 


ACACAGTTGA 


540 


TGAGTATATC 


CCAGAACCAG 


AACGTGACAC 


TGACAAACCA 


TTGCTTCTTC 


CAGTCGAGGA 


600 


CGTATTCTCA 


ATCACTGGAC 


GTGGTACAGT 


TGCTTCAGGA 


CGTATCGACC 


GTGGTATCGT 


660 


TAAAGTCAAC 


GACGAAATCG 


AAATCGTTGG 


TATCAAAGAA 


GAAACTCAAA 


AAGCAGTTGT 


720 


TACTGGTGTT 


GAAATGTTCC 


GTAAACAACT 


TGACGAAGGT 


CTTGCTGGAG 


ATAACGTAGG 


780 


TGTCCTTCTT 


CGTGGTGTTC 


AACGTGATGA 


AATCGAACGT 


GGACAAGTTA 


TCGCTAAACC 


840 


AGGTTCAATC 


AACCCACACA 


CTAAATTCAA 


AGGTGAAGTC 


TACATCCTTA 


CTAAAGAAGA 


900 


AGGTGGACGT 


CACACTCCAT 


TCTTCAACAA 


CTACCGTCCA 


CAATTCTACT 


T CCGTACTAC 


960 


TGACGTTACA 


GGTTCAATCG 


AACTTCCAGC 


AGGTACTGAA 


ATGGTAATGC 


CTGGTGATAA 


1020 


CGTGACAATC 


GACGTTGAGT 


TGATTCACCC 


AATCGCCGTA 


GAACAAGGTA 


CTACATTCTC 


1080 


TATCCGTGAG 


GGTGGACGTA 


CTGTTGGTTC 


AGGTATGGTT 


ACAGAAATCG 


AAGCTTAATT 


1140 


CGATTTAGTT 


CCCAGAAGAA 


CAATTATTTA 


AGTTAGACAC 


TAAAAGAATC 


TTGCTTGGCA 


1200 


AGG TTCT TTT 


TTTAGATATT 


GAACTAATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTATAATA 


1260 


TATTGAAACT 


AGAATAGTAC 


ACATCTACTT 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


1320 


TCCTGATCGA 


TTTGTCTTGT 


TCTTATTTCA 


TTTTACTATA 


GAAAGTTAGC 


TACAGACTGC 


1380 


TCAAAACATT 


GTTTTTAGGT 


TGTAGATAGA 


ACTGACGAAG 


TCAGtAACAT 


CTATACGACA 


1440 
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AGGCGAAGCT GACGCGGTTT GAAGAGATTT TCGAAGAGTA TAATACTAGA CTAAAATCAA 1500 

AAAGCATTAT ACAATAGTAA TATGAAATCA ATTAAAGAAG AAATCCAAAC CATCAAAACA 15 60 

CTTTTAAAAG ACTCTCGTAC AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTCGT 1620 

CTGATGGGCA AATCTTATAA AGAGATTATA GAACTTTTAT AGTGGTTTGA AATAAGATGT 1680 

GAACAACTCT ATCAGGAAAG TCAAACTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA 174 0 

CTGTTATAGA TTCAATACAC TTTAGACTGT AATCAAACAA CGATTTGGCG AAATGTAAAA 1800 

AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC ACGTCGTGGT CGTAACCATG 18 60 

CTTATATGAC GGTTGAGCAA GAG AAA GTCT TTCTTGCCCG CCATTTGAAG GCTACAGAGG 1920 

CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA TAAAAAGGAG TTAGCTCGTT 1980 

CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA TGGTTGGCGA AATATTACGC 2040 

CACGTCCAGA ACATCCTAAG AAAGCAGATG CTCAAACCAT TGTCGCGTCT AAAAATAAAC 2100 

TCTCAATTCA AGAAGACAAG TGAACTGCAC CCCAAAAGTT AGACAGAAAA AATCTAACTT 2160 

TTGGGGT GTT TTTATTATGA AATTAACTTA TGATGATAAA GTTCAGATCT ATGAACTT AG 2 220 

AAAACAAGGA TATAGCTTAG AGAAGCTTTC AAATAAATTT GGGATAAACA ATTCTAATCT 2 2 80 

TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG TTCGTCAAAA AAGGAAAAAA 23 40 

TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT AAAGTCTGAC ATGAAGGCTG 24 00 

GACTAAAGAT AGAGTTTCTC TTGAATACTG TCTCCCAAGT CGTACGATAC TTCTTAACTG 2 4 60 

GCTAGCACAA TACAGGAAAA ACGGGTATAC T ATTGTTG AG AAAACAAGAG GGAGAGTACC 2520 

TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT CCGATTGAAG GAGGAAAAAG 2580 

AGAAAGAAGA AAGACAGAAA TTATTCAAGA ATTAATGACT GAGTTTTCGT TAGATATTCT 2 64 0 

TCTAAAAGCC ATTAAACTAG CTCGT TTG AC CTACTACTAT CACTTGAAAC AGCTAGATAA 2700 

ACCAGATAAG GACCAAGAGC TTAAAGCTGA AATTCAATCC ATTTTTATCG AACACAAGGG 2 7 60 

AAATTATGCT TATCGTCGGA TTTATTTAGA ACTAACAAAT CGTGGTTATC TGGTAAATCA 2 820 

TAAAAGAGTT CAAGGCTTGA TAAAAGTACT CAATTTACAA GCTAAAATGC GACAGAAACG 2 880 

AAAATATTCT TCTCATAAAG GAGACGTTGG CAAG AAGGC A GAGAATCTCA TTCAAGGACA 2 940 

ATTTGAAGGC TCTAAAACAA TGGAAAAGTG CTACACAGAT GTGACAGAAT TTGCCG 2 996 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 
(8) TYPE: nucleic acid 
(C) STRAND EONESS : double 
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(D| TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 261: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTCCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTT A AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA CTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC G AC CAATTCT 4 20 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 4 80 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA AT GGTTTTC A 54 0 

CGGTCCGAAA GCCCAACCTT AG CC AACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 600 

GATAATTTCT TGACAAC AAC CAAGCCTTCT TTCACATTAT CAAGTGCTCT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTT AC G AC GTAGGGCAAG G ATTTCTTC T 720 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCT AG AT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 



(2) INFORMATION FOR SEQ ID NO: 262: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CCGAACAAAA TGGGCTAATT AG ATT AT AGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 60 

ATTGGATGTG ACC ACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 12 0 

AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 180 

TTTGGTAAAA AAGTAGGGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATGTATC 240 

TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 3 00 

TTGGTT CGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 3 60 

GGTTTTGGTG GTAAAATTAC TGGTGAATTG CTTATGTGTG ATATCATCGA AGCTTTCATC 4 20 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 4 80 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 54 0 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCG AGT AC TTTCAGAATT 7 20 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



CCGTTCAAAG 


TCTTCATAAG 


ACTCGAAAGT 


CACAGTTCTT 


TCGTTCTTGC 


TGGCATCTAT 


60 


ATAGGTAATT 


TCAATCATGT 


TTAAAACTCC 


TTTGTTTAAT 


GCTAACTTTA 


TTTTACTCCT 


120 


TATAAAAGAG 


AATGT CAAGA 


AAAATGATTG 


CGCACGCAAC 


TTTTTTTAAA 


ATCATCTTAA 


180 


ATCAAGAAAT 


CCAAACCTGC 


TTCCAAGCTT 


TCTTCGACAG 


TCTTTTGTAG 


CGAGGCCAGT 


240 


GTCTTTTGCC 


CATCATTTGT 


CAGGCAGATA 


AAACTAGAGC 


GTCTATCTTG 


ATGGCAACAC 


300 


ATGCGACTGA 


GTAGACCGCA 


ATTTTTAGCT 


TCCAAGCGAG 


CCACCATCCT 


AGAAACTGCG 


360 


CTCGGGCTCA 


GATGAAGCTT 


ATCTGGCAGG 


TCAAT CTGGC 


GTAGAGATTT 


TTCTTCAGCC 


420 


AAGTCC AG AT 


AGTAGAGCAG 


GTAGAACTCT 


TTCAAGGTCA 


CACTTTCCTC 


GCTCTGTTGG 


480 


GCAATGGTCT 


CTTCCAAGAG 


ACTTTCAATT 


TCTTTCTGAC 


GCCGATTGAA 


GTCAAACCAT 


540 


TTTTCCAAAT 


AGGTCATAGT 


GTCTCCTTTC 


TTTTTAGAGT 


CATAAATAGA 


AGAAAGTCCA 


600 


TTAACGCGCA 


GTCTCTG CGT 


CACAAGATGA 


TTGCGCATGC 


AATAATTATA 


CTACTTTTCA 


660 


AGAATGCTGG 


CAAGCTCTGT 


TTTTTAGTGG 


TTTTATTTTT 


GTGTGAATAA 


TGGGGGAATC 


720 


CTATTGTTTC 


AATTTCTAAC 


TCCTTATCAC 


ATTCGAATTC 


AGATTTTATT 


TCATTTCTCT 


780 


ATCTATAGTT 


GCTTAGTTTA 


AAATAAGCAT 


GGTCTAATAA 


AGCTATGCAT 


ATAGTACTGA 


840 


TTTTAAACAA 


GGAGCATTAG 


ATTCCATTAA 


AGGAGGGCAC 


AGACATGTCG 


AGGCGGCCAA 


900 
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AGTTTTTCAT GTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG TAAACAAGGG 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



AACTTAGAGC GGAAAAAGCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA AGAATTGAAA 
GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG TTTTGATTGT 
GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA AAAGACGACC 
AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT GGATAACCTA 
AAAGATTTAC CAGTCCTATA TATTGACGAA ACGGGAATCG ACCGCTACCT CTATCGTCCT 
TATGCAGGGG CTCCTAGAGG GGAGAAAGTC TATGGCAAGA TTAGCGGACG GCGTTTTGAG 
CGGACTAATG AGGTGGAGCA AAAACTCAAT GGTAGTTTTC TAATCAGATA TATTGATTCA 
CAAATTAGAG AATGAAAGAA TAATTATGCA TAAAAATAGG AATATAAACC AAAAATTAGC 
TGATTTATAC TCATTTGCGT GTCTTTATAA AAAACTTATC TTATAATATA TATATATATA 
TATACAAAAT AGTAAAATGC I I TflTH T T TAGCAAAAAT ACCTCAAGTT TCTTGCTATT 
TTGGGTTCCC TATTCTATAA TTATAGTATG GTAATTTATT TATATCCATA CATGAAAATA 
ATACTCGAAA GGAAATTTCA AAATATTTTT TAGACGTCAG AAGGGTGAAT ATAGAGAAAC 1680 
AGACCGAGTA ACTCGGTTCA AATTAATCAA ATCAGGGAAG CATTGGCTAC GGGCCTCGAC 17 40 

TTCTCTTTTT GGCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC AGGTCATGAC 
CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGG CTTGATATCC TCAAGGGGAT 
AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG TATTTACAAA 
TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT GCTTTGGCAA CAAATGATAC 
AGTAGTTCTA GGTACGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT TGTCAGCTTC 2040 
AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA CCTCAGCAAG 2100 
TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCGACA AGTATTTCTG CATCATCTAC 2160 
TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGC TCGAAGAAGA 2220 
TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 22 80 

TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT CTATAAAAAA 2340 
TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA ATGCAAGTCT 2400 
AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT CGGTATATCG 2 4 60 

TGTTCCAATT TACTATAAAT TGAAACTGAC AAATGATGGT TCAAAATTGA CCTTTACCTA 2S20 
TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA GTATGCGTCC 2S80 
TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC TTGGCAGTGA 
TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA GACAGGTTCT 



1800 
1860 
1920 
1980 



2640 
2700 
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ATCCTATAAT 


ACATCTACAA 


TGACGACGCA 


GGGTAGTGGG 


TATACTTGGG 


GAAATGGTGC 


2760 


CCAAATGAAT 


GGTTTCTTTG 


CTAAGAAAGG 


ATATGGATTA 


ACATCATCTT 


GGACTGTACC 


2820 


AATTACTGGA 


ACCGATACAT 


CCTTT AC ATT 


TACCCCTTAC 


GCTGCTAGAA 


CAGATAGAAT 


2880 


TGGAATTAAC 


TACTTCAATG 


GTGGAGGAAA 


GGTAGTTGAA 


TCTAGCACGA 


CCAGTCAGTC 


2940 


ACTTTCACAG 


TCTAAGTCAC 


TCTCAGTAAG 


TGCTAGTCAA 


AGCGCCTCAG 


CTTCAGCATC 


3000 


AACAAGTGCG 


TCGGCTTCAG 


CATCAACCAG 


TGCCTCGGCT 


TCAGCCTCAA 


CCAGTGCGTC 


3060 


AGCTTCAGCA 


AGTACCAGTG 


CTTCAGTCTC 


AGCATCAACA 


AGTGCTTCAG 


CCTCAGCATC 


3120 


GACAAGTGCC 


TCGGCTTCAG 


CAAGCACATC 


AGCATCTGAA 


TCAGCGTCAA 


CCAGTGCTTC 


3180 


GGCTTCAGCA 


AGTACCAGTG 


CTTCAGCTTC 


AGCATCAACC 


AGCGCCTCGG 


CCTCAGCAAG 


3240 


CACCTCAGCT 


TCTGAATCGG 


CCTCAACCAG 


CGCCTCGGCC 


TCAGCAAGCA 


CCTCAGCTTC 


3300 


TGAATCGGCC 


TCAACCAGCG 


CCTCAGCCTC 


AGCATCAACG 


AGTGCTTCGG 


(""PTC* a f*r* a. & c 


lien 
j jOU 


CACAAGCGCC 


TCGGGTTCAG 


CATCAACCAG 


TACCTCAGCT 


TCAGCGTCAA 


CCAGTGCTTC 


3420 


AGCCTCAGCA 


TCAACAAGTG 


CGTCAGCTCA 


GCAAGTATCT 


CAGCGTCTGA 


ATCGGCATCA 


3480 


ACGAGTGCGT 


CTGAGTCAGC 


ATCAACGAGT 


ACGTCAGCCT 


CAGCAACCAC 


CTCAGCTTCT 


3S40 


GAATCGGCCT 


CAACCAGTGC 


GTCACCTCAG 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


3600 


CCAGTGCTTC 


AGCCTCAGCG 


TCGACAAGTG 


CGTCGGCCTC 


AACCAGTGCA 


TCTGAATCGG 


3660 


CATCAACCAG 


TGCGTCAGCC 


TCAGCAAGTA 


CTAGTGCATC 


GGCTTCAGCA 


TCAACCAGTG 


3720 


CCTCGGCTTC 


AGCGTCAAAC 


AGTG 








3744 



(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 795 base pairs 
(B> TYPE: nucleic acid 
(C> STRAND EON ESS : double 
<D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTAT CATTTC 120 

AC CCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCCGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AG ATCTG ACT TGGAGGCTAG 3 00 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 42 0 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 4 80 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 54 0 

AG AGGTGGC A GAGCAAGTCT GTGATCGGAT TGC C ATTTTG AAAAAGGGGC ATTTGATTTA 600 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TT ACC C AG AC CAGTCTTTGG AAACTATCTA 660 

CCTTAGTCTT GCTGCTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 

AGCAGGCTAA GAATC 7g5 
<2> INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TGCTACTACC ATTTCCCATA GCGACCGACA 60 

TACCAACGCC AGC CAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT 180 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 300 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 360 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 4 20 

AAGAAC CCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAG^.G 4 80 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 540 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAAGATATA CTGGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CGCATCCTTA GTGGATTTCA 720 

AAACACTCTT GCGATTGTTG ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATGCAG 900 
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AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 102 0 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 114 0 

TG ACT AC AAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 1260 

TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG 132 0 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 13 8 0 

TAATCTTAG A GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 1560 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 1620 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATGG CAAGTTTCTA AAGAAACTCC 1680 

(2) INFORMATION FOR SEQ ID NO : 271: 

<i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xii SEQUENCE DESCRIPTION; SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AG AGG AT AG A GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 12 0 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 18 0 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 2 40 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 3 00 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 3 60 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 4 20 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 4 80 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

T CAT AGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTG AT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 



CCAGCAAATC 


AATAACTGCA 


ATTGCTATAA 


AATGGATTCT 


ATAGAGTTTT 


TTCATGACAA 


60 


GACCTCCCTC 


TTTTATCTAA 


CTTCATTCTA 


CTCCAAAAGA 


ATGGGAGTTA 


CAACTAAAAT 


120 


GATAAAAATA 


GCAGAAGGGA 


GATTCTCTTA 


AGTTGGCTAG 


TATTCTTTAT 


TTGAGTTTCC 


180 


TTCTATTATC 


TAACTTCTTC 


ATCATTCCAG 


ACAAATAAAG 


CTCCGATTCC 


ATTGAGGATA 


240 


TAAAAGATGT 


ATTTACCGAT 


ATTGG CGAAG 


TTTCCTTGAA 


TACCAGCTTT 


TGTCAGCTGA 


300 


ACGAAATTGT 


AAATCAACCA 


AAAGCCCCAC 


TGAGTTGTTA 


GTTTTAATGC 


ATTCAAAGCA 


360 


TTGGCAATGA 


GGGACAGTGC 


AAAGGCAATA 


GTTGTTACGT 


AGGCAAGGAG 


ATTCATCTTG 


420 


CCCCCATATC 


CGATATAGTT 


GGTCACAAAG 


GCAAAGAGGA 


AGGCGATGAT 


GGAAATGATG 


480 


ATGGCCGCCA 


ATTTTACCTG 


TTTTTGGCTC 


ATTTGGTTGG 


GTCTGCCTTC 


TTGCGAAGCT 


540 


TCCCACTTCT 


TTATAGCAAA 


GGTATAAATG 


AGGAAGGTGA 


CGGGATAGGT 


AATGATGGCC 


600 


GCCTTATTTC 


CAAGGATATA 


ATCAATAG C A 


CCGGACAAAA 


TGGTATTAAC 


AATACCAAAG 


660 


TAATTTCCCC 


ATTTGCTTAA 


TTTCCCCGTG 


AAACGAGTGG 


ACAACATGGA 


AATCCCAACG 


720 


TTGGTTACGG 


AAATCAATCC 


AAAGGGTACA 


AGAGCTGTCC 


ATGATCCCCA 


GTCTACAAAT 


780 


TTATCGAGGT 


GTGAGTTGAG 


GTAACCAGAT 


GCAATCGCAA 


TCCCAACGAC 


CAAAGCAACC 


840 


CCGAAGAGGT 


CAAACTATTT 


AG ATGT AG CA 


AAAATTTTTA 


GTGATTTTTT 


CATAGGTTAA 


900 


ACTACCTTTC 


TTTTTTTCAA 


AT ATTCTCC C 


ACCAAATGAA 


AGTAAAATAA 


AATGATAGAA 


960 


ATAAAACCCT 


GAAAATAAAG 


GTTCTATAAT 


ATTTGTAGTG 


GGTAAATCCA 


CTATAGATAT 


1020 


TATGGAGCCT 


ATTTTATTGT 


AGAAAAAAAG 


TCCCATATGA 


CCTATAATGA 


AAAGCGACAA 


1080 


AACAACTCAT 


TAGAAAGAT 










1099 


(2) INFORMATION FOR SEQ ID NO: 273: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA CGTGAAAACG AAGCCCAGAG AGTAGCCAGG TGTACTGCTA GAACAGTGAG 60 
TGAAATTGAA TATTACCATA GAGAGTCAAC CCAGATAGCT CAGGCTTTAG TTGAAAATCA 
AGCTCGTATC GAGGGAATCT AT AAATAC TT TAGCCTTAGC ATGCCAGACT ATTTTTACTG 
GCAATTAGAG CGGAAAGCTT CGCCTTATAT ATCAGTCTCT CTGTATGAAA ATGTTGATGA 
CCTCTATGTT CGAAATGATT TTGTAACTGG GGTGGCCATT GCTTTTCAAG ATTACAAGGA 
AGTCTATGTT TCTACTAAAG ACAAACGTAG GkkAGAAAAA ATCAGGGCTG AGGATTTCAA 
ACCAGCAGGA AATAGTTTTG CCATTCCAGT GTCAGATCCA GTGTCAG A TC AAGACTTAGG 
AGTGATTTAC ATCTCCTTGG ATCCTGCTGT TTTATACCAT GCCATTGATA ATACTAGAGG 480 
TCATACTCCG ATGGCAGTAA CAGTGACCTC ACCTTTTGAT ACGGAGATTT TTCATATGGG 54 0 

TGAGACAGTT GATAAGGAGA GTGAAAATTG GCTAGTTGGC TTAACTTCTC ATGGATATCA 
GGTTCAGGTG GCAGTTCCTA AAAACTTTGT TTTACAAGGA AC AG TG ACT A GCTCTGCTTT 
GATTG TGGGT TTGAGCCTTC TCTTTATTGT CATTCTTTAT CTGACTTTGA GGC AG ACT TT 720 
T GCTAATT AC CAAAAGCAGG TAGTGGATTT AGTAGAATCC ATTCAAGTCA TTGCTCAAGG 780 
CGAAGAGGGG CGTCGGATTG ACATTTCCGA GAAAGATCAG GAATTACTCC TAATCGCGGA 
GACGACCAAT GATATGTTGG ATCGATTGGA AAAGAATATC CATGATATTT AC C AGTT AGA 
GCTTAGTCAA AAAGATGCCA ATATGCGAGC CTTGCAGGCG CAAATCAATC CTCATTTTAT 
GTATAATACG CTGGAGTTCT TGCGCATGTA TGCAGTTATG CAGAGTCAAG ATGAGTTGGC 
AGATATCATT TATGAATTCA GTAGTCTCTT GCGTAACAAT ATTTCCGACG AAAGAGAGAC 1080 
CCTCCTCAAA CAGGAATTAG AATTTTGCCG TAAATACAGC TATCTCTGCA TGGTTCGCTA 1140 
TCCCAAGTCC ATTCCCTATG G TTTCAAG AT AGATCCAGAG TTAGAGAATA TGAAGATTCC 1200 
CAAGTTTACC TTGCAACCGC TGGTAGAAAA CTATTTCGCG CATGGTGTTG ACCACAGGCG 12 60 

GACAGATAAT GTGATTAGCA TCAAGGCTCT TAAACAGGAT GGTTTTGTGG AAATTTTGGT 1320 
GGTCGATAAT GGTAGAGGAA TGTCGGCTGA AAAGTTGGCA AATATCCGAG AAAAATTAAG 1380 
TCAGAGATAT TTTGAACACC AAGCCAGCTA CAGTGATCAA AGG CAGTCTA TCGGGATTGT 14 40 

CAATGTACAC GAGCGTTTTG TGCTCTATTT TGGAGACCGC TATGCCATTA CTATAGAGTC 1500 
TGCAGAGCAA GCCGGTGTTC AGTATCGTAT TACAATTCAA GATGAGTAGA AAGGGAGAAA 1560 
ATGTATAAAG TATTATTAGT AGATGATGAG TACATGGTGA CAGAAGOTCT GAAGCGTTTG 1620 
ATTCCCTTTG ATAAGTGGGA TATGGAGGTC GTCGCAACAG CCAGTCATGC CGATGAAGCT 1680 
CTAGAATATG TTCAGGAAAA TCCTGTCGAT GTCATCATTT CCGATGTCAA TATGCCAGAC 1740 



840 
900 
960 
1020 
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AAAACACCGC TTGATATGAT TCGGGACATG AAAGAGATCT TACCAGATGC TGCCTATATC 1900 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT GT AAAAAG AG CAATGAACCT TAGTGTGGTG 18 60 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGAGGCT 1980 

GGATTTGTTA GTTATTTAGG GGATAAGGAG AATTGGTGGA TAGGTCTATC CAAGGAAAAA 2040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CTCTTTTACG GTTCTGTAAA TCTGCAGCAG 222 0 

TCTGAGAGTC TCTTTGCCT A TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 22 80 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2 3 40 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 24 00 

CATCTCAAAG CTGATGATAT GACGGACATT GTCAAAACCA TT CATGCT AT TCAATCCTTC 24 60 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2S20 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATT AC CAAAA AG AGCTTTCC 2580 

CTCAAGGATA TCAG TAAGGC CCTCTTTATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 264 0 

CGTGAAACCG ATTCGACCTT TGCAGACTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 2 700 

CAGCTCTTGC TTTCAACTAG TGA 2723 

(2) INFORMATION FOR SEQ ID NO: 274: 

Ul SEQUENCE CHARACTERISTICS: 

<A] LENGTH: 836 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT AT AT AAGT AT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TC CGGCGAT A TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT AC CTTGAG AG CTGTTTCGAT TGGAAACTGG 3 60 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCT AGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CGACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTGGAGC AATTCCTGAT GCCACTTTGA GGTTACTGAT AGGATTGTGG 540 

GCGATAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCG 600 

TGAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTCCCGTATC GTTTGAGGAT AATTCCT G AC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAACTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTC GATTTTTATA TTTTAA 836 
(2) INFORMATION FOR SEQ ID NO: 275; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 5: 

ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAG CTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTGTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 300 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TTTTTTTACG 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTT T 420 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 4 80 

TATATGATTG AGCTAAGTAA GAATCCTGTT TTGGGTGTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGCCAAT 600 

CT AATTGAT C TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTG CCTCTTT AGGGGCTAAT ATT GCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCCG AACAGTTGTC TGCGTTATTT TTCTAGTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAAAT CTAGCACCGG AAATGACCAT CGCCTTTGCT 84 0 

CACGGAACCT TTAATATTAC CAACACCATT CTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 
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CTTGATGAAC ATTTCATCAA ACAGGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAAGAG 
CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA CTACATCATT 
GACTTGGATG AAAAAGTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 
GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 
AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 
ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 
GCCG CCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 
CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 
GCAATCAATA AGATAGAACG TGTT CTC AG A AAAACCCACA TCAAACCCCT CAACAAAGGC 
GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTC ACACT A C ACT CGTGT A 
TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 
AGCTATCCAT CATAATTGGA TGG CTTTTT A CTTTTTCCTA AGCAAGACTA GGATGAATGA 
AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 
ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 
TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 
AGAAAGGAAA CTCACATGAA CACTTTACCA AATCATCACT TCCAAAACAA GT CTTTTT AC 
CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 
CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 
CGCCGCTACT CTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 
ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 
TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 
GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 
AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 
(2) INFORMATION FOR SEQ ID NO: 276: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 

(D) TOPOLOGY: linear 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2335 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 
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TTGCTTTT C A TACTAGGCAA GTGACTGAGG CTTGTACTTG GGTACAGCAA GCGAGCTTAA 120 
GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG ACTTCATCAT TTCGAGAAGT 180 
GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 240 
AGCAAACAAC TTCGTCCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 300 
GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TCTAGAAGTT 3 60 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCGTGG AGCAATGGTA 
TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT TCGCACGTGG TGCAAAAGCT 
GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 
AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC CTGATATGAT G G CTCTTGTT 
GGACGTCTTG GACGTGTCCT TGGACCACGT AACTTGATGC CAAACCCTAA AACTGGTACT 660 
GTAACAATGG ATGTTGGGAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 720 
GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 752 
(2) INFORMATION FOR SEQ ID NO: 27 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2643 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GTCAACATTG ATTTCAAGCC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 60 

AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 120 

AGAAGTAGAG GTGTACTATT CTAGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 180 

CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGACCTCTTT G ATTTG CCTC 24 0 

GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 3 00 

CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTGG GGGTTGATAA AAAATCTCCC 3 60 

GATATGAAGA TTGCTGTCTT CT AG C CTGAT TTTCTGGATA AATCCCACAA ACTCTTGTTC 42 0 

ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTGG 4 80 

TCCCATCCAT TGTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 540 

AG CGTTTT CT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 600 

TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATGATTGAT 6 60 
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TTTTATTTTT 


TTCTCGTCGG 


GAG CATTCT A 


GCTTCCTTTC 


TTGGTTTGGT 


CATTGACCGT 


720 


TTTCCAGAGC 


AATCCATTAT 


CAGTTCAGCC 


AGTCACTGCG 


ATTCCTGTCA 


GACTCCCTTG 


780 


CCTCCCTTAG 


ATTTGATTCC 


GATTCTCTCA 


CAGGTCTTCA 


ATCGCTTTCG 


CTGTCGCTAC 


840 


TGCAAAGTTC 


GCTATCCTGT 


CTGGTATGCC 


CTCTTTGAAT 


TAAGCTTAGG 


ACTCCTCTTT 


900 


CTGC TTT ACT 


CTTGGGGATG 


GCTCTCCTTG 


GGGCAAGTCG 


T CCTAATC AC 


CGCTGGTTTG 


960 


ACCTTGGGTA 


TCTACGACTT 


TCACCATCAG 


GAATATCCCT 


TACTGGTCTG 


GATGACTTTC 


L020 


CAGCTAATCC 


TAATAGCTTC 


CTCTGGCTGG 


AATCTGGTCA 


TGGTCTCCTT 


CCTCATACTT 


1080 


GGAATTTTGG 


CTCATTTTAT 


CGATATCCGC 


ATGGGTGCAG 


GGGATTTCCT 


CTTTTTAGCT 


1140 


TCTTGTGCTC 


TCGTCTTTAG 


CGTAACGGAG 


TTACTGATCT 


TGATTCAGTT 


CGCTTCTGCC 


1200 


ACGGGTATCC 


TGGCCTTTCT 


CCTGCAAAAG 


AAAAAGGAAA 


GACTTCCTTT 


CGTGCCTTTC 


1260 


CTCTTACTTG 


CTACTTGTTT 


GATTATTTTT 


GGTAAGCTAC 


TGCTTGTCTG 


ATAAAATCCA 


1320 


ATTTCTGCCA 


TATATCCTTC 


ATGAAATTAT 


TTCACAGTTA 


AATTATAAAT 


TATTTCTTTT 


1380 


GTACAAAGGG 


ATGATGTTAT 


CAAATCGATC 


TGTTCTTCTA 


TCTTCTTGAT 


ACTGATCAAA 


1440 


AAATTTCATT 


TCGACTGAAA 


ATATTTCGCT 


TATAAACTGT 


AAACGAATAC 


TTTGTTTAGA 


1500 


CATTATAGTC 


GCTAGACTGA 


CTAGATGATT 


ACTCAAAACG 


ACGTCCAGAA 


TACTCTTTAC 


1560 


TTTGCTTGGT 


TTTTTAACAA 


AAATTTGATC 


ATCCAAGGGT 


TCAATCATTT 


TGTAACCTTT 


1620 


TTGCGCAATT 


TGACGATAAA 


AGTAAGAATG 


TTGCTTTGGA 


GTCAATAATC 


CTAACTTAAA 


1680 


AGCTCGATAC 


TCTAAAGCCT 


GTATCGAAAC 


ATTCAAATCC 


GACTTCAATA 


AAATATAACT 


1740 


ATCAGGATTG 


CTGACACGCT 


TGCCAACCCT 


CTCTTCAAAT 


TTGACTAAAA 


ACTCTTCTTT 


1800 


TGGCAATAAA 


AAACATGATG 


CAAAATAATT 


TGCTTCTTGC 


TCCAAACGAT 


CGCCATCTTC 


1860 


ATTCATATCT 


TTATATTTAT 


GTAAAAGAAT 


ATGTCCTAGC 


TCATGAGCTA 


AGTCAAAATT 


1920 


TCGACGTACA 


GATGATTTAT 


TCGTTCCTAA 


CACAATATAA 


GGTCTTCCCA 


ATTTTGACCA 


1980 


TGCGCTATAA 


GCATCAGCTT 


GGCCATTAAT 


TAATCGTTCC 


ACGATATAGA 


TGCCTGAACt; 


2040 


TTCTAATTTA 


TAAAGCAAAT 


CATGATTATC 


TTTTGAAATA 


CCTAATTTTT 


CCCTGGCATA 


^100 


AAGAGCCAAT 


TCCTCAATGG 


ATTCTCCCTT 


ATGATAAGAT 


TCACTCACTA 


CATTACTTAG 


2160 


GTCATGAATT 


ATAATATTAG 


GTATAATTAC 


AAAACTTTCA 


AAATAATCAA 


TCAAACTATC 


2220 


TACCTTATGT 


AAATACATAG 


TTTGAATATC 


TATTGTTTTC 


CGTGTTGCTA 


GGTCTGCATT 


2280 


TCTAAAGGCA 


ATT AC AGAAG 


AATCAAATCG 


AATGCTCTCT 


TCTTCCTGTT 


CAAAATAAGT 


2340 


TAAATCAACA 


TGAAATTGGT 


TGGCCAAATG 


CATTTTGGTT 


GATAATTTAG 


GTTTCGTTTC 


2400 


GTTGGACTCA 


AACTGCCAAA 


TGGCTTGTTC 


CGTTAAATTA 


ATTCTCTGAG 


CTAATTCTGC 


2460 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2 520 

ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2 580 

GTTTC CG AT A ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2 640 

GGG 

(2) INFORMATION FOR SEQ ID NO: 278: 



2643 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


278 : 






TGACCAGTGG 


CAAAATGGCT 


ATCCAAATGC 


AGATGTTATT 


ATCGATGATA 


TCATCTCAGG 


60 


GCAAGCCTAC 


GTAGCCTTGG 


AAGAGGGAGA 


ACTGCTAGCC 


TATGCTGCTG 


TGACCAAGAG 


120 


TCCAGAGGAG 


GCCTATGAAG 


CTATTTATGA 


GGG AAACTGG 


CAAGCTGGAG 


AGTCAGAGTA 


180 


TCTAGTCTTT 


CACCGTATTG 


CTGTGGCAGC 


AGATGTGCAG 


GGAAaAGGAG 


TTGCTCAAAC 


240 


CTTCTTAGAG 


GGCTTGATTG 


AAGGTTTTGA 


TTATCTTGAT 


TTTCGCTCAG 


ATACGCATGC 


300 


TGAAAACAAG 


GTTATGCAAC 


ATATTTTTGA 


AAAACTTGGT 


TTTAAACAAG 


TCGGTAAGAT 


360 


GCCAGTAGAT 


GGCGAACGCT 


TGGCCTATCA 


AAAATTAAAG 


AAATAATGCA 


AAAGAAGTAT 


420 


GTAAAAATCC 


TCTACTCCTC 


ACCAATTGGT 


ATTCTATCAC 


TTGTAGCTGA 


TGACCATTAT 


480 


TTGTATGGAA 


TTTGGGTTCA 


GGAGCAGAAG 


CATTTTGAGA 


GGGG ACTAGG 


AGATGAAACG 


540 


ATAGAAGAAG 


TTGTwAGTCA 


TCCTATTTTA 


GACCCAGTTA 


TT 




582 


(2) INFORMATION FOR SEQ ID NO: 279: 









<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 554 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 279: 
CCCAAGCTAC TAAGAGACTA AAACTTGCTA GAGAAGCAAG AGAAAGTGTG AATCTTTTTA 
ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 
CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 



WO 98/18931 



PCT7US97/19S88 



1310 

GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 240 

TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 300 

CACATCCCTT TCCTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 3 60 

AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 420 

AAAGAGGATT TGAGGTCGAT ACCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 480 

AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 540 



GCGAsCAGCC CAAT 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



S54 



{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 80: 

CCCGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATG CT AC ACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 120 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGC GGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 3 60 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAG G 480 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 600 

ATAGAGGCGG TCAGCTGAAG ATT C ACT C AA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TT AC CG AG AA GATTTGTCAC AACCACAGGG GGATGG 766 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xii SEQUENCE DESCRIPTION: SEQ ID NO: 2 81: 
CCGCCCACGG TTCCATCCAA CTTCACAGGT GTGCACTTGA TTGTGTATGT AATTGTCACT 
AACGGTAGAA TTTCACCTAT CCCTCCTATC TGCTCGCAGT ACCCGCAGAC TTTCTGAAAG 
AAGAAGATAA CCTACTTATC CGTTGCTATG ATTATACTAA AGTTTCTACT TTTTTGCAAA 
TAGATTTTTA AATTTTTGGC TAATTGTCTG AATCAGGGTC GGAAGTTTGA CGACCTTGTC 
ATTGCCTAGT TTTTCGCGTG CAATTTTGAG AATGGCACCT GAGTCTTTTG AAGCAAAGAG 
GAATTTTCCT TTGTCTGTAA AGACTTCGAA GTGGCGGCTG ATTTTGCGTC CAGTGACATT 
GGCTCCAATC TGATTGATAT GGCTCCAAGG AATCTGGATA AATTGTTCGA CATTGACATC 
TGGGTAAAAT TCCAAAGCCT GATCTCCGAC AAGGAATTTC CCAACTTTCC CAGCGATAGA 
GAGGTAGGAA GTGCCTGTCG TACTGAGGAG TACTGTTTTG TTAAGTGATT GGGCCATGCT 540 
TAGTCTTCCT T ACTTTCT CC AAAAAAGGCA TTGTAGAGGG CTTTAATTGC TGCTTTCTCT 600 
TGGTCTTTAT TGACAACAAA CATAATAGAA ACTTCACTAG AACCTTGAGA CATCATCTGG 660 
ATGTTGATTT TGTTTTCAGA TAGAGCGCGT GTCGCAGTAG CAGTCACTCC GATATGGCTC 720 
TTC AT TTTTT CACCAACAAT CATAATGATA GAAAGGTCGT GTTCGATTTC TGCATGATCT 780 
ACTTTAGCCT TTTGAACCAA CTGACGCAGG ATTTCTTCTT CCTTGATGGG AGTTAGTTGG 84 0 

CGAGAACGGA GAATGATAGA AAGAwCGTCG ATACCTGTTG GCATATGTTC CCAACCGATG 900 
T 

901 

(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS : 

<A> LENGTH: 1765 base pairs 
<B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 12 0 

TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA A TT A TCGC AT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 24 0 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GG ACGTTCCA ACTCTTCCCC 300 



60 
120 
180 
240 
300 
360 
420 
480 



WO 98/18931 



PCT/US97/19588 



1312 

ATTTCTGGGA GTTGGGGTAA AAATGTTCAC TGGACGTTCC AACTCTTCCC CATTTCTGGG 3 60 

AGTTGGGCTG ATACAGTCTC CCAGACTGTA TCACTCCTCC ATAAAGCTGT TGAAGACTTC 420 

TTCAATCATG TTCCATTCGT CTTCTGAGTC TTCTGGGATT GGTTGCAATT CCCCTTCTGT 480 

TCCATCTTCG TTTTCGATGA ATGAGTAAGC TTGGATTTCA ACTTGTCCGT CTTCGTCTTC 540 

TTCTGCGTTA ACTGGTACTA GAAGAACATA GTTTTTACCA AATTCTTCTT TTCCATCAAT 600 

TGTCAAAAGG ATTTCAAACA AGGTTTCATT TCCTTGCTCA TCTACT AGTG TGATTAGTTC 660 

ACGTTCTTCG TGGT CGTGGT TATGATCGTG TGACATAGCC TCGCCTTTAT ATTAAAATTT 720 

T CT ATCT AAA TAATTTTGTA AAATCAGCTG AGCTGCTAAC TTATCAATGA CTTTCTTGCG 780 

CTTATTGCGA CTGATATCTG CTTGTTCAAT CAACATGCGC TCAGCAGCCA CTGTTGTCAA 84 0 

GCGTTCATCC TGATACTCTA CTCGTAAACC AAAAAACTCT TCTAGCTTTG CTCCGTAGCT 900 

TGACTAGCTT CTACGCGCGG TCCACTTGTA TTGTTCATGT TTTTACGCAA GCCCACTACA 960 

AATCGTTCCA CCTTGTAAGT ATCAACCAAT TCCTTAACGC GGTCAAAACC AAATTGGCCT 1020 

TGTTCTT CAT TTATCTGGAT GATTTCAAGC CCTTGAGCTG TAAAACCAAG CGGAT CGCT A 1080 

ATCGCCACCC CTACCGTTTT TGAACCGACG TCCAATCCCA TAATTCTCAT AGGTTATAGA 1140 

TCGACTCCTT GTCCTTTGAG GTAGTAGCGA ACCAATTCCT CAACGATTTC ATCACGCTCA 1200 

TACTTACGGA TTTGATTTCG TGCATTATTA TAACGAGGAA CGTAGGCAGG GTCTCCACTC 1260 

AATACGTAAC CTACGATTTG GTTAATTGGG TTGTAaCCCT TATCGTTCAA CGAAGCATAA 13 20 

ACATCTGTCA AAGTTTCGCT AATTTCTTTT TTATTGGAAT CGTCCAATTT AAAACGTACT 1380 

GTTTCTTCAG T AAAT CCC AT TCTAACACCC TCTTTCCTTA GAATAGTACC ATTATAGCAT 1440 

AATTCCTTAC CTTCTACAAT TCAGGCAGTC TATTTATTTG GATTTTCTAT TGTTCTGTCG 1500 

CGCCATTTGC CAATCTATCT GAAATATATT TGCTTGGTTC ATTTTTCAAA AGATTTTCCA 1560 

AACCAATATT CTT CAGATGT TCCAACTGGG AAGCCTTCTT GACATCCAGA ACTTGAAAAT 1620 

CAAAACTAGT CGTTGTTTGA AGTTCCGTTG CGCTCAATAG TTTTGTTTCA AGTTTGAAAC 1680 

CTGCCAATTT AC GAGCTTCA ATGATAGACT TATCCTTCTC CTCCGCTTCA AG AAG AG CTT 1740 

TTTGAGTTTC CTCCACTCCA TGTTG 1765 

(2) IN FORMAT I ON FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 6 base pairs 
<8> TYPE: nucleic acid 
<C> STRANDEDNESS : double 
( D ) TOPOLOGY : 1 inear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 3: 

CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT A T AC C AC ATT 

TGCCCCTACA TGTGAACGAG AGAAACGCT C T AG ACATTG C CAAGAAGGAA AAAAAAGGGT 

ACAATGTAAC AAAATCAAGG GAGGTCTGGA ATCAAGAAAC AAAGCAAGTA CAAAGAGGTC 

GTTTCCTATC TGAAAAATGG TATCGAGTCT GGACGATTTC CGACGGGTAG TCCCCTGCCT 

TCTATCCGTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACGAGCCCTG 

CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATGTATTA 

GAACAAGGGC AACATCAACA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 

GACGATTTCC GACTCTGTGT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 

TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTCTTT 

GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGGAAC CCAACAAGCC 

TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT GGTGGAACAG 

CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACGATT 

GAACGAGGCA TTGATGGGAT TGACTTGGAG GAG CTGG AAG GCCACTTCAA AACAGGAAAA 

ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 

CAAGACAAAC GATCTATTCT TAACTTAGCT GCCAAGTATG ATGTCTAT AT CGTAGAGGAC 

GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACAGAG 

GAGCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCCT TCGTATTACA 

GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTCC CCTACAAAAA TATCCTAGAC 

TACGACAGCA ACCTCATTAT GCAAAAGCCC CTGTCACTCT ATATTGACAC TCAATTGTTT 

GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 

AGGATAACTA AAACACCTTG TCCCCTTCCT C A TT ATT CC C TACACGATCG yTT ATTGCT A 

GACCTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 

TTTGAAGAGG CCTATTTAAG CACCTG 

(2) INFORMATION FOR SEQ ID NO: 284: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 900 base pairs 
<B> TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTGCGGAGGG TTCATTTGAC 60 

ATATTTAGAA AACTCCCCCA AAGAATTAAT TTTAAGAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTGCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AG AATGG AAA TATTTTTTCC TCTCTAGTAT TTTATTTAGA AGATTTAATC 240 

TATCCTTGGA TTGTTAAACC TTTGGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 300 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 360 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGACTCT 4 20 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTT AT C AA ATTTAGAAAA TGATTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT 540 

GATACACACA GCCAAGGAAA AACTGTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GT AT ATCCG A 6 60 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAA CT A TTGCTTATTC GGATCACAGC 720 

TATGAAGAAT TTATTGATTA TCTACCTCTA G AGCAAAAG A AAAATTTACC TGAATATTAT 780 

T AT AATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT GTAATAATAG ACTTTGAAAC AATGTTACGG 900 
(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



TTATTT AG C A 


GAGGCAGTTT 


TAAATGTGAA 


GGATTTGGTC 


AGTCAAACAG 


ttttttatc a 


60 


GCAGATTATT 


GGTTTAGAAA 


TCCTATCTCA 


AACGGATACA 


GAGGTCGTTC 


TGGGACTTGG 


120 


AGGAAAAGCC 


TTGGTACACT 


TGATTCAAGC 


ACAAGAGGGT 


GGAGAAGTAA 


GGGAACATTA 


180 


TGGTCTTTAC 


CATCTGGCTA 


TTCTTTTGC C 


GACACGAAAG 


GCTTTGGCGG 


ATGTCTTGAA 


240 


GCACCTGACG 


GATTTACAGA 


TTCCTCTTGT 


TGGCGGTGCA 


GATCACGGTT 


ACAGTGAGGC 


300 


CCTTTACTTA 


GAGGACTTGG 


AGGGAAATGG 


CATTGAACTC 


TATCGAGATA 


AGCCAGTTTC 


360 


CACATGGGAT 


ATTCGAGAAG 


ATGGACGTAT 


TATCGGGGTG 


ACTGAAGTCC 


TTGCGGCTCA 


420 


GGATATCTAT 


GAGTTGGGGG 


AAAGAGTAGA 


GCCTTTTATC 


CTAGCAGAGG 


GTACGAGAAT 


480 
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GGGGCATATT CATCTTTCTC TCAAGGATAG TCGAAAGTCC AG AC AG TTTT ATCAAACGGT 540 
GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TAGTGCTAGT TGGATCGCAG CTGGGGACTA 600 
CCAT CATC AT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 660 
CCTACCAGGT TTAGCCTACT ATGTCATCGA AG TCGC AC AT AAAGAAGAAC TGTTAACGAT 720 
TGCCCAACGA GCACAAGAAG TTG ACGC AC C AATCAAATGG ATGACATCGA TCCAATTGGA 700 
AATCACAGAC TCAGATGGCA TCGTG AC CCG TATTCGTTTA GCTAGATAGA TGGTATGTGA 
TGAAGGTAGA GCATCAATTG TA 
(2 J INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 60 

GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 120 

AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCT CATTTT A 180 

T C ATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 240 

TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 300 

ACAAATCCAA GCGACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 3 60 

ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 4 20 

TTTCCAAGTC CTCGATG TTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 480 

GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 54 0 

AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcT t G ACTGTAAGGA 600 

AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 6 50 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEC ID NO: 287: 

GAT AG CAATC CGCTTCAGAA ACTTCTCGCT TACCTCTAAC TCCGATCGCT AGTTTGGGAG 60 

AAGATACTTC CATTCTCATA CTATCTGTTG GCTTTGCAGG CTGTAAAAAC AACTTTTCTC 120 

TTGCTACTTC CTGAAAATCT GAATCTTGCA GTTCTTTGCT TTCAAAATAG TCCTGTACTC 180 

GCTCCACATC AAAATTCCCA GCTAAAGACA GAGACATGTT TACAGGTTTG TAAAACTTTG 24 0 

TAAAATTTTC TTGCAAATTA GTTAGATTGA TTTGGGAAAT GGACTCCTCA CTTCCAACTA 300 

TATCAGTTGC TAAAGGTGTA CCAGGATACA AATTCGCTAA AGTTGAAAAG AATAAACACG 3 60 

AATCTGGATC ATCTTGGTAC ATTTCTCGTT CTTGCTGAAT AATATCCTGC TCTGTCAGAA 42 0 

TGGAAGCTTC AGTAAAGTGT GCTGATGTTA CCAATTCATC AAGTAAATCT AAATTTTCTA 4 80 

AAAAATAATC CGTTGCTGAA AAAAGATAGT TTGTTTTTGT AAAGCTTGTA AAGGCATTAC 540 

TATCTGCACC TAG ACT CGT A AAAGCCGACA TCAAATCACT AG AAT CTTCT CTCTCAAATA 600 

ATTTATGTTC AAGAAAATGA GCAATTCCTC CAGGATATTG TTTTACATCT CCGTCAACTT 660 

CTGTGACAAA CGTATCTACC GAACCAAACT GTACAGTGAC ACTCCCGTAA ACCTCTTTAA 72 0 

ATTCCTTTTT AGGCAAAAGA GCAACTGTCA ATCCGTTGGC CAAACGAGTT CGATAAACCA 7 80 

TTTCTTTTAC AGCTGGATAG TATTTTTCTT CAAAAACAAC CTTTGTCATT CTATTCCTTC 840 

CATAAAGTAA ATCGCTTGTA GTTTCACATT ATT AG CT ACT CTACAAATAG CATCTTTGTC 900 

AATTTGTTCA AGCTTTGCAA TCCAACTTTT AAAGTCTGCT GAAGATTTTC CAAATAAGGC 960 

ATTTTGATAA GCACGTTCAA TCAATGAAGA ATGATTATCT TGAGAAAGTA ACAACGACCA 1020 

ACGAATCATT TCCTTGGTCT GATTTAACTC AAACTCTGTA AAAAAACCTT TTTTTAAATC 1080 

AAGCCGTTGA TT ATT CATC A ATTTACGAGC CTGGTTACG 1119 



(2) INFORMATION FOR SEQ ID NO: 288: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDN ESS : double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGG CCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTT CTTC CAT CTGCAAGTGC GCCTGCGTTA TGTG AT CCCG 240 
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GTCTTTTGCA TGTG TGTGGC ATGAATGCTG TTCCCAATCC CACTCCAGAA CATTCTCCTC 
AAAAGTGCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 
GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CCCATCCTCC GGCTCAAATC GGGTGCATTT 
CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GGTGACTCAA GCGTCCTTTC 
AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1949 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


289: 






AAAGAATTCG 


ACCAATTCAA 


GGTTGAGGCA 


TCGCAAACTA 


TGGACTGTTC 


CCCCGTPAfTT 


o u 


TCTGGACAGA 


AAACGGGATA 


AGGTTGGCTG 


TGAAGCAAGC 


TGCCCTCCTA 


CCAACAATTT 


120 


TGGAAAGTAG 


GCATCAGCTG 


ACAATTCTTT 


ACAAGCATAG 


TCCGTTCCAT 


AACCTGTTAA 


180 


CAGTTGAAAG 


AGGAACTGGA 


CAAGGATATC 


TGAATCCGAA 


TAACGACAGT 


AGCGGCGTTG 


240 


GTCATTCGTT 


ACTAAATACT 


TAGAAATCCG 


CTCTTTTAGT 


TTCAACTGGG 


AAAAAAGTTC 


300 


CTGAAAAAAG 


ATAAGACCAC 


CATACTGGGT 


TAAATGACCT 


CCATCGAAAG 


ATACTTGGTA 


360 


AAAAGACTTG 


TTTTGGAAGT 


GATGATTTGG 


TAAACTGTTC 


ATCTGAGTTT 


CCTTTCTTTT 


420 


TGTGTTTTTT 


TCTACACTTA 


TACCATAAAG 


GGGAAACTCT 


TTTTTCTCTA 


GTAAAAAACA 


480 


CCCATTGGGT 


GAAAAAAGAA 


ACCATCCAGG 


ATCTAAGCTA 


AGGCAAGGAT 


TCTGGATGGT 


540 


TTTTAGATTT 


GGGGTGAATA 


ATTGGGGTTT 


TACAATATCA 


ACTCCCATGA 


TAGTCATGAG 


600 


ATGACTCTTC 


ACGAATTGAC 


GTGATGACTG 


TCCTTCCTTT 


TGCATAATTA 


CCTCCGAAAC 


660 


ACAAAAAAAG 


GGGTAGACAA 


TCTAGTGTCT 


ACCCCCGAAA 


GTTTATTAAA 


ACAAAAATCC 


720 


TGCCAAAGAA 


TTTTTGGCAG 


GAAACCAAAT 


CAATTTATCA 


GTTTCTAT CA 


ATCGCTTATC 


780 


GCTCTCAAAG 


ACTGGTAAAT 


AGGGATTCCG 


CAATCAAATT 


GCGATACTCT 


ATTATTTAAG 


840 


AGTAACTGAA 


GCTCCAGCTT 


CTTCCAATTT 


AGCTTTGATT 


TCTTCAGCTT 


CTGCAGTTGC 


900 


AACGCCTTCT 


TTAACAAGTG 


CTGGTGCACC 


GTCAACAAGT 


TCTTTAGCTT 


CTTTAAGACC 


960 


AAGACCAGTG 


ATTTCACGTA 


CAACTTTGAT 


AACGCCAACT 


TTTTTGTCGC 


CTGCAGATGT 


1020 


CAATTCAACG 


TCGAATGAAT 


CTTTAGCAGC 


ACCAGCATCA 


GCTGCATCAG 


CTGCAGCAAC 


1080 
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AGCTACAGGA GCAG CTGCAG TTACACCAAA TTCTTCTTCG AT AG CTTTT A CAAGGTCGTT 1140 

CAATTCAAGC ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 1200 

TGTTATTTCC TCCAAATAAG TT TTAAATTT TATAATAGTT TTTTTCGTAG CTAGksTACG 12 60 

CTGTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TGACTGCAAG 1320 

AGCAACGTTG CGCACTGGCG CTTGAAGTAC AGAAAGGACC ATAGAAAGAA GTCCTTCGCG 13 80 

GTTTGGAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACAGCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTG CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TTTTCGCTGG 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGGT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAGCTTTTT CAGCTGCACG ACGCAAGATT GAGTTTTTAA TAACTTTATA 162 0 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 1680 

ACG AG CGTCT ACAACGACGA TAGATGCAGC AG CTTTC ATT TTTTCAGCTA tACGTCAACT 174 0 

AGTTC CGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 1800 

T TGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA I860 

CGCTT CTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 1920 

ACTGTCTTAG GCAGTTTTTT TAGATACGG 194 9 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 90: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGGTG. 1 ^ 12 0 

CACGGCACTC GTGTG AG CAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA AT ACG CTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 480 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACT CACGG AA GTTGCGAATA TCTGGGCGAA 540 
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TAGCTCCACG 


GAAATGTCCT 


AGGTCGTACT 


CATAATCGTT 


ACGTGTGTCA 


AGGACAACGG 


600 


TATCTTTATC 


AAGAAGCGCT 


TCTTTGAACT 


CTTTTGGAGA 


CAAGTAAGCA 


CCTGTTGTTT 


660 


CAAGTGGGTT 


GATGTCATTG 


TCAAAGTCGT 


TGTCTTCCAA 


AC CAAGGTGG 


ACAATTTCTT 


720 


TCTTGTAGCG 


AACAAACATC 


TTCTTGAAGG 


CTTGTTCATT 


TTCTTCGTCA 


ATCTTGAACC 


780 


AGAGTTCTTC 


CATTCCTGGA 


AGGCTGTGAA 


CGTAgTCCAT GTATTTTTGA 


GTTGTTTCAT 


840 


AGTCACCTGA 


AACTGTTCCG 


TTAATTCCCT 


CGTCAGCGAC 


TAGGATACGG 


CCTTTAAGCn 


900 


CGATTGATTT 


ACAGAAAGCC 


AAGTGGTCTG 


CAGCAAATTG 


CTCTGCATTT 


TCAATTGGAG 


960 


TATAAAGGTA 


GTAAAGTAAG 


ACACGAATAT 


CTTTTGkCaw 


AAGATTTGTA 


TCTCTTTATC 


1020 


TAT 












1023 



(2) INFORMATION FOR SEQ ID NO : 291: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3831 base pairs 
fB) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


291 : 






ACTATGAACA 


AGACCCAGAA 


AAACTAGCCT 


TATTTCTTAA 


GAATTTTAAT 


AGTTTAAAGC 


60 


ACCTAGCACC 


TGTTTAGATT 


GACGAAACAG 


G ATTCG AT AC 


TTATTTTTAT 


CGAGAATATG 


120 


GTCGCTCATT 


AAAAGGTCAA 


TTAATAAGAG 


GCAAAGTATC 


TGGAAGAAGA 


TATCAGAGGA 


180 


TTTCTTTGGT 


TGCAGGTCTA 


ACAAATGGTG 


AATTAATCGC 


TCCAATGACT 


TACGAAGAGA 


240 


CGATGACGAG 


CGACTTTTTT 


GAAGCTTGGT 


TTCAGAATTT 


TCTCTTACCA 


ACATTAAACA 


300 


CACCATCGGT 


TATTATTATG 


GATAATGTAA 


GATTCCATAG 


AATGGGGAAG 


CTAGAACTTT 


360 


TATGCGAAGA 


GTTTGGGCAT 


AAACTTTTAC 


CTCTTCCTCC 


CTACTCGCCT 


GAGTACAATC 


420 


CTATTGAGAA 


AACATGGGCT 


CATATCAAAA 


AGCACCTCAA 


AAAGGTATTA 


CCAAGTT GC A 


480 


ATACCTTTTA 


CGAGGCTTTT 


TTATCCTGCT 


CTTGTTTCAA 


TTGACTATAT 


TAGAGGCGAG 


540 


ACATTTTTCG 


GTTCTTTGTC 


AACTGTAGTG 


GGTTGAAGAA 


AGCGAAGATC 


TAGAAAGGAC 


600 


AAATTTCGTC 


CTTTCTTTTT 


TGAAGTTTTC 


AAAGTTCCTA 


AAACCAAAGG 


CATTGTGCTT 


660 


GATAAGTTTG 


ATGAGATTAT 


TGGTGGCTTC 


CAGTTTGGCG 


TTGGAATAAG 


GTAATTGAAG 


720 


GGCGTTGACG 


ATTTTCTCTT 


TATCTTTGAG 


GAAGGTTTTA 


AACAAAGTCT 


GAAACAGAGG 


780 


TGGAAAAGCA 


AGAGCTGATA 


GAGATTATAG 


TGCTCTTTAA 


AGTCTTCGGA 


ATAGCTCAAA 


840 
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AGTTTATCTA GAATTTCTTT 


' ATTAGTCAAG 


1 TGCATACGAA 


. AAGTAGGGCG 


I ATAAAATCGT 


900 


TTATCACTCA GTTTCTGACT 


ATCTTGTTGA 


, ATGAGCTTCC 


AGTAGCGCTT 


1 GATAGCCTTG 


960 


TATTCATCGG 


ATTTCGGATG 


ATGGCTTGTG 


TTCTGCTCTC 


AAGAACAGTT 


ATGATATTGA 


1020 


GTTTATCAAA 


GTCCTGAGCA 


ATAAAGCTCA 


TCTCCATCTC 


CCGATTGAAA 


CAGTCACTCC 


1080 


CCGGACTGTT 


TCAACsTCCT 


AGGACATAAT 


CTCAGGAAGA 


eoeGAAAAAT 


CATGCTCAAA 


1140 


GTGAAAATCA 


TTGTTCTTGC 


GAATGACAGT 


TGAAGTTGAA 


ATAGACAACT 


GATGATCAAT 


1200 


GTCGGTCATA 


GAAGTCTTTT 


TAATTAGCTT 


CTGAGCAATC 


TTTTGGTTGA 


TGATACAAGG 


1260 


AATTTGATGA 


TTCTTCTTGA 


CGATAGAAGT 


CTCAGCGAGC 


TCCATTTTTG 


AGCAATGATA 


1320 


GCACTTAAAA 


CGGCCTTTTC 


TAAGAAGAAT 


TCTAGTTTGA 


ATTTTTTTAT 


ACTAGAAAAT 


1380 


CAGAACCATA 


ATACCTATAT 


AAAAATATTA 


TAGTTCTAAT 


AGGATTTACC 


CAAAAGTTTT 


1440 


AAGGCGGTCT 


TTTTAGAACT 


TT AATTGTT T 


GAAATTTAGG 


TAGCAAATTT 


GTTTCTATTT 


1500 


TGTCAACTTT 


TCCTATTTTT 


ATCTTGTTGA 


GGCTGGTATT 


TTAACAATTC 


AGGAATTGAT 


1560 


AGTGAATGTG 


TAAAATTTTT 


TGTTAGAATA 


AGTTTATAAA 


AAAGAAAAGG 


AGTATTTGAT 


1620 


TATGTTACAA 


AAAATTTATG 


AGCAGATGGC 


TAATTTCTAT 


GATAGTATTG 


AAGAAGAGTA 


1680 


TGGTCCTACA 


TTTGGTGATA 


ATTTTGACTG 


GGAACATGTT 


CATTTTAAAT 


TTTT AATTT A 


1740 


TTATTTAGTG 


AG ATATGGC A 


TTGGTTGTCG 


TAAGGATTTT 


ATTGTTT AC C 


ATTATCGTGT 


1800 


TGCTTATCGT 


TTGTATCTTG 


AAAAATTGGT 


AATGAATCGG 


GGTTTTATTT 


CTTGTTGAGG 


1860 


TAATTTTAGT 


AAATTTCCGA 


ACTAATTTAC 


TCTTTTATGG 


AAAGATGATA 


GTAAATAGCT 


1920 


AGTAATTTTT 


CTAAATCATT 


TTTTAATAGT 


TGGAAATAGC 


AAATCTTTCT 


ATTGTTTCTT 


1980 


CTTGATAAAA 


AGGCGATTTT 


TTATTATAAT 


AAATTGTAAG 


ATATAATTGC 


AGGTGAGAGT 


2040 


CCTGCCATGT 


ATGTGAGAAA 


GGAAGAGCCT 


GATGGCTCAG 


ACAAGATTAT 


GACTTCAGTT 


2100 


GTTGTTGTAG 


GTACCCAATG 


GGGTGATGAA 


GGTAAAGGGA 


AGATTACAGA 


CTTCCTTTCA 


2160 


GCGAATGCAG 


AAGTGATTGC 


ACGTTACCAA 


GGTGGTGATA 


ATGCTGGTCA 


CACGATTGTG 


2220 


ATTGACGGTA 


AGAAATTTAA 


GTTGCACTTG 


ATTCCATCTG 


GGATTTTCTT 


CCCTGAAAAA 


2280 


AT AT CTGTCA 


TTGGGAATGG 


TATGGTTGTA 


AATCCTAAAT 


CTCTTGTAAA 


AGAGTTGAGC 


2340 


TATCTTCATG 


AGGAAGGTGT 


AACAACTGAT 


AACTTGCGTA 


TTTCTGATCG 


TGCGCATGTT 


2400 


ATTTTGCCTT 


ATCATATCGA 


GTTGGATCGC 


TTGCAAGAAG 


AAGCTAAGGG 


CGACAATAAG 


2460 


ATTGGTACGA 


CAATTAAGGG 


AATTGGTCCA 


GCTTATATGG 


AC AAGGCTG C 


TCGTGTTGGA 


2520 


ATTCGTATTG 


CAGATCTTTT 


AGATAAAGAT 


ATTTTCCGTG 


AGCGTTTAGA 


ACGTAACCTT 


2580 


GCTGAAAAGA 


ATCGTCTTTT 


TGAAAAATTG 


TATGACAGTA 


AAGCGATTGT 


TTTCGATGAT 


2640 
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ATTTTTGAAC AATATTACGA ATATGGTCAA CAAATCAAGA AATACGTGAT AGATACATCT 2 700 

GTTATCTTGA ATGATGCGCT TGATAATGGC AAACGTGTGC TTTTTGAAGG TGCACAAGGT 2 760 

GTTATGCTAG ATATCGACCA AGGTACTTAT CCATTTGTTA CGTCATCAAA CCCTGTAGCT 2 820 

GGTGGTGTGA CAATTGGTTC TGGTGTCGGT CCAAGCAAGA TTGACAAGGT TGTAGGTGTA 2880 

TGTAAAGCTT ATACGAGTCG TGTAGGAGAT GGTCCTTTCC CAACTGAGTT GTTTGATGAA 2 940 

GTGGGAGAAC GTATCCGTGA . AGTGGGTCAT GAATATGGTA CAACAACTGG TCGTCCACGT 3000 

CGTGTAGGTT GGTTTGACTC AGTTGTGATG CGTCATAGCC GTCGTGTTTC TGGTATTACT 3060 

AACCTTTCTT TGAACTCTAT TGATGTTTTG AGCCGTTTCG ATACTGTGAA AATCTGTGTG 3120 

GCCTATGATC TTGACGGTCA ACGTATTGAC TACTATCCAG CTAGTCTTGA ACAATTGAAA 3180 

CGTTGCAAGC CTATCTATGA AGAGTTGCCA GGTTGGTCAG AAGATATTAC CGGAGTTCGC 32 40 

AATTTGGAAG ATCTTCCTGA GAATGCGCGT AACTATGTTC GTCGTGTGAG TGAATTGGTT 3 300 

GGCGTTCGTA TTTCTACTTT CTCAGTAGGT CCTGGTCGTG AACAAACAAA TATTTTAGAA 3 3 60 

AGTGTTTGGT CCTAAGAGAT TTTTAAGATT TGTTTAAGAT AGGTCGGGTA TACTATAGAC 34 2 0 

GGTTACAAGA AGACCTCCTA ACTTGTTGTA AC AAAT AT CC TAAACTTTTC TTTTTCATAA 3 4 80 

TAATCTCCCT ATAGAGTCAC CGCATTCGGT GGCTTTTTTT GTGTTGGGAT TCATGATATA 3 540 

ATAATAAAAT CGATAAGTAG GAAAAGAGAA AAGAGATGTA TTATACGCTT GAAGAAAAAG 3 600 

AAGTCTTTAT GAGGGAGGCT TTGAGAGAGG CTGAGATTGC TCTTGAACAC GATGAAATTC 3 660 

CAATTGCTTG TGTG ATTGT C AAAGATGGGG AAATCATTGG TCGTGGGCAT AATGCGCGTG 3720 

AGGAATTACA GCGAGCGGTT ATGCATGCGG AAA TT ATGGC TATAGAGGAT CCGAACTTGA 37 80 

GTGAGGAGAG TGCGCTTGCT GGATTGCACA CTTTTTGTGA CCATTGAACC G 3 831 

£2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG AGTTGTTATT TTCAACTCTT 120 

ACTATTATAC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 



WO 98/18931 



PCT/US97/ 19588 



1322 

CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 24 0 

AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 3 00 

ATTATAGTCC ATCATGAATT CTGCATCTTT ATTTTCAGCA TTAAGCT CT A AAAGGAATTC 3 60 

TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA CGTGCTTTGT CTCGAAAGGC 
TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 
TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 
TCCATCTCCT GCTTCTACTC CAGCTACAAC ATG A TTGGCA TAAACAAAGT CTGATACTTC 
ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 
ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCGAT ATGCTGAATT GATTTTTCAC 
GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCC ATG C ATTTCG CCT GAAACTGCGA 
TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 
TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 900 
TG AATGCTT C AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 960 
TCAATTCTGG GAAATCTGAG AAGAAAAGAT CTGTC AATGG GATAATCTCA TCTACTGATT 
TCATTTGTGG TT TAT AG AG C TCAACTAATT TTTC AG CCTT GTCACTCAAA CGGCCTGCTT 
CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 
CATTGCTCAT CCAGTCTAGT TTTTTCTCAT CAAAGCCTGC TGGTGACTTG CTGAGGCGGT 
TTT CATC AAA AAGTTTAATG AATTCTT C AC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 
TCCAACCAAG AAGAGCAATA AAG TTAAAG A CTGCTTCTGG AAGGTAACCT TTCTTTCGGT 132 0 

AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 13 80 

AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGAGCTT CCTCAACCTA AGAGCGGGTA 
T 

(2) INFORMATION FOR SEQ ID NO: 293: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4398 base pairs 
fB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



420 
480 
540 
600 
660 
720 
780 
840 



1020 
1080 
1140 
1200 
1260 



1440 
1441 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 
CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACCA GGGGAGATTA GAGGCCCTAG 
AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 



60 



120 
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ATATGCACTT CATTGGACAA AGTCAAGTGG CTTTCCGTGA ATGGAATCAA AAATGGGTCG 180 

ATTTATCTCT CAACTCTTTT GCCGATATTG AAAATAATCT CTTTGAAGCA GAAGGCTATA 24 0 

ACCATTCATT T CGTTTTCT C AAGGC CAGTC ATCAAATTGA CCAAATTGAG AGTCAAATTA 3 00 

CTTTGATTGA AGAAGATATT GCGGCAATTC GCAATGCTTT GGCAGACTTA GAGAAGCAAG 3 60 

AATCTAAAAA TAGTGGTCGT GTTCTTCATG CTTTGGATTT ATTTGAGGAA CTTCAGCATA 4 20 

GAGTTGCTGA AAATTCAGAA CAGTATGGTC AAGCCTTGGA TGAAATTGAA AAA C AATT AG 480 

AAAATATCCA ATCTGAATTT TCACAATTTG TAACCTTGAA TTCATCGGGT GACCCTGTGG 540 

AAG CCGCAGT G AT TTTGGAT AATACAGAAA ATCACATTTT GGCCTTAAGT CATATTGTCG 600 

ATCGTGTTCC AGCCTTGGTT ACGACGCTTT CTACAGAATT CCCAGATCAA TTACAGGATT 6 60 

TGCAAGCCGC TTATCGTAAA CTAATTGATG CTAATTATCA TTTTGTTGAA ACGGATATTG 72 0 

AAGCGCGTTT CCACTTGCTT TATGAAGCAT TCAAGAAAAA CCAAGAGAAT ATTCGTCAGT 780 

TGGAATTGCA TAATGCCGAA TATGAGAATG GACAGGCACA AGAGGAAATC AATGCCTTGT 84 0 

ATCATATTTT TACTCGAGAA ATTGCTGCTC AGAAAGTAGT GGAAAATCTA CTTGCAACTC 900 

TTCCAACTTA TCTTCAACAT ATGAAAGAGA ATAATACTTT ATTGGGAGAA GATATTGCAC 960 

CTTTCAACAA GACCTATTTA CTTCCTGAGA CAGCTGCAAG CCATGTTCGT CGTATTCAGA 1020 

CAGAATTAGA GAGTTTTGAG GCAGCTATTG TTGAGGTAAC TTCAAATCAA GAAGAACCAA 1080 

CCCAAGCTTA TTCAGTTCTT GAAGAAAATC TTGAGGATTT ACAAACTCAA CTAAAAGATA 1140 

TTGAAGATGA GCAAATTTCA GTTAGTGAGC GCCTGACACA AATTGAGAAA GATGATATTA 12 00 

ATGCACGTCA AAAGGC CAAT GTTTATGTCA ATCGTCTCCA TACTATCAAG CGATACATGG 12 60 

AAAAACGCAA TCTGCCAGGT ATTCCACAAA CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 13 20 

ATAATACCGA GGATTTAATG GTTGAGTTAG AACAAAAAAT GATTAACATT GAATCTGTTA 13 80 

CCCGAGTTCT TGAAATTGCA ACGAATGATA TGGAAGCTTT AGAAACGGAA ACTTATAATA 14 40 

TTGTACAATA TGCAACTTTG ACAGAGCAAC TCTTGCAATA TTCTAACCGC TATCGCTCAT 1S00 

TTGATGAACG CATTCAAGAA GCATTTAACG AaGCTTTAGA TATTTTTGAA AAAGAATTTG IS 60 

ATTATCACGC TTCATTTGAC AAGATTTCTC AACCATTGGA AGTGGCAGAG CCTGGTGTAA 1620 

CCAATCGCTT TGTTACCTCA TATGAGAAAA CACGTGAAAC GATTCGTTTT TAATAAAAGA 1680 

AAAAGATTTT ATTGTGTGAG GAGCAGAATC AAATCTTTTT CTATAGTTGT GGGGAGATTT 1740 

ACTTCATTTT CTCCTGAGAT TGAGTTTTTG CCCAGCCGAT TTATCCACTA CCTCAAAACA 1800 

GTGTTTTATA CTCTTCGAAA ATCTTTTCAA ATCACGTCAG CGTCGCCTTA CCGTACTCAA i860 
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GTACAGCCTG AGGCTAGCTT CTTAGTTTGC TTTTTGATTT TCATTTAGTA TTAAAGTGAT 1920 

TTCGCCAGTC TTATCTGCAG CTTCAAATCT GTACTTTGAG TAACTTGGTA ACCGTCCAAT 1980 

AACGAAGTCT ATTGAAAAAT CTCCAGACTA GAGAACTCAC GGATAGTTCC TAATCTGGAG 2040 

ATTTCTTATT TGCACTTTTC TTGTACAACT TTACTCCACG GTAAATAGAC CTCTAAAACC 2100 

TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA AG AC ATT CT A GAAGATAGGA TAGATATTTC 2160 

TCGCTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 2220 

AAAGAAGAAA TCCAAACCAT CAAAACACTT TTAAAAGACT CTCGTACAGC TAAATATCAT 2280 

AAACGCCTTC AAATCGTTCT ATAGTAAAAT GAAATAAGAA CACTACAAAT CGA TCAGG AC 2340 

AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG T ACT ATTCT A GTTTCAATCT 2400 

ATTATATTTC GTCTGATGGG CAAATCTTAT AAAGACATTA TAGAACTTTT ATAGTAGATT 24 60 

GAAATAAGAT GTGAACAACT CTATCAGGAA AGTCAAATTA ATTT AT AC AA ATATTTTAGC 2520 

AGCCAAGGTG TACTGTTATA GATTCAATAC ACT AT AG ACT GTAATCAAAC AACGATTTGG 2580 

CGAAATGTAA AAAAATATGA GGAGTTCGGA CTCGACTCTC TCCTTCAAGA AACACGTGGT 2 640 

GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGC CATTTG 2700 

AAGGCTGCAG AGGCAGGAGA ATTTGTTACA ATTGATGCCT TATTTCAGGC TTATAAAAAG 2760 

GAGTTAGGTC GTTCCTACAC ACGTGATGCC TTCTATCAAC TGTTGAAGTG CCATGGTTGG 2820 

CGAAATATTA TGCCACGTCC AGAACATCCT AAGAAAGCAG ACGCTCAAAC CATTGTCGCG 2880 

TCTAAAAATA AAATCTCAAT TCAAGAAGAA AAGAAAGCGC TTTAAAACCA GTAGACGTTT 2940 

TCGTAAGGTT CGCTTGATGT ACCAAGATGA GGCTGGTTTC GGTAGAATCA GTAAACTGGG 3000 

ATCTTGTTGG GCTCCAATAG GAGTAGGTCC ACATATCCAT AGTCACTATA TACGAGAATT 3060 

TCGCTATTGT TATGGAGCTG TTGATG CCCA TACAGGCGAA TCATTTTTCT TAATAGCTGG 3120 

TAGATGTAAT ACTGAGTGGA TGAACGCCTT TTTAGAAGAG CTTTCACAAG CTTATCCAGA 318 0 

TGATTATCTT TTACTCGTTA TGGACAATGC TATATGGCAT AAATCAAG T A CCTTAAAGAT 3240 

TCCGACTAAT ATTGGTTTTA CCTTTATTCC TC CAT AC AC A CCAGAGATGA ACCCCATTGA 3300 

ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 3360 

AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 3420 

CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGAGTAT AAAAAGAAAG 3480 

TCCT CATTTC AATAGAAATC ACGACTTTCT GAT GG ATTT A TAGTAAAATG AAATAAGAAC 3 540 

AGGACAAATC GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 3600 

ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 3 660 
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T CT ACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 

CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTCAC CGGAGAAAGA ATCCAATTCA 384 0 

AAGTGGGTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTCCCA 3900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3 960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TCTTCATTGA 4 080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCT AACTT TT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4 200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 42 60 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATGAT TAAATTGATT GATCGTTACG 4 32 0 



GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 
TGATTAATAA AGTCTTAC 

(2) INFORMATION FOR SEQ ID NO: 294: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 12 0 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 3 00 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 3 60 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGG T TCTCCTTGGG ACTGTGCTAC 4 80 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 54 0 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA TCAAGCAGTT TTGGCTACAT ACGCTTCTAA G AACCT A TAG TTCAGTGATG 660 
ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 718 
(2) INFORMATION FOR SEQ ID NO : 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TCGGTACCAA AATTCTCGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGACA GTGAAGATAG TTTTGCAAAG 12 0 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATC CTCTTT TAGGTAAAAT AAATGCAATT 240 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGC AC TCC A 3 00 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 3 60 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 420 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 480 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 54 0 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGC C A TAAGCGGTGT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATC C A GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCCTTTTAG GGAATTCATC AAGCATAA 718 

(2) INFORMATION FOR SEQ ID NO: 296: 

<i) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 14 3 6 base pairs 
(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTCCTCGTT TCCACTTAAT TTTGTGTCTA CTTT ATT AT A 120 

CCAAGTCCAC sCTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTTGCTTTTT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT TGAATGAGAG AACCAGCTGA 

TGAAAAGGGT GAG AT AT TAG TAGATAGAGC GCCAATAACG GTGGCTGTTG TGAGTAAGTG 

AATATCAATC TGAGGATTTT GAGCACTGAT CATAGCAATG ATGGGAAAGA GGGCTGGAGC 

TACAACGGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC AAAAGAACAG 

AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG TGACTAAACC 

TGACTTGACC GCTAGAGACA TTAGTAAGCT CATGCCGCAG AGCATGATAA TTGTAGCCCA 

GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG AGCCTTAAGG CGAGGCAGAC 

CATCAGTATT GAGACAAAGC CAATATCAAA TGTTTTTTGA TAAGTAGCTA TCCAGGCGAT 

GTTTGGGAAA ATGAGATGCA ACAAGGGAAA AAGCCAAACC AAAACCATGC TGCTG AT CAT 

GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT GGTTGGTCAA TAGTCAAGGA 

TGAGTTTGTT CTTCCCTTAC TATAGTGACT GTAACAGGAT AATAAAAGCA AGACGATGAG 

TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT GAAAAAGCTT GCTCTTCCCA 

TCCCATTTGC TTAAACAGGC CTTGAAAGAC AATGCCTGAG CTACTGGTTA TC AAATT AG C 

CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC AAAGGGTGTT TGTCCGCTTT 

TTGACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA GTAGTGAAAA ATCCAGCACC 

TAAAGCAGAC AAAAGGGTTG CCATCAGGTA TAAAATCATG TAGAGGGCGT TAGGGTGGGT 

GCGTGTGCGG TAGAGAATGT GTTGAGCCAA AACATCAAGA G T ACCGTT AG TTGTTGCAAC 

GTTATAAAAG AGAGAGACGC TAAAAATGGT AAAAAAGAGT G AGGTTGG C C AAAAATGAAG 

AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG AGGTAAGAAA AAGCAATAGC 

CAGCAGGCCA ATATTGATTT TGGTG CGGT A ACCAATTCCA ATGGCTAGAG CAATGG 

(2) INFORMATION FOR SEQ ID NO: 297: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 
GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 
CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1436 



60 
120 
180 
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AAGGCGGCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 
TTGGTGGACG AGATGTGGAA GCTTGCCCAA CGTGAACTGG CTACAGCATT TTACTATGGT 
ACACCATCTG AAAATGAGCA GTTGTTTGGT GCTCGCCGTA AAATTCCTGA GTACAAGTTT 
GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG CAACAATTCG TCAACGAAAT 
GTCATTAACG AAGGGG AC C A AGTTGAGTTT TATGGTCCAG GTTTCCGTCA TTTTGAAACC 
TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG ACCGCGCTCC AAAT CCAATG 
GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG GAGATATGGT TCCTCCATTA 
AAAGAAGGAC TCATCAATCT TTATAAGGAA GATGGAACCA GCGTCACAGT TCGAGCTTAA 
GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT GGATAAGCAA ACAACATCCA 
TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG CTATGATTGT AAAAAGTATT 
ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC TTTTGAGCCG TATCCCTTAT 
CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT AAAACTACTA ACATATAAGC 
AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG CTTTAATCCA GGTTGCCAAA 
ATCATAAGGA ACGCTATTTT AAAT AGC AAA TCATCTAGTT TTGAAGTAGG AGAAAACTCA 
ATTT CAAG AG AAAATGAAGT AAATCTTCCC ACAATAAAAC GCATAATATC AAGATTGTTC 
AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT TTTCCTTTAT CTGGTATTTT 
GACTACTTGT TAAAACTGGG TTAATTTTCG ACTGTTTAAT AGTTATTATG CAAAGTCTAA 
AAGGTTAGAA TTGTCAAAAC AATCCGTCTA GAGTATGCGT GATGCCAACC GTGGTGGATG 
TTCTCAGTCA TGCCGTTGGA AGTACGACCT TTACGATATG CCATTTGGGA AAGAACGTAA 
GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA GCCGTTGATA TGTCTATGAT 
TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT CTAAAAATCG AAGGACGTAT 
G GAG T CT ATT CACTATGTAT CAACAGTAAC CAACTGCTAC AAGGCGGCTG TGGATGCCTA 
TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC TTGGTGGACG AGATGTGGAA 
GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT ACACCATCTG AAAATGAGCA 
GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT GTCGCTGAAG TGGTTTCTTA 
TGATGATGCG GCGGTA 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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300 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



CCGAGTTTAT 


TATGGTTTCT 


TCGGAATTTA 


TCTCAAAGAT 


* VJ/W * 1 i VjV. 1 


I Vjv-/vA 1 AAVj A 


fin 


AAGAAAGTCT 


TTATAGTCAA 


AGCAAATTTA 


AGTATGCGAT 


TCCTTrnATr: 




1 z. u 


CATTTTTAAC 


CTTCAGTACT 


GCTGCAGGTG 


C AGTTC GCC C 


Tf^AfTT f&TT 
i W/\V- i L 1 1 






CACCAGGTAG 


TGGACCCTTC 


CTCTT Tt C AT* 

W * ^ 4 4 4 Wo 4 


TfliTTI^I'i'Cr 

A V*\J 4 A 4 ^ 2 




uLL lAv.nl Ivj 




TTTTTTTGAA 


TGCCGAGTTG 


G T CACTTC AA 


ACATGATG TT 


V. i 1 U AV- 1 V*V_ 1 


VjVj 1 AO 1 1 ILi 




TAAAAAAAAT 


CTCTTGGAGA 


AAAACAGC TG 


AGATTTT AfT* 


ATAfTCTArC 




i fin 
J o u 


TTATCGGAGC 


CTTGATAGCA 




i t Vj\- 1 \. A 1 1 V— 




L»Vw\j AA 1 (_ 1 L»A 


j4 O n 


CACACGATAG 


TTTCATCTCA 


VJN3 K W A 4 \_7 X 4 


AGATCAACTT 


AC nccc f^rr* a 

A*Jijiiw\.\jL i \.A 


AA 1 u AA 1 I LKj 




TCTTGCTTGA 


GGCGATTTTG 


GCAAATATTT 


TTGTAAATAT 


TGCGATTCTG 


TCATTTATTT 


540 


TGGTCAAAGA 


TGGTGGTGCC 


AAACTTTGGC 


TTGTGTTGTC 


AGCTATTTAC 


ATGTTTGTAT 


600 


TCTTAACAAA 


CGAGCACATT 


GCGGCGAACT 


TTGCTTCTTT 


CGCGATTGTG 


AAATTCAGTG 


660 


TTGCTGCGGA 


TTCAATTGCC 


AACTTCGGTG 


TTGGAAATAT 


GCTTCGCCAC 


TGGGGTGT G A 


720 


CTTTCATCGG 


AAACTTTATC 


GGAGGAGGCC 


TCTTGATGGG 


TCTTCCATAT 


GCCTTCCTCA 


780 


ATAAAAACGA 


AGATACTTAT 


GTAGATTAAG 


AAAATGAGCA 


CGATTG AGTC 


GTGCTTTTTT 


840 


CATTTTCAAA 


ATAAGGTAAT 


AGCTATTTCT 


TAT AT C AAAA 


TATAGAAAAC 


TGATATTTGT 


900 


ArACTATAAC 


TCAAGGT GCT 


ACAATATCCT 


TAATAAAATA 


ATATGGAGGT 


CACCTTATGA 


960 


CTTGTGATTT 


TAAATnTGAA 


ACTCTACAAC 


TACATGCTGG 


TCAAGTTGTG 


GCTCCAGCTA 


1020 


CT 












1022 



(2) INFORMATION FOR SEQ ID NO : 2 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 663 base pairs 
(Bi TYPE: nucleic acid 
<C J STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 120 

CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 
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AATAGATTCT CTCGCATTAG TTACATAACC AGATATAGGC ATATCTGATA TAGATACCCA 240 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 3 60 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 4 20 

AATGTC CAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 4 80 

TATTTTTTCA AGTAAAACTT CGACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGCTG 600 

TTCTAGT CT A TTATAACTTT CAGCATATTC ATCTACTTTT TCTAAAGCTG ATTCGATTGC 660 

TTC 663 



(2) INFORMATION FOR SEQ ID NO: 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 
<B> TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



CGTCGCTGAA 


CATGTCAACA 


GCAAATTAAA 


CTAAACAAAC 


TAAAATTATG 


TGATACTTCA 


60 


CATAATTTTC 


TTTAGAAAAT 


ATTATCAGAA 


GAAAGTTGAG 


AAAAATGGCA 


GAAAAAACAT 


120 


ATCCTATGAC 


CCTTGAGGAA 


AAGGAAAAAC 


TTGAAAAAGA 


ATTAGAAGAA 


TTGAAATTGG 


180 


TTCGTCGACC 


AGAAGTGGTA 


GAACGCATTA 


AGATTGCCCG 


TTCATACGGT 


GACCTTTCAG 


240 


AAAACAGTGA 


GTACGAAGCA 


GCTAAGGATG 


AACAAGCCTT 


TGTCGAAGGA 


CAAATCTCTA 


300 


GCTTAGAAAC 


AAAAATCCGC 


TATGCTGAAA 


TCGTCAATAG 


CGACGCAGTT 


GCCCAGGACG 


360 


AAGTAGCGAT 


TGGTAAAACA 


GTCACCATCC 


AAGAAATTGG 


TGAGGACGAA 


GAAGAAGTTT 


420 


ATATTATCGT 


AGGTTCAGCT 


GGTGCAGATG 


CCTTTGTAGG 


TAAGGTTTCA 


AATGAAAGCC 


480 


CAATTGGGCA 


GGCCTTGATT 


GGCAAGAAAA 


CAGGTGATAC 


AGCAAC C ATT 


GAAACGCCTG 


540 


TTGGTAGCTA 


TGATGTAAAA 


ATCTTGAAGG 


TTGAAAAAAC 


AGCCTAAAAA 


CAGAAAAAGG 


600 


ACTGGGCAGG 


CGATGTGCTT 


CACTCACTCC 


TTTTTCCATT 


TTGCTACTCT 


TCGAAAATCT 


660 


CTTCAAACCA 


CGTCAGCGTC 


GCCTTGCCGT 


ATCTATGGTT 


ACTGACTTTG 


TCAGTTTCAT 


720 


CTACAACCTC 


AAAACAGTGT 


TTTGAGCTAA 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


780 


TATGTTTTGA 


GCTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAACCATGT 


TTTGAGCCGA 


840 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


TATGTTTTGA 


G 




881 
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(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
{D| TOPOLOGY: linear 



{xij SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

CCTTTTTTAA TACAAGTTAT TTTGATTTAA CCGGCTTGTC TTCAGCTGTC TGCAAAGCTG 60 

TGGCAATCGT ATCTGCATAC AATTTTGCTC CTGCTTCGAT AGTGCTACTC TCACTCCCGA 12 0 

AATGAAC CTG GTCTGTTCCA GCCCAAATTT CTGGATGCTC TTTCGCAACT TGATTCCAAT 180 

CTGCTATCGT AATGTAAGGT GTCTTCTCTG CCAATTCTCT CATATAGGCA GCAGCCTTCT 24 0 

CAACGATGGC ATAGGTCTCT TTTGTCTTAT CTCCCTCATA AGGAGTCACC AAAATCATAT 300 

GGTGTCCCTT AGGAAGATTT TTCACGATAC TGTCCCAGTC ATCCTTGTAA TTCTCAGGAT 3 60 

TATTTACCCC AGTCGCAATG ACCACCGTCT TAGGTAAAAA TTTATTCTGG CTATTATTTA 42 0 

G CATGAT TTC ATTTGCGGTC TTGGTTGTTA CGCTGACCTG CGCGTTAATC TGTGCTCCAG 4 80 

GAAGAGCTGT CTGT AGTGCT GTATTTGCCC TTAAAGCCAC TG AG T C AC C A ATTAACATAG 540 

TGCCATCAGC AATTCCCAAA CTGTTTGC AT CTGCCCGTTC TGCCATCACC TTGGTCTGGC 600 

CAATATTTGT TGCAGCTTGC TTCAAGCCAT TGACAGT CAA GTCTGTCTCA AACGCTCCCA 660 

CTTGTGGTGC CAACAAGGTC AC CGTGC AG A CAATGATGGT CAAGATTCCT GT AC CTG CTG 720 

CAAGAATTGC CTGAATATAA GGCAGGGGAC GAAsGGTTTG GACAATAGGT GTGTTCTTGC 780 

CTGCAATCCA AGGTTCCAAT ACATAAAATG ACAGACTGGC AAAGCC AT AA GAACAAATCA 840 

GAGTCAGTAA TACAGCAAGA AGATTTGATG TCAACTGTGA GAAAATGATA TAGAAAGGCC 900 

AATGGAAAAG ATAAACCGCA TAGCTAGTAT CCGCTAAAAA GCTGATAAT 949 



{2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACGTAC ACGGCTAAGG AAAAATAACA 120 

AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT TTAGCAAGGG GACAAACCCC 180 

TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGGA AAATGACAAC TTAATTTATA 24 0 

AAAAGGAAAA ATGGAGGATA TAAATGCAAA TTCTGTCTAA AG AAAT A CAG TTACAGGGCT 300 

TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AOAAAAACAA CGATCTAGTA 3 60 

AGTTAGATTT AATTTCT CGT AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 420 

TTGATAACTG GGAGGATCTT GGTCTTAAAC GATATCAGAC TCCGATGGAT GGAGCTAAGA 480 

AAGTAT TCTA TCGTCCGTCA GATGTGTATT T ATTTTT AG C AATAAAATAG GAGTTATGAA 540 

ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATGGGTTTTA ACTGCCAGGT 600 



ATTGACATTC TATTTGTCTC AG 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



622 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

CGCTAACTTG CAAAGAAAAG AAGAACGCAA ACTCCACAAA TCCTTTACGC AGAAACTCAA 60 

TCTCATCTAC TTACCTTGCT GACTTGGTAG AGTATGTTGC AGACAAAGAC TTCTCAGTAA 120 

ACGTAATTTC TAAATCAGGT ACAACAACTG AACCACCGAT TGCTTTCCGT GTCTTTAAAG 180 

AACTCTTGGT TAAGAAATAC GGTCAAGAAG AAGCTAACAA ACGTATCTAT GCAACAACTG 2 40 

ACCGCCAAAA GGGTGCTGTT AAGGTTGAAG CAGACGCTAA CGGTTGGGGA ACATTTGTTG 300 

TTCCAGATGA TATCGGTGGA CGCTTCTCAG TATTGACAGC CGTTGGTTTG CTTTCAATCG 3 60 

CAGCATCAGG AGCTGACATA AAAGCTCTTA TGGAAGGTGC GAATGCAGCT CGCAAAGACT 4 20 

ACACTTCAGA CAAAATCTCT GAAAACGAAG CTTACCAATA CGCAGCTGTT CGTAACATCC 4 80 

TTTATCGTAA AGGCTATGCA ACTGAGATCT TGGTAAACTA TGAGCCATCA CTTCAATACT 540 

TCTCAGAATG GTGGAAACAA TTGGCTGGTG AATCAGAAGG AAAAGACCAA AAAGGTATCT 600 

ACCCAACTTC AGCCAACTTC TCAACTGACT TGCACTCACT TGGTCAATTT ATCCAAGAAG 660 

GAACTCGTAT CATGTTTGAA AC AGTTGTC C GTGTTGACAA ACCTCGTAAA AACGTGCTT A 720 

TTCCTACTTT GGAAGAAGAC CTTGACGGAC TTGGTTACCT TCAAGGAAAA GACGTTGACT 780 

TTGTAAACAA AAAAGCAACT GACGGTGTTC TTCTTGCCCA CACAGATGGT GATGTACCAA 840 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 960 

GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 1020 

TCAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACTCTTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 1140 

AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATG CTCG TACAGCATTG TTTAATTACT 1260 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CCAAGATACT GACCGTAAAC 13 20 

GCCATGTCGA GGATGGTGAA CGTTCACAAC TTGAAAACCT TCGCTGGTTA GG C ATGG ATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TCACCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACACCA CGCTACATCA 1560 

ATGAAT AC CT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATC GC AG AA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATG 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGCTGAC TGGGTTATCC 17 40 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 

CTGAAACTG 1929 

(2) INFORMATION FOR SEQ ID NO: 304: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 708 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOI#OGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGG AG A C ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAAGCT GAGAAAGCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCT ACCTTG ATT ATT CAG C CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 180 

TCCAACACAT CCAG CTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGC TATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGCTACA GTGAAAAACT TGCTTGACTG 300 

GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCCTT TCTGCCCTTC AAGACAAGTT 360 

TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACGATCAA CTTTTCGCTA TCTACAAAGA 420 

CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTG C AC CTGTTAATGA 4 80 

CTCTGCCTGG GCAs ACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 540 

ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 600 

AAACTAkGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 660 

CGAGTGTnnA ACATATATAC GGT AAGGCG A CACTGACGTG GCTTGAAn 708 



<2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 
(B> TYPE : nucleic acid 
(C> STRANDEONESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



CTTCTTTTCT 


TGGAAATAGG 


TGTATAATAC 


GTTTATTAAA 


TTTTTGAGGA 


GTTGTCTATG 


60 


AAGAAAAGTT 


TTATCCATCA 


ACAAGAAGAA 


ATTTCCTTTG 


TCAAAAACAC 


TTTTACCCAG 


120 


TATTTGAAAG 


ATAAGCTAGA 


AGTTGTCGAA 


GTTCAAGGTC 


CTATCTTGAG 


TAAGGTCGGT 


180 


GACGGAATGC 


AGGACAACCT 


GTCTGGTGTG 


GAAAATCCAG 


TATCGGTCAA 


GGTTCTCCAA 


240 


ATCCCTGATG 


CTACTTATGA 


AGTGGTGCAC 


TCACTTGCTA 


AATGGAAACG 


CCACACCTTG 


300 


GCTCGTTTTG 


GCTTTGGTGA 


AGGAGAGGGT 


CTCTTTGTCC 


AC ATGAAAG C 


CCTTCGTCCA 


360 


GATGAGGATT 


CCTTGGATGC 


AACCCACTCT 


GTTTATGTTG 


ACCAGTGGGA 


CTGGGAGAAG 


420 


GTTATCCCAA 


ATGGTAAGCG 


TAACATCGTT 


TATCTAAAAG 


AAACAGTTGA 


GAACATTTAT 


480 


AAGGCTATTC 


GCCTGACTGA 


GCTAGCTGTT 


GAAGCCCGCT 


ATG ACATCG A 


GTCTATCTVG 


540 


CCAAAACAAA 


TTACCTTTAT 


CCATACAGAA 


GAATTGGTAG 


AACGCTACCC 


AGACTTGACA 


600 


CCGAAAGAAC 


GTGAAAATGC 


GATTTGTAAA 


GAATTTGGAG 


CCGTCTTTTT 


GATTGGTATC 


660 


GGTGGCGAGT 


TGCCAGATGG 


TAAACCGCAC 


GATGGACGTG 


CACCAGACTA 


TGAT GACTGG 


720 


ACAAGCGAGT 


CTGAGAATGG 


CTACAAGGGT 


CTAAATGGTG 


ATATTCTTGT 


CTGGAATGAG 


780 


T 












781 



(2) INFORMATION FOR SEQ ID NO: 306: 
<i> SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(O) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


306: 






CCCGCATCTT 


GTAGGGTTTT 


AACGGGCACG 


ATTTTCATAT 


CCGTCTTGAT 


TGTTTTAGCC 


60 


GCTTCTAGGG 


CTGTTTGGTA 


GTTGTTTTTC 


GCGTCCGGAT 


GCG CCTTTTG 


TTCTTCTTCG 


120 


CTAACAGGGT 


TATCAGGAGC 


AAAGAAAATA 


GCAGCACCTG 


CCCTAGCCGA 


AGCTACAACC 


180 


TTCTTATCAA 


TACCTCCAAT 


GTCTCCCACA 


TTACCATCGC 


GGTCAATGGT 


ACCTGTACCG 


240 


GCAACAATAC 


GACCATTACG 


AAGATCTGGG 


TGAGCTATTT 


GAGTATAGAT 


AGCTAGACTA 


300 


AACATGAGAC 


CAGCACTTGG 


ACCGCCAATA 


CCAGCTGTTG 


AAAAGCTAAT 


TGGGACATTG 


360 


CTGATTACCT 


CTGTACGGTC 


AATCAAGCCG 


ATTCCAATTC 


CATTTTTGCC 


ATTTTCCAAG 


420 


GTGATGATTT 


TTCCTTCTGC 


AGACTTGGTT 


TGCCCATCCT 


CTTCATAGGT 


GACCTTGACG 


480 


GAATCCCCTA 


ATTTTTGAGA 


ACTGACGTAA 


TCAATCAAGT 


CTTTGGAACT 


ATCAAAGGTC 


540 


TGATCATTGA 


CTGCTGTGAC 


TGTATCAGAG 


ATATTGAGAA 


TC CCTTT AAA 


GGTTGAATTA 


600 


TCCGTCACAT 


TCAAAACATA 


AACTCCAAAC 


TACTTGAGTT 


CGATATCCTT 


ACCAGCTCTT 


660 


TTTAGTCCTT 


GATACTTGGC 


CATATTTTGC 


GATGTTTGCA 


TGTAGAATTG 


ATTGATTCGC 


720 


ATAAATTCAA 


CATCGGAAGA 


ACCACCTGTA 


GTCTCCTGAG 


CACTACGAAT 


ATCTGTAAAA 


780 


GGTGTCAACC 


AAGCATAAAT 


CATATGAGCT 


AAAGTGGCAT 


GTTGAACACC 


AACCGTAACG 


840 


AATTGT 












846 



(2) INFORMATION FOR SEQ ID NO: 307: 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 
(Bi TYPE: nucleic acid 
fC) STRANDEDNESS: double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 
GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 
CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 
TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 
GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA AT CACCTTGC GCTTGACACC 
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AGTTACTTTA CAAATATCCC GATACATCAA GACGTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAG CTT 3 60 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 4 20 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GC CAAACT AT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTACGATT 720 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCACCT 7 80 

CATACTATTG AT AAAGGT CA GCTGAGTCCA AATCTGGAAG AGCTGGATG 829 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i> SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 464 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 30 8: 



CGAACATCTT 


GCTGGCTGAT 


TCGTCTGCCG 


CCATCGCAGC 


CCCGAACACA 


TTGCGACCCA 


60 


TGGCAAGCGG 


GCTCAATCCG 


CACATGGGAT 


CCGTGCCAAA 


GCCCCGCGTG 


TGCATCATTT 


120 


GCTCATCTAG 


TAACGTATGA 


GGTTTGCCTT 


CGCTGTCGAT 


AAACCGATAT 


TCAATCGCAC 


180 


CACTGCTCGT 


TCTCCGCGGA 


GGGGAAACCG 


ACTGCGGTAG 


GATGAACTCC 


AGAGAAGAGA 


240 


GATCACGACC 


TACCAGGTGC 


GGCTCGTTGA 


AGCTGTTGCC 


GCTTAGCAGC 


AGGCTCGCCA 


300 


CCACGCATTC 


CCAGAACTCA 


ACGGGGGTTT 


GATCGGCGTT 


CGGTTGCTGA 


CTAATAACTC 


360 


GGTG CACGGG 


ATGCGAAGTG 


GCCACTTCTG 


GCACACCGTT 


CTTGTCTTCG 


TAGAGAGCAA 


420 


TTGGGAGGGT 


GGCCAGCGTT 


TCGGCGATGA 


GGCGCACGCA 


GGCC 




464 



(2) INFORMATION FOR SEQ ID NO: 309: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA TGGTAATAGA TTTTATTTGG AGGTTTTTAT GTCATTTCTA TCAAAAAATG 50 

GAGCAGGTAT CTTGGCCTGC CTTCTCATTT CCATCCTATC TTGGTACTTA GGAGGATTCT 120 

TCCCTGTGGT TGGCGCGCCC GTTTTTGCCA TTTTCATAGG CATGCTCCTA CATCCCTTTC 180 

TCTCGTCCTA TAAACAACTG GATGCTGGTT TGACCTTTAG TTCCAAGAAG TTGCTCCAAT 2 40 

ATGCCGTTGT CTTGCTTGGT TTTGGTCTCA ATATCTCCCA GGTCTTCGCA GTTGGCCAAT 300 

CTTCACTCCC TGTCATCCTG TCCACTATCT CAATAGCTCT GATTATTGCC TACCTCTTCC 3 60 

AGCGTTTCTT TGCCCTGGAT ACAAAACTGG CTACCTTGGT TGGAGTAGGT TCTTCTATCT 4 20 

GTGGGGGTTC TGCCATTGCA GCGACAGgCC CGTTATTGAT GCTAAGGAAA AGGAAGTAGC 480 

CCAAGCC ATT TCCGTTATCT TTTTCTTCAA TGTCTTGGCT GCGCTCATCT TTCCAACCCT 540 

CGGCACCTGG CTTCATCTAT CCAATGAAGG CTTCGCCCTC TTTGC AGGG A CTGCGGTCAA 600 

CGACACTTCC TCTGTAACGG CTGCCGCCAG CGCTTGGGAC AGTCTTTACC AAAGCAATAC 660 

CCTCGAGTCT GCAACCATTG TTAAACTCAC ACGTACTTTG GCCATTATCC CTATCACGCT 720 

CTTTCTATCC T ACTGG C AAA GTCGCCAACA AGAAAACAAG CAAAGCCTGC AACTGAAAAA 780 

AGTCTTCCCA CTTTTTATCC TTTACTTTAT CCTTGCCTCT CTCCTCACTA CACTACTCAC 84 0 

CTCTCTAGGT GTGTCCAGTA GTTTCTTTAC TCCTCTCAAA GAACTCTCTA AATTCCTTAT 900 

TGTCATGGAC ATGAGTGCTA TCGGTCTCAA AACCAATCTG GTCGCTATGG TCAAATCCAG 9 60 

TGGAAAATCC ATTCATCATG GA 

(2) INFORMATION FOR SEQ ID NO : 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1939 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY; linear 



982 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CTACCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 
CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 
GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 
TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGCCTTCCC AGCTC AG AT T 
TGCCAG CATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 
GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 
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ACCTGACCAG CTGGGAATTT CTCTTCTTCC TCTGTGGGCA 
TATTTGGCTT TGGCAAGGGG ATAAACCTTG TCAAAACGGT 
CGGAAGCTGC CTTAAATCTT G AGGGTTTC A AAATCAATCG 
CTTCGATTGG AGGAATTCTT CTCTTGGCAG CTTTGGTTTc 
TTCAGGCTAA AAATTATGCC AATGTAGTCA CGGTTACGGA 
CTAAGAGTGA CACCAGTAAG GTTCCTATCC TAGATAGAAG 
ACCGCTACTT GGGTTCCCTA ACCGATAAGG TCTCGCAATA 
CCCAATTGAC AATTGATGGG AAACCTTATC GGGTCACACC 
TCAAATGGTT TAACAATCAA GCCAAGGGAA TCGGTGAGTA 
CTGGAAATGC GGATTTGGTG GACTTGAAGA CACCAATCAA 
TTAACCGTGA TGTCAAACGT CACCTGCGCT TGAAGT AC CC 
CATCTTTTGA GGTGGACCAT GAGGGCAATC CTTTCTATGT 
AATTTGGACT TGCTGTTCCT CGTCCTGCTT CAGTCATTAT 
AAACCAAGGA ATACAGCTTA TCAGATGTTC CAGAATGGGT 
AGGAAACCAT TGAGCAAATC AACTACAACG GCAAGTACAA 
TGATTTCCAA GAAAAACGTG ACCCAGACTA CCAATGGCTA 
ATGACATCTA TCTCTACACA GGTGTGACGT CGGCTAATGC 
TCATCCTTGA AAATATGCGA ACAGGAGAAA TCACTAAGTA 
AAGAATCAGC CCGTGAATCA GCAGAAGGTG CTGTTCAGGA 
TCCCAATCCT CATCAACCTC AATGACAAGC CTCTCTACAT 
CTGGCTTGGT CAAAGAGTAC GCCCTGGTAG ACGCAGTCGA 
C TACT AC ACT GGAAGAGATG CTCAGCAAGT ATGCCAATAA 
ATGCAACGAC AG AAAGCAT C AATGGAGTAG TAGCAGACCT 
GAGACACTGT CTACTTCTTT AAAGTTGATG GCAACATCTA 
CCGATGACCT TCCTTACCTT GAAAATGGTA AAACCTTCGA 
ATTATCTCAA GACCTTTAAG CTACGGTAAA AATAGGTTTT 
AATAAGGTAA ATTAAGCCG 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 
< B) TYPE: nucleic acid 



TTTGTTAGTT 
TCATGTGCGC 
GTTAGGGAAA 
CTTGGTAACT 
AAAAGACTTT 
TACTGCTGAA 
CGTAGCGGCA 
ACTAGAATAT 
TATTAAGGTG 
GTATTCAGAC 
GACCAAAATC 
AGCAACGGTT 
CTTGGATGCT 
GGACAGGATC 
GGACGGTTTC 
TAATTACTTC 
GGATGAGAGT 
TAGCTTGGCT 
GAAATCCTAC 
CATGGCCTTG 
GTACCAAAAT 
AAACGACCTT 
CAAATCAGCT 
CAAGGTCAAG 
AGGTCAAGTA 
TTTCAGAAAG 



GTGGCAATAT 
CACGGTAAGG 
ATTCTGTTAG 
TCCAGCATGT 
ACT GAATTTC 
AAAATTCGAG 
GATACCTATA 
G C AG ACCCTA 
GACATGGTAA 
TCGGAGTATT 
TTTAAAACTC 
TACCAAAAGC 
ACAAATGCAG 
TATCCAGCAG 
TTGAATGCCA 
TCTATCGGTA 
AATCTTGGTT 
TCTGCGACAG 
AAAGCAACCT 
AAGGACAATG 
GTTATCGTTG 
GAAATTGACA 
GTTATCAACG 
G CTTCAGT AT 
GGAAAAGACA 
TATATGTTAT 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1939 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xii SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CCTCCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 60 

CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTGCGGATA T AAA TGT AAC 120 

ATTATTTTTC TAATCTGTCA ATAAAATTTC TGACAATTTA AT AAAT AC AA CAAGGAGAGA 180 

GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTT A ATCTGATAAA 240 

ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 3 00 

TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGCATTTTT 3 60 

TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCTTTGA TAGTATAGCC TTTTTTAGTT 4 20 

CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTGAAATA GGTAGTTCAT TCCTTTAACA 4 80 

TGAGCCTATG CAAGGTCCAA GACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 540 

TCGTAGTCAT CTCCAAATAT TTTTAAGCTA TCATTTTGTC CCAATGCGGT CTTGTTGATA 600 

TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 660 

CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGT AGCGATT GGCCATCCAG 720 

TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCAGG 7 80 

GTATCTTCAG TCACCGGCTT GTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 840 

AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA CAACTTCGTT CATACCAGCA 900 

ACGTTGG 907 
(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CCACATAAAG GTAAATATCT TTTCTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CAT AATG ATG CGGTCTTTGA 12 0 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 24 0 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC TGCTTTCATT AGACAGAAAC 300 

AAAAACCCAA CCTCCTTTCG TGACTGGAAA TACTTTTCCA AGTCATTCTT CTTTTCGATC 3 60 

TTATTTTGTA CCGAACAAGC CGTCTCCAGC ATCTCCAAGA CCTGGAACGA TATAACCGTG 420 

TTCGTTCAAA CGTTCATCCA AGGCTGCTGT AAAGATTTCT ACATCTGGAT GAGCTTCTTG 480 

AAGGGCTTTT ACACCCTCTG GAGCAGATAC AAGGCAGACA AATTTGATAT TTGATGCGCC 540 

ACGTTTTTTA AGAGAATCAA CAGCCAAGAT TGCTGAGCCA CCTGTTGCCA ACATTGGGTC 600 

TACTACAAAA ATTTGACGTT GGTCAATGTC CTCAGGCAAT TTCACCAAGT ATTCAACTGG 660 

TTGAAGTGTT TCTTCATCAC GGTACATACC GATGTGGCCA ACTTTAGCAG CTGGAACCAA 720 

GTTCAAGAGA CCATCAACCA TCCCGATACC TGCACGCAAG ATTGGG AC G A TGGCCAATTT 7 80 

CTTACCTGCC AATTGTTTTT GAACTGTTTT TGTAATTGGT GTTTCGATTT CCACATCTTC 84 0 

TAGTGGAAGA TCACGAAGTA CTTCATACCC CATCAACATT GCAATCTCAT CTACTAGCTC 900 

ACGAAAAGCT TTTGTAGAAG TATCTGTACG ACGCAAGATT GACAATTTGT GTTGAATCAG 960 

TGGGTGATTA ATAACTTCAA TT TT TCCCAT TTTTGGAATT CCTTCTTTCA ATTTATTCTT 1020 

CTTATTATAC CAAAAAACGG TTTAAAAATC TTTCTAAACC ATTTATTTTT GATAATTTTT 1080 

ACATTAGATC AGCCTCTTTA AGAGCTGTCT GTACTCTCTC AAGTGGTAAA TGGCTCAATT ll4 0 

CTGTCCCTTT TTCTTGATAA AGGTATTGGG CGTAGTCGTC CATTCGGTAC TGGTTGATAT 1200 

AAACCACGCG CTTGCAGCCG ACCTGAAGCA ATTGTTTTGT AC AGTTG AG A CAAGGAAAAT 1260 

GGGTTACATA GGCTGTAAAG CCTTTGGGAA CACCACGCTC AGCACCTTGA AGGATAGCAT 1320 

TGACCTCAGC GTGAAGGGTG CGAACGCAGT GGCCTTCAAT G ACCAAA CAT TCGTGATCAA 1380 

TACAATGCTC AGTCCCTGAC ACCGAACCAT TGTAACCAGT GGAAATAACC TTATTATCTT 1440 

TTACCAGAAT CGCGCCCACT TTAGCACGTT TACAAGTGGA ACGATTCGCA ATTAGTAGAG 1500 

CTTGGGCTGC AAAATACTCA TCCCAGGCCA GTCTTTTTTC AGTCATCTCT TTTCTCCTTT 1560 

TTCTCTATTT TTTAAAAAAT GGTAAACCTA AATCTGCAAT CTTTTCAGCT GGTACCTTCA 1620 

TGCCATCCTT GATCCATTTT AGAAGGACAG AGACGATGGC TGAGCT CC AG AAGGAATGAA 1680 

GATAAGAGCT GACACCTTTT GATTTCCCAT GGTATTTTTC TAGAAATTCC TGCATGGCTT 1740 

GGACAAAGAT TTTTTCCAGA TGGTAATCCA AGGCCAATTG AATTACTCTA GCTTCCTTTC 1800 

TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC TGTCTTTAAA TCGTAATGAT 1860 

GCAGCTCTTC CATAATATTG TGGACAGTTC GTTTAAAGAC GCTCTCTAAA ATTTCCTCTT 1920 

TGG AGT CAT A ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG AC CAATTCAG 1980 

AAATACTAAT CTTGGTCAGT TCCTTTTTTT CCAAGAGTTG CAAGAGGGCT G TTTC AATGG 2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 2100 
C AG AG ATT CT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 2160 
CTTTTGTTTC 

(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 60 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 120 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 180 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 2 40 

TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 300 

AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGCTTCT TTTGGAACGT CGTTTGGATA 360 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 420 

ACGG TC AC AT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 480 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCAGTA 53 9 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 60 

CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 120 

AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 180 

CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 24 0 

TTGGATGACT T GGAACACC A GGAGTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 300 
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CACCTTGACT TGGCTGAGTA GGCACGGTAG GAGCTTTTCT CATAATCTCC TCTACCGTTG 360 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AAGGTTCATT TGCAGGAGTG CGAACTACTG 4 20 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 480 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT G TT T CCCCTG CAAAGGTTGA 660 

CTGGGAA 667 



(2) INFORMATION FOR SEQ ID NO: 315: 

li) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 



GGGAAGCCAA 


GGTATTTTAT 


CGGATGAAGT 


TGTTACTAGT 


TCTTCACCGA 


TGGCTACAAA 


60 


AGAGTCTTCT 


AATGCAATTA 


CTAATGATTT 


AGATAATTCA 


CCAACTGTTA 


ATCAGAATCG 


120 


TTCTGCTGAA 


ATGATTGCCT 


CTAATTCAAC 


CACTAATGGT 


TTAGATAATT 


CGTTAAGTGT 


180 


TAATAGTATC 


AGCTCTAATG 


GTACTATTCG 


TTCCAATTCA 


CAATTAGACA 


A C AGAACAGT 


240 


TGAATCTACA 


GTAACATCTA 


CTAATGAAAA 


TAAGAGTTAT 


AAGGAAGATG 


TTATAAGTGA 


300 


CAGAATTATC 


AAAAAAGAAT 


TTGAAGATAC 


TGCTTTAAGT 


GTAAAAGATT 


ATGGTGCGGT 


360 


AGGTGATGGG 


ATTCATGATG 


ATCGACAAGC 


AATTCAAGAT 


GCAATAGATG 


CTGCAGCTCA 


420 


AGGGCTAGGT 


GGAGGAAATG 


TATATTTTCC 


TGAAGGAACT 


TATTTAGTAA 


AAGAAATTGT 


480 


TTTTTTAAAA 


AGTCATAC AC 


ACTTAGAATT 


GAATGAGAAA 


GCTACAATTC 


TAAATGGTAT 


540 


AAATATTAAG 


AATCACCCTT 


CCATTGTTTT 


TATGACAGGT 


TTATTT AC GG 


ATGATGGTUC 


600 


GCAAGTAGAA 


TGGGGCCCAA 


CAGAAGATAT 


TAGTTATTCT 


GGTGGTACGA 


TTGATATGAA 


660 


C GGTGCTTTG 


AATGAAGAAG 


GAACTAAAGC 


AAAAAATCTA 


CCACTTATAA 


ATTCTTCAGG 


720 


TGCATTTGCT 


ATTGGGAATT 


CAAATAACGT 


AACTATAAAA 


AATGTAACAT 


TCAAGGATAG 


780 


TTATCAAGGG 


CATGCTATTC 


AAATTGCAGG 


TTCGAAAAAT 


GTATTAGTTG 


ATAATTCTCG 


840 


TTTTCTTGGG 


CAAGCCTTAC 


CCAAAACGAT 


GAAGGATGGG 


CAAATCATAA 


GTAAGGAGAG 


900 


CATTCAGATT 


GAACCATTAA 


CTAGAAAAGG 


TTTTCCTTAT 


GCCTTGAATG 


ATGATGGGAA 


960 


AAAATCTGAA 


AATGTGACTA 


TTCAAAATTC 


CT ATTTTGG C 


AAAAGTGATA 


AATCTGGCGA 


1020 
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ATTAGTAACA GCAATTGGCA C AC ACT AT C A AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT 1140 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTT AAAGG AG AG AGTGT AC A 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 12 60 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 13 20 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 14 83 



(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 2453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 



CCTGAACGCT 


TTTTTATAAA 


TAT CAT AAAG 


CCAATCTGAT 


TTATCAAGTG 


TGTCTAAGCG 


60 


ACGCGAATTA 


AAATTCATTG 


CATAC TCC AT 


CGCTTCTAAA 


AAACTCATTT 


TTGAAAAGAC 


120 


GTTAAAATCA 


TCTAAATTCT 


GACTCCAATA 


TAATAACAAA 


ACCAATCCCA 


TAATATCCTC 


180 


TGGTTGATTA 


TTCAATAAAT 


TTAAGTTGGT 


TTCATAAAAC 


CCTGGAGTTC 


CAAATAGAGG 


240 


CAACTTTTTT 


TCTTCAATTT 


GAGTTTCTTT 


CCTTAGGGCA 


TGCTCAAAGT 


CTATAATATA 


300 


AATATTATTT 


CTATTATCAA 


TAAGTATATT 


ATTAAATGAT 


AAATCTCTAT 


AGGAAAGATT 


360 


ATATTTGGAG 


TTTATTATCT 


CCATATAATC 


AATTAATGTT 


AAAAACCAAT 


CATACGAGCC 


420 


ACTAACCATA 


TTATACTCGC 


TTAATTTATC 


TGCAATAATA 


AACTCAAATT 


CCACAAAATA 


480 


CGAATTCTTT 


ATGTAAAAAT 


CGTTAAAAAC 


TTTTGGAGTA 


AATTCCTCCT 


TTTCCAATTC 


540 


TACTAATATT 


TCTCTTTCAT 


TTATTAAACG 


ATTCACAGAA 


TCTCTATTTG 


TAAAATCAAC 


600 


CAACGATAAA 


TCACTAGCTT 


CTTTTAATAA 


AGAATAAACT 


CGCTTTTGAG 


TATTAAATAC 


660 


TTTATAAACT 


CCACCTTTGG 


CATTTTTAGA 


AATCACTTCC 


AAAATAATAT 


ATTGATCAGG 


720 


AATAGTGTTA 


TATCTTGGAA 


TATAGTAATC 


CCTTATTGGA 


ACATTCACAT 


TTGAAGGGAT 


780 


TTTCTTATCT 


CTTTTATCCT 


TGAAAGTGCT 


ATCTTTTACG 


AACTCCCCAT 


ATCTGTAATA 


840 


TACAACCTCG 


CTAAGTTGAA 


AT CTG AAATC 


TGATGGTATG 


TTTACACCCT 


TTACACCTTT 


900 
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ATACAATATT TCTAATTTGT GTAACAAACG TTGAAACTCT TTATTATCTT TTCGATAAAT 9 60 

TCTAATGAAT TTCCCGACTT GTGAATAACC ATTAAGCCCT GTATTTTGCA AAGAAAGTTC 1020 

TTTAATGCTA ACCAAAATTT TGAAATTTAT CTTCTTCTCT CT AG AAAAT A TAAAATCAAA 1080 

GAATTTTTTA GCAACCAAAT TAGCATTTAA TATTGAAGCG CTCAGGTGTA TTTTAAATCC 114 0 

CTTAGATTGG GTGATATTAG ACGGCAAATT ATATAACCAA" TGTTCATCAC TAAAATTATC 1200 

ACTAATTTTA TATTCTAATA ATAAATTATG GTATGCGTCT TCTATTTCAG TTTCATAGTC 1260 

CAAATAGTTT AAATACTTTT CGTAATTCAT ATTAAGAAAT CTTCTCCATA AATTTTTAGA 1320 

CCATCATTTA AAGCCAAACA ATTTAAAGCG TGATAATAAA TGTTGATAAT CAATGTAACT 1380 

TTCAGTCCTC TATTTTGTAA TTCCTTCACC AATAATTTTA TGCTATATCT ATTTTCTCGA 14 40 

GGCAATTTAT AGCACTTCAA GATAAAACCA TAAAAGAGAT AAGTATTATA ATCTGACAAT 1500 

CCAGTTTCAG AATAATTTTT TAGAAAAATA TCTAGTGATT CTGATAATTC ATCCGGAATA 1560 

ATTCTTTTAA CATCGTATTT ATTTTTCATA TCGGCCACTC TTCCTTAAAA AGCTCACAAT 1620 

AAAATTTTAA ATTTCTATAC AACAATCCGA GAGTAGTCTC ACAATTTGAA CATTTCACAT 1680 

CACTCTTAAT ATATAAAAAA TGAATTAATC AGAAACCTCT GACTAAGATT TCCTAATTAA 1740 

TTCACTTTCT ATATCATAGT AAGGAATTCT ATTATCCCTA ATTGAAAATT GAAATTTTAT 1800 

GTTTTATATA TTAACAATTA TGCGGATTCT AAATCTTGTC TAACAAAATG GCAAGTGCTA I860 

CTATGTGCCC CAGAAGGCGA TGCAACGCTA TTTTGAATTG AAAGAGCATA ATCATCCATA 1920 

TCATTTAAGT CACGGATTAG CAATGCTTCC TTCTCTCTTC CGACAATTCC AAATTTTCTA 1980 

ATTACCTTTT CAGGATTATC AAAAAATTCT CCAACAACTT CCATATTTCC TTGAAGTTCA 2040 

TTCAAGAAAG CTTTCATTTG ACT AC TC ATT ATATAGCTCC TTTTCTATTA CTTTATTTGG 2100 

AATCAAAACT TACTTGTACA TTGGAAACAC CTCTATTCTA CGCTT TCAT A TTGCTGCATG 2160 

ACACTTTCAA AATCAAATTG CTAAAAATAA TTTTTTAAAG CTTAATTTAG ATTTAATTAC 2 220 

ATATATCTCA AAAAATTGTT TTGAAATTAG TAAATTAAAA TAGGTTTCTG TACTTATAGG 22 80 

AACTAGTTAT AAAAACTTCG CCCATCATAA AATATCTATT TAAGTAAAAC AAAAATTTTA 23 40 

TAATTTTTTG ATTTTTAAGT GACTATAATC TCCTATCTAT AAAT A CCATT CGCAGGACCT 2 400 

GCATCAATCC CTCTAGCCAT CTTATGAACT TGAGTTCCTC CAGACAGTCC CGG 2453 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

CCAATTTGAA GGCTCTAAAA CAATGGAAAA GTGCTACACA GATGTGACAG AATTTGCCAT 60 

TCCAGCAgTA CTCAAAAACT TTACTTATCA CCAGTTTTAG ATGGCTTTAA CAGCGAAATT 12 0 

ATTGCTTTTA ATCTTTCTTG TTCGCCTAAT TTAGAATAAG TACAAACAAT GTTGGAACAG 180 

GCATTCAAAG AGAAGCACTA TGAGAATACG ATTCTC CAT A GTGACCAAGC CTGGCAATAC 240 

CAACACGATT CTTATCATCG GTTCCTAGAG AGTAAGGGAA TTCAAGCATC CATGTCACGC 3 00 

AAGGGCAACA GCCCAGACAA CGGCATGATG GAATCTTTCT TTGGCATTTT GAAATCGGAG 3 60 

ATGTTTTATG GTTATGAGAA GAACTTTAGA TCTTTAGAAA ACCTTGAACA AGCTATTGTG 4 20 

GACTACATTG ATTATTACAA C AACAAGAG A ATTAAGGTAA AGCTAAAAGG ACTTAGCCCT 4 80 

GTGCAATACA GAACTAAATC CTTCGGATAA ATTAATTGTC TAACTTTTGG GGTGCAGTAC 54 0 

ATTTTTGGTA TATATAAAAT TTGTAGGAGC TATATCTACA ATTTT AT AT T CCCAGTTTAT 600 

GGATGTAACT TACTATATTC ACAAT GTTAT CCAGTGTTTT TTCTCTAATA TTTAAGGAGT 660 

GTTCTGTTTC TCGAATAAAT TCTTCAAAGT TTAACCCGTC AACTTGTTCC TGAACAAGAA 72 0 

AATAATCATC CACGATATAA AATTCATCAG TTAAATTAGT AGTATAACTT TTATCGGCTA 7 80 

ATTTTTTTAG CATGTGAGCT TCATTTTTTA TATCATCAAG AG CTGTCC AT TCTCCTTCAG 840 

CATCATAATT CACAAAAGGT CTTGACTGCT TGATGATTAC TTTTTGCCCG TCCGATTTTC 900 

TAATTGCCCG ATAAACATTT CCTTTATTTG ATCTCTTAAT AATTTTTTCC ATTTTGTATT 9 60 

TATTTATTGC AGAGTCCTTA CTTGAAACTT CACATGTGGT TTGAAAATAA ATCCTTTTTT 1020 

CTTCTTCTGA AAATAAATCC ATTTTCCGG 1049 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH : 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AG AAT CAGAA AATCGCCGAA GTGGTTATTT ATTTT TGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTGCC TTGCTTCCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 24 0 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 300 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTCCT ATT TTTGTG A TTATGGCTCT 360 

TATTATGGGA ATATACCTAT GAATTGGGTT CTTATAAAAA TAAAAGATAT TTTTTCAATA 4 20 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAACCATTA ATAATAAAGG TGTTAGAATT 4 80 

ATACGTGGTG GTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC AT AATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 6 60 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCC AAAGTGCAGA GATGATGTCA 72 0 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 77 6 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGCCTGCATA CGCT TGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCCATGCCT GCTTGCCGAA TATCATGGTG 24 0 

GAAAATGGCC GCTTTTCTGG ATTCATCCAC TGTGGCCGGC TGGCTGTGGC GGACCGCTAT 3 00 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 3 60 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGC AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 658 
(2) INFORMATION FOR SEQ ID NO: 320: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1475 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

CCGGCTTAAT TTTTAGAAAA CGTGGGCAGG GAACCTTTCT TCTCTCTCGT GGCAGCTCAA 60 

AAAGAAAATT AATCGTTCCA GAAAGAGATA TCCGGGGACT GACAAAAATA TCTGAAGATG 120 

CTCATTCTAC AATTGACTCG AG G ATT ATT C ACTTCAAATT AGAATTTGCA AATCAATTTT 180 

TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC CGCCTGCGTA 240 

TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT GTTATTCCAG 300 

GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAAT T A CATTGAAGGA AAGTTAGGAT 3 60 

TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA AATGAGCAAC 4 20 

ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG GCTTATTTGG 480 

ATAACGGAAC TCCGTTTGAG TACTCGATTA GTCGTCATCG CTATGATTTA TTTGAATTTA 54 0 

ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC CAATCTTTTA 600 

CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAG GTCCCATCTA TGCTTAAATG 6 60 

GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA TCATGTTGAA 7 20 

AAATATCTTC GTATAGCTTA TAGAGTAGGT ACTGAAATTG TTCACCTGAT CTACTTCTTA 7 80 

TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA GTTTTTCAGT 84 0 

CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA CTTCTAACAA 900 

ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT TTAATATATT 960 

AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT CATTACGAAT 1020 

AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA TTAGTTCAAA 1080 

AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTG GATCATCGTA AATTGTTTGA L14 0 

TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA AAAATGGTAC 1200 

AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT ACTTCTAAAA 12 6 0 

TATTGTTAGA AATCGATTTG ACTAT CCTGA TCTATTTGTC CTGTTCTTAT TTCATTTCAC 1320 

TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC ACTTTCTTCA 13 80 

TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT TAAACTTCAA 144 0 

CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 1475 
(2) INFORMATION FOR SEQ ID NO: 321: 



1 

V 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATAGTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTGAC GCGTACACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT AC CTG AGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 24 0 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 300 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 3 60 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTACAAAAAT CATAACCTTC CCTATCTACG 480 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGCGA AAGGAAGATA AGAAa CGT AT 540 

TTAGCCCATA ATCTATAAAG 560 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 643 base pairs 
<B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTC AC C TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 120 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGC CTTTTC C GAAGAACGGC TCCGGGTTTG ATGACGAGGG 240 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 300 

TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 360 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 4 20 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 480 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 540 



) 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTG 600 

GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 7 80 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 3: 

CCTACCCACT CATTCTTGAT GAATTCTGAA CAGTTCCCCT TGGGTCGTTT TGCGAGTTGA 60 

AGTCAAGAAC AGGAAAAAAA CAAAAAGGAC AAA TACT CAT GGCAGTAATT TCAATGAAAC 12 0 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACC AAACTC G TCGCTGGAAT CCTAAGATGG 180 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 240 

AATACGCTGA CC AAG CAT AC GACTTCATGC GTGATGCAGC AGCTAACGAT GCAGTTGTAT 3 00 

TGTTCGTTGG TACTAAGAAA CAAGCAGCTG ATGCAGTTGC TG AAG AAG C A GTACGTTCAG 3 60 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTCG GGAACAATCC 4 20 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTATGGA AGAAGATGGA ACTTTCGAAG 4 80 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 54 0 

TGGGCGGTAT CGAAGATATG CCTCGTATCC CAGATGTGAT GTACGTAtTG ACCCACATAA 600 

AGAGCAAATC GCTG TTAAAG AAGCTAAAAA ATTGGGAATC C C AGTTGT AG CGATGGTTGA 660 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG AC GCT ATCCG 720 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT AT CG AAG G AC GTCAAGGTGT 780 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 60 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 120 
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AGGAAAACCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTT T CC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 24 0 

G ACT ACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCCTAT CTCTGCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCAT g AAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCACTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

TCTTATGAAG CCGAAGCGTG ATTTATGGCG GATAGGTTTG GTCTGCAGAA ACTGACAAAT 60 

CT AGTGC CAT CAGCGTATAT GGAATCTnTG GCTGAGAAAC AGTC CCGGGG TGAACTGACT 120 

TATGAGCAGG TTTATGAGGA TGCAACGGCT TATCATCATA CCATTGATGC GAGTACAGAG 180 

GAGGCAGACT TGGTTTCTCT ACGTATTGTA GAACTATTGT CTCGAAGAGG CTTTAGCTTC 2 40 

AGTCCTGCGA TCTTACTTGC TATTCATAAG GAGTTGTTTC AAGATATATT TGAACCCTCG 300 

ATTCCGGTAG GTCAATTTCG TCACACTAAT ATCACAAAGA ATGAACCTGT TTTGAATGGT 360 

GAAAGTGTTG TGTACTCTGA TTACTCCATG ATTCAAATGA CCTTGGATTA TGATTTTAAT 420 

CAGGAAAAAC AAGTTGCATA TGCGACACTA ACCCAGGCGG ATATGGTTAA AAAAATCCAG 4 80 

CATTTTATTT CAGGAATCTG GCAGATTCAT CCATTTCGCG AAGGAAACAC TCGGACGGTA 540 

ACGGTATTTT TGATTCAGTA TCTTCGTGAG TTTGGTTTTG ATATTGATAA TACACCATTT 600 

CAGCAACATT CCAAGTATTT TCGTGATGCC TTAGTGTTAG AT AATGC AAA GATTTTACAC 660 

CGACGTCCTG AGTTTTTAAC AGCTTTTTTT GAAAATCTCT TGCTCGGTGG TCAAAATGAT 720 

TTGTCTTCAG AAAAAATGTA TCTAGATTTA GACCTCGATC TTTCATAATC CTAATACTGA 7 80 

GTAAACATTG AATTTTAGGA AAAAATGAAG TAAATATTCT CACAAGAAAA CGTATATCAT 840. 

CAAAGTTTGG CTCTTTGTCA ATTGTAGTGG GTTGAAGAAA AGCTAAGTTC GAGAAAGGGC 900- 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CC AG TTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 

GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG ^37 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAGACTCA CTACTAGCTT AAAACATGTT 24 0 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTT TG TT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 360 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 4 20 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 4 61 



<2) INFORMATION FOR SEQ ID NO : 327: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 
{S) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATAGAATTCA 


TAAAAGCCAT 


CGCTCATTAA 


AACAATTTGT 


TCACTAGTAA 


CATCTATTTG 


240 


ATTAATAATA 


GCATGGTCTA 


AAAATCTCTC 


ATCCAACGAA 


CCTATC CAGT 


ACCCACTCGG 


300 


TTGATTAGAT 


AATTTTCTGA 


TTTTTTGTAA 


AATAATTTTT 


TTATTTAAAA 


CACTATTTGT 


360 


ACCAATTGAA 


TCTTTTATCT 


CATTTTTCCC 


TTTTTCAAAT 


AAGTTATCTA 


CTCTATGATC 


420 


AGTTATTTCC 


ATTTCGTTTA 


CTAACATGAC 


GCAGTCACCT 


AGCATCATAT 


ACTCCAACTT 


480 


TTTTTCTGAA 


AGTTTAGCAA 


ATATTGGTAA 


GCGATAATAT 


ACT AT ATT GA 


AACTAGAATA 


540 


GTACACCTCT 


ACTTCTAAAA 


CATTGTTAGA 


AATCGATTTG 


ACTGTCCTGA 


TTGATTTGTC 


600 


CTATTATTAT 


TTCATTTTAC 


TATACTCTGT 


TAATTTATAT 


GAGTTTAAAC 


CGATTTCATC 


660 


TTTAACCTCG 


ACTAAAGCAG 


TTTCAAATAT 


TTGTTTAAGA 


GTTTTTGATT 


CTTTACAATT 


720 


AACCGACAAA 


CTTTCTGATA 


AAATATGTAC 


AACTTCTGAG 


ACTGAATAAC 


CTATCTCCTC 


780 


TTTAGAATTA 


TATAAATCTG 


TAGCTCCACC 


AATAATCCAA 


AAATACTGAT 


TTTGTGAACC 


840 


TACAATATCC 


TCATTTTCTA 


CGGAACTTCC 


TTCTATCGAA 


CAAATTTTAT 


TTATCTTTAC 


900 


CATAATACTT 


CAACCCTTTT 


AGTGTCAAAA 


GTAAACCAAT 


TCCTGTCACT 


GTTAAGAATA 


960 


GTTCCATAAT 


CTTATTCGAA 


CCAGTCTTTG 


GTAATTTTTG 


TTTkACATCT 


ACTATyTCTT 


1020 


TAGATTTATT 


AATATGATTT 


TCAGTTTCTC 


TGCCATCTCC 


AACTATTTTA 


TAGTTTACTT 


1080 


CTTCTGTCTT 


ATTATCTTGT 


TTATTGTCGA 


TCTTGTCATT 


CATTTGTCTA 


TTATCTTTAC 


1140 


TTGAGTTAAA 


CTCTCCGTTC 


TTCTGGTTAC 


TATCAATTAC 


ATTATTTGAA 


TTAGATTGTT 


1200 


TTTCCTCTTT 


GTTTTTTTCT 


TTTTCGTTTT 


TATCACTTAA 


ATTATTTGTT 


ACAATTTTGT 


1260 


AAAGCCCATT 


CTCCGTTACA 


ATATTGAAAT 


TACCATCGCT 


ATCACGTATA 


ACAGGTTCTT 


1320 


TCCCATTTGC 


ATTAGATTTG 


ATGAATGATA 


TATACTTACC 


GCATAAATTA 


TAAAATTGGT 


1380 


TATTTAAAAC 


GGTTATTTTA 


CCCTTTGAAT 


CCTCAATAAC 


AATTCCTTCT 


TTACCC 


1436 


(2) INFORMATION FOR SEQ ID NO: 328: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 8: 

CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 60 

ACGTATTTTC CGTCTTTCTT CCGTTGCTTG AC ACACT CTG TGAGGAGATA TTCGATTTGC 120 

CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 180 
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GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AG GCTTCCTG 240 

TGTTGACAAT TCGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 420 

ATCCTTGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTCA 4 80 

TACCTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 540 

T ACGGGT AG C AACAATTTCG CTAGAGCCAC GG AG ACT A CC TTCATCTGCG TGCCCATCAC 600 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 64 6 



(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 1653 base pairs 
<B) TYPE: nucleic acid 
(CJ STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

GTTGCAGGTG CAGTAGGTGT TACTTCAGAT ACATTTGAAC GTGCAGAGGC TCTTTTTGAG 60 

GCAGCAGCGG ATGCGATTGT TATTGATACT GCACATGGTC ATTCTGCAGG TGTCTTGCGT 120 

AAAATTGCCG AGATTCGTGC TCATTTCCCA GATCGGACTT TGATTGCTGG AAATATTGCT 180 

ACTGCTGAAG GTGCACGTGC CCTTTATGAA GCGGGTGTAG ACGTTGTTAA GCTTCGTATT 240 

GG AC CAGGTT CTATCTGTAC TACTCGTGTG ATTGCTGGTG TTGGTGTTCC GCAAGTAACA 300 

GCTATCTACG ATGCTGCAGC TGTTGCGCGC GAATATGGTA AAACG ATT AT TGCTGACGGT 3 60 

GGGATCAAGT ATTCTGGAGA TATTGTAAAA GCACTTGCTG CAGGTGGAAA TGCTGTTATG 4 20 

CTTGG AT CTA TGTTTGCTGG AACTGATGAA GCTCCAGGCG AAACTGAAAT CTTCCAAGGA 4 80 

CGTAAATTCA AGACTTACCG TGGTATGGGA TCAATTGCTG CTATGAAGAA AGGTTCAAGC 54 0 

GACCGTTATT TCCAAGGTTC TGTCAATGAA GCAAACAAGC TTGTTCCAGA AGGAATTGAA 600 

GGTCGTGTTG CTTATAAAGG AGCGGCAGCT GATATTGTTT TCCAAATGAT TGGTGGTATT 660 

CGCTCTGGTA TGGGTTACTG TGGTGCAGCT AACCTTAAAG AACTACACGA TAATGCTCAA 720 

TTTATTGAAA TGTCTGGTGC TGGTTTGAAA GAAAGCCATC CTCATGATGT GCAAATTACT 7 80 

AATGAGGCAC CAAATTATTC TATGTAAAAA ACAATGAAAA GAACTCCAGT GAAAACAGGA 8 40 

GTTCTTTTAC AATGTTGTCA ATTTCCATTT ACAGCAG CTT TACCATCCTG AATAGTGAAG 900 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC TTGTTGTTGT GATAAAGGTT 9 60 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTCTC TAGTGTTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA T ATAGGAG AT TCTGTGGTAA TGCTTT CCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 1140 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TGAAAGATTC GGATAAGTTT 1260 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGGATATTG ACAACTCTTC GATCTGATTA 1320 

GAGAGTTCAA AATGTTTCTT ACGCCCAAAT GATTCTAGTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATGTT 1440 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAAGT CAGATAAATA GATTGGCTTA ATTTCCCCAA GTTCCATATC AATGAATTTT 1560 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 1620 

TTCATGTGTC CT ACATAATC TGAAAGGCGT GCC 1653 



(2) INFORMATION FOR SEQ ID NO: 330: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA C GTTTTCAG A TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 240 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 300 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 3 60 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 4 20 

TATGCACATA TACACTTAAA TT CAT AT AAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AG AT ATTTT T TCAAGCCTGC CTTGATTTAC ATTCGTTG C A 780 

AAATGATCAT CTAAGAATCT TG CT AC AT CT TCAAGTGCTT TAT CATC ATC AATTTCAATT 84 0 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 9 60 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGACTTT GCGGTAATAC TTTGATACGG ATTAAACTTT 12 00 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CAT AGTCTG A CAT ACT AT AA 12 60 

GTGCT ATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 1320 

CCTCCTTCAT AAAACCGGTA 1340 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTG ATC C GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TT CTTTTT AT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTCTTCA ATTGG AAT AT 240 

AGTGATTTAC GAAAT CGT AT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 3 60 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 4 80 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 6Q7 
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(2) INFORMATION FOR SEQ ID NO: 3 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 900 base pairs 

(B) TYPE : nucleic acid 
(C> STRANDEDNESS : double 
<Q> TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



TTAAAATACC 


GAATTTTGTT 


TTGTCCTCTA 


TTTCAACATT 


GTGAATCGCC 


TCAGGCAGAG 


60 


AACCGATACT 


AAAGATATAA 


CCAAAATAGT 


TGTCATTTGC 


TTTACCGATA 


T CAATCTT AT 


120 


TGGTTAAATC 


AAAATCCAGT 


TCGTCAATTG 


CGCCATCGAT 


GTCTTGATTG 


ATTTCCAAAA 


180 


GTTTTGTAAT 


GAGGTTACCC 


GTACCGCCTG 


GGATAATCCC 


TAACTTAGGA 


ATGTAGTCTC 


240 


TCTCATCAAT 


ACCTGAAATG 


ACTTCATTGA 


CACTTCCATC 


TCCACCAAAC 


ACAACCACTG 


300 


CATCATACTG 


CTCACGAGAA 


GCTTCTTCAG 


CAAAATGTGT 


TGCATCCAGC 


GCTTTTTCGG 


360 


TAATTTTGGT 


TTCAACATAT 


TCAAAGTATT 


CTTTTGCTTT 


ATTCTCCAGC 


TTTTCTTTGT 


420 


AATC CAAAGC 


CTTCTCGCCA 


CCAGAAGTAG 


CGTTGATAAT 


TACCATTGCT 


TTTTTCATTG 


480 


ATTTTAT CCT 


TAATTTTAAA 


CAGAAATGTT 


TACATTTCGT 


CGTATGCAAG 


TAAATGTAAT 


540 


CCTATTATAC 


AATGAAAATA 


CAGAAAAGAG 


AAATCTGACG 


TACTGGAGAT 


TAATACGCTT 


600 


TTATTCTATT 


TTCCCATCGC 


CTAACTACAT 


CCTTTAAGGG 


TTCATCCAAG 


TAAGAATAGG 


660 


CCTTATCCTT 


GATCCAATCA 


GGAATACCGT 


AAGCTGCCTC 


TGCTAwGCTA 


CAAGTGATTG 


720 


CTGCGAGAGT 


ATCACTGTCG 


CCACCAAGTG 


AGATGGCATT 


TCTTATCGCA 


TCTTCGAAGT 


780 


CTCTACTTTC 


AAGAAAGGCG 


ATAATGGCTT 


GAGGGACAGT 


TTCCTGACAT 


GTTTCGTTAA 


840 


AACGATAGTT 


AGGACGGATT 


TCATCTAAAG 


TTTGAGATAG 


ATTGTAATCG 


TATTCTTTTT 


900 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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CGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCCC TGGTGAGTTC 24 0 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTGG CCTACACTCT CAGAAAGAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 3 60 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGC AG AC CG TAGAAATGCC 420 



AGAGCTGCTG AGCAAGCGTG CCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 

GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 

(2) INFORMATION FOR SEQ ID NO: 3 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGACCT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGC AG AT AG A ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGC AG A TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 240 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 3 60 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGC GTTTTT A GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 480 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 



ATGA 544 

(2) INFORMATION FOR SEQ ID NO: 3 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 
(B> TYPE : nucleic acid 
(C> STRANDEDNESS: double 
{D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 5: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGC CTCT A 60 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTT AG AAT AT 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGCTATATT CAAAATACAA 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGG AT AC C TATCCTACC 

(2) INFORMATION FOR SEQ ID NO: 3 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 
(9> TYPE : nucleic acid 
<C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 36: 

CTCCGATAAC CACACCAGCA ATGGAAATAA TTCCATCGTT AGCATCAAGA ACACCCGCAC 60 

G C AGG AT ATT TAAACGACCT GCAAAATTTG AATCAATTTC GTGATTTCTT TCTGACGCTA 120 

AATTTCAAGT TCAAGTTAGC CATCAAGAAG TCTTCTCTGG GTGACTTGTA GT CCAAG CAT 180 

TTTTTAGGAT AGTTCTTAAT CCACTTTTCG ATGAATGCGA CTTCTTTGGG AGTCATTTTC 2 40 

TTGCTTCCCT TAGGTAACCA TCTACGAATG AGCCTGTTGT GATTCTCATT AGTTCCCGGG 300 

ATCCTCTAGA GTCGACCTGC AGGCATGCAA GCTTGGCACT GGCCGTCGTT TTACAACGTC 3 60 

GATGACTGGG GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT 420 

CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 4 80 

CCTGAATGGC GAATCGGGCC TGATGCGGTA TTTTCTCCTT AC GCATCTGT GCGGTATTTC 540 

ACACCGCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC 600 

CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 660 

TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAAGTTTT CACCGTCATC 720 

ACCGAAACGC GCGAAACGAA AGGGCCTCGT GATACCCCTA TTTTTATAGG TTAATGTCAT 780 

GATAAGGATG GTTTCTTAGA CGTCAAGTGG CACTTATCGG GGAAATGTGC GCCGAGACCC 840 

TATTTGTTTA TTTGTCTAAA TACATTCAAA TATGTATCCG CTCGTGAGAA AATAAACCTG 900 

ATAAATGCGT CAATAATATT GAAAAATGAA GAGTATGAGT ATT CT AC ATT TCCGTGTCGC 960 

CCTTATACCC TTTTTTGCGG CATGTTGCCT TCCTGTTTTT GCTCACCCAG AAAACGCTGG 102 0 

TGAAAGTTTA AGATGCTGAA AAATCATTTG GGTGCACAAC TGGGGTTACA TCCAACTGGA 1080 

ATCTCCAnCA GCAGTTAAGA TCCTCTGACA GTTGTACACG CCGCAAGAAC TATTCCCGAT 1140 
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GAATCAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 1206 
(2) INFORMATION FOR SEQ ID NO: 3 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 37: 

CTGCTCAACT C AG ACAGTCA AATTTCTCAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCAGGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TGGGCTTAGA CACTCCTATG AT T AACTTC A ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT TGAAAAGTGT TTGACTGTAG ACAAAAAATA TCTGCTAGAT 2 40 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAAGCT 3 60 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTACCAG 480 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 54 0 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TGAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTT AC CAACG ACCAAGATGG GGTTGCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 7 80 

CTGATACTCA ATGAGGGGCA AAGAGCGAAC TTA 813 



{2) INFORMATION FOR SEQ ID NO: 33 8: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 38: 
C CT AG AT AAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA C AG AC GTTCG TACAGAGGTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC T C TTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CG ACGATT AT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTT T ACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 4 80 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACT CTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

C C ATT ACT AT CTTCTTTGTC ATC 683 



(2) INFORMATION FOR SEQ ID NO : 3 39: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 39: 

CT ACTTT AC A TGGAAGTAGT CACTGAATTC CAGTTAGAAA TTACTTTGTA ACTACGTTTT 60 

GAGGAGGAGT AAAATGCTTT CCTACGTTCG ATATTACCCA CTAGCGATAG CTAAATTAAT 120 

GTGTCTGTGC TCTCCTAAAA TCTGCTGATT TATTACTGAC TAATACAGGA GGTTTTTTTT 180 

ATGgACAGAC AAT CAT AT CT GCTATTGGTG TTTATATTTC CACCAGTATC GATTATTTAA 240 

TTA T TTTAAT TATTTTATTT GCACAGCTAT CACAGAATAA ACAGAAATGG CATATTTATG 300 

CGGGGCAATA TCTAGGCACA GGCTT ACTTG TAGGGGCGAG TTTAGTTGCT GCTTATGTCG 3 60 

TTAATTTCGT GCCTGAAGAA TGGATGGTTG GATTGCTTGG TTTAATCCCT AT CT ATT T AG 4 20 

GGATTCGCTT TGCAATTGTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA ATTATTGAAA 4 80 

GATTAGAACA AAGCAAGGCA AATCAACTGT TTTGGACAGT T AC ATTGCT G ACAATTG CGT 540 

CTGGCGGAGA TAATTTAGGT ATCTATATAC CTTATTTTGC TTCGTTAGAT TGGTCACAGA 600 

CCCTCGTGGC CTTGCTTGTG TTTGTAATCG GCATAATTAT CTTTTGCGAG ATTAGTCGGG 660 

TGTTATCCTC TATTCCGTTA ATATTCGAGA CAATTGAAAA ATACGAGCGA ATCATTGTGC 720 

CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG ATAGAGACTT 780 
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TTCTGATCCT GTAGATTTTT TTGTTTCACT AGGGATTTAG CCCGAGCTCA AAT CAGCTCT 
CTGATTTTCA GA 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND EDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

CCGCACAAAA GCGCATAGTA TCAAGATTCT ATAAAGCCTT GATACTATGC CTTTTTAATG 60 

GAT AAATAG T TAGTCTTTTT TAAAGACCGG ATCTTTCAAA CTCTGCATAC TGGCATTGAT 120 

CACCGCGCCT AGGATAACAA TTTTAGCAAT CAAGATAAAC CAAAACATCA TAACAACAAG 180 

AAGAACGGAA CCTAAAATTC GGACATCCAC CAAATGATCC ACATAGTAAT TGAGATAACT 240 

AGAGAACAGA GTTAGTAAAC CTAAAATCAC TAAGAGAACA AAGGCACTGC CTGGTAGGGT 300 

ATAGCTAATT TTCCTGTTAG ATAGATTGGG AAGAAAATAA TAAAGCATGA CCAAGATAGC 3 60 

AAAGAGGAGG GCGTAAATCA GAGGACCTGC CAACCCTTGT AAAGCCTGAT AGATAATGCC 420 

ATCTTTTCTC CAATAATGAG CAAGTAAAGC C AAAATC AT C TGACCAAATA AGATCAAAAA 4 80 

CAAGGCAAAC GCAAAGAGGA GCTGCAACCA AAACTGACTA GGAGACTTAG CATCTGATGG S40 

GAAATAAGTC CACGACTCTT TTCGACGCCA TAAGCCTTGT TAAAAGCTTT TTGCAAGAAA 600 

TTCATAGATT TTGAAAAACT CCATAACGCC GATAAAACAG AAAAACTCAA TAAACCTGTT 660 

GAAGGTTGCG TCAAGACTTC TCTGGCTATT TTTTCCACAC CTTCATAGAG GCTTGGGGGG 720 

CAGACGTCTT TCATAAAGCC CAAAAATTCT CCCA 754 
(2) INFORMATION FOR SEQ ID NO: 341: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 1: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG ACTGCACAAG AAGTCAGGAT 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 



60 



120 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTCCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 3 60 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 480 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCAT AG AC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 660 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 707 



(2) INFORMATION FOR SEQ ID NO : 342: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GGATTTTGAA AAACCATACC GATTT G AC G A CGTATATTCC AAACATTTTC CTCAGTCAAA 60 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACC GTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACGTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTGATT TTCTTTATAG CGAAAAGAAA 240 

GATTTTTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 360 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACT A_\A 4 20 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 480 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 540 

AGCCATACCT AACTCAATCA A CAT AAT AAA AGCCG ACAT A ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GAT AAT AGCT G C AACT AT AT GACTCTCTAT 7 20 

CGAATTTCCT ATCGTTAAAA TAAAGAT AAA AAT AGGT AT A AA 762 
(2) INFORMATION FOR SEQ ID NO: 34 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTC AG C GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 18 0 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 24 0 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC GCAAACTACA 3 60 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACT GGGC AGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 

TC 4 82 



<2» INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 
TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 2 40 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 3 00 

CTACACTTAA AGATTATGGA TTTACCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 3 60 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 520 
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(2) INFORMATION FOR SEQ ID NO : 34 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 5: 

GCATCAAATC CG C CATC AAA GAAGTTCTCT GGATTTACCA ACACCACTCA AATAGCTTAG 60 

AAGTG CTT AA TGACAAGTAC AATGTTCACT ACTGGAATGA CTGGGAAGTT GGAGACACGG 120 

GAACCATTGG TGAGCGCTAT GGTGCCGTTG TTAAGAAACA CGACATTATC AATAAGCTTC 180 

TCAAACAGTT GGAAACCAAT CCTTGGAACC GCCGCAATAT TATTTCGCTC TGGGATTACC 240 

AAGCTTTCGA AGAAACAGAT GGGCTGCTCC CGTGCGCCTT TCAGACCATG TTTGATGTTC 3 00 

GGCGTGTTGA TGGGGAAATC TATCTGGATG CGACCTTGAC CCAGCGCTCC AATGATATGC 3 60 

TGGTGGCCCA CCACATCAAC GCTATGCAGT ATGTGGCTTT GCAGATGATG ATTGCCAAAC 42 0 

ATTTTGGCTG GAAGGTTGGG AAGTTCTTCT ACTTCATCAA CAACCTCCAT ATCTATGATA 4 80 

ATCAATTTGA ACAAGCTCAG GAATTGCTCC GTCGGGAgCC GTCAAACTGC CAACCACGCT 54 0 

TGGTTTTAAA TGTTCCTGAT GCGACTAATT TCTTTGATAT CAAAGCAGAA GATTTTGAGT 600 

TGGTGGATTA TGACCCTGTT AAGCCACAGT TGAAGTTTGA CCTAGCTATT TAAAAGAATA 660 

GAAAAAAGAA GTTGAGAATA ATCC CAACTT CTTTTGTTTC TTAACGTGAT ACGCGGCGAC 7 20 

GAGCTGCTTT TTTACGGTTT TCTTCGATGA AAGCTGCTTT TTGCTCTTCT GGTTCGATTA 780 

CTTTCTTTTT AAATGCGTAT ACTGC AC CTG CAACGGCAGC GACAGTTCCT GCGACACCTG 840 

TTACAAGACC TTT AG CGAAT CCTTTAGCCA TGAGTCTTCC TCCTTTATAT TCTCAATCAG 900 

CCAGCCTCCT CAAGAGGTCA CATTTTTCTG ACTGACCTTT TTGTGTTATA ATAATAGTAA 960 

CGAAAAAATG GGAATTTTTC AAGGAAAAAA GATGAGAACA AAA 1003 



(2> INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 6: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGCTC 60 
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TCAACGCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA AT AC AT ACT C CGCCATGCTA 12 0 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGCAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCT CT TTA TGGCGTTCGT AAGCCTT GAG . 24 0 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGCAGCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

CCCCATATTG GCCTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTGCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTCTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATCAAC 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTCACCTTAT TGAAATGATA 720 

GTCACGGCTG ACAAAGACTT TTTCTGAAGA 7 50 

(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 
<B> TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAACCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 24 0 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 3 00 

TACGAAGAAG GTCGATAGAT GTCAkTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 3 60 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 42 0 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAG C CAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATC AG C 54 0 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 673 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CAGAGTCAAC AGCCTGAGTT GAAGGCAACT TTAGACACAG CAGTTACGAC AGCTGAATGA 60 

GCTCCTCCAT CAGTTTTTTC TTTAATGAGT CCAGCTACAT CTTCAACTTC GAGGCCGTTA 120 

ATCACAATGT CAGCGCCTAC TTCTTTTGCA AGGGCAAGTT TGTCATTGTT GATATCGACT 1B0 

GCGATAACAT GAGCATTGAA TACTTTTTTA GCGTATTGAA CAGCGAGGTT ACCAAGTCCA 2 40 

CCAGCACCGT AAAGAACAAC CCATTGGCCT GGTTCAACTT TTGCTTCTTT G AT AG CTTT A 300 

TAGGTTGTTA CTCCAGCACA TGTGATAGAA GAAGCTTGGG CTGGATCAAG TCCGTCACGA 3 60 

ACTTTGACAG CATAGTCAGC AGTTACGATA CATTGTTCAG CCATACCACC GTCTACTGAG 420 

TAGCCAGCAT TTTTCACTGT ACGGCAAAGG GTTTCGCGAC CAGTTGTACA GTATTCGCAA 4 80 

GTGCCACATC CTTCAAAGAA CCAAGCAACG CTGACGCGGT CACCGACTTT AAGGCTTTTC 540 

ACATCTGGAG CAATCTCTTT AACGATACCG ATACCTTCGT GCCCAAGAAC ACGTCCTGGG 600 

ACTTGACCAA AG T CACCATG AGCAACGTGG AGGTCGGTGT GGCAAACGCC CACAGTATTC 660 

ACTTCTACAA GTG 673 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 
{ B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
{□) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 34 9: 

GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 60 

TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGAT AT CA TACAGCAACT 120 

GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 180 

TCCCTCACCA AGATTCCG 198 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

GCTTCTTCTA TAGACAAAAA TATCATGGGT AAAATAATCA AGGCTATAGC TAGAAGGAGG 60 

GACCAATCCA CTACTAATCC TAAGAACAAA ACACTCAAGA GAGCAGAAGA GAGAGGTTCA 120 

CTGGCACTGA TAACGGCAAC CACCAAAGGA GAAACCAAGG ACACAGCCTT CATGGAAATG 180 

AAAAAAGCAA AAGCCGTTCC AAAGAAAGCG AT AATGAGG C AAATCAAGAT ACTCCAAATA 240 

TCAAGAGTAA AGGAAAGCTG ATAAACCGGC GAGAGGACAT TGCTAAACAA ACCTGCCAAA 3 00 

ATCATCCCCC ACCCAACCGT AGGAACAAAA CCATAACGCT TAGCAAAAGG TTGGGGCAAG 3 60 

ATAACATTAA ACATAACACC CATGGC ACTC AGCAAACCTG TTATAAGAGC TAGCGGCGTC 4 2 0 - 

ATGGATAACT GAGAGAGGTC TCCCTTTGTC GCCATCAAGC AAACACC C AG CATGGCAACC 480 

AAAACATAGA AAACAGCGCT TTTTGACGCT CGTTTTTGAT AAACCAAGCG ATTGTAAAAG 540 

AGGATAAAGA CAGGGCTAAT AAACTGTAAA ATAGTTGCTG TCGTAGCATT TGAGTATTCT 600 

ACACAGAGAT AGAAAAAATA CTGAACTGAA AAAATCCCCA AAATAGCATA GGCTAAAAAG 660 * 

GGCAGGTAAT TTTTCTTGTC TCGCCAAATA TCTAGCACTT GCGATTTTAA TTGTATTGCA 72 0 

GACCAAATGA GTACAAGACT CCCTGCCAGT GTCAAACGCA TAGAGGTAAT CCAGCCCGAA 7 90 

G AC ACCT GAT AATGAGTAAA GAAGTACTCT CCTAAAATTC CACAGATTCC CCATATTAAG 840 

CCGGATAGGA GCGAATAAAT TTTTCCGTTA ACAATCTTTT TCTGATACTG A 891 



(2) INFORMATION FOR SEQ ID NO : 351: 

U> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTG TTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 300 
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ATGCGTTTGA CTGTCAAATG GAGTG 325 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA AAATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 180 

AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 240 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 344 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
( D > TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCTTTG GCACAAATCA GTCTTCCAAT 60 

TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 120 

CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 180 

GTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTG GCCCTACTGC AGGCTATCTT 240 

TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 300 

GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CGGGATTCTC 3 60 

AGCTTGCATT TCCTAGCTGG AATGGCATTT GAAAAAGCTC TTGCTGTGGG GGTTCTTCCC 420 

TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 480 

CAACGCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 54 0 

TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 600 

GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGGTT GTGGATGAAA CTGACGAGTA 660 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 692 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 60 

GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 120 

GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 180 

GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 240 

CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 3 00 

TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 3 60 

TTCACATGCT TTGCAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGG TCATT 420 

CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 4 80 

GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA AAGTGATTCA ACTATATCGC 540 

TCATT AGAAC TTGATGGAAA GATTGGCATT ATTTTAAACT TGACACCTGC TTATCCAAGA 600 

AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 660 

TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 72 0 

GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 7B0 

GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGC AC AAGC AAATCCTGAG 840 

GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 900 

CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAG CCATTTATGA TATTGCTATG 960 

ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 1005 



(2) INFORMATION FOR SEQ ID NO : 355: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDN ESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 355: 

CCGACAAGCA AT AT T AAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 60 

TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 120 

AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGTTCC TATCAGCTTG GACGTTACCA 180 

AGCTGGTCAG GATAAGAAAG AGTCTAATCC AGTTGCTTAT ATAGATGGTG AT C AG GCTGG 240 

TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 300 

ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 3 60 

TTACTATAAT GGCAAGGTTC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 420 

TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 480 

TAAGGTAAAC GGTAAATACT ATGTTT AC CT TAAGGATGCA GCTCATGCGG ATAATATTCG 54 0 

GACAAAAGAA GACATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 600 

AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 660 

CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 72 0 

CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 78 0 

CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCT AGITATA ATGCAAATCC 84 0 

AGCTCAACCA AGATTGTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 900 

TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 960 

CATGTGGGAT CTG 973 
(2) INFORMATION FOR SEQ ID NO: 3 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 56: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 12 0 

C AC CAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 180 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 300 

GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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GTCGCAAGGG CATTTTCCAC ATTGTGGCTA CCTGGAACAC CGATTTCATT CGCTGCCATG 420 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AAGCTCCATC AACCTTTTCA 4 80 

AGTGTTGAAA ATGGTACAAC AGTGGCTTCT GTCTTGGAAG TCAAGTCTTT TGCCAAGTCT 54 0 

TGATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTCCACTTG 600 

GCTGCTACAT ATTCCGAAAA TG AC CCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 

ATAACCGCAA TCTCTGGATG GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC 720 

ATAACAACCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCGGGn TCCTCTAGAG 840 

TCG 84 3 



(2) INFORMATION FOR SEQ ID NO: 3 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
( B > TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 



TTTTTTTTAT 


ATTTTTTTTA 


TTTATTATTT 


TTTGGCAAAA 


AAGACCAATT 


TGCTTTGGAG 


60 


CATTGCTTCT 


GCATTAAATT 


GTCTATTTTT 


GCTCGTGCTG 


TTACGCTCTT 


TGTATCATGT 


120 


ATTAACTAGC 


AAGTGCAACT 


TGCAAACTAC 


TAGTAAGAGG 


AGAAAAACAA 


AATGGTTATG 


180 


ACTGACCCAA 


TCGCAGACTT 


CCTAACTCGT 


ATTCGTAATG 


CTAACCAAGC 


TAAACACGAA 


240 


GTACTTGAAG 


T ACCTG CATC 


AAACATCAAA 


AAAGGGATTG 


CTGAAATCCT 


TAAACGCGAA 


300 


GGTTTTGTAA 


AAAACGTTGA 


AATCATTGAA 


GATGACAAAC 


AAGGCGTCAT 


CCGTGTATTT 


360 


CTTAAATACG 


GACCAAATGG 


TGAGAAAGTT 


ATCACTAACT 


TGAAACGTGT 


TTCTAAACCA 


420 


GGACTTCGTG 


TCTACAAAAA 


ACGTGAAGAC 


CTTCCAAAAG 


TTCTTAACGG 


ACTTGGAATT 


480 


GCCATCCTTT 


CAACTTCTGA 


AGGTTTGCTT 


ACTGATAAAG 


AAGCACGCCA 


AAAGAATGTT 


540 


GGTGGTGAGG 


TTATCGCTTA 


CGTTTGGTAA 


AATCAAGATA 


CAAAGCTCGT 


AAAGAACAAA 


600 


GCAAAATTAG 


GAAGTTGGAG 


AAGTTTGTTT 


AC AAACAGGC 


CAACTTATCT 


ATTTTGCACA 


660 


GTTCTTAGAG 


CGTGTTCAGT 


TCAGCTCTTG 


AGCTAAGTAA 


GTATCTGAAC 


CCCGTGAAAA 


720 


CTGGCCGTGC 


TGGCATGTTC 


GGGTAACAGG 


AGAnAATAAA 


CATGTCACGT 


ATTGGTAATA 


780 


AGTTCAGCTA 


AGGCCTTCGT 


AAAAGTT 








807 
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(2) INFORMATION FOR SEQ ID NO: 358: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EON ESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGG ATGAT A CACATCTGGA ATTGGCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT C AGGTC ACT A TTTT CGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATG CATC AAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 3 00 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 3 60 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 4 20 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 4 80 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 6 53 



(2) INFORMATION FOR SEQ ID NO: 359: 

<i> SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT G AC CAGTC AG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 3 00 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 3 60 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 
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TCAACCAAGT AGGAAACTTT CTTAAAAGTA ACTTGCTCAA CGACTGGAAG GGTAAAATTG 4 80 

CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 54 0 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGCCTG CTC TTATCTTGCC G 641 



(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 1958 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

CCTCAAGGCC AATTTGAAGG CTCTAAAACA ATGGAAAAGT GCTACACAGA TGTGACAGAA 60 

TTTGCCATTC CAGCAAGTAC TCAAAAGCTT TACTTATCAC CAGTTTTAGA TGGCTTTAAT 120 

AGCGAAATTA TTGCTTTTAA TCTTTCGACT TCACCCAACT TAGAACAAGT ACAAACAATG 180 

TTAGAACAGG CATTCAAAGA GAAGCACTAC GAGAATACGA TTCT C C AT AG TGACCAAGGC 24 0 

TGGCAATATC AACACGATTC TTATCATCGG TTCCTAGAGA GTAAGGGAAT TCAAGCATCT 300 

ATGTCACGCA AGGGCAACAG CCAAGACAAC GGTATGATGG AATCTTTCTT TGGCATTTTA 3 60 

AAATCCGAAA TGTTTTATGG CTATGAGAAA ACATTTAAAT CACTTAACCA ATTGGAACAA 4 20 

GCCATTATAG ACTATATTGA TTACTACAAC AACAAACGAA TTAAGGTAAA ACTAAAAGGA 4 80 

CTTAGTCCTG TGCAGTACAG AACTAAATCC TTTGGATAAA TTAATTGTCT AACTTTTTGG 540 

GGTCAGTACA AAACTCTTGC T ACT AT GCGT TTTATTATTG AAAG ACTT AT TGGACTTTCT 600 

CTCAAATCGA GTTTTTACTC AATTTTCTTA CTTGATTGGG ATTGAAATTC CAATTAATTT 660 

CTCTGAGT AG AGTGTCTTGA TATTGGCTTC ATCAACAGAG GCCTTATCAA TTTTACGTTT 720 

CAAGAAAAAT TCTTGAATGG TTTCGATTTC AGGCTCACGA ATAGCACGGT GTTTGTTTGA 7 80 

GATGAGGATT TCATAGTGAA GCGGAGCTTG GGTAAAAATA ACATCTGTAT TCCCTGCAGA 84 0 

ATAAACCTCA ACAAGGGTTG CATCGGTACT TTCT AGCTGA CTTTTTACAA GTTGCGAGTG 900 

TGAGTTTGTC GTATTGATAA GCTTCATAAT ATTTCCTCCG ATTTTCTAAT TCTATTATAG 960 

CACTTTTTGA ATAAAGTCGC TTGATTTATA CTCAATGAAA ATCAAAGAGC AAA CT AGG AA 1020 

GCTAGCCGCA GGCTATACTT GAGTACGGTA AGGCGACGCT GACGTGGTTT GAATTTTATT 1080 

TTCGAAGAGT ATTAGCCAAT CTTATGCTGT TTTTTCCAAG ATTCAATGGC CCATTTATGG 1140 
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CTACCACGTT TAAGGTTTTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT ATGATTAAAG 1200 

TTTTCTAGTG GCTTTTGTAC TTCTTTGAAA GGAGTTCCTT CATAGAGGTA GGCAGGATTG 12 60 

TAGTAGTAGG TATCACGATG ACGAGAATAG AAATATTCCT CAGCTTGTCC GTAATAGGTA 13 20 

CCAATTTCTG CTGTGAAACC AAGCTCTTCA AT C AACTC AT GCTTTAGGGC TTCCTGATGA 13 80 

TTTTCACCTG CTTCAATTTC TCCACATGGT AGGAACCAAG CACCATTTGG TTCTTGAACA 1440 

AGAACAATTT GTTTTTGTTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 1500 

TAGTCTGTAT TCACTTTTTT TCTCCGAAAG TTGGGTTTGC CATTGCATTT TCCTCATTAT 1560 

CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 1620 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTCTC TCTTATTGTG AGCGAACAGG 1680 

TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTCGTT 17 40 

AAAGCTAAAA GACAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 1800 

GGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCTT ATTTGACTGA 1860 

AATAGCTTCT GAATTTGGCT GTC AT CCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 19 20 

tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 1958 



<2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 851 base pairs 
<B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 
TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGAAG CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 120 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 180 

TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 2 40 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 300 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 3 60 

CCATCCTAAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 42 0 

AGAAATTGTT TAAGAATTAA TG AC TGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 4 80 

ACTAGCTCGT TCG AC CT ACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 540 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 600 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 6 60 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTG CT AC A CAGATGTGAC AG AATTT GCC ATTCCAGTAA GTACTTAAAA 840 



GCTTTACTTA T 

(2) INFORMATION FOR SEQ ID NO: 3 62: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 62: 



GGGTAGAATC 


GATATCTCCA 


ATGAGTTGGT 


t TAG CTGGTG 


AAACTGTAAA 


AAGATTTCGw 


60 


CCAATTCAAG 


GTTGAGCCAT 


CGCAAACTAT 


GGACTGTTTC 


CTCGTCAGTT 


CTGGAAAGAA 


120 


AACGGGATAA 


GGTTGGCTGT 


GAAGCAAGCT 


GCCCTCCTTC 


C AAC AATTT T 


GGAAAGTAGG 


180 


CATCAGCTCA 


CAATTCTTTA 


CAAGCATAGT 


CCGTTCCATA 


ACCTGTTAAC 


AGTTGAAAGA 


240 


GGAACTGGAC 


AAGGAT AT CT 


GAATCCGAAT 


AACGACAGTA 


GCGGCGTTGG 


T C ATTCGTT A 


300 


CTAAATACTT 


AG AAATC CGC 


TCTTTTAGTT 


TCAACTGGGA 


AAAAAGTTCC 


TGAAAAAAGA 


3 60 


TAAGACCACC 


ATACTGGGTT 


AAATGACCTC 


CATCGAAAGA 


TAGTTCGTAA 


AAAGACTTGT 


420 


TTTGGAAGTG 


ATGATTTGGT 


AAACTGTTCA 


TGTGAGTTTC 


CTTTCTTTTT 


GTGTTTTTTT 


480 


CTACACTTAT 


AC C AT AAAGG 


GGAAACTCTT 


TTTTGTCTAG 


TAAAAAACAC 


CCATTGGGTG 


540 


AAAAAAGAAA 


CCATCCAGGA 


TCTAAGCTAA 


GGCAAGGATT 


CTGGATGGTT 


TTTAGATTTG 


600 


GGGTGAATAA 


TTGGGGTTTT 


AGCTGCTTGC 


GGCCAATCAG 


GTTCAGATAC 


AAAAACTTAC 


660 


TCATCAACCT 


TTAGTGGAAA 


TCCAACTACA 


TTTAACTATC 


TATTAGACTA 


TTACGCTGAT 


720 


AATATAGTCA 


ATTGAAACAA 


GAACAAGACA 


AAAGAGCCTC 


ATAAAAGGTA 


TTGCAACTTG 


780 


GTAATACCTT 


TTTGAGGTGC 


TTTTTGATAT 


GAGCCCATGT 


TTTCTCAATA 


GGATTGTACT 


840 


CAGGTGAGTA 


GGGAGGAAGA 


GGTAAAAGTT 


TATACCCAAA 


CTCTTCACAC 


AAGAGTTCTA 


900 


ACTTACCCAT 


TCTATGGAAT 


CTTGCATTAT 


CCATAATAAT 


AACCGATGGT 


GTGTTTAATG 


960 


TTGGTAAGAG 


AAATTTCTGA 


AACCAAGCTT 


CAAAAAAGTC 


GCTCGTCATC 


GTCTCTTCGT 


1020 


AAGTTATTGG 


AGCGATTAAC 


TCACCATTTG 


TTAGACCTGC 


AACCAAAGAA 


ATC CTCTGAT 


1080 
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ATCTTCTTCC AGATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC LI 40 

GATAAAAATA AGTATCGAAT CCTGTTTC 1168 

(2) INFORMATION FOR SEQ ID NO: 363: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

GTCAGCTTCA GCAAGCCCAT CAGCTTCTCA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 60 

GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCC GCTTCAGCAT CAACGAGTGC 12 0 

GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 180 

TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 24 0 

GAGTG CGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 300 

ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 360 

CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 420 

ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTG CATC A GCTTCAGCAT CAACGAGTGC 480 

ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 54 0 

AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 600 

CTCGG CTTC A GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 660 

AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 720 

ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 7 80 

AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 84 0 

GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 900 

AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 960 

CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 1020 

AT CAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 1080 

ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 1140 

GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 1200 

GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 12 60 

ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 1320 
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TCrCCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCkTC AGCCTCAGCG 13 80 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 1440 

TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC AGC AAGT ACT 1500 

AGTGC AT CAG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCGACAAG CGCCTCAGCT 1560 

TCAGCAAGTA CCAGTGCGTC AGCCTCAGCG TCGACAAGTG CGTCCGCTTC AGCAAGTACC 162 0 

TCAGCGTCTG AATCAGCATC AACAAGTGCG TCGGCTTCAG CATCAACGAG TGCATCAGCT 1680 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 174 0 

AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCCG CATCAACAAG TGCCTCGGCT 1800 

TCAGCAAGCA CCAGTGCTTC GGCTTCAGCG TCAACGAGTG CGTCTGAGTC AGCATCAACG 1860 

AGTGCGTCAC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCACTTC 1920 

CGCATCAACA AGCGCCTCGG CCTCAGCAAG TACAAGTGCT TCAGCCTCAG CATCAACCAG 1980 

TGCATCAGCT TCAGCCTCAA CAAGTGCTTC AGCCTCAGCG TCAACCAGTG CCTCGGCTTC 204 0 

AGCAAGTACC AG TGCGTC AG CTTCAGCAAG CACAAGTGCG TCAGCTTCAG CATCAACCAG 2100 

TGCTTCGGCT TCGGCATCAA CAAGTGCCTC AGCATCAGCA TCAACGAGTG CGTCAsCTCA 2160 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACCAGTGCAT CAGCCTCAGC AAGT AT CTC A 2220 

GCGTCTGAAT CGGCATCAAC GAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA 2280 

GCGTCAACCA GTGCATCAGT CTCAGCAAGC ACCAGTGCGT CGGCTTCAGC ATCAACCAGT 2 340 

GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT CAACGAGTGC GTCAGcCTCA 2 4 00 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACGAGTGCAT CGGCTTCAGC AAGTACCAGC 24 60 

GCCTCAGCTT CAGCAAGCAC CAGTGCGTCA GCCTCAGCAA GTACCAGCGC CTCAGCCTCA 2520 

GCAACCACCA GTGCCTCAGC TTCAGCAAGT ACCAGTGCGT CAGCCTCAGC GTCGACAAGT 2 580 

GCGTCGGCTT CAGCAAGTAC CTCAGCGTCT GAATCAGCAT CAACGAGTGC ATCAGCTTCA 2640 

GCATCAACAA GTGCTTCAGC TTCAGCAAGT ACCAGTGCGT CGGCTTCAGC ATCAACGAGT 2700 

GCTTCAGT CT CAGCGTCAAC CAGTGCCTCT GAATCAGCAT CAACAAGTGC CTCGGCTTCA 2 7 60 

GCAAGCACCA GTGCGTCGGC TTCAGCAAGT ACTAGTGCAT CGGCTTCAGC ATCGACAAGT 2 820 

GCGTCTGAAT CGGCATCAAC GAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCAGCCTCA 2880 

G CAAGCAC AT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC GTCAACCAGT 2940 

GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCGGCCTCA 3000 

GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCCGC TT CAGCAAGC 3060 
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ACAAGTCCGT CAGCCTCAGC AAGTATCTCA GCCTCTCAAT CGGCATCAAC GAGTGCGTCG 3120 

GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT CAACCAGTCC GTCGGCTTCA 3180 

GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT 3240 

GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCG 3300 

GCATCAACCA GTGCGTCAGC CTCAGCATCG ACAAGCGCCT C AG CTTCAGC AAGTACCAGT 3 360 

GCTTCACCCT CAGCGTCGAC AAGT GCGTCG GCCTCAACCA GTGCATCTGA ATCGGCATCA 3420 

ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT CAGCATCAAC GAGTGCATCG 34 80 

GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGT CAACC A GTGCGTCAGC TTCAG CAAGT 354 0 

ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT CAGCATCGAC AAGTCCCTCG 3 600 

GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA GCGCCTCAGC TTCAGCAAGT 3 660 

ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA 3 720 

GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCCTCAACCA GTGCATCAGA GTCAGCAAGT 3780 

ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT CAGCAAGCAC C AGTG CGTCG 384 0 

GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA GTGCGTCAGC CTCAGCAAGT 3900 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA 39 60 

GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA GTGCGTCTGA ATCGGCATCA 4020 

ACCAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT CAGCGTCAAC AAGTGCATCG 4 080 

GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCGTCA 4140 

ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTGCGTCA 4200 

CCTCAkCAAG CACATCAGCT TCTGAATCTG CAT CAACC AG TGCGTCACTT CCGCATCAAC 4260 

AAGCGCCTCG GCCTCAGCAA GTACAACTGC TTCAGC CTC A GCATCAACCA GTGC AT CAGC 4 320 

TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG TGCCTCGGCT TCAGCAAGTA 4380 

CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA GCATCAACCA GTGCTTCGt; Z 4440 

TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT GCG 44 83 

(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 25S0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 



WO 98/18931 



PCT/US97/ 19588 



1379 

GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATC AACG AG TGCGTCCCCC TCAGCAAGCG 120 

CAACTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 3 60 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 4 80 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

T CAG CATC AA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT AGTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TG CATC AG AG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 84 0 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCAC CTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGGTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 12 60 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACGAGTGCG 13 20 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 13 80 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGC CTCAG CTTCAGCAAG TACCAGTGCG 144 0 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AG CACATC AG CAT CTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 
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CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC TCAACCAGCG CCTCGGCCTC 1800 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 1860 

TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 1920 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCCTCTGAA TCGGCATCAA CGAGTGCGTC TGACTCAGCA TCAACGAGTA CGTCAGCCTC 2 040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCACTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 
AACCAGTGCA T CTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 
GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2 340 

GGCTTCAGCA AGCACATCAG CATCTGAATC AG CGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 
TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAACTA CTAGTGCATC 



2220 
2280 



2460 



AGCTTCAGCA TCAACCAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2 520 



2S50 



TACCAGTGCG TCAGTTCACG CATCAACAAG 

(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doub 1 e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA 60 

CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAC 120 

CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 180 

CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 240 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 300 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 360 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 4 20 

AG TGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 4 80 

TCAGCCTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 

TCAGCTTCTG AATCTGCATC AACCACTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 600 
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TCAGCAAGTA 


CCACTGCCTC 


AgCCTCAGCA 


AGTACCAGTG 


CTTCAGCCTC 


AGCGTCGACA 


660 


AGTGCGTCGG 


CCTCAACCAG 


TGCATCTGAA 


TCGGCATCAA 


CCAGTGCGTC 


AGCCTCAGCA 


720 


AGTACTAGCG 


CCTCAGCCTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


T ACT AGTGCA 


780 


T CAGCTTC AG 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCGA 


CAAGCGCCTC 


AGCTTCAGCA 


840 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


CTTCAGCAAG 


TACCTCAGCG 


900 


TCTGAATCAG 


CATCAACAAG 


TGCGTCGGCT 


TCAGCATCAA 


CGAGTGCATC 


AGCTTCAGCA 


960 




LI 1 Lnuiw III— 




AGTGCGTCGG 


CTTCAGCATC 


AACGAGTGCT 


1020 


TCAGTCTCAG 


CGTCAACCAG 


TGCCTCTGAA 


TCCGCATCAA 


CAAGTGCCTC 


GGCTTCAGCA 


1080 


AGCACCAGTG 


CTTCGGCTTC 


AGCGTCAACG 


AGTGCGT CTG 


AGTCAGCATC 


AACGAGTGCG 


1140 


TCAGCCTCAG 


CAAGCACATC 


AGCTTCTGAA 


TCTGCATCAA 


CCAGTGCGTC 


AGCTTC CGC A 


1200 


TCAACAAGCG 


CCTCGGCCTC 


AGCAAGTACA 


AGTGCTTCAG 


CCTCAGCATC 


AACCAGTGCA 


1260 


TCAGCTTCAG 


CCTCAACAAG 


TGCTTCAGCC 


TCAGCGTCAA 


CCACTGCCTC 


GGCTTCAGCA 


1320 


AGTACCAGTG 


CGT CAGCTTC 


AGCAAGCACA 


AGTGCGTCAG 


CTTCAGCATC 


AACCAGTGCT 


1380 


TCGGCTTCGG 


CATCAACAAG 


TGCCTCAGCA 


TCAGCATCAA 


CCAGTGCGTC 


AGCCGG 


1436 



(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 66: 



GCAGTTGCCA 


CACCGTGCTG 


ACCAGCACCC 


GTTCCTGCGA 


TAATTTTCTT 


TTTACCCATG 


60 


CGTwTGGCAA 


GCCAAACTTG 


TCCTAAGG C A 


TTGTTAATCT 


TGTGGGCTCC 


TGTATGGTTA 


120 


AGGTCTTCCC 


GTTTGAGATA 


AATCTTGCTC 


CGCCAATATG 


CTGGGTCAAG 


TTTTTTGCGT 


180 


AATAAAGAGG 


AGTTTCACGT 


CCTACGTACT 


GGCGCAAAAG 


CTGGTTTAAT 


TCCTCTTGGA 


240 


AACTTGGGTC 


TGCCTGACTT 


TCACGGTAGG 


CCTTCTCCAA 


CTCCAAAACT 


GCTGTCATCA 


300 


ATGTTTCTGG 


GACAAAACGT 


CCGCCGAATT 


TTCCGTAAAA 


TCCATCTTTA 


TTTGGTTCCT 


360 


GATATGCCAT 


GCTTTACCCT 


CTCTATAAAT 


CTTCTAATCT 


TTTCATGATC 


TTTTTGTCC A 


420 


TCTGTCTCCA 


CTCCGCTCGA 


TACATCTACT 


GCATAGGG AG 


TAAAGTGTTG 


AATTGCTTTT 


480 


ACTACATTAT 


CTTCATTAAG 


GCCACCTGCG 


ATAAAGAAGG 


GCTGTGCTAG 


TCCAGTCGTA 


540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 600 

TAATCTGCCT GAGAATTGGG GACATGCCCA TTTC C ATCTA CCTGCACAGC CTGAATACTG 660 

GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 7 20 

AAGCCGGGGA TCCTC 735 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1702 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 
{D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 67: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 60 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 120 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAACCA CATCAGCTTC 180 

TGAATCTGCA TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG 24 0 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 300 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 360 

TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA AGTACTAGTG CATCAGCTTC 420 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCCGCATCAA CAAGTGC CTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTT C AG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 660 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCGCG TCTGAATCCG CATCAACCAG 7 20 

TGCCTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGCCTl'C 780 

AGCAAGCACA AGTGCTT CAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 114 0 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGC CTCAG CTTCAGCAAG 12 60 

CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 132 0 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 13 80 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 144 0 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AAC CAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA C ATC AG CTTC 1680 

TGAATCTGCA TCAACCAGTG CG 1702 
(2) INFORMATION FOR SEQ ID NO: 3 68: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68: 

ACCAGTGCAT CAGCTTCAGC CTCAACAAGT GCTTCAGCCT CAGCGTCAAC CAGTGCCTCC 60 

GCTTCAGCAA GTACCAGTGC GTCACTTCAG CAAGCACAAG TGCGTCACTT CAGCATCAAC 120 

CAGTGCTTCG GCTTCGGCAT CAACAAGTGC CTCAGCATCA GCAT CAACG A GTGCGTCACC 180 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACCAGTG CATCAGCCTC AGCAAGTATC 240 

TCAGCGTCTG AATCGGCATC AACGAGTGCA TC AG CATC AG CATCAACGAG TGCATCGGCT 3 00 

TCAGCGTCAA CCAGTGCATC AGTCTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 3 60 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 4 20 

TCAGCAAGTA CTAGTGCATC GGCTTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACC 480 

AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 54 0 

TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC 600 

AGCGCCTCAG CTTCAGCAAG CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC 660 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 720 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 7 80 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 840 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 
TCAGCAAGCA C CAGTG CGTC GGCTTCAGCA AGTACTAGTG C 
(2) INFORMATION FOR SEQ ID NO: 369: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCGTCAACCA 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 12 0 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCCCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA 300 

GTGCGTCTGA ATCGGCATCA ACCAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 360 

CAGCGTCAAC AAGTG CATCG GCTTCAGCAT CAACGAGTGC GTCCG CTTC A GCAAGT ACTA 420 

GCGCCTCAGC CTCAGCCTCA ACAAGTGCAT CGGCTTCAGC GTCAACCAGT GCCTCTGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCCTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGT ACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTG CAT CT GAATCGGCAT 720 

CAACCAGTGC CTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGACTGCAT 7 80 

CGGCTTCAGC AT CAACC AGT GCATCAGAGT CAGCAAGTAC CAGTG CGTCA GnTTCCGCAT 840 

GCAACAAGTG CCTCGGCTTC AGCAAGTAC 869 
(2) INFORMATION FOR SEQ ID NO: 370: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 



60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 2 40 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTC AG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTCCCTC AGCCTCAGCA 360 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAC CCTCAGCAAG TACTAGTGCA 4 20 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 4 80 

AGCACCAGTG CCTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 54 0 

TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTC AG CATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 



(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 
{B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(O) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

CCGGAAAACA GCTCTGGCGC TTGGTCTTGC CCAGCCTATT GCTAGTGGTG ACGTGCCTGC 60 

GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGT CGTTG CAGGGACACG 120 

CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 180 

CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGGTA GCGGGATTGA 240 

TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 3 00 

GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 3 60 

TCGTCGTTTC GCTAAAGTGA CGATTGAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 4 20 

ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 4 80 

TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 540 

TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC 600 

AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 660 
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ACCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 72 0 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 84 0 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTAT GTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 372: 

CAAAGCGCCT CAG CTTCAGC ATCAACAAGT GCGTCGGCTT CAGCATCAAC CAGTGCCTCG 60 

GCTTCAGCGT CAACCAGTGC GTCACATTCA GCAAGTACCA GTGCTTCAGT CTCAGCATCA 120 

ACAAGTGCTT CAGCCTCAGC ATCGACAAGT GCCTCGGCTT CAGCAAGCAC ATCAGCATCT 180 

GAATCAGCGT CAACCAGTGC TTCGGCTTCA GCAAGTACCA GTGCTTCAGC TTCAGCATCA 240 

ACCAGCGCCT CGGCCTCAGC AAGCACCTCA GCTTCTGAAT CGGCCTCAAC CAGCGCCTCG 300 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GCGCCTCAGC CTCAGCATCA 3 60 

ACGAGTCCTT CGGCTTCAGC AAGCACAAGC GCCTCGGGTT CAGCATCAAC GAGTACGTCA 420 

GCTTCAGCGT CAACCAGTGC TTCAGCCTCA GCATCAACAA GTGCGTCAGC CTCAGCAAGT 4 80 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTACGTCA 54 0 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GTGCGTCAGC CTCAGCATCG 600 

ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG 6 60 

GCCTCAACCA GTGCATCTGA ATCGGCATCA ACCAGTGCGT CAGCCTCAGC AAGTACTAGT 720 

GCATCGGCTT CAGCATCAAC CAGTGCCTCG GCTTCAGCGT CAACCAGTGC GTCAGCTTCA 7 80 

GCAAGTACCA TGTGCTTCAT GTCTCAG 8 07 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 373: 



CATCGGCTTC 


AGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTACCGCC 


T CAG CCTCAG 


60 


CGTCAACAAG 


TGCATCGGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTG 


120 


CGTCACCTCA 


GCAAGCACAT 


CAGCTTCTGA 


ATCTGCATCA 


ACCAGTGCGT 


CACCTCAGCA 


180 


TCGACAAGCG 


CCTCAGCTTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


ACAAGTGCGT 


240 


CGGCTTCAGC 


AAGTACCAGT 


GCGTCASCTC 


AGCAAGTACC 


AGTGCGTCAC 


CTCAGCGTCG 


300 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


CAACCAGTGC 


GTCACCTCAG 


360 


CAAGTACTAG 


TGCATCAGCT 


TCAGCATCAA 


CGAGTGCATC 


GGCTTCAGCA 


TCAACCAGTG 


420 


CATCAGAGTC 


AGCAAGTACC 


AGTGCGTCAG 


cTTCCGCATC 


AAC AAGTGC C 


TCGG CTTC AG 


480 


CAAGTACTAG 


CGCCTCAGCC 


TCAGCGTCAA 


CAAGTGCTTC 


AGCTTCCGCG 


TCAAC CAGCG 


540 








AAT CGGC ATC 


AAC AAGTG C C 


TCGGCTTCAG 


600 


CATCAACGAG 


TGCATCAGTC 


TCAGCAAGCA 


C C AGTGCGTC 


GGCCTCAGCA 


AGCACCAGCG 


660 


CGTCTGAATC 


CGCATCAACC 


AGTGCCTCAG 


CTTCAGCAAG 


TACCTCAGCA 


TCTGAATCAG 


720 


CATCAACAAG 


TGCATCGGCT 


TCAGCAAGCA 


CAAGTGCTTC 


AGCCTCAGCA 


AGTATCTCAG 


780 


CGTCTGAATC 


GGCATCAACG 


AGTGCGTCCG 


CTTCAGCAAG 


TACTAGCGCC 


TCAGCATCAG 


840 


CGTCAACAAG 


TGCTTC GGCT 


TCAGCGTCAA 


CGAGTGCGTC 


TGAGTCAGCA 


TCAACGAGTA 


900 


CGTCAGCCTC 


AG C AAGC AC A 


TCAGCTTCTG 


AATCTGCATC 


AACCAGTGCG 


TCAGCCTCAG 


960 


CATCGACAAG 


CGCCTCAGCT 


TCAGCAAGTA 


CCAGTGCGTC 


AGCCTCAGCA 


AGT ACCAGT G 


1020 


CTTCAGCCTC 


AGCGTCGACA 


AGTGCGTCGG 


GCTCAACCAG 


TGCATCTG 




1068 



(2) INFORMATION FOR SEQ ID NO: 374: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TT CAG C AAGC CCCAGTGCGT CGGCTTCAGC AACTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 24 0 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 3 60 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 4 80 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 



CGGCATCAAC CAGTGCGTCA 
(2) INFORMATION FOR SEQ ID NO: 375: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 75: 

GTATTGGGGC GCCCCAACCT CTATGTGACT ACGGATTATT TCCTAGATTA CATG^GGATA 60 

AACCATTTAG AAGAATTACC AGTGATTGAT GAGCTTGAGA TTCAAGCCCA AGAAAGCCAA 120 

TTATTTGGTG AAAGGATAGA AGAAGATGAG AATCAATAAG TATATTGCCC ACGCAGGTGT 180 

GGCCAGTAGG AGAAAAGCAG AAGAGCTGAT TAAGCAAGGC TTGGTGACGG TTAACGGCCA 2 40 

AGTGGTGCGT GAACTAGCAA CCACTATCAA GTCAGGCGAC AAGGTCGAAG TTGAAGGTCA 300 

ACCTATCTAC AACGAAGAAA AGGTCTACTA TCTGCTTAAC AAACCACGCG GTGTGATTTC 360 

CAGTGTGACA GATGATAAGG GTCGCAAGAC GGTTGTCGAC CTCTTGCCCA ATGTCAAAGA 420 

GCGT ATT T AC CCTGTGGGTC GTTTGGACTG GGATACATCA GGTGTCTTGA TTTTGACCAA 4 80 

TGATGGGGAC TTTACAGACG AGATGATTCA CC CTCGTAAT GAGATTGACA AGGTTTATGT 540 

CGCGCGTGTT AAAGGTGTGG CCAATAAGGA CAATCTCCGC CCCTTGACCC GTGGTCTTGA 600 

GATTGATGGT AAGAAAACCA AGCCATAATA TATAGGTTTT GTAGCCTCTA CACCATAAAT 660 

ATTTGCTAAT AAAAATACTG T ATT ATT AC C CTCTTAAGGT GCGAAATTAT TCAAGTTCTT 720 



(21 INFORMATION FOR SEQ ID NO: 376: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 6: 

CCCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACT s JcATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 3 00 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 360 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA C CATC ATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 4 80 

AG ATAAG AAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 54 0 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 64 8 



(2) INFORMATION FOR SEQ ID NO: 377: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAACTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTG CGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAG CGTCA ACCACTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 2 40 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 300 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 3 60 

CCGCCTCAAC GAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 4 20 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 480 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 54 0 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 690 
(2) INFORMATION FOR SEQ ID NO: 378: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37 8: 



CGAGATTCTC 


TGGAGTTATG 


GATGTCGTTC 


CAATATGTGC 


ACGTTGGAAT 


GTTAGTGCTT 


60 


ATATGGGGGG 


AAC AGAATC C 


TCTCTTGATT 


GAAGACAAGC 


TAGTCATTAG 


GCTGGTTTGT 


120 


CTTTTTGTCA 


ACTGTAGTGG 


GTTGATATAA 


TAGTATTAGT 


GAGTGGGATA 


AAAGTTTCAT 


180 


TTAGTTTATT 


CAGTACAAAT 


TTAACGGGTC 


AAG ATTT AT A 


TACTAGTGGT 


GTTTTTGGGG 


240 


CTGAGAGAAG 


TATCTTGATT 


TTATGTGTGG 


TTTTTATACT 


TACAGTTGTT 


CTGCTCCAAA 


300 


GAGCTTGTAG 


AGAAGAATTA 


GCTCATAAAG 


GAGATTGATT 


ATTTTG AT AT 


CAAAAAAATG 


360 


CACAGGATAA 


CCTGATGCAT 


TTTTTTAGCG 


ACAATGCTTG 


CTACTTCCTT 


CTGTCGAATT 


420 


TAGACAATTT 


TAAACCCCAA 


TTATTCACCC 


CAAATCTAAA 


AACCATCCAG 


AATCCTTGCC 


480 


TTAGCTTAGA 


TCCTGGATGG 


TTTCTTTTTT 


CACCCAATGG 


GTGTTTTTTA 


CTAGACAAAA 


540 


AAGAGTTTCC 


CCTTTATGGT 


ATAAGTGTAG 


AAAAAAACAC 


AAAAAGAAAG 


G AAACT C AC A 


600 


TGAACAGTTT 


ACCAAATCAT 


CACTTCCAAA 


ACAAGTCTTT 


TTACCAACTA 


TCTTTCGATG 


660 


GAGGTCATTT 


AACCCAGTAT 


GGTGGTCTTA 


TCTTTTTTCA 


GGAACTTTTT 


TCCCAGTTGA 


720 


AACTAAAAGA 


GCGGATTTCT 


AAGTATTTAG 


TAACGAATGA 


CCAACGCCGC 


TACTGTCGTT 


780 


ATTCGGATTC 


AGATATCCTT 


GTCCAGTTCC 


TCTTTCAACT 


GTTAACAGGT 


TAT GGAACGG 


840 


ACT ATG CTTG 


TAAAGAATTG 


TCAGCTGATG 


CCTACTTTCC 


AAAATTATTG 


GAAGGAGGGC 


900 


AGCTTGCTTC 


ACAGCCAACC 


TTATCCCGTT 


TTCTTTCCAG 


AACTGACGAG 


GAAACAGTCC 


960 


ATAGTTTGCG 


ATGCCTCAAC 


CTTGAATTGG 


TCGAATTCTT 


TTT 




1003 



(2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 60 

TG AAA TACT A ATTGTTTTAG CTATTATCCT ATCTCTTGCT TTGATTGTAT TGGTAACTAT 120 

ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 180 

CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 

TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 300 

CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 3 60 

TAGGAAGCTA GCCGCAckGC TCAAAACACC G TTTTGAGGT TGTAGATATA ACTGACGAGc 420 

GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 4 80 

TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 54 0 

CTGACGAAGT CGc TCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 

ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC AT AT AT AC AG CAAGGCGACG 660 

CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 7 20 

ATGAAGGTTT TTTTTTT T 738 



(2) INFORMATION FOR SEQ ID NO: 3 80: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 360 

CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 4 80 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGC CAT AAGG GCAAGAGTTC 600 
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CC G C AC AG AT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 



GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 
(2) INFORMATION FOR SEQ ID NO : 3 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



695 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 360 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 42 0 

TTTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAACGAA TGACCAACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 54 0 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 
(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTG C GTAATGGTCC TCAGATAACT CTGATGATGT CTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA G ATTC CGACT 180 

AATATTGGCT TTG CATTT AT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 24 0 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG G ACT AGAATG 3 60 

CTTTTTGAAA ACAGATGACT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 4 20 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 4 80 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC C TTTTCTGTA 600 

CTGGACCAAG TCA G TTTTCC GTTCTCAAAG C GTTT AT AT A ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 720 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 7 50 



(2) INFORMATION FOR SEQ ID NO: 3 93: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 
{ B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 383 : 
TCAAATTCTT CGTGCTCCGC ATATCTnTCT TCGTACACGG CACTCACTTG GTCTTTCACT 60 

ACTCGAGTCG CAGCTTCACG GCCCAATTTC TCTTCTACTT GAACTGCCTT TTGGAGGTCA 120 

CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT CCACGTGAAG CAATTCCACT 180 

TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT GGAAGGCAAT CAA T TCTTTG 240 

ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT CTTCTGACAA TTCTTTGGCA 300 

CCAG ACTCT A CCATGTTGAT AGCGTGCTTG GTTCCAGCTA CTGTCAATTC AAGAAGAGAT 360 

TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC CATCTACATA TCCCACTTGT 420 

ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG ACAGTGCCAA AGATGAACCA 4 80 

AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG AAAGCACTGT ATTGATGACT 540 

TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA TCGGACGGTC AATCAAACGC 600 

GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT TCATAAAGCC ACCAGGAAAC 660 

TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA GTGGGAAGAA ATCCTCAGTT 720 

GCCATTTTCT GGGGATCC 738 
(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



txi> SEQUENCE DESCRIPTION: SEQ ID NO: 38tr 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA AT CTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 240 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 360 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 4 20 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 4 80 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTC CATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAG CAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 657 



(2) INFORMATION FOR SEQ ID NO: 385: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE : nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 
AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 3 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE; nucleic acid 
(C> STRAND EDN ESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 



CGTCGGCTTC 


AGCATCAACG 


AGTGCATCAG 


CTTCAGCATC 


AACAAGTGCT 


TCAGCTTCAG 


60 


CAAGTACCAG 


TGCGTCGGCT 


TCAGCATCAA 


CGAGTGCTTC 


AGTCTCAGCG 


TCAACCAGTG 


120 


CCTCTGAATC 


CGCATCAACA 


AGTGCCTCGG 


CTTCAGCAAG 


CACCAGTGCT 


TCGGCTTCAG 


180 


CGTCAACGAG 


TGCGTCTGAG 


TCAGCATCAA 


CGAGTGCGTC 


ACCTCAGCAA 


GCACATCAGC 


240 


TTCTGAATCT 


GCATCAACCA 


GTGCGTCAGC 


TTCCGCATCA 


AC AAG CGCCT 


CGGCCTCAGC 


300 


AAGTACAAGT 


GCTTCAGCCT 


CAGCATCAAC 


CAGTGCATCA 


GCTTCAGCCT 


CAACAAGTGC 


360 


TTCAGCCTCA 


GCGTCAACCA 


GTGCCTCGGC 


TTCAGCAAGT 


ACCAGTGCGT 


CAGTTCAGCA 


420 


AGCACAAGTG 


CGTCAATTTA 


GCATCAACCA 


G 






451 



(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



TCTCAGCAAG 


CACCATTGCG 


TCGGCTTCAT 


CAAGCACCAG 


CGCGTTTGAA 


TCCGCATCAA 


60 


CGAGTGCTTC 


AGCTTCAGCC 


AAGTTACCTC 


AGCATCTGAA 


TCAGCATCAA 


CAAGTGCATC 


120 


GGCTTCAGCA 


AGCACAAGTG 


CTTCAGCtCA 


GCAAGTATCT 


CAGCGTCTGA 


ATCGGCATCA 


180 


ACGAGTGCGT 


CCGCTTCAGC 


AAGTACTAGC 


GCCTCAGCAT 


CAGCGTCAAC 


AAGTGCTTCG 


240 


GCTTCAGCGT 


CAACGAGTGC 


GTCTGAGTCA 


GCATCAACGA 


GTACGTCAGC 


CTCAGCAAGC 


300 


ACATCAGCTT 


CTGAATCTGC 


ATCAACCAGT 


GCGTCAGCCT 


CAGCATCGAC 


AAGCGCCTCA 


3 60 


GCTTCAGCAA 


GTACCAGTGC 


GTCAGCCTCA 


GCAAGTACCA 


GTGCTTCAGC 


CTCAGCGTCG 


420 
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ACAAG 425 
(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTACT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TT T TTT TG CA ATTTAGCTGA TTTlTCTri T TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC AC CGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 3 60 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGJcCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 
CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAAcAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 
CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 
CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 3 60 

GTGCGTCAGC CTCAGCAAGT ACT AGCGC CT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 480 

GCGCCTCAGC TTCAGCAAGC ACCGG 505 

(2) INFORHATION FOR SEQ ID NO: 390; 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 
(8) TYPE: nucleic acid 
(C) STRAND EDN ESS : double 
{ D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 
GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA AC AGT A T AT C ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 3 00 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 3 60 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 



<2) INFORMATION FOR SEQ ID NO: 391: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 57 2 base pairs 
< B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
AG CACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTT CTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCT AT AATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAG AGCT AAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 4 20 
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AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 4 80 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 57 2 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for the International 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the 
individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention .* 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Faicui CfSce 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ED NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1 -391 ; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ED NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ED NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said- 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
degenerate variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ED NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5* to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 
to any one of the fragments of the Streptococcus pneumoniae genome depicted in 
SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a hpmolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 
comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

b) identifying members of said library which contain sequences that 
hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 



16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

J 7. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 

18. An isolated polynucleotide molecule encoding any one of the - 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

130 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS:l-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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nucleotide sequence depicted in SEQ ID nos. 1-391, a 
representative fragment thereof or a nucleotide sequence at 
least 95% identical to a nucleotide sequence depicted in SEQ 
ID nos. 1-391; a computer-based system for identifying 
fragments of the Streptococcus pneumoniae genome of 
conmercial importance comprising: a) a data storage means 
comprising said nucleotide sequence(s); b) search means for 
comparing a target sequence to the nucleotide sequence of 
the data storage means of step (a) to identify homologous 
sequence(s), and c) retrieval means for obtaining said 
homologous sequence(s) of step (b); a method for identifying 
comnercially important nucleic acid fragments of the 
Streptococcus pneumoniae genome comprising the step of 
comparing a database comprising said nucleotide sequence! s) 
with a target sequence to obtain a nucleic acid molecule 
comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not 
randomly selected; a method for identifying an expression 
modulating fragments of the Streptococcus pneumoniae genome 
comprising the step of comparing a database comprising said 
nucleotide sequence(s) with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary 
nucleotide sequence to said target sequence, wherein said 
target sequence comprises sequences known to regulate gene 
expression; 



2. Claims: (8-20) partially 

An isolated protein-encoded nucleic acid fragment of the 
Streptococcus pneumoniae genome, wherein said fragment 
consists of the nucleotide sequence of the fragment of 
SEQ ID no.l depicted in Tables 2 and 3, or a degenerate 
variant thereof; a vector comprising the fragment of the 
Streptococcus .pneumoniae genome SEQ ID no.l; an isolated 
fragment of the Streptococcus pneumoniae genome, wherein 
said fragment modulates the expression of an operably linked 
open reading frame, wherein said fragment consists of the 
nucleotide sequence from about 10 to 200 bases in length 
which is 5 1 to any one of the open reading frame of SEQ ID 
no.l depicted in Tables 2 and 3 or a degenerate variant 
thereof; a method for regulating the expression of a nucleic 
acid molecule comprising the step of covalently attaching to 
said nucleic acid molecule a nucleic acid molecule 
consisting of the nucleotide sequence from about 10 to 100 
bases 5* to any one of the open reading frame of SEQ ID 
no.l and Tables 2 and 3 or a degenerate variant thereof; an 
isolated nucleic acid molecule encoding a homolog of SEQ 
ID no.l; an isolated polypeptide encoded by SEQ ID, no.l and 
depicted. in Table 2 and 3; an antibody which selectively 
binds to any one of said polypeptides, a method for 
producing a polypeptide* in a host cell comprising a) 
incubating a host containing a heterologous nucleic acid 
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molecuie whose nucleotide sequence consists of SEQ ID no.l 
and depicted in Table 2 and 3, under conditions where sad 
heterologous nucleic acid molecule is expressed to produce 
said protein, and b) isolating said protein- 
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